MA50290: Applied Machine Learning

Note: The official webpage for this unit is on Moodle.
(since Moodle can have issues, I thought it would be good to set up a backup webpage)

Unit convenor: James Foster
Email: jmf68@bath.ac.uk

Lectures are in 8 West 2.5 at the following times in Semester 1, 2024
10:15am Tuesday (all weeks except week 6) and 1:15pm Wednesday (odd weeks)

Labs are in Chancellor's Building 4.17 at 1:15pm on Wednesdays (even weeks except week 6)

I will be away in week 6 at a research workshop, so there will be no lectures and labs that week. Instead, there will be a lecture in Chancellor's Building 4.17 at 11:15am on Friday 1st Nov and a lab in the Library's Level 4 PC area at 11:15am on Friday 15th Nov.

Office hour: Happy to meet you whenever I'm in my office (4 West 3.37).
I'll probably be there on Tuesdays after the lecture, but it may be easiest if you come up to me after lectures or make an appointment.

Overview

In recent years, there has been a surge of interest in algorithms that can "learn" patterns from data. Whilst such algorithms have traditionally been studied in the area of statistics, modern computers and large datasets have led to a new field of Machine Learning (ML) which lies at the intersection of mathematics, statistics, computer science and engineering. Nowadays, many large-scale ML models (e.g. ChatGPT, ‎Gemini and Stable Diffusion) are so impressive at generating and processing data, we often call them "Artificial Intelligence".

In this unit, we will introduce some of the central topics and algorithms in machine learning, explore their underlying mathematics and develop practical implementations using Python.

Topics covered in this unit include:

• Statistical learning theory: Data as random variables, expected risk and cross validation.
• Data preparation and retrieval: Numerical data as vectors and matrices (e.g. images) and non-numerical data (e.g. text).
• Unsupervised learning: Clustering and Principal Component Analysis (PCA).
• Supervised learning: Linear regression, logistic regression and maximum likelihood.
• Optimization algorithms: Gradient Descent (GD) and Stochastic Gradient Descent (SGD).
• Non-parametric models: Decision trees and K-nearest neighbours.

MA50290: Applied Machine Learning

Overview

Course materials