ST310 (Machine Learning)

Contact
Tip

Internet is a great resource. Use it. Some resources I like:

  1. MIT OpenCourseWare,
  2. Stanford,
  3. Harvard.
Beware, the Crime!

It is illegal to download articles and books from pages like LibGen, Sci-hub or from Telegram bots like @scihubot. Also, DO NOT use VPN to protect your freedom of education (Opera offers a free VPN).

🙃

Course content (Official)

The primary focus of this course is on the core machine learning techniques in the context of high-dimensional or large datasets (i.e. big data). The first part of the course covers elementary and important statistical methods including nearest neighbours, linear regression, logistic regression, regularisation, cross-validation, and variable selection. The second part of the course deals with more advanced machine learning methods including regression and classification trees, random forests, bagging, boosting, deep neural networks, k-means clustering and hierarchical clustering. The course will also introduce causal inference motivated by analogy between double machine learning and two-stage least squares. All the topics will be delivered using illustrative real data examples. Students will also gain hands-on experience using R or Python (programming languages and software environments for data analysis, computing and visualisation).

Material and solutions

I did not write the following material, which was prepared by Joshua Loftus. My role was simply to present the content of the seminars and the solutions in class.

Week Seminar Material
Week 1 Seminar
Week 2 Seminar
Week 3 Seminar
Week 4
Week 5
Week 6
Week 7
Week 8
Week 9 Seminar
Week 10 Seminar