Week 4: Supervised and Unsupervised Learning

Announcements

Tuesday

Supervised learning

  • From Algorithms perspective

  • From Dataset perspective

Unsupervised learning

  • From Algorithms perspective

  • From Dataset perspective

  • Clustering

  • Dimmension Reduction

Clustering

  • Overview

  • K-means

  • DBSCAN

Dimmension Reduction

  • Decomposing signals in components

  • Principal component analysis (PCA)

  • Kernel Principal Component Analysis (kPCA)

  • Independent component analysis (ICA)

The Diabetes dataset

  • Regression

  • Efron, Bradley, Trevor Hastie, Iain Johnstone, and Robert Tibshirani. "Least angle regression." The Annals of statistics 32, no. 2 (2004): 407-499.

  • Cited by 11138

  • Google Scholar has the Paper PDF file

  • lab 04

Read accepted posters

Thursday

Midterm

  • Research questions

  • Literature review

  • Paper format, two-page poster and full length eight-page paper

  • Timeline

  • Team, student pairs

Concept

  • Loss function

  • Accuracy

  • Train-test

  • Cross validation

  • Overfitting and underfitting

Textbooks

All textbooks are free available online, and are optinal, not required.