Learning Machines

Key questions

375:
- How can systems improve from experience?
- What can be learned from data vs interaction?
- How can we evaluate learning: does it generalize?
376:
- How can we learn without labeled data? (self-supervised learning)
- How do foundation models learn generalizable patterns from massive datasets?
- How can generative agents learn to improve their behavior from feedback?

After this course, I will be able to:

375:
- Learning Theory
  - I can explain how different ML approaches (supervised, unsupervised/self-supervised, reinforcement) learn.
  - I can explain how stochastic gradient descent uses data to improve performance.
  - I can describe the relationship between loss functions and metrics.
- Implementation and Debugging
  - I apply data validation techniques without having to be reminded to do so.
  - I can diagnose problems in model training, such as overfitting or underfitting, from metrics.
  - I can implement a basic training loop in PyTorch (also in ML Engineering)
376:
- I can explain how next-token prediction implicitly encodes many other linguistic and reasoning tasks.
- I can explain how supervised tuning and feedback tuning can improve the performance and reliability of a model.