Learning Machines

Key questions

375:
- How can systems improve from experience?
- What can be learned from data vs interaction?
- How can we evaluate learning: does it generalize?
376:
- How can we learn without labeled data? (self-supervised learning)
- How do foundation models learn generalizable patterns from massive datasets?
- How can generative agents learn to improve their behavior from feedback?

After this course, I will be able to:

375:
- Learning Theory
  - I can explain how different ML approaches (supervised, unsupervised/self-supervised, reinforcement) learn.
  - I can explain how stochastic gradient descent uses data to improve performance.
  - I can describe the relationship between loss functions and metrics.
- Implementation and Debugging
  - I apply data validation techniques without having to be reminded to do so.
  - I can diagnose problems in model training, such as overfitting or underfitting, from metrics.
  - I can implement a basic training loop in PyTorch (also in ML Engineering)
376:
- I can explain how next-token prediction implicitly encodes many other linguistic and reasoning tasks.
- I can explain how supervised tuning and feedback tuning can improve the performance and reliability of a model.

The content may not be revised for this year. If you really want to see it, click the link above.

The content may not be revised for this year. If you really want to see it, click the link above.

The content may not be revised for this year. If you really want to see it, click the link above.