Unit 1: Introduction
Unit 2: Data
Unit 3: Data and Ethics
Unit 4: Models
Unit 5: Learning and Classification
Unit 6: Recap and Regularization
Unit 7: Learned Representations (Embeddings)
Unit 8: NLP Intro
Unit 9: NLP Modeling
Unit 10: Transformers
Unit 11: Generation
Unit 12: Review, Reinforcement Learning
Unit 13: Human-Centered AI

Unit 5: Learning and Classification

In this unit we extend our modeling skills to encompass classification models, and start to build the tools that will let us represent complex functions by using hidden layers. Both of these objectives require us to learn about nonlinear operations. We’ll focus on the two most commonly used ones: the softmax operator (which converts scores to probabilities) and the rectifier (“ReLU”, which clips negative values).

Students who complete this unit will demonstrate that they can:

Describe the difference between a metric and a loss function.
Describe and compute cross-entropy loss
Explain the purpose and mathematical properties of the softmax operation.
Explain the role of nonlinearities in a neural network (e.g., why they are used between linear layers)
Implement a logistic regression model using basic numerical computing primitives optional for 22SP

Preparation

The fastai course videos are a bit disorganized here, sorry about that.

Watch the Lab 4 walk-through video if you have not yet.
Read Deep Learning for Coders chapter 5 (open in Colab).
- skip (or skim) Presizing and the LR finder
- skim “discriminative” rates (which would better be called “layer-dependent learning rates”)
- Watch FastAI Course Lecture 4 (starting about 1 hr in) and FastAI Course Lecture 6 (first half hour) for this material.
Recommended skim Deep Learning for Coders chapter 17 (open in Colab).
- This should reinforce what we’ve been studying about how linear regression works and how Tensors work, and give you a preview of how we’ll extend it to a full neural net.

Supplemental Material

We’re using Elo scores for intuition a few times this week, but we’re intentionally not diving deep on it. If you do want to dive deep:

Class Meetings

Monday

Work session for Homework 4

Wednesday

Discussion summary (come prepared to contribute)
Classification (slides)
- intuition of scores: Elo
- cross-entropy
  - intuition: maximize prob given to the correct answer
    - illustration: plot_top_losses output: probs and losess
  - math: sum the log of the prob given to the right answer
- how do we make probabilities? softmax (and its relative the sigmoid)
- where do the “right” scores come from?
  - In linear regression we were given the right scores.
  - In classification, we have to learn the scores from data. (hence Elo scores)

Friday

Discussion presentation
Nonlinearities (continuing (slides))

Discussion 5: Autonomous Vehicles (due Wed Feb 9)
Week 5 Q&A (Thu Feb 10)
Homework 5 (due Thu Feb 17)