Unit 7: Learned Representations (Embeddings)

In this unit we introduce one of the most powerful concepts in machine learning: the embedding. It’s the idea that instead of being explicitly given a representation for something, we learn it from data based on the properties it should have. We’ve actually already seen embeddings of various kinds (the inputs to the last layer of an image classifier for one), but we’ll look at two examples where the embedding aspect is even clearer: movies and words.

We’ll also look a bit at some very useful non-neural models for tabular data, and the sklearn fit-predict API. But since this material is reviewed in DATA 202 and DATA 303, we won’t spend much time on it.

Students who complete this unit will demonstrate that they can:

Preparation

Class Meetings

Monday class
Wednesday Class
Friday Lab

Contents