Unit 12: Review, Reinforcement Learning
Preparation
- Watch MIT 6.S191 Lecture 5: Deep Reinforcement Learning: [Slides], [Video]
Supplemental Material
- Contextual
- Technical
- Using Sequence Models for RL
- Spinning Up in Deep RL - a hands-on introduction to reinforcement learning in PyTorch by OpenAI
- Creativity and Exploration
- Other
- Strategies for Missing Data (a stats reference) - related to our guest lecture from last week
Class Meetings
Monday
- Review of neural network architectures:
- Wiring doesn’t change: Feed-forward (MLP)
- Current sample wired to previous sample:
- Current sample wired to surrounding samples: Convolutional Networks (CNN)
- Wiring computed dynamically based on “self-attention”: Transformer
- Tricks
- Residual Connections
- Dropout
Wednesday
- finish topics from Monday
Friday
- Reinforcement Learning (learning from feedback)
Contents
Due this Week