Unit 3: Data and Ethics

Students who complete this unit will demonstrate that they can:

Identify ethical issues pertaining to the collection and use of data in AI systems

Explain examples of social impacts of AI systems in wide use today

Explain examples of biases in AI systems.

Explain the importance of evaluating image classifiers on unseen data.

Preparation

Finish reading Deep Learning for Coders chapter 3 (open in Colab). Complete the prep quiz in Moodle.

Next week’s chapter is dense, so I highly recommend you get a head start on Preparation 4.

Monday class

Trying out sli.do (event code 2557005)
- Review quiz about last week’s guest lecture:
  - What about Colin Davison’s task made it “supervised learning”?
    - He gave the classifier examples of input-output pairs.
  - Why did he need to split his data?
    - So that he could evaluate how well the classifier would do on data it hadn’t seen.
  - What did he need to do to the text to make it usable by his classifier?
    - He turned each sentence into a vector.
  - Which of the following is a bigram?
    - “bi” is a character-level bigram.
    - “a bigram” is a word-level bigram
  - We summarized the difference between classical ML and deep learning as whether the feature extractor is programmed by hand or learned. (The classifier is the same.)
Introduce Homework 1
- Goal: Train and evaluate a classifier on a dataset we collect ourselves
- Next week: Kaggle competition!
Introduce Discussion 2
- Practice and refine our answers to questions we might get asked.
- For family gatherings, interviews, etc.
Student questions.

Wednesday class

Preparation