Homework 1: Train and evaluate a classifier on your own images | CS 344 Spring 2023 at Calvin University

Goal

In this assignment, you will train and evaluate your own image classifier.

Completing this homework will give you practice

Working with image datasets
Training image classifiers
Evaluating image classifiers
Explaining your decisions and their possible consequences.

A famous image classification example is handwritten digits (called MNIST). For fun, we’ll remix that idea and classify handwritten letters. To keep it manageable, we’ll just work with the first 3 letters (a through c).

Try to make the best model you can, under the following constraints:

No more than 100 training images.
No more than 5 minutes compute time (on a Kaggle, Colab, or lab machine GPU) to train a model.
Only use models that are already built into fastai.

Instructions

Do this assignment individually. You may help each other, but use Ed so all benefit.

Collect your own set of images of handwritten letters, one letter per image. (Do this yourself, don’t get it from the Internet.)
- I’ve hacked together this little webapp to let you sketch and share/save. It’s clunky; improvements welcome!
- You can also take pictures of sketches on paper, whiteboards, etc.
- You can share images, but no single image should be used by more than 3 people.
- You should have at least 10 images per letter.
- To get started, you can use this dataset I hacked together very quickly. But it’s bad in various ways, so please collect your own.
Organize your dataset into a folder structure like images/c/c01.png.
- Make an images/README.txt describing how you collected the images (e.g., whether you used a mouse/finger/pen or took pictures of paper/whiteboard/chalkboard/documents you found in the Meeter Center/…)
Train a classifier to indicate which letter is contained in the image.
Evaluate the accuracy of the classifier on the validation set.
- How accurate is the classifier overall?
- Which letter is it most successful at classifying? Give an example of a correctly classified image (show a specific image file and its classification).
- What mistakes does it make most frequently? Give an example of a mistake (show a specific image file and its classification).
- For the previous 3 questions, any ideas about why?
- Suppose someone else gave you one of their images. How likely do you think your classifier would be to get it right? report your answer in terms of a percentage, either overall or broken down by which letter.
- What choices did you have to make in the process of collecting data, processing it, and analyzing the results?
  - What are one or two choices that you could have made differently?
  - What do you expect would be different if you made that different choice?
Share the results (including code and answers to the above questions) in a Jupyter Notebook in your portfolio repo.
- Submit a link to the notebook on Moodle.
- Don’t include your dataset. Instead, link to a ZIP (or tgz) file of your dataset. Then your notebook can start with dataset_path = untar_data("https://your-url.../images.zip").
- The easiest way to get a ZIP file with direct download is to put it your public_html folder on the lab computers. Then you can access it at https://students.cs.calvin.edu/~username/filename.zip (make sure you include the tilde.)

Notes

Include all the code needed to get one good accuracy number.
Don’t try to show the results of every model you trained, but do make a single cell to change numbers for any aspects you varied (e.g., the seed, how many images you used)
Don’t include extraneous code
Use Markdown cells, not code comments, to report results.

Tips

set_seed() for reproducible results. Also, sort your image files, as we did in lab 1.
Chapter 2 has some helpful low-level code for constructing an ImageDataLoader. Alternatively, use ImageDataLoaders.from_path_func(..., label_func=parent_label, bs=2).)
You can use the same techniques you used in Lab 2 to evaluate the classifier. See chapter 2 for examples of how to make a confusion matrix and plot top losses.
You probably need to set the batch size to be smaller than the default (which is 64 images). Do this by passing bs=2 as a keyword parameter to your ImageDataLoader.
Visualize things:
- What does one batch of your DataLoader look like?
- What do the predictions of your classifier look like?
- What does the confusion matrix look like?
- refer to Chapter 2 for the code for these.
Note that from_name_func fails silently with parent_label. (It should throw an exception. I submitted this bug to propose that it does.) Use from_path_func instead if you want to use that approach.

ChatGPT’s Solution

Caution: this is not too helpful. You might note that the imports are incorrect, the dataset loading is not quite correct, and you should probably fine_tune instead of fit_one_cycle. But the general outline is basically correct.

What code do I need to train and evaluate a classifier on my own image dataset using fastai?

Here is an example of code to train and evaluate a classifier on your own image dataset using fastai:

from fastai import *
from fastai.vision import *

# set path to your image dataset
path = Path('path/to/your/image/dataset')

# create a databunch for your dataset
data = ImageDataBunch.from_folder(path, train='train', valid='val', size=224, bs=64)

# create a CNN model using resnet34 architecture
learn = cnn_learner(data, models.resnet34, metrics=accuracy)

# train the model using the fit_one_cycle method
learn.fit_one_cycle(4)

# evaluate the model using the validation set
interp = ClassificationInterpretation.from_learner(learn)
interp.plot_confusion_matrix()

Note: This is a basic example and you may need to adjust the code depending on your specific dataset and requirements. Also, make sure to have your dataset in the format of train and val folders containing images and their respective labels in the name of the folder.

Another attempt:

How do I train and evaluate a classifier on my own image dataset using fastai?

To train and evaluate a classifier on your own image dataset using fastai, you can follow these steps:

Prepare your image dataset by organizing the images into train, validation, and test sets.
Use the fastai library's ImageDataBunch class to create a DataBunch object from your dataset.
Use the create_cnn or create_unet function to create a convolutional neural network (CNN) model.
Train the model by calling the fit_one_cycle method on your DataBunch object and passing in the model.
Evaluate the model's performance on the test set by calling the learn.validate method on your DataBunch object and passing in the model.
You can also use the learn.show_results method to inspect the results visually.

Please note that fastai is a high-level wrapper around PyTorch, that makes it easy to train models using the latest deep learning techniques. To use it, you'll need to have PyTorch and fastai installed, you could use the command !pip install fastai to install fastai library.