Homework 13 | CS 344 Spring 2022 at Calvin University

Count the Parameters

The large language model we looked at for our discussion last week, PaLM, reports that it has 540 billion parameters. What are all of those numbers used for? This week, we’ll count parameters.

This is the Fundamentals notebook for this week:

Why so big? Counting parameters in sequence models (preview, Colab)

Train a Sequence Model on Your Own Data

Collect a plain text file (.txt) of things that you’ve written recently. (perhaps all of your discussion posts in this class, or all of the papers you’ve written in the past year, or even all of the code you’ve written for a recent project). It should be your own work.
Split that into two different files on some criterion. You could, for example, split long vs short, essays from two different classes, etc.
Fine-tune the GPT-2 on your text using the run_clm.py script documented in the Hugging Face Transformers repo.
- You will need a validation txt file also, but it need not be large. You may even be able to leave it empty, I haven’t tested.
- Unless line breaks are significant, use --keep_linebreaks=false.
Generate some example text. We did this in a Fundamentals notebook a few weeks ago, or you can use this script.
Write a brief summary of your experience:
- What data did you use?
- What sort of results did you get?
- How did the generation results compare between the two different models?

Submitting

In the Moodle assignment for this Homework:

Attach the ipynb file. Make sure you’ve done a “Restart and Run All” before submitting.
Copy and paste your responses to the Analysis questions into the text box.
Copy and paste your responses to “Train a Sequence Model” here too.