Slides – Generation by Prompting

Lab 4 Recap

Discuss with neighbors:

One prompt engineering technique that did work, and why. (Can you come up with another example that works?)
An example of a hallucination that you noticed. Why might it have happened?
Your design for the welcome-to-Calvin Q&A system.

A brief history of ML:

Supervised learning: train a model from scratch on a given set of inputs and outputs.
Fine-tuning: same, but start with a model that was trained on a related task
In-Context Learning: don’t train the model at all, just give it a few examples of the kind of thing you want it to do
Zero-Shot Learning: don’t train the model or even give examples, just describe the task in words

A model’s job: interpolation

Training set:

“The first president of the United States, George Washington, …”
“The first president of the US, named Washington, …”
“As the first president, General Washington …”
“Alice: Who was the first president? Bob: George Washington.”

Prompt:

“User: Who was the first US president? Assistant:”

But what kind of context are we in?

Quiz Worksheet

The capital of the United Kingdom is:

The capital of the United States is:

The capital of Canada is:

The capital of Australia is:

The capital of France is:

Quiz Worksheet Answer Key

The capital of the United Kingdom is: London

The capital of the United States is: Washington, D.C.

The capital of Canada is: Ottawa

The capital of Australia is: Canberra

The capital of France is: Paris

Travel Blog

Title: My visit to Paris

The capital of France is a city of art and culture. It is a city of romance and love. …

Next-Token Distributions Depend on Context

“The capital of France is”

What tokens would come next if this document were a:

quiz worksheet?
quiz worksheet answer key?
travel blog?
encyclopedia article

Additional Context Helps Disambiguate

At first, the model doesn’t know what kind of document it’s trying to predict. With additional context, it can make a better guess.

“The capital of France is”

quiz worksheet: 25%
quiz worksheet answer key: 25%
travel blog: 25%
encyclopedia article: 25%

“The capital of France is:”

quiz worksheet: 50%
quiz worksheet answer key: 50%
travel blog: 0%
encyclopedia article: 0%

Out of Distribution

The capital of France is a city of contrasts. It is a city of history, culture, and art, but it is also a city of modern life. The city is home to some of the world’s most famous landmarks, including the Eiffel Tower, the Louvre, and the Arc de Triomphe. The city is also home to some of the world’s most famous museums, including the Musee d’Orsay and the Musee du Louvre. The city is also home to some of the world’s most famous monuments, including the Arc de Triomphe and the Eiffel Tower. The city is also home to some of the world’s most famous monuments, including the Arc de Triomphe and the Eiffel Tower.

The capital of France is a city of contrasts. It is a city of history, culture, and art, but it is also a city of modern life. The city is home to some of the world’s most famous landmarks, including the Eiffel Tower, the Louvre, and the Arc de Triomphe. The city is also home to some of the world’s most famous museums, including the Musee d’Orsay and the Musee du Louvre. The city is also home to some of the world’s most famous monuments, including the

Prompt Engineering

prompt engineering (n): the process of designing prompts to elicit desired responses from a model

Example of prompt engineering: I typed (n) after “prompt engineering” to get Copilot to suggest a definition.

Works by narrowing down the range of possible contexts we might be in.

Few-Shot Learning

Explicitly give examples of the kind of thing you want.

Pro: often relatively easy
Con: the model may fixate on details of the examples; takes lots of tokens to give examples

Personas

Frame the prompt as a conversation with a particular person (e.g., an expert, a child, …

The training data had many personas including confused non-experts–the model had to be unsurprised by those inputs too
But: personas might not be authentic (e.g., it doesn’t really know what a child wouldn’t know); temptation to personify and stereotype the interaction

Other Prompt Engineering Techniques

Tool use
Chain-of-thought reasoning
Retrieval-Augmented Generation

Hallucination (Confabulation)

Models were trained to mimic patterns
Human feedback encouraged models to be believable (but not necessarily accurate)