Model Deployment
Keith VanderLinden
Calvin University
Deployment of ML Systems
Most production ML systems:
- Include more than just one model.
- Operate in dynamic, evolving environments.
- Update their models regularly.
- Operate at scale.
We’ll implement limited variants of these realities.
Prediction Modes
The basic prediction modes are:
- Batch prediction
- Online prediction
- using batch features
- using streaming features
There are many variations and combinations.
Batch vs Streaming Features
Computation Architecture
Model computation can be done on either:
The first is the easiest and most common.