Model Deployment

Keith VanderLinden
Calvin University

Deployment of ML Systems

Most production ML systems:

  • Include more than just one model.
  • Operate in dynamic, evolving environments.
  • Update their models regularly.
  • Operate at scale.

We’ll implement limited variants of these realities.

Prediction Modes

The basic prediction modes are:

  • Batch prediction
  • Online prediction
    • using batch features
    • using streaming features

There are many variations and combinations.

Batch vs Streaming Features

DMLS Figure 7-8

Computation Architecture

Model computation can be done on either:

  • Cloud
  • Edge

The first is the easiest and most common.