Intro To Docker For ML

Intro To Docker: Containerizing Models

Pascual Vila

MLOps Instructor // Code Syllabus

"It works on my machine" is the death knell of ML projects. By mastering Docker, you ensure your Python scripts, heavy ML libraries, and hardware drivers are perfectly synchronized across every environment, from your laptop to the cloud.

The Core Problem in ML

Machine learning models are notoriously sensitive to their environments. A model trained on TensorFlow 2.10 might throw cryptic errors when run on TensorFlow 2.12. Python `virtualenvs` help, but they don't capture system-level dependencies like CUDA toolkits or C++ compilers.

Docker solves this by packaging the entire operating system environment along with your code into a standardized unit called a container.

Images vs. Containers

Understanding the distinction is critical for MLOps:

Images (The Blueprint): A static, read-only file containing the source code, libraries, dependencies, tools, and other files needed for an application to run.
Containers (The Execution): A running instance of an Image. You can spin up thousands of containers from a single ML model image to handle incoming API requests.

Anatomy of a Dockerfile

The Dockerfile is where you script the creation of your Image. Common commands include:

FROM Defines the starting point (e.g., `python:3.9-slim` or `nvidia/cuda:11.8.0-base-ubuntu22.04`).
WORKDIR Sets the directory inside the container where the following commands will be run.
COPY Moves files from your local laptop into the Docker image.
RUN Executes terminal commands during the build process (like `pip install`).

❓ Frequently Asked Questions (MLOps)

Why use Docker instead of just Python Virtual Environments (venv/conda)?

Virtual Environments only isolate Python packages. They assume the underlying operating system and system libraries (like `libc` or NVIDIA GPU drivers) are already correct on the host machine.

Docker isolates the entire file system. It guarantees that if your ML model works on your Windows laptop inside Docker, it will behave exactly the same way on an AWS Linux server.

How do I use my GPU inside a Docker container?

To utilize GPUs for Deep Learning inside Docker, you need to use the NVIDIA Container Toolkit. You must base your Dockerfile on an NVIDIA CUDA image (e.g., `FROM nvidia/cuda...`) and run the container with the `--gpus all` flag.

docker run --gpus all -p 8000:8000 my-ml-model:latest

Why are my ML Docker images so large (3GB+)?

ML libraries like PyTorch and TensorFlow are massive. To optimize image size:
1. Use `-slim` base images (e.g., `python:3.9-slim`).
2. Add `--no-cache-dir` when running `pip install`.
3. Use multi-stage builds so training dependencies aren't shipped in the production serving image.

Container Glossary

Dockerfile

A text document containing all the commands required to build a Docker image.

terminal

FROM ubuntu:20.04 RUN apt-get update && apt-get install -y python3

Image

A lightweight, standalone, executable package that includes everything needed to run a piece of software.

terminal

$ docker build -t my-model:v1 . # Creates the Image

Container

A runtime instance of a Docker image. It runs completely isolated from the host environment.

terminal

$ docker run -d -p 5000:5000 my-model:v1 # Starts the Container

Layer

Each instruction in a Dockerfile creates a read-only layer. Docker caches these layers to speed up future builds.

terminal

Step 3/5 : RUN pip install pandas ---> Using cache ---> 8a2b...

Volume

A mechanism to persist data generated by and used by Docker containers, escaping the container's ephemeral lifecycle.

terminal

$ docker run -v /host/data:/app/data my-model

Docker Hub

A cloud-based registry service where you can link to code repositories, build your images and test them.

terminal

$ docker pull pytorch/pytorch:latest # Downloads from Docker Hub

Intro To Docker

Deployment Matrix

Concept: Docker Images

System Check

Deployment Challenges

MLOps Engineering Network

Deploy with the Community

Intro To Docker: Containerizing Models

The Core Problem in ML

Images vs. Containers

Anatomy of a Dockerfile

❓ Frequently Asked Questions (MLOps)

Container Glossary