Session 3: Docker Fundamentals¶

Today, we're taking a significant leap into the world of Docker, a technology that's revolutionized how developers build, ship, and run applications.

Introduction to Docker 🐳¶

Docker is a platform that uses OS-level virtualization to deliver software in packages called containers. These containers are isolated, lightweight, and portable, allowing applications to run consistently across different environments.

Think of it like this: If your code is a recipe, and your computer is a kitchen, then Docker helps you package your recipe (code) and all its necessary ingredients (dependencies) into a perfectly organized meal prep kit (container). This kit can then be reliably cooked (run) in any kitchen, anywhere!

Containers vs. Virtual Machines (VMs)¶

Before diving deeper, it's crucial to understand why Docker containers are so powerful. Let's compare them to Virtual Machines (VMs), which you might be more familiar with.

Virtual Machines (VMs): VMs abstract the entire hardware, requiring a full guest operating system (OS) for each application. This makes them heavier and slower to start.

Containers: Containers abstract the operating system itself. They share the host OS's kernel and only package the application and its dependencies, making them extremely lightweight and fast.

Here's a visual comparison:

graph TD
    subgraph "Host Machine (Hardware)"
        HW(Hardware)
    end

    subgraph Virtual Machine Architecture
        HW --> HostOS_VM(Host OS)
        HostOS_VM --> Hypervisor(Hypervisor)
        Hypervisor --> GuestOS1(Guest OS 1)
        Hypervisor --> GuestOS2(Guest OS 2)
        GuestOS1 --> App1(App 1)
        GuestOS2 --> App2(App 2)
    end

    subgraph Container Architecture
        HW --> HostOS_Container(Host OS)
        HostOS_Container --> DockerEngine(Docker Engine)
        DockerEngine --> Container1(Container 1)
        DockerEngine --> Container2(Container 2)
        Container1 --> App1_C(App 1 + Dependencies)
        Container2 --> App2_C(App 2 + Dependencies)
    end

Key benefits of Docker:

Portability: Runs the same everywhere.
Isolation: Applications are isolated from each other and the host system.
Efficiency: Uses fewer resources than VMs.
Speed: Starts up much faster than VMs.

The Docker Workflow: From Code to Container 🧑‍💻¶

The magic of Docker lies in a straightforward workflow: you define your application's environment, build it into a reusable package, and then run it. Let's walk through this process with a simple Python application.

graph LR
    A[Your Code & Dependencies] --> B(Write Dockerfile);
    B --> C(Build Docker Image);
    C --> D(Run Docker Container);
    D -- Optional --> E(Push Image to Registry);
    E --> F[Share/Deploy Image];

Step 1: Write a `Dockerfile` (Defining Your Environment)¶

A Dockerfile is a text file that contains all the instructions needed to build a Docker image. An image is a read-only template that bundles your application code, runtime, libraries, and environment variables into a single, self-contained unit.

Let's set up a simple Python application in our DevContainer:

Open the Lesson Folder: Ensure you have the examples/docker-basic folder (from the class repository) open in your DevContainer.

Create a Python App:

app.py

import time

print("Hello from inside the Docker container!")
for i in range(3):
    print(f"Counting: {i+1}")
    time.sleep(1)
print("Application finished.")

Create a requirements.txt: (Even for simple apps, it's good practice)

requirements.txt

numpy
pandas

Create Your Dockerfile: In the same docker folder, create a file named Dockerfile (no extension) and add these instructions:

Dockerfile
# Use an official Python runtime as a parent image (1)
FROM python:3.9-slim-buster 

# Set the working directory in the container (2)
WORKDIR /app

# Copy the requirements file first to leverage Docker's build cache (3)
COPY requirements.txt .

# Install any needed packages specified in requirements.txt (4)
RUN pip install -r requirements.txt

# Copy your application code into the container (5)
# This copies app.py from your current (host) directory into '/app' in the image
COPY app.py .

# Command to run app.py when the container launches (6)
CMD ["python", "app.py"]

Starts with a base image that already has Python installed. slim-bookworm is a lightweight version.
Sets the default directory inside the container for subsequent commands.
Copies your requirements.txt file from your local machine (where you run docker build) into the /app directory in the image. This is a crucial step for getting your code and configuration into the container's filesystem.
Executes commands during the image build to install Python dependencies.
Copies your main application file (app.py) into the /app directory in the image.
Specifies the default command to run when a container starts from this image.

Step 2: Build the Docker Image¶

Once your Dockerfile is ready, you use the docker build command to create an image.

Open the Integrated Terminal in VS Code (ensure you're in the examples/docker-basic directory).

Run this command:

docker build -t my-python-app .

docker build: The command to initiate an image build.

-t my-python-app: Tags the image with a name (my-python-app). This makes it easy to refer to later.
.: Tells Docker to look for the Dockerfile in the current directory.

Dynamic Context

The ability to specifiy where the Dockerfile is allows you to specify "context". Meaning, I can point to a Dockerfile in a different folder, while using the context (ie relative pathing) from where you execute the command. You can combine this with the -f option.

You can verify the image was created and see its details:

docker images

Building Specific State

Note, when you build an image, this usually describes a specific state of the codebase. If you make changes to files which affect the build (ie Dockerfile, requirements, etc.) then you'll have to re-build the image.

Step 3: Run the Docker Container (and Volume Mounts)¶

An image is a template; a container is a running instance of an image. You use the docker run command to start a container.

From the same terminal, run:

docker run my-python-app

You should see your Python script (app.py) execute within the isolated Docker container.

Understanding Volume Mounts:¶

While COPY puts files into the image during build time, volume mounts allow you to share a directory from your host machine (your local computer) directly with the running container. This is incredibly useful for:

Development: You can edit code on your host machine, and the changes are immediately reflected in the running container without rebuilding the image.

Data Persistence: Store data generated by your container on your host machine, so it's not lost when the container stops or is removed.

Let's try running our app with a volume mount:

# Make sure your current terminal directory is 'examples/docker-basic'
docker run -v "$(pwd):/app" my-python-app

-v "$(pwd):/app": This is the volume mount.
$(pwd): This is a shell command that gets your current working directory on your host machine (e.g., /path/to/your/repo/examples/docker-basic).
:: Separates the host path from the container path.
/app: This is the directory inside the container that will be "mounted" to your host directory. Now, any changes you make to app.py on your host will instantly be seen by the container if you restart it, without a rebuild!

After building your image, you can optionally push it to a Docker registry like Docker Hub. This makes your image available for others to pull and run, or for deployment to cloud platforms.

To push an image, you'd first need to tag it with your Docker Hub username and then use docker push:

# Example: Tag your image with your Docker Hub username
docker tag my-python-app yourusername/my-python-app:latest
docker push yourusername/my-python-app:latest

(We won't do this live in this session, but it's an important part of the workflow).

Docker Hub: A Registry for Docker Images 🌐¶

Docker Hub is the world's largest library and community for container images. It's a central place where you can find, manage, and share Docker images.

Official Images: Most major software projects (e.g., Python, Node.js, Redis, PostgreSQL) publish official, pre-built images on Docker Hub. These are often the FROM images you'll use in your Dockerfiles.
Community Images: Developers worldwide share their own custom images.
Exploration: You can search for various images, inspect their details, and sometimes even view their Dockerfiles to understand how they were built.

Exploring Docker Hub: Jupyter Lab Example¶

Jupyter Lab is a popular environment for data science, and instead of installing everything locally, you can simply run a Docker container pre-configured with Jupyter Lab and many common data science libraries.

Go to hub.docker.com.
Search for jupyter/jupyterlab.
Notice the different tags (versions) available. This illustrates how complex, ready-to-use environments can be easily accessed via Docker Hub.

Docker Desktop: Your Container Control Panel 📊¶

While you'll use the terminal for most Docker commands, Docker Desktop (the application you installed) provides a graphical interface to manage your Docker environment. It's a convenient way to visualize and control your containers and images.

Containers: See all your running and stopped containers. You can easily start, stop, restart, and delete them.
Logs: View the real-time output (logs) from your running containers, which is critical for debugging.
Images: Manage the Docker images stored locally on your machine.
Volumes: Inspect and manage any volumes you've created for data persistence.

Live Demonstration: The Docker Workflow & Docker Desktop in Action 🎬¶

We'll perform these steps live together, so follow along in your DevContainer and with the Docker Desktop application.

Verify Docker Desktop is Running: Confirm the Docker Desktop application is active.
Navigate in DevContainer: Ensure the examples/docker-basic folder is open in VS Code, inside the DevContainer.
Review app.py and requirements.txt: We'll quickly look at the simple Python application and its requirements.
Create and Explain Dockerfile: We'll build the Dockerfile step-by-step, explaining each instruction, paying special attention to COPY.
Build Image: Execute docker build -t my-python-app . and analyze the build output.
List Images: Run docker images to see your new my-python-app image.
Run Container (Basic): Execute docker run my-python-app and observe the Python script running.
Run Container (with Volume Mount): Execute docker run -v "$(pwd):/app" my-python-app.
- Demo: While the app is running, quickly edit app.py on your host machine (e.g., change the print message). Restart the container to show the changes are reflected without rebuilding the image. This highlights the power of volume mounts for development.
Explore Docker Desktop:
- Open Docker Desktop.
- Navigate to the "Containers" tab to see my-python-app (and potentially other containers).
- Click on your running container to view its logs.
- Demonstrate how to stop and restart the container from the GUI.
Explore Docker Hub: Briefly navigate to hub.docker.com and explore pre-built images.

Recommended Exercises & Further Reading¶

By the end of this session and after completing these exercises, you should have a solid grasp of the Docker workflow: writing a Dockerfile, building images, running containers with volume mounts, and understanding Docker Hub and Docker Desktop.

Containerize a Python Script with Dependencies:¶

In a new subfolder within examples/docker-basic, create a Python script that uses a common library like requests to fetch data from a public API (e.g., a simple "hello world" API).

Ensure your requirements.txt file lists the requests library.

Write a Dockerfile to containerize this Python script.

Build the image (e.g., docker build -t my-script-app .).

Run the container using a volume mount for your code (e.g., docker run my-script-app).

Verify that the script executes correctly inside the container and prints the expected output.

Customize a Jupyter Lab Environment:¶

In a new subfolder within examples/docker-basic (e.g., jupyter), create a new Dockerfile.

Base Image: Start your Dockerfile from a Jupyter Docker Stack image.
Add Custom Packages: Create a requirements.txt file in the same folder as your Dockerfile. Add a couple of Python libraries that aren't typically pre-installed in Jupyter (e.g., folium for mapping, scikit-image for image processing).

requirements.txt

folium
scikit-image

Dockerfile Instructions: Add COPY requirements.txt . and RUN pip install -r requirements.txt to your Dockerfile to install these additional packages during the image build.
Build the Image: Build your custom Jupyter Lab image (e.g., docker build -t my-jupyter-env .).
Run the Container: Run a container from your image, mapping the port: docker run -p 8888:8888 my-jupyter-env

Ports

This is important for this exercise and anything which involves starting a web app, such as Jupyter notebook or lab. We will revisit this much later, but all you need to know is -p {port on host}:{port in container} is used to publish/expose a port. Since Jupyter normally runs on port 8888 we need to bind that port to a free port on the localhost (ie our local computer) - in this case the same port. Theoretically, we could bind to any free port ie -p 123456:8888.

Verify Installation:
- Open the URL provided in your terminal output (it usually starts with http://127.0.0.1:8888/?token=...) in your browser to access Jupyter Lab.
- Create a new Python notebook.
- In a code cell, try to import the libraries you added (e.g., import folium and #!python import skimage). If no errors occur, your custom packages were successfully installed!

Making notebooks/changes

Make changes within the container. Do you notice your files/changes appearing locally?

Extra challenge: volume mounts

Create a folder called notebooks/ in the same directory as the Dockerfile. Then establish a volume mount which maps ./notebooks/ (locally) to /app/notebooks (in container). Try again to make notebook edits. Notice anything different?

Explore Docker Desktop Further:¶

After running your containers from exercises 1 and 2, use Docker Desktop to:

View the running containers.
Explore the Logs, Exec, Files, and Stats tabs.
Stop and then start one of the containers from the GUI.
Remove a container (after stopping it).
View the images you've built under the "Images" tab.

Next Steps & Questions¶

Understanding Docker is fundamental for modern development, especially as we move into distributed systems and DevContainers. Experiment with building and running containers. If you encounter any issues or have questions, please reach out! Next, we'll build on this by exploring Docker Compose and DevContainers in more detail.