Articles: Production-ready Docker packaging for Python developers
Table of Contents
The basics of Docker packaging
- 
    Connection refused? Docker networking and how it impacts your image 
 Learn how to fix connection refused errors when trying to connect to a Docker container.
- 
    Faster or slower: the basics of Docker build caching 
 Docker’s layer caching can speed up your image build—if you write your Dockerfile correctly.
- 
    Where’s that log file? Debugging failed Docker builds 
 Your Docker build just failed, and the reason is buried a log file—which is somewhere inside the build process. How do you read that log file?
- 
    Debugging ImportError and ModuleNotFoundErrors in your Docker image 
 There are many reasons your Python code might fail to import in Docker. Here’s a quick series of checks you can do to figure out the problem.
- 
    A tableau of crimes and misfortunes: the ever-useful docker history
 Use thedocker historycommand to understand how a Docker image is constructed, why an image is too big, and how Dockerfile commands work.
Looking for more? Learn the fundamental concepts of Docker packaging in just one afternoon, by reading my book: Just Enough Docker Packaging.
Free ebook: "Introduction to Dockerizing for Production"
Learn a step-by-step iterative DevOps packaging process in this free mini-ebook. You'll learn what to prioritize, the decisions you need to make, and the ongoing organizational processes you need to start.
Plus, you'll join over 8000 people getting weekly emails covering practical tools and techniques, from Docker packaging to Python best practices.
Best practices for production
The broken status quo
- 
Broken by default: why you should avoid most Dockerfile examples
 Most Dockerfile examples for Python you’ll find on the Web are broken. And that’s a problem.
Base image and dependencies
- 
    The best Docker base image for your Python application (May 2024) 
 Ubuntu? Official Python images? Alpine Linux? Here’s how to choose a good base Docker image for your Python application container.
- 
    A deep dive into the “official” Docker image for Python 
 The “official” Python Docker image is useful, but to actually understand why, and to use it correctly, it’s worth understanding how exactly it’s constructed.
- 
    Why you should upgrade pip, and how to do it 
 Learn the problem with using old pip, and how to upgrade pip to fix those problems.
- 
    Using Alpine can make Python Docker builds 50× slower 
 Alpine Linux is often recommended as a smaller, faster Docker base image. But if you’re using Python, it will slow down your build and make your image larger.
- 
    When should you upgrade to Python 3.13? 
 Python 3.13 is out—when should you switch to using it?
- 
    Building on solid ground: reproducible Docker builds for Python 
 Learn how to get reproducible Docker builds for your Python application, including base image, system packages, and Python dependencies.
- 
    It’s time to stop using Python 3.8 
 Python 3.8 will stop getting security updates in November 2024. You really should upgrade!
- 
    CentOS 8 is dead: choosing a replacement Docker image 
 CentOS 8 is no longer being maintained as a drop-in replacement for RedHat Enterprise Linux. Here are your options for replacing it as a Docker image.
- 
    Push and pull: when and why to update your dependencies 
 When should you update your software project’s dependencies? There are two rhythms to updates: security and critical bug fixes, and broader updates.
- 
    “Externally managed environments”: when PEP 668 breaks pip 
 Getting a externally-managed-environment/PEP 668 error when you pip install? Here’s how to fix it.
- 
    Should you use uv’s managed Python in production? 
 The uv package manager can also install Python for you. Should you use this feature in production?
Security
- 
    Installing system packages in Docker with minimal bloat 
 Learn how to minimize your Docker image size while installing or updating system packages on on Debian, Ubuntu, and RHEL.
- 
    Avoiding insecure images from Docker build caching 
 Docker’s layer caching is great for speeding up builds—but you need to be careful or it’ll cause you to have insecure dependencies.
- 
    Less capabilities, more security: preventing Docker escalation attacks 
 Reduce the security risk from your Docker image by running as a non-root user and reducing capabilities.
- 
    Staying secure by breaking Docker caching 
 Even more ways you can ensure Linux distribution security updates in the face of Docker caching.
- 
    How to (not) use Docker to share your password with hackers 
 Docker images can leak runtime secrets, build secrets, and even just some secret files you have lying around. Learn how to leak them, and how to avoid it.
- 
    Don’t leak your Docker image’s build secrets 
 When you’re building Docker images you often need some secrets: a password, an SS Hkey. The secure mechanism is BuildKit; others might leak them.
- 
    Finding leaked secrets in your Docker image with a scanner 
 It’s easy to mistakenly embed a secret in your Docker image. Use a scanner to find these secrets before you leak them out potential attackers.
- 
    Security scanners for Python and Docker: from code to dependencies 
 How do you know your Python code is secure? How about your Docker image? Learn how to catch problems is using security scanners running in your CI setup.
- 
    The security scanner that cried wolf 
 If you’ve ever been alarmed by how many security vulnerabilities your Docker image has, even after you’ve installed security updates, here’s what’s going on.
Fast builds, small images
- 
    The high cost of slow Docker builds 
 A slow Docker build on the critical path for developer feedback is a lot more expensive than you think.
- 
    Elegantly activating a virtualenv in a Dockerfile 
 How to activate a Python virtualenv in a Dockerfile without repeating yourself—plus, you’ll learn what activating a virtualenv actually does.
- 
    Faster Docker builds with uv, pipenv, poetry, or pip-tools 
 Installing Python dependencies separately from your code speed ups Docker builds. Here’s how to do it with pipenv, poetry, or pip-tools.
- 
    Poetry vs. Docker caching: Fight! 
 Poetry’s versioning scheme for Python dependencies makes Docker caching harder, which means slower images rebuilds. Learn some workarounds.
- 
    Speed up pip downloads in Docker with BuildKit’s new caching 
 Every time you change your Python pip requirements and rebuild your Docker image, you’re going to redownload all your packages. You can fix this with BuildKit.
- 
    Speeding up Docker builds in CI with BuildKit 
 If your CI runners spin up an empty environment, your Docker builds will be slow. Speed up builds by warming the cache, plus BuildKit’s extra speedup.
- 
    Faster pip installs: caching, bytecode compilation, and uv 
 Installing packages with pip can be slow. Learn some ways to speed it up.
- 
    Shrinking your Python application’s Docker image: an overview 
 Learn the variety of techniques you can use to make your Python application’s Docker image a whole lot smaller.
- 
    Multi-stage builds #1: Smaller images for compiled code 
 Building Docker images with compiled code can lead to huge images. Learn how to shrink them with multi-stage builds.
- 
    Multi-stage builds #2: Python specifics 
 Once you understand generic Docker multi-stage builds, here’s how to implement them for Python applications, with virtualenvs or user installs.
- 
    Multi-stage builds #3: Speeding up your builds 
 Multi-stage Docker image builds give you small images and fast builds, but only if takes extra steps prevent slowness due to caching problems.
Conda
- 
    Pip vs Conda: an in-depth comparison of Python’s two packaging systems 
 Python has two packaging systems, pip and Conda. Learn the differences between them so you can pick the right one for you.
- 
    Activating a Conda environment in your Dockerfile 
 Learn how to activate a Conda environment in your Dockerfile.
- 
    Shrink your Conda Docker images with conda-pack 
 Docker images built for Conda tend to be quite large. Learn how to shrink them by using the conda-pack tool and multi-stage builds.
- 
    Scanning your Conda environment for security vulnerabilities 
 Learn how to check your Conda environment and packages for security vulnerabilities.
- 
    Reproducible and upgradable Conda environments with conda-lock 
 You want your packaging to be reproducible, and upgrade Conda dependencies without conflicts. Learn how to do it with a third-party tool: conda-lock.
- 
    Speed up your Conda installs with Mamba 
 Conda installs are very slow, but you can speed them with a much-faster Conda reimplementation called Mamba.
- 
    Using Conda? You might not need Docker 
 Conda has some overlap with Docker’s functionality; sometimes you might not need Docker at all.
Applications and runtime
- 
    Configuring Gunicorn for Docker 
 Running Gunicorn in a Docker container isn’t the same as running on a virtual machine or physical server. Learn what you need to do differently.
- 
    What’s running in production? Making your Docker images identifiable 
 It’s difficult to debug production problems if you don’t know what image is running in production.
- 
    Decoupling database migrations from server startup: why and how 
 Migrating your database schema when your application’s Docker container starts up? Here’s some reasons to rethink that choice.
- 
    A Python prompt into a running process: debugging with Manhole 
 Your Python process is acting strange—learn how to get a live Python interpreter prompt inside your running process for debugging.
- 
    Please stop writing shell scripts 
 It is quite difficult to write correct shell scripts; you’re much better off just using Python.
- 
    Why new Macs break your Docker build, and how to fix it 
 New Macs can break your Docker image build in unexpected ways; learn why, and how to fix it.
Packaging as a process
- 
    A thousand little details: developing software for ops 
 Software for ops suffers both from historical complexity and from problem space complexity. Some generic suggestions, with Docker packaging as an example.
- 
    Your Docker build needs a smoke test 
 If you don’t test your Docker image before you push it, you’ll waste time (and maybe break production).
- 
    “Let’s use Kubernetes!” Now you have 8 problems 
 For smaller teams, Kubernetes is usually the wrong solution: too complex, too complicated, and with too much work to keep it running.
Docker variants and alternatives
- 
    Docker BuildKit: faster builds, new features, and now it’s stable 
 BuildKit is Docker’s new system for building images. It’s faster, has previously missing security featuers, and it’s finally stable.
- 
    Options for Python packaging: Wheels, Conda, Docker, and more 
 Learn and compare the many ways to package your Python server for distribution: wheels, PEX, RPM/DEB, Conda, executables, Docker.
- 
    Docker vs. Singularity for data processing: UIDs and filesystem access 
 Containers allow for reproducibility of data processing applications. Docker is the most popular option, but Singularity is also well-suited to this use case.
Free ebook: "Introduction to Dockerizing for Production"
Learn a step-by-step iterative DevOps packaging process in this free mini-ebook. You'll learn what to prioritize, the decisions you need to make, and the ongoing organizational processes you need to start.
Plus, you'll join over 8000 people getting weekly emails covering practical tools and techniques, from Docker packaging to Python best practices.