The Dockerfile in this directory will build Docker images with all the dependencies and code needed to run example notebooks or unit tests included in this repository.
Multiple environments are supported by using multistage builds. In order to efficiently build the Docker images in this way, Docker BuildKit is necessary. The following examples show how to build and run the Docker image for CPU, PySpark, and GPU environments.
Note: On some platforms, one needs to manually specify the environment variable for DOCKER_BUILDKIT
to make sure the build runs well. For example, on a Windows machine, this can be done by the powershell command as below, before building the image
$env:DOCKER_BUILDKIT=1
Warning: On some platforms using Docker Buildkit interferes with Anaconda environment installation. If you find that the docker build is hanging during Anaconda environment setup stage try building the container without Buildkit enabled.
Once the container is running you can access Jupyter notebooks at http://localhost:8888.
CPU environment
DOCKER_BUILDKIT=1 docker build -t recommenders:cpu --build-arg ENV="cpu" .
docker run -p 8888:8888 -d recommenders:cpu
PySpark environment
DOCKER_BUILDKIT=1 docker build -t recommenders:pyspark --build-arg ENV="pyspark" .
docker run -p 8888:8888 -d recommenders:pyspark
GPU environment
DOCKER_BUILDKIT=1 docker build -t recommenders:gpu --build-arg ENV="gpu" .
docker run --runtime=nvidia -p 8888:8888 -d recommenders:gpu
GPU + PySpark environment
DOCKER_BUILDKIT=1 docker build -t recommenders:full --build-arg ENV="full" .
docker run --runtime=nvidia -p 8888:8888 -d recommenders:full
There are several build arguments which can change how the image is built. Similar to the ENV
build argument these are specified during the docker build command.
Build Arg | Description |
---|---|
ENV | Environment to use, options: cpu, psypark, gpu, full |
BRANCH | Git branch of the repo to use (defaults to main ) |
ANACONDA | Anaconda installation script (defaults to miniconda3 4.6.14) |
SPARK | Spark installation tarball (defaults to Spark 2.3.1) |
Example using the staging branch:
DOCKER_BUILDKIT=1 docker build -t recommenders:cpu --build-arg ENV="cpu" --build-arg BRANCH="staging" .
In order to see detailed progress with BuildKit you can provide a flag during the build command: --progress=plain
docker run -it recommenders:cpu pytest tests/unit -m "not spark and not gpu and not notebooks"