Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

try nvidia-cuda docker img, should clone faster #332

Closed
wants to merge 12 commits into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 11 additions & 3 deletions .github/workflows/unit_test_4gpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16,12 +16,20 @@ jobs:
runner: linux.g5.12xlarge.nvidia.gpu
gpu-arch-type: cuda
gpu-arch-version: "12.1"
# This image is faster to clone than the default, but it lacks CC needed by triton
# (1m25s vs 2m37s).
docker-image: "pytorch/pytorch:2.3.0-cuda12.1-cudnn8-runtime"
# Trying how much faster the nvidia-cuda image is
docker-image: "nvidia/cuda:12.4.1-runtime-ubuntu22.04"
repository: "pytorch/torchtitan"
upload-artifact: "outputs"
# ~/miniconda3/bin/conda init bash
script: |
apt update && apt install -y wget
mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm -rf ~/miniconda3/miniconda.sh
source ~/miniconda3/bin/activate
conda create -n "test" python=3.10
conda activate test
conda install -y -q git clang clangxx
export CC=clang
export CXX=clangxx
Expand Down
Loading