Skip to content

Commit

Permalink
Change EFA Installer and AWS OFI nccl plugin versions. (#454)
Browse files Browse the repository at this point in the history
  • Loading branch information
mhuguesaws authored Oct 17, 2024
1 parent 86ce468 commit 5d28e92
Show file tree
Hide file tree
Showing 3 changed files with 10 additions and 10 deletions.
12 changes: 6 additions & 6 deletions micro-benchmarks/nccl-tests/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,16 +36,16 @@ The NCCL tests are packaged in a container.
> | Variable | Default | Repository |
> |-----------------------|-------------|---------------------------------------------------------------------------------------------|
> |`GDRCOPY_VERSION` | `v2.4.1` | [link](https://github.com/NVIDIA/gdrcopy) |
> |`EFA_INSTALLER_VERSION`| `1.34.0` | [link](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/efa-start.html#efa-start-enable) |
> |`AWS_OFI_NCCL_VERSION` | `v1.11.0-aws`| [link](https://github.com/aws/aws-ofi-nccl) |
> |`EFA_INSTALLER_VERSION`| `1.35.0` | [link](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/efa-start.html#efa-start-enable) |
> |`AWS_OFI_NCCL_VERSION` | `v1.12.0-aws`| [link](https://github.com/aws/aws-ofi-nccl) |
> |`NCCL_VERSION` | `v2.23.4-1` | [link](https://github.com/NVIDIA/nccl) |
> |`NCCL_TESTS_VERSION` | `v2.13.10` | [link](https://github.com/NVIDIA/nccl-tests) |
### Build the container
1. Build the container image with the command below:
```bash
EFA_INSTALLER_VERSION=1.34.0
AWS_OFI_NCCL_VERSION=v1.11.0-aws
EFA_INSTALLER_VERSION=1.35.0
AWS_OFI_NCCL_VERSION=v1.12.0-aws
NCCL_VERSION=v2.23.4-1
NCCL_TESTS_VERSION=v2.13.10
docker build -f nccl-tests.Dockerfile \
Expand Down Expand Up @@ -81,8 +81,8 @@ To run the NCCL tests on EKS, you will need to build the container image, then p

1. Create the ECR repository if it does not exist
```bash
EFA_INSTALLER_VERSION=1.34.0
AWS_OFI_NCCL_VERSION=v1.11.0-aws
EFA_INSTALLER_VERSION=1.35.0
AWS_OFI_NCCL_VERSION=v1.12.0-aws
NCCL_VERSION=v2.23.4-1
NCCL_TESTS_VERSION=v2.13.10
ECR_REPOSITORY_NAME="nccl-tests"
Expand Down
4 changes: 2 additions & 2 deletions micro-benchmarks/nccl-tests/buildspec.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@ version: 0.2
env:
variables:
GDRCOPY_VERSION: "v2.4.1"
EFA_INSTALLER_VERSION: "1.34.0"
AWS_OFI_NCCL_VERSION: "v1.11.0-aws"
EFA_INSTALLER_VERSION: "1.35.0"
AWS_OFI_NCCL_VERSION: "v1.12.0-aws"
NCCL_VERSION: "v2.23.4-1"
NCCL_TESTS_VERSION: "v2.13.10"
exported-variables:
Expand Down
4 changes: 2 additions & 2 deletions micro-benchmarks/nccl-tests/nccl-tests.Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,8 @@
FROM nvidia/cuda:12.2.2-devel-ubuntu22.04

ARG GDRCOPY_VERSION=v2.4.1
ARG EFA_INSTALLER_VERSION=1.34.0
ARG AWS_OFI_NCCL_VERSION=v1.11.0-aws
ARG EFA_INSTALLER_VERSION=1.35.0
ARG AWS_OFI_NCCL_VERSION=v1.12.0-aws
ARG NCCL_VERSION=v2.23.4-1
ARG NCCL_TESTS_VERSION=v2.13.10

Expand Down

0 comments on commit 5d28e92

Please sign in to comment.