Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding the script to build and run the rccl-tests for PTS #26

Open
wants to merge 33 commits into
base: master
Choose a base branch
from

Commits on Oct 25, 2021

  1. Add MPI_IBM build option

    AddyLaddy committed Oct 25, 2021
    Configuration menu
    Copy the full SHA
    7130fa6 View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2021

  1. Add option to statically link cudart

    Build with CUDARTLIB=cudart_static to remove dynamic linkage
    
    Also removed unused curand and nvToolsExt dependencies
    
    BUG 95
    AddyLaddy committed Nov 10, 2021
    Configuration menu
    Copy the full SHA
    de3ddbe View commit details
    Browse the repository at this point in the history

Commits on May 26, 2022

  1. Merge pull request #96 from NVIDIA/nersc-linkage-fix

    Add option to statically link cudart
    AddyLaddy authored May 26, 2022
    Configuration menu
    Copy the full SHA
    8274cb4 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2022

  1. Resync with NCCL 2.13

    * Added "verifiable", a suite of kernels for generating and verifying reduction
      input and output arrays in a bit-precise way.
    * Data corruption errors now reported in number of wrong elements instead of max
      deviation.
    * Use ncclGetLastError.
    * Don't run hypercube on non-powers of 2 ranks.
    * Fix to hypercube data verification.
    * Use "thread local" as the defaut CUDA capture mode.
    * Replaced pthread_yield -> sched_yield()
    * Bugfix to the cpu-side barrier/allreduce implementations.
    jbachan committed Aug 23, 2022
    Configuration menu
    Copy the full SHA
    51af557 View commit details
    Browse the repository at this point in the history
  2. Changed top-level Makefile behavior so that BUILDDIR is interpreted

    as relative to top-level directory. This done is by abspath'ing it before
    passing it to subdirectory Makefile's.
    
    The old behavior had two cases: with and without BUILDDIR being set by
    the user. With BUILDDIR not set, the build dir would be named "build"
    in the top-level directory. If BUILDDIR was set, then the build dir
    would be placed at "src/${BUILDDIR}".
    
    The new behavior is simpler, if BUILDDIR is not set then it defaults
    to "build", and the directory holding the final build is always at just
    "${BUILDDIR}" in the top level.
    jbachan committed Aug 23, 2022
    Configuration menu
    Copy the full SHA
    bc5f7cf View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2022

  1. Display N/A for error count in AlltoAll in-place test

    AlltoAll does not support in-place buffers
    AddyLaddy committed Sep 6, 2022
    Configuration menu
    Copy the full SHA
    a0a1491 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2022

  1. Configuration menu
    Copy the full SHA
    afa4c56 View commit details
    Browse the repository at this point in the history
  2. Fix preprocessor version check for ncclGetLastError()

    ncclGetLastError() was added in NCCL 2.13.0
    AddyLaddy committed Sep 7, 2022
    Configuration menu
    Copy the full SHA
    749573f View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2022

  1. Update NCCL tests

    sjeaugey committed Sep 23, 2022
    Configuration menu
    Copy the full SHA
    d313d20 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2022

  1. Configuration menu
    Copy the full SHA
    d22281c View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2022

  1. Configuration menu
    Copy the full SHA
    3ae371c View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2022

  1. make rccl-test compile again.

    all files compile now.
    mpi tests also pass
    edgargabriel committed Oct 21, 2022
    Configuration menu
    Copy the full SHA
    641e93e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9a89c30 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4d7cd87 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    84e8be8 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2022

  1. fix a messing endif statement

    error introduced with the web merger-resolution tool :-(
    edgargabriel committed Oct 25, 2022
    Configuration menu
    Copy the full SHA
    8a754f1 View commit details
    Browse the repository at this point in the history
  2. Merge pull request #22 from edgargabriel/pr/compile-fix

    fix a messing endif statement
    edgargabriel authored Oct 25, 2022
    Configuration menu
    Copy the full SHA
    fb0d339 View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2022

  1. Configuration menu
    Copy the full SHA
    9c97467 View commit details
    Browse the repository at this point in the history
  2. Merge pull request #23 from edgargabriel/pr/link-fix

    add the rccl/lib directory to the link path
    edgargabriel authored Oct 31, 2022
    Configuration menu
    Copy the full SHA
    a80fbba View commit details
    Browse the repository at this point in the history
  3. make cmake stage also pass in CI

    the subdir entry is not actually required for the compilation.
    edgargabriel committed Oct 31, 2022
    Configuration menu
    Copy the full SHA
    377b28e View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2022

  1. Merge pull request #24 from edgargabriel/pr/cmake-fix

    make cmake stage also pass in CI
    edgargabriel authored Nov 1, 2022
    Configuration menu
    Copy the full SHA
    a8c920c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9d3a53d View commit details
    Browse the repository at this point in the history

Commits on Nov 24, 2022

  1. Configuration menu
    Copy the full SHA
    08f8dc6 View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2022

  1. fix algorithm assigning values in testsuite

    avoid a division by zero which seems to only occur for op=prod and
    datatype=half, since the maximum exponent is small (15) and can exceed
    the number of ranks.
    edgargabriel committed Nov 30, 2022
    Configuration menu
    Copy the full SHA
    e9f5be1 View commit details
    Browse the repository at this point in the history

Commits on Dec 1, 2022

  1. Merge pull request #27 from edgargabriel/topic/half_prod_fix

    fix algorithm assigning values in testsuite
    edgargabriel authored Dec 1, 2022
    Configuration menu
    Copy the full SHA
    b3f0716 View commit details
    Browse the repository at this point in the history

Commits on Feb 23, 2023

  1. Configuration menu
    Copy the full SHA
    2b2f23f View commit details
    Browse the repository at this point in the history
  2. Merge pull request #28 from edgargabriel/topic/mpi-auto-compile

    auto-detect and enable MPI
    edgargabriel authored Feb 23, 2023
    Configuration menu
    Copy the full SHA
    453e729 View commit details
    Browse the repository at this point in the history

Commits on Feb 25, 2023

  1. Adding -pthread flag for linking issues into src/Makefile (#30)

    * Adding -pthread flag for linking issues into src/Makefile
    
    * Adding -pthread flag for linking issues into CMakeLists.txt
    PedramAlizadeh authored Feb 25, 2023
    Configuration menu
    Copy the full SHA
    5275aa5 View commit details
    Browse the repository at this point in the history

Commits on Mar 2, 2023

  1. revamp cmake MPI detection

    we honor user requested MPI installations using MPI_PATH first,
    and check afterwards for MPICH and Open MPI in the default
    Ubuntu and RHEL installation directories.
    edgargabriel committed Mar 2, 2023
    Configuration menu
    Copy the full SHA
    bdf58b1 View commit details
    Browse the repository at this point in the history

Commits on Mar 3, 2023

  1. Merge pull request #32 from edgargabriel/topic/mpi-auto-compile

    revamp cmake MPI detection
    edgargabriel authored Mar 3, 2023
    Configuration menu
    Copy the full SHA
    0fc25d5 View commit details
    Browse the repository at this point in the history

Commits on May 23, 2023

  1. Configuration menu
    Copy the full SHA
    83a89cb View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. Configuration menu
    Copy the full SHA
    948a664 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2023

  1. Configuration menu
    Copy the full SHA
    491c33c View commit details
    Browse the repository at this point in the history