You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Further investigation tracked the origin of the issue to this particular commit. Manually building this repo checked out at said commit and compiling with the clang++ executable generated reproduces the issue, while checking out to a previous commit and repeating the same process yields a successful execution.
This test failure is confirmed to happen on Vega20 (gfx906), MI100 (gfx908), V620 (gfx1030), and rx7900xtx (gfx1100).
The change was an attempt to redesign the whole control flow lowering in the AMDGPU backend. Further, I found a simpler and cleaner way to solve the problem that the change originated from.
The culprit commit is going to be reverted and for sure won't go to the next release.
So, it doesn't make sense to investigate the exact reason and make any changes to it now.
Thanks for the very clear repro steps @Beanavil and the update from @alex-t . I reran this from ROCm/llvm-project amd-staging and I don't see any test failures, so I'm closing this issue.
Problem Description
One of rocThrust's tests for the scan algorithm is failing right after the ROCm 6.2.1 release. During debugging, it was noticed that modifying the addition operator of the
FixedVector
type used in the test like so:__host__ __device__ FixedVector operator+(const FixedVector& bs) const { FixedVector output; + #pragma unroll 1 for(unsigned int i = 0; i < N; i++) output.data[i] = data[i] + bs.data[i]; return output; }
fixes the test.
Further investigation tracked the origin of the issue to this particular commit. Manually building this repo checked out at said commit and compiling with the
clang++
executable generated reproduces the issue, while checking out to a previous commit and repeating the same process yields a successful execution.This test failure is confirmed to happen on Vega20 (gfx906), MI100 (gfx908), V620 (gfx1030), and rx7900xtx (gfx1100).
Operating System
Ubuntu 20.04.6 LTS
CPU
AMD EPYC 7713P 64-Core Processor
GPU
AMD Instinct MI100
ROCm Version
ROCm 6.2.0
ROCm Component
llvm-project
Steps to Reproduce
cmake -G Ninja -S llvm -B build -DLLVM_INSTALL_UTILS=ON -DCMAKE_INSTALL_PREFIX=/path/to/llvm/install -DCMAKE_BUILD_TYPE=Release -DLLVM_BUILD_TESTS=False -DLLVM_ENABLE_PROJECTS="clang;lld;compiler-rt"
./build/testing/test_thrust_scan # this should fail
cmake -G Ninja -S llvm -B build -DLLVM_INSTALL_UTILS=ON -DCMAKE_INSTALL_PREFIX=/path/to/llvm/install -DCMAKE_BUILD_TYPE=Release -DLLVM_BUILD_TESTS=False -DLLVM_ENABLE_PROJECTS="clang;lld;compiler-rt"
./build/testing/test_thrust_scan # this should pass
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: