Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecate cub::RegBoundScaling and cub::MemBoundScaling #3685

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Feb 4, 2025

Fixes #3663

Description

Deprecate and move into detail namespace cub::RegBoundScaling and cub::MemBoundScaling

  1. Deprecate cub::RegBoundScaling and cub::MemBoundScaling
  2. Forward them to cub::detail namespace implementation

To backport to 2.8

@fbusato fbusato added 3.0 Targeted for 3.0 release backport branch/2.8.x labels Feb 4, 2025
@fbusato fbusato self-assigned this Feb 4, 2025
@fbusato fbusato requested review from a team as code owners February 4, 2025 23:45
Copy link
Contributor

github-actions bot commented Feb 5, 2025

🟩 CI finished in 1h 50m: Pass: 100%/90 | Total: 2d 17h | Avg: 43m 43s | Max: 1h 16m | Hits: 146%/12730
  • 🟩 cub: Pass: 100%/44 | Total: 1d 16h | Avg: 55m 33s | Max: 1h 16m | Hits: 41%/3500

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 14h | Avg: 55m 00s | Max:  1h 16m | Hits:  41%/3500  
      🟩 arm64              Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 09m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 05m | Hits:  41%/875   
      🟩 12.5               Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m
      🟩 12.8               Pass: 100%/37  | Total:  1d 09h | Avg: 54m 03s | Max:  1h 16m | Hits:  41%/2625  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 07m | Avg:  1h 01m | Max:  1h 05m | Hits:  41%/875   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m
      🟩 nvcc12.8           Pass: 100%/35  | Total:  1d 07h | Avg: 53m 38s | Max:  1h 16m | Hits:  41%/2625  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 14h | Avg: 55m 17s | Max:  1h 16m | Hits:  41%/3500  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 57m | Avg: 59m 26s | Max:  1h 01m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 30s | Max: 58m 39s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 20s | Max: 58m 57s
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 02m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 54m | Avg: 50m 42s | Max:  1h 09m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 58m | Avg: 59m 13s | Max:  1h 01m
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m
      🟩 GCC9               Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 06m
      🟩 GCC10              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 40s | Max: 57m 55s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 09s | Max:  1h 01m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 04m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 38m | Avg: 39m 48s | Max:  1h 09m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 16m | Hits:  41%/1750  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m | Hits:  41%/1750  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 47m | Avg: 55m 43s | Max:  1h 09m
      🟩 GCC                Pass: 100%/21  | Total: 17h 45m | Avg: 50m 45s | Max:  1h 09m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 54m | Avg:  1h 13m | Max:  1h 16m | Hits:  41%/3500  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 52m 03s | Avg: 26m 01s | Max: 26m 16s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 11h | Avg:  1h 03m | Max:  1h 16m | Hits:  41%/3500  
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 09m | Avg: 31m 14s | Max:  1h 04m
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 01m | Max:  1h 16m | Hits:  41%/3500  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 33s | Avg: 22m 33s | Max: 22m 33s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 10s | Max: 25m 47s
      🟩 TestGPU            Pass: 100%/2   | Total: 41m 42s | Avg: 20m 51s | Max: 21m 22s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 52m 03s | Avg: 26m 01s | Max: 26m 16s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 56m | Avg:  1h 02m | Max:  1h 16m | Hits:  41%/2625  
      🟩 20                 Pass: 100%/24  | Total: 19h 48m | Avg: 49m 30s | Max:  1h 15m | Hits:  41%/875   
    
  • 🟩 thrust: Pass: 100%/43 | Total: 1d 00h | Avg: 33m 54s | Max: 1h 08m | Hits: 186%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 38m 49s | Avg: 19m 24s | Max: 27m 50s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 23h 11m | Avg: 33m 56s | Max:  1h 08m | Hits: 186%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 55s | Max: 33m 26s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 13m | Avg: 38m 38s | Max: 55m 12s | Hits: 141%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 06s | Max: 56m 12s
      🟩 12.8               Pass: 100%/36  | Total: 19h 14m | Avg: 32m 03s | Max:  1h 08m | Hits: 197%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 10s | Avg: 28m 05s | Max: 28m 30s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 13m | Avg: 38m 38s | Max: 55m 12s | Hits: 141%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 50m | Avg: 55m 06s | Max: 56m 12s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 18h 18m | Avg: 32m 17s | Max:  1h 08m | Hits: 197%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 10s | Avg: 28m 05s | Max: 28m 30s
      🟩 nvcc               Pass: 100%/41  | Total: 23h 21m | Avg: 34m 11s | Max:  1h 08m | Hits: 186%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 06m | Avg: 31m 34s | Max: 33m 15s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 34s | Max: 34m 45s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 52s | Max: 34m 35s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 54s | Max: 33m 04s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 52s | Max: 32m 24s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 04s | Max: 34m 55s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 36s | Avg: 35m 36s | Max: 35m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 52s | Max: 38m 40s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 06s | Max: 36m 01s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 33s | Max: 31m 39s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 40s | Max: 36m 22s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 16m | Avg: 24m 35s | Max: 37m 11s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 43s | Max:  1h 02m | Hits: 141%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 47m | Avg: 55m 42s | Max:  1h 08m | Hits: 215%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 06s | Max: 56m 12s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 10m | Avg: 28m 49s | Max: 34m 45s
      🟩 GCC                Pass: 100%/19  | Total:  9h 32m | Avg: 30m 09s | Max: 38m 40s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 54s | Max:  1h 08m | Hits: 186%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 50m | Avg: 55m 06s | Max: 56m 12s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 09m | Avg: 36m 39s | Max:  1h 02m | Hits: 141%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 08m | Avg: 24m 49s | Max:  1h 08m | Hits: 253%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 53m | Avg: 37m 06s | Max:  1h 08m | Hits: 141%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 42s | Avg: 17m 14s | Max: 35m 32s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 01s | Avg: 11m 00s | Max: 12m 17s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 38s | Avg: 33m 38s | Max: 33m 38s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 48m | Avg: 38m 24s | Max:  1h 02m | Hits: 141%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 50m | Avg: 30m 59s | Max:  1h 08m | Hits: 253%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 28s | Avg: 3m 44s | Max: 5m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  5m 11s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
      🟩 Test               Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release backport branch/2.8.x
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

Deprecate cub::RegBoundScaling and cub::MemBoundScaling
1 participant