Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
6280: Add TBB to HPX documentation in Migration Guide r=hkaiser a=dimitraka 6284: Add CUDA/HIP MultiGPU Event Polling r=hkaiser a=G-071 While the [cuda,hip]_executors within HPX support MultiGPU scenarios themselves (by using a device index and ```cudaSetDevice```), the event polling actually does not, forcing users to use the less performant callback version instead! To fix this, we need to make the CUDA/HIP event pool aware on which device the events it reuses are created, and add an index to always push/pop events on the correct device. This PR implements this, by adding multiple event stacks (one per device) to the event pool singleton. Furthermore, the PR adds an additional device_id parameter to the appropriate get_future calls that use events. To keep things compatible with code that used previous HPX versions, 0 is used as a default where this parameter was added. For each device, there's always 128 events added in the beginning by default - in case one wants to avoid this overhead when only a single GPU is used, simply set the environment variable CUDA_VISIBLE_DEVICES=0. Co-authored-by: dimitraka <[email protected]> Co-authored-by: Gregor Daiss <[email protected]> Co-authored-by: Gregor Daiß <[email protected]>
- Loading branch information