Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor(gpu): remove max_shared_memory from pbs arguments #1439

Merged
merged 1 commit into from
Aug 1, 2024

Conversation

agnesLeroy
Copy link
Contributor

Always use max shared memory from device 0 to configure the kernels, to avoid bugs with multi-GPU configurations

closes: please link all relevant issues

PR content/description

Check-list:

  • Tests for the changes have been added (for bug fixes / features)
  • Docs have been added / updated (for bug fixes / features)
  • Relevant issues are marked as resolved/closed, related issues are linked in the description
  • Check for breaking changes (including serialization changes) and add them to commit message following the conventional commit specification

Copy link
Contributor

@pdroalves pdroalves left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A long time waiting PR. Thank you @agnesLeroy !
The only thing that came to my eyes was the replacement of cuda_get_max_shared_memory(gpu_indexes[0]) by cuda_get_max_shared_memory(0). If that is intentional, then we are good to go.

Always use max shared memory from device 0 to configure the
kernels, to avoid bugs with multi-GPU configurations
@zama-bot zama-bot removed the approved label Aug 1, 2024
@agnesLeroy agnesLeroy merged commit 5547d92 into main Aug 1, 2024
76 of 77 checks passed
@agnesLeroy agnesLeroy deleted the al/clean_max_shared_memory branch August 1, 2024 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants