-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FFT/generic : Replace explicit openacc directives with do concurrent #30
Comments
However, tests performed on the cluster Bede during the last GPU hackathon suggest that the nvidia compiler is not currently able to deal with this kind of do concurrent. Attached is a small example showing the issue.
|
The example attached refers to |
The objective would be to keep the
with
or alternatively a 3D shared buffer and
The example refers to |
Following commit ef6dfb9, explicit openacc directive allow the generic fft to run on GPU. Do concurrent should be used instead of explicit openacc directives.
The text was updated successfully, but these errors were encountered: