Skip to content

Commit

Permalink
Missing collapse statement in the omp code improves performance in th…
Browse files Browse the repository at this point in the history
…e ectrans benchmark
  • Loading branch information
PaulMullowney authored and wdeconinck committed Sep 1, 2023
1 parent 80f688c commit 9afc482
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/trans/gpu/internal/asre1b_mod.F90
Original file line number Diff line number Diff line change
Expand Up @@ -84,7 +84,7 @@ SUBROUTINE ASRE1B(KFIELD,PAOA,PSOA)
#ifdef OMPGPU
!WARNING: following line should be PRESENT,ALLOC but causes issues with AMD compiler!
!$OMP TARGET DATA MAP(ALLOC:PAOA,PSOA,D_MYMS,D_NPROCL,D_NSTAGT0B,D_NPNTGTB1,G_NDGLU,FOUBUF_IN)
!$OMP TARGET TEAMS DISTRIBUTE PARALLEL DO DEFAULT(NONE) PRIVATE(KM,ISL,IPROC,ISTAN,IGLS,IPROCS,ISTAS) &
!$OMP TARGET TEAMS DISTRIBUTE PARALLEL DO COLLAPSE(2) DEFAULT(NONE) PRIVATE(KM,ISL,IPROC,ISTAN,IGLS,IPROCS,ISTAS) &
!$OMP& SHARED(D_NUMP,D_MYMS,R_NDGNH,G_NDGLU,D_NPROCL,D_NSTAGT0B,D_NPNTGTB1,KFIELD,R_NDGL,FOUBUF_IN,PAOA,PSOA)
#endif
#ifdef ACCGPU
Expand Down

0 comments on commit 9afc482

Please sign in to comment.