pp00aa dynamic OpenMP scheduling improves load imbalance #207

missing-user · 2024-11-14T14:07:25Z

Since poincare tracing is done using an adaptive integration routine, execution time isn't equal between points.
Especially chaotic regions take the longer to integrate and are typically not equally distributed along the nptrj range, so this results in a significant load imbalance between threads. The default static scheduling divides the workloads in large, equal blocks between threads. Dynamic scheduling creates a more fine grained work distribution (round robin, 1 loop iteration per thread) and uses whichever threads are available at the moment.

This might cause a minor performance overhead in the edge case of very small nppts, with low and large nptrj, but this should be outweighed by the improved load balance. It improved wall clock time and CPU utilization in all my tested examples.

E.g. for a simple rotating ellipse with

 odetol      =   1.000000000000000E-07
 nPpts       =      1000
 nPtrj       =      8    80

this resulted in a speedup of 34s (1m25s to 51s) on 4 threads, and a speedup was measurable even in the "worst case" of nppts = 1

missing-user · 2024-11-14T19:44:37Z

Arguably related to #79 since it does speed up field line tracing and #55

jonathanschilling

LGTM!

pp00aa use dynamic OpenMP scheduling due to load imbalance

fdc832f

jonathanschilling approved these changes Nov 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pp00aa dynamic OpenMP scheduling improves load imbalance #207

pp00aa dynamic OpenMP scheduling improves load imbalance #207

missing-user commented Nov 14, 2024

missing-user commented Nov 14, 2024 •

edited

Loading

jonathanschilling left a comment

pp00aa dynamic OpenMP scheduling improves load imbalance #207

Are you sure you want to change the base?

pp00aa dynamic OpenMP scheduling improves load imbalance #207

Conversation

missing-user commented Nov 14, 2024

missing-user commented Nov 14, 2024 • edited Loading

jonathanschilling left a comment

Choose a reason for hiding this comment

missing-user commented Nov 14, 2024 •

edited

Loading