-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When using latest GEOS model in the DAS must use O-server #88
Comments
Ben: What's the GCM layout in this configuration (NX, NY)? |
No, I'm not talking about 5.27.1, I am just giving a heads up that when testing the GEOSadas with newer model versions that make use of MAPL2, use of the IO server is mandatory so the scripting will need to reflect this. That's all. |
Question on this: how does the total number of PEs about relate to the ncpus-per-nodes? So, we are on skylake w/ ncpus-per-node=40 but requesting a batch job w/ ncpus-per-node=36 would the math leading up to the number 5720 above change, that is, 5720 = 5400 + (8 * 40) in the case of 36 should I have the entry in the option -np change to 5400 + (36*8) = 5688 ? |
Yes, if you are only running 36 cores per node then you would change the -np to 5400 + (36*8) = 5688 |
More clarification, the infrastructure is detecting the node information and partitioning the MPI communicators between the nodes as appropriate. The npes_model does not have to be evenly divisible by the number of nodes and if there are few left over mpi tasks, those just aren't used by either the model or o-server. So just to makeup numbers, lets say you had 23 cores per node and needed 540 mpi tasks for the model and wanted 4 O-server nodes, you would use (24 * 23) (as you need 24 nodes to run the model) + (4 * 23) mpi tasks like so: -np 644, --npes_model 540, --nodes_output_server 4. You need 24 nodes to run the model (so 12 mpi tasks just don't get used by in the actual GEOSgcm.x run, they just wait for the rest to finish) |
Thanks Ben. |
Another issue related to updating the GEOSadas to use newer MAPL/Model versions
With changes to History to use the new O-Server and ESMF regridding, at the GEOSadas scales (c720, ~80 collections) it is necessary to use the o-server to achieve optimal performance. We have been benchmarking the DAS history with the latest version of MAPL that has the newest o-server options.
It was found that on the skylakes, 8 server nodes in this configuration provided good performance with limited gains beyond 8 nodes:
mpirun -np 5720 ./GEOSgcm.x --npes_model 5400 --nodes_output_server 8 --oserver_type multigroup --npes_backend_pernode 8 --fast_oclient true
The text was updated successfully, but these errors were encountered: