-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update to MAPL 2.23.1 and ESMA_env v4.2.0 #568
Update to MAPL 2.23.1 and ESMA_env v4.2.0 #568
Conversation
@mathomp4, thanks for keeping us current. I went ahead and ran the tests, in the hope that it would work out and we can just merge the PR. Alas, I'm getting somewhat odd results. All Intel "model" tests pass, but the Intel "assim" tests fail to run. The error messages are as follows: |
Interesting. This is due to a new fix in MAPL from @bena-nasa detailed in: based on tests with the ADAS: where Ben was trying to run the ADAS and found that the scripting wasn't cleaning up background files in a good way and History was essentially trying to write to a file that already existed. Ben is on leave until next week, but maybe you can read through the GEOSadas issue and see if you can figure out what's happening? He was pretty confident that if History ever hit this, something was wrong.
Ah. These will fail for sure as the nightly test scripts are still pointing to an older-Baselibs |
@mathomp4: Thanks for the info. I think I sort of understand the ADAS issue reported by @bena-nasa, but I don't quite see how it would apply to the GEOSldas test case. Here, we're just writing a regular, 3-hr avg output file. There's no 2-step dance between the predictor and corrector segments, so this particular time-stamped file is only written once. |
@gmao-rreichle
|
@bena-nasa, the problem occurs with the standard LDAS Nightly test suite. For reference, my most recent (failed) runs are still in |
@gmao-rreichle @biljanaorescanin
It had no problem writing this. Is the scripting maybe touching this file? I really don't think there is any logic problem with MAPL but it really seems like somehow something touched this file before GEOSldas.x executed. |
@bena-nasa : Thanks. What was the config of your model run? Single-member or ensemble? I'm asking because my only hunch is that the scripting and model execution differs between single-member, no perturbation simulations and ensemble simulations with perturbations. Here's my earlier comment on this:
|
Here are run scripts for tests: |
@gmao-rreichle @biljanaorescanin This was a stock GESOgcm.x run which can not run ensembles but I don't see how that would matter once the GEOSldas.x executes. When you run ensembles you still run a single executable that runs via MAPL_Cap? Does each ensemble have it's own MAPL_CapGridComp (so would have its own history instantiation) or is it an "ensemble" in the sense that below the single capgridcomp, under the root gridcoomp you have multiple children of somebody? I assume it is the latter. Still would bet 4 figures this is a scripting problem. @biljanaorescanin why don't you try putting an exit in the script right before (and by that I literally mean the line before the GEOSldas.x executes). Let's see where the problem lies definitively, either the file will be there or it won't in the scratch or temporary directory GEOSldas.x will be run from. |
Yes, it's the latter.
It's worth a try for @biljanaorescanin to do this, but if this doesn't point to the solution easily, it's probably best to wait for @weiyuan-jiang to be back in the office. Also, it's still only an assumption that I did not mess up somewhere. Perhaps the most important thing for @biljanaorescanin would be to confirm that she can reproduce my problem with the LDAS_GLOBAL/assim test case. |
@bena-nasa, @mathomp4, here's an update. Both @biljanaorescanin and I ran more tests using Matt's LDAS_Nightly_Cron.csh script and noticed the following:
So something in the new infrastructure doesn't seem to want to work with GEOSldas. Could still be a GEOSldas scripting error, or perhaps a problem with how MAPL is used in GEOSldas. Or it's possibly a problem with MAPL HISTORY that is uncovered by GEOSldas but not the GCM. I can only hope it's not too hard to figure out. |
@gmao-rreichle I can trigger the same error in the GCM. I have a theory I need to discuss with @bena-nasa, but hopefully we can figure out a solution (might be simple, might be complex). |
@biljanaorescanin can you try updating MAPL this this branch? |
I've pushed my |
Why is the CI failing here? It should not be doing so. |
Okay. I think I figured out the reason the CI was failing. The hotfix branch was not up to date with MAPL I've also made a useless comment to re-trigger the CI |
All except one assim test are now failing for the same error for both gnu and intel. One that passes is for All model tests passed now as you can see below. Error all failed tests have: |
@biljanaorescanin @gmao-rreichle this is baffling, well, I reverted all_pes to .true. and moved the file inquiring into the netcdf block of that if/else. Can you pull the hotfix branch and try again. |
@bena-nasa @mathomp4 @gmao-rreichle |
Thanks, @biljanaorescanin. Before we can merge this, we need a new release of MAPL that includes @bena-nasa's hotfix, and once we have that, we'll need to run the tests one more time to be safe. |
@biljanaorescanin @gmao-rreichle We have released MAPL 2.23.1 and I've updated this PR accordingly |
I've retested from Matt's systests, all tests except GNU passed as we expected. |
This is a PR to update GEOSldas to use MAPL
v2.23.0v2.23.1. Note that this update also requires a newer ESMA_env (aka Baselibs). This was due to needing some new interfaces in yaFyaml for use with ExtData2G.All testing with GEOSgcm is zero-diff, so I'm labeling as such.
The equivalent GEOSgcm PR is GEOS-ESM/GEOSgcm#428
NOTE: For now I'll keep this draft because I want to make sure the CI works. This is not anything crucial for the LDAS so it can wait until @biljanaorescanin is back from her leave so she can do full testing.