Grackle Tests #125

mabruzzo · 2021-08-04T16:06:13Z

Overview
This PR introduces 2 automated tests for EnzoMethodGrackle. These are the first tests to check that Grackle-related results remain consistent across commits. In both of these tests, the code sets up parallel one-zone models without hydrodynamics. More details are provided about the test below:

method_grackle_general: This uses the null method to set the simulation timestep (each timestep extends over multiple heating/cooling times). After a fixed amount of simulation time, the data is written to disk and the new tools/field_summary.py script is used to compute several summary statistics for relevant fields ("pressure", "temperature", "cooling_time") and compare them against previously computed reference values. The summary statistics include the mean, standard deviation, min, min-location, max, and max-location.
method_grackle_cooling_dt: This uses a similar setup to the first test, except that the timesteps are set to be a fixed fraction of the minimum magnitude of the heating/cooling time. This test checks the final simulation time, after a fixed number of cycles, against some reference value.

An advantage of not setting the timestep in the first test to be a fixed fraction of the minimum heating/cooling timescale is that it increases the chance that we might witness race-conditions (if any exist or get introduced) when the simulation is run in SMP-mode.

More Details

Unfortunately these tests may be a little fragile given that Grackle is an external dependency and incremental changes to Grackle may slightly change results. Hopefully, we can adjust the tolerances to be flexible enough that this isn't a major problem (but the tests remain meaningful). In any case, I think these tests are definitely an improvement over having nothing.

The enzo-dev style of regression/answer testing would certainly be a superior approach (it would facilitate comparisons between two revisions of Enzo-E on the same machine using a single build of Grackle). However, the machinery behind that testing system remains opaque to me. It would be great if we could move to that system sooner rather than later. I expect that my creation of tools/field_summary.py was reinventing the wheel to some degree, and it would be great to rely on existing machinery. As an aside, tools/field_summary.py could probably be reused in the short-term for the creation of other tests (e.g. the 1D Zel'dovich pancake tests mentioned in Issue #123)

An obstacle to creating these tests was that the path to the Grackle data files can't be hard-coded into the input files. To get around this issue, I have introduced tools/gen_grackle_testing_file.py which writes a new input file to disk that simply has an include-directive (i.e. to include the contents of input/method_grackle_general.in or input/method_grackle_cooling_dt.in and overwrites the value of the Method:grackle:data_file parameter. I choose to make this functionality and tools/field_summary.py into a command-line tool because of my uncertainty over the future format of the testing system (giving the pending transition of the build-system from Scons to CMake).

While implementing test/MethodGrackle/SConscript I broke parts of the test into individual build steps. I realize that may be somewhat difficult to follow. Let me know if you think I should refactor running the simulation and executing tools/field_summary.py into a single build step for the method_grackle_general test.

Removing -ffast-math from default build
This PR was delayed by issues I encountered with reproducibility. The tests ran fine on different machines with the same compiler version (e.g. on my personal laptop and the CI infrastructure). However, we got surprisingly large differences (the biggest relative difference was maybe ~0.1% for double-precision tests) when we ran the test using a binary with a different compiler version (e.g. my work machine). In particular I had encountered issues with versions 7.5 and 9.4 of the gcc compiler.

It turns out that these issues arose because of the value-unsafe floating-point optimizations enabled by the -ffast-math flag (which was enabled by default on linux-gnu builds). It took me a while to figure this out since Grackle itself is never compiled with this flag. As far as I can tell, most of the differences creep in during initialization of fields in EnzoInitialGrackleTest.

As a consequence of this issue, I have removed the -ffast-math flag from the default build configuration. I don't think this should be controversial since we were already moving towards this change in PR #139. Removing this flag required the following modifications to existing tests:

the expected values for the VL+CT's dual_energy_cloud test needed to be updated (there was already a conversation about needing this change in PR Replace scons build & test infrastructure with cmake #139).
the final time for the PPML tests had to be updated for binaries compiled with single precision
I needed to change the precision for the mass constant in the cosmology test from "double" to "default". Without this change, the test crashed for binaries compiled with single precision (without the -ffast-math). This problem was discussed in detail in Issue input/Cosmology/method_cosmology single-precision bugs #154.

While setting the tolerances for the double-precision version of the method_grackle_general test, I took some care to make sure they were loose enough to avoid issues when running the test on my 2 machines with different compiler versions. I didn't take as much care for the single-precision tests (I figure we can loosen the tolerances on those tests as it becomes necessary)

A section was also added to the website documentation to describe these tests.

…est is detected.

…that is necessary for the Grackle tests.

- fixed a bug in tools/field_summary.py that occurs for newer versions of yt - updated the expected restart values for double-precision

This commit consists of 3 main changesets that: 1. update the reference results for the single precision test 2. addressed some deprecation notices in tools/field_summary.py for modern versions of yt (while maintaining backwards compatability) - when passing a field to yt, we now a tuple including the fluid type (previously we just passed a single string, which caused some confusion) - the script now has yt compute the "weighted_standard_deviation" quantity instead of the "weighted_variance" quantity, when possible. In older versions of yt it falls back to having yt compute the "weighted_variance" quantity (which was incorrectly named - it always returned the weighted standard deviation). 3. tweaked the commands related to running the general_grackle test. Now the report is printed to stdout to make it obvious whether there where issues and exactly what they are

It turns out that this was the reason certain summary statistics were varying by more than 0.1% when I was using 2 different machines with different compiler versions (gcc versions 9.4.0 vs 7.5.0). This was not completely obvious because Grackle itself was never compiled with these flags (although it took me longer to determine than it should have). As far as I can tell most of the differences arose in EnzoInitialGrackleTest. We were already making this change in the CMake PR (enzo-project#139), anyways, so this shouldn't be controversial. As a result of this change, the following modifications were required for existing tests: - the expected values for the VL+CT dual_energy_cloud test needed to be updated. This change was previously discussed in the context of PR enzo-project#139. - the final time for the PPML tests had to be updated for binaries compiled with single precision - the cosmology test had to be disabled for binaries compiled with single precision. As is described in Issue enzo-project#154, it appears the `-ffast-math` had been suppressing an existing bug for this case (the test now crashes without the flag). Note that the test ran perfectly fine when compiled in double-precision. The double-precision results for the Grackle test also had to be updated (but the single-precision results were basically fine). Note that the tolerances for the double-precision results were calibrated so that the tests would pass if you used gcc versions 7.5.0 or 9.4.0 (they may need to be loosened more in the future). This commit also added the ability for the field_summary to accept different tolerances for different summary statistics (this is useful since we might want 0 tolerance for the location of a min/max while still applying a tolerance for the min/max itself). It also a runtime warnings raised by the script.

…ision binary (without the -ffast-math flag)

brittonsmith

This looks good to me. I had one very minor comment and a somewhat unrelated question.

input/Grackle/grackle.incl

gregbryan

Looks good to me (one minor correction and one minor docs request).

input/Grackle/grackle.incl

doc/source/tests/grackle.rst

@gregbryan

This comment addresses the following 3 PR comments: 1. We now comment out more fields required by Grackle to further test automatic field creation. 2. Fixed an issue with the length units definition. We had a comment stating that we were defining the length unit to be 1 pc, but were actually setting the length unit to 0.01 pc. I choose not to actually change the length unit definition (as was recommended by @gregbryan) since that would require us to update our reference answers. Instead, I updated the comment. 3. Better documented why we use such long timesteps (i.e. for increasing the chance of exposing SMP bugs) While modifying input/Grackle/grackle.incl, I deleted the "gamma" field. This was a holdover from a much earlier version of EnzoMethodGrackle. The current implementation does not support this field. This revealed that EnzoInitialGrackleTest was still accessing this field (it wasn't doing anything of substance with it). Therefore, I deleted the mentions of the "gamma" field from EnzoInitialGrackleTest (we can always add it back in the future).

…as accidentally overwritten while addressing a merge conflict.

peeples · 2022-06-10T17:52:11Z

@mabruzzo Please merge this once ready; looks like very little left to do!

…CTest infrastructure

mabruzzo · 2022-06-10T19:05:45Z

@peeples, I had to introduce 2 new scripts to get this to work properly. I'm not sure whether @gregbryan or @brittonsmith would want to look at this again.

I'm inclined to pull the trigger, since these new scripts effectively transcribe the logic that I had previously encoded in an SConscript file (and the logic should now be more readable). Plus, merging this in might help with wrapping up the multispecies cosmology PR (#143). But, I'll defer to your judgement.

peeples · 2022-06-10T19:06:38Z

Let's merge, and if more issues arise, they can be fixed in a small-and-fast PR. Thanks!

mabruzzo force-pushed the grackle_test branch from 507080c to 3e9425f Compare August 4, 2021 16:06

Introduced automated tests for EnzoMethodGrackle

c53c505

A section was also added to the website documentation to describe these tests.

mabruzzo force-pushed the grackle_test branch from 3e9425f to d643041 Compare August 4, 2021 17:17

Updated build.sh so that build.sh exits with failure when a crashed t…

d2a03c8

…est is detected.

mabruzzo force-pushed the grackle_test branch from d643041 to d2a03c8 Compare August 4, 2021 19:24

mabruzzo marked this pull request as draft November 8, 2021 17:41

mabruzzo changed the title ~~Grackle Tests~~ WIP: Grackle Tests Nov 20, 2021

stefanarridge mentioned this pull request Feb 25, 2022

Multispecies Cosmology #143

Merged

mabruzzo added 4 commits March 11, 2022 20:05

Merge branch 'main' into grackle_test

1665c28

Tweak to let Scons know when an interemediate dependency is created (…

60e696c

…that is necessary for the Grackle tests.

Made a series of tweaks

579b742

- fixed a bug in tools/field_summary.py that occurs for newer versions of yt - updated the expected restart values for double-precision

mabruzzo mentioned this pull request Mar 13, 2022

input/Cosmology/method_cosmology single-precision bugs #154

Closed

mabruzzo force-pushed the grackle_test branch from aaff67a to cfa2fd5 Compare March 13, 2022 17:54

mabruzzo marked this pull request as ready for review March 13, 2022 18:03

mabruzzo force-pushed the grackle_test branch from cfa2fd5 to 4c516a7 Compare March 13, 2022 18:09

mabruzzo changed the title ~~WIP: Grackle Tests~~ Grackle Tests Mar 13, 2022

Resolved the issue with running the Cosmology test with a single prec…

b0ba1a5

…ision binary (without the -ffast-math flag)

peeples requested review from brittonsmith and gregbryan March 14, 2022 16:39

mabruzzo mentioned this pull request Apr 30, 2022

Consolidate pressure routines, improve VL+CT & Grackle compatability #173

Merged

brittonsmith approved these changes May 9, 2022

View reviewed changes

input/Grackle/grackle.incl Outdated Show resolved Hide resolved

input/Grackle/grackle.incl Outdated Show resolved Hide resolved

gregbryan approved these changes May 9, 2022

View reviewed changes

input/Grackle/grackle.incl Outdated Show resolved Hide resolved

doc/source/tests/grackle.rst Outdated Show resolved Hide resolved

mabruzzo added 4 commits May 29, 2022 21:23

Merge branch 'main' into grackle_test

128d30c

reintroducing the fix to the single precision cosmology test - this w…

85e923b

…as accidentally overwritten while addressing a merge conflict.

Merge branch 'main' into grackle_test

e0d9ddf

Slightly refactored the Grackle tests to integrate them with the new …

fe5e530

…CTest infrastructure

peeples merged commit bac823a into enzo-project:main Jun 10, 2022

mabruzzo deleted the grackle_test branch June 10, 2022 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grackle Tests #125

Grackle Tests #125

mabruzzo commented Aug 4, 2021 •

edited

Loading

brittonsmith left a comment

gregbryan left a comment

peeples commented Jun 10, 2022

mabruzzo commented Jun 10, 2022

peeples commented Jun 10, 2022

Grackle Tests #125

Grackle Tests #125

Conversation

mabruzzo commented Aug 4, 2021 • edited Loading

brittonsmith left a comment

Choose a reason for hiding this comment

gregbryan left a comment

Choose a reason for hiding this comment

peeples commented Jun 10, 2022

mabruzzo commented Jun 10, 2022

peeples commented Jun 10, 2022

mabruzzo commented Aug 4, 2021 •

edited

Loading