Python script to load distributed snapshots and Cosmology Dark Matter Only 64^3 test #383

bvillasen · 2024-03-18T00:45:45Z

Added tools to load a distributed snapshot without merging the snapshot files into a single snapshot. This is especially useful when dealing with large snapshots and duplicating the memory footprint becomes very inconvenient.
The script also has the option to load only a sub-volume of the grid, where only the set of files covering the sub-volume is loaded; this is useful, for example, when plotting a slice of a very large grid that wouldn't fit in memory if the full volume is loaded.
An example and some description here: python_scripts/load_cholla_snapshot_distributed.py

Added a test for a cosmological dark matter only run (only particles) for 64^3 particles on a single GPU.
To run, go to tests/cosmology/dark_matter_only/64_N1
and run:
sh run_test.sh

The script will:

compile Cholla with the correct configuration for the dark matter only run
download the initial conditions at redshift 100
run the simulation to redshift 0
download the reference result (redshift = 0)
run a Python script to compare dark matter density from the output of the simulation and the reference snapshot
the script returns 0 if the test passes and 1 if it fails.

@evaneschneider, let me know if this format for testing is OK, and I can add other tests for the Hydrodynamics cosmological simulation (adiabatic and with H+He chemistry).

Merge dev into main

…nce snapshot are not downloaded correctly

bvillasen · 2024-04-15T15:38:01Z

I had a conversation with @bcaddy about closing this PR since there might not be value in it, which I don't agree with, but I wanted to hear other opinions from the team.

Besides the test, I think some tools could be useful for others. I have added scripts to load snapshots without having to write the snapshot into a single file, which is a big problem when dealing with large simulations. the script also has the option to load a sub-volume without loading the entire grid. From what I see, the concatenation scripts only paste all the snapshot parts into a single file. Is that right? @mabruzzo

The test script offers a simple way to add tests that can be more flexible than the current testing system, which seem to have some limitations that, according to Bob, would require a big change to the code to accommodate.

If @evaneschneider and others don't think it's appropriate to merge this PR, then I'll close it, but I wanted to hear more voices before

bcaddy · 2024-04-15T15:59:12Z

I haven't completed my code review yet but to add some clarification:

I think being able to load sub-volumes in a performant way is great. I know that's something that @mabruzzo and @evaneschneider have been working on/thinking about so looping you in seemed like a good idea. I'm not sure what the best method is but integrating with the current concatenation scripts seems like a more maintainable option to me than adding new scripts to do a similar thing.

The testing scripts in this PR only new feature is to provide the potential ability to run tests with non-default builds. Currently they can only do that with a single, hard coded, build and would require significant reworking to the scripts, testing framework, and CI pipelines to generalize and integrate them. They also rely on unversioned data hosted on dropbox instead of versioned data in our testing data repo. IMO instead, if we want to test a specific build with the current system, we can just add it to the list of defaults with a new build type file for it.

The floating point comparison method in our current system is also more robust, stable, and accurate and has multiple different comparison methods depending on what works best for a specific test. Also, these scripts duplicate much of the code in run_tests.sh and the SystemTestRunner class and make our testing scheme less clear for newcomers by adding additional code that may not be used.

The build files for Lockhart are great, always happy to see native support for more machines.

evaneschneider · 2024-04-15T17:41:37Z

Thank you both for working on this! I really appreciate the effort on both your parts.

To respond to Bruno's questions - I think there is significant value in this PR (after all, I am the one who asked for a system test of the cosmology build, and this one seems like it will work great!), and I asked @bcaddy to take a look at it to figure out what exactly we need to change to integrate it with our current testing system. I do think this test will be more effective in the long run if we can incorporate it into our automated build / test system, and the work that remains to be done in this PR is sorting out how to do that.

My understanding is that adding the build would be simple (we just have to add it to the matrix), and the main thing that needs to be changed is to put the initial conditions and reference data into our GitHub tests subdirectory, and add the cosmology system test to the list of automated tests. I have asked Bob to help with that, since he set up the testing system and as far as I know this would be the first test that requires reading in the initial conditions, so there is probably not a bread-and-butter example to follow. If I am off base about how difficult it will be to incorporate this kind of test into our existing system, then we need to take a look at accommodating testing some other way, perhaps through a script like the one included in this PR, because we do need that kind of flexibility.

I am also fine with the additional python scripts, since adding those is consistent with our past practice. That said, I think there is significantly more likelihood that others will use them if the wiki is updated to document them.

If it would be helpful to set up a Zoom call to discuss strategy once Bob has finished his code review, I am happy to do that. Again, I very much appreciate the effort on everyone's part.

bvillasen · 2024-04-15T18:08:00Z

I'm always happy to join a call to talk about Cholla ;)
I'm normally free at 8 am Pacific time (except for Thursdays), some weekdays in the afternoon would also work in case 8am PT doesn't.
Happy to talk about this further.

bcaddy · 2024-04-15T18:16:46Z

The test itself is a separate topic that will be in an upcoming PR. Bruno is working on one that works with the default cosmology build; I think that's a better choice since it will presumably also test the coupling of the particles to the fluid rather than just the particles.

bcaddy · 2024-04-16T21:51:09Z

python_scripts/compare_snapshots.py

I think this was added to help with the testing scripts. I don't see a clear use case for it without those that isn't met by the existing testing tools or h5diff. Unless you have a strong case for this I would like it to be removed.

Do we currently have a single file python script that can be used to compare snapshots like this? If so I am not aware of it, and this seems easy to use. I see no downside to adding it to this directory, although I agree it would be nice if it had some documentation.

Ok, I'm not terribly opposed to it. I don't see an immediate use case but we can always change/remove it later if needed.

bcaddy · 2024-04-16T21:54:27Z

python_scripts/load_cholla_snapshot_distributed.py

This looks like its primary purpose is to be documentation for the contents of io_tools.py. I'd prefer to see that documentation with the code that it's documenting in io_tools.py and in the wiki and this file removed.

Same here...

I disagree. To me it looks like a thin wrapper around load_cholla_snapshot_distributed with all the choices hard coded and it doesn't actually do anything besides call load_cholla_snapshot_distributed and provide some documentation for the contents of io_tools.py.

If it had all the choices as CLI arguments and was importable into another script that might be different, but then it would just be a second, identical, interface for load_cholla_snapshot_distributed.

Right, but some of us LIKE have example files to work off of versus having everything be a CLI choice with documentation elsewhere. In a sense, this script provides the usage documentation that you are requesting, it's just a different style. I think it good for us to accommodate different styles, so I'd like this to stay.

Ok, including it is fine with me then.

bcaddy · 2024-04-16T21:54:40Z

python_scripts/io_tools.py

this needs to be documented in the wiki, ideally along with some documentation in the code as well.

Same here - I think documentation will make this much more useful, but right now I do not think it is duplicating functionality that we have...

bcaddy · 2024-04-16T21:55:34Z

tests/cosmology/dark_matter_only/64_N1/clean_test.sh

Superseded by PR #390, please remove.

bcaddy · 2024-04-16T21:55:38Z

tests/cosmology/dark_matter_only/64_N1/output_a_values.txt

Superseded by PR #390, please remove.

bcaddy · 2024-04-16T21:55:42Z

tests/cosmology/dark_matter_only/64_N1/parameter_file.txt

Superseded by PR #390, please remove.

bcaddy · 2024-04-16T21:56:57Z

tests/cosmology/dark_matter_only/64_N1/run_test.sh

Superseded by PR #390, please remove.

bvillasen · 2024-04-16T23:00:55Z

Closing this for now. I guess I'll resubmit at some point with some documentation... we'll see how that goes.

evaneschneider · 2024-04-17T14:53:33Z

Apologies that I was not able to get to this immediately after Bob's code review, but I think there is still a lot of value in this PR, even if not everything is documented yet. In particular, right now there is no simple way in the public repo to load distributed data and compare it, something that the python scripts in this PR would allow. They look easy to use, and it was my intention to keep the bar for adding scripts to this directory low, so that people can share tools quickly even if they are not as polished as code entering the main source directory. I think if we summarily dismiss PRs like this, it will only lead to people sharing scripts as files over email/slack instead, which I do not think is preferable.

evaneschneider and others added 7 commits February 2, 2024 16:34

Merge pull request cholla-hydro#370 from cholla-hydro/dev

8478c9e

Merge dev into main

added host files for Lockhart (AMD internal system)

af8b165

added script to load a distributed snapshot

9f1272a

added script to compare two snapshots

e1122d9

added Cosmology DM-Only 64^3 test

a70b468

Update load_cholla_snapshot_distributed.py

4b091a9

make clean before build and exit test if initial conditions or refere…

c193632

…nce snapshot are not downloaded correctly

bcaddy requested changes Apr 16, 2024

View reviewed changes

bvillasen closed this Apr 16, 2024

evaneschneider reopened this Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python script to load distributed snapshots and Cosmology Dark Matter Only 64^3 test #383

Python script to load distributed snapshots and Cosmology Dark Matter Only 64^3 test #383

bvillasen commented Mar 18, 2024

bvillasen commented Apr 15, 2024 •

edited

Loading

bcaddy commented Apr 15, 2024 •

edited

Loading

evaneschneider commented Apr 15, 2024

bvillasen commented Apr 15, 2024

bcaddy commented Apr 15, 2024

bcaddy Apr 16, 2024

evaneschneider Apr 17, 2024

bcaddy Apr 17, 2024

bcaddy Apr 16, 2024

evaneschneider Apr 17, 2024

bcaddy Apr 17, 2024

evaneschneider Apr 17, 2024

bcaddy Apr 17, 2024

bcaddy Apr 16, 2024

evaneschneider Apr 17, 2024

bcaddy Apr 16, 2024

bcaddy Apr 16, 2024

bcaddy Apr 16, 2024

bcaddy Apr 16, 2024

bvillasen commented Apr 16, 2024

evaneschneider commented Apr 17, 2024

Python script to load distributed snapshots and Cosmology Dark Matter Only 64^3 test #383

Are you sure you want to change the base?

Python script to load distributed snapshots and Cosmology Dark Matter Only 64^3 test #383

Conversation

bvillasen commented Mar 18, 2024

bvillasen commented Apr 15, 2024 • edited Loading

bcaddy commented Apr 15, 2024 • edited Loading

evaneschneider commented Apr 15, 2024

bvillasen commented Apr 15, 2024

bcaddy commented Apr 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bvillasen commented Apr 16, 2024

evaneschneider commented Apr 17, 2024

bvillasen commented Apr 15, 2024 •

edited

Loading

bcaddy commented Apr 15, 2024 •

edited

Loading