Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed issues related to checkpointing and profiling #13

Merged
merged 8 commits into from
Dec 1, 2022

Conversation

zhenghh04
Copy link
Member

In this PR, I fixed the following issues:

  • Fixed the checkpointing issue raised by: Checkpointing feature doesn't work  #12. I also changed the default values for steps_between_checkpoints and epochs_between_checkpoints
  • fixed environment setup for Darshan profiler to reflect the recent changes in darshan. I removed darshan_preload config option since the darshan runtime library has to be loaded before python in LD_PRELOAD.

Besides, there are also following minor fixes

  • Removed logdir from the config options, since it is no longer needed
  • fixed bugs caused in png generator due to dtype issue.
  • fixed bugs in HDF5 generator
  • fixed issues in requirement.txt (unfound *+cpu packages)

@zhenghh04 zhenghh04 self-assigned this Dec 1, 2022
@zhenghh04 zhenghh04 merged commit db7a39d into argonne-lcf:main Dec 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant