-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
written train and validation file looses info from the original file #242
Comments
Thanks for using IPSuite and letting us know about this issue. Personally, I think the two ways of storing these information in ASE, either through e.g., the I'll add support for: magmoms, initial_magmoms, qe_forces through https://github.com/zincware/ZnH5MD, which we use to save / load H5MD. I'll try to further support all properties in For a quick solution, I added IPSuite/ipsuite/data_loading/add_data_ase.py Lines 66 to 68 in bbeed50
you should be able to test it via |
Thanks a lot @PythonFZ ! problem solved :-) I think having both options is really desired. I understand the motivation of using the H5MD format for large data files in MD but for ML trianing codes, those benefits are not really applicable and ase extxyz format does provide scope to include different properties which are not restricted to calc.results. so the way you have implemented it now provides a good basis to serve both needs. On a different notes
|
Our focus will remain on storing everything in H5MD, but I do agree, that we should avoid unnecessary data duplication and overuse of the H5 files, if not required.
Thanks for pointing out the changes made to MACE (new issue #246). |
Hello again,
I really like the code you built and trying to see if it fits our need. thanks a lot in advance for your help.
I would like to keep all the info in the user given extxyz (in the info field as well as extra arrays in the train and validation file split written during mace training. although going over the code I see it should be the case, but this is not happening. I suspect this is because of interal data representation of the
atoms.h5
format that gets written in the data add step which can not store these info.the
train-data.extxyz
andtest-data.extxyz
files in thenodes/MLModel
directory only have the minimum infoLattice="7.725411453296575 -0.2019109857113853 -0.23392998344550006 0.15426298908328637 7.725411453296575 -0.26775798105159754 -0.20378498557876815 -0.23229898356092088 7.725411453296575" Properties=species:S:1:pos:R:3:forces:R:3 energy=-30889.876898123603 free_energy=-30889.876898123603 pbc="T T T"
the original file has more info in the train field and more arrays. you can easily test the issues by making a traj from the atom config below as exampleAny suggestion how to solve this?
Thanks!
The text was updated successfully, but these errors were encountered: