Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the training data #105

Open
Qmi3 opened this issue Apr 17, 2024 · 1 comment
Open

About the training data #105

Qmi3 opened this issue Apr 17, 2024 · 1 comment

Comments

@Qmi3
Copy link

Qmi3 commented Apr 17, 2024

Great Work! It seems that ModelAngelo organizes a training set with good quality. Do you have plans to release them? I think it's a great resources for the computational CryoEM community.

@jamaliki
Copy link
Collaborator

Hi @Qmi3 ,

Sorry for the late reply! The original dataset with all the processing already done etc is too large for us to be able to release, due to the size of the maps associated with it.

What I can offer instead is the following list of PDB ids used for training. I can also (if there is interest) provide some cleaned scripts for the PDB parsing, some constant shifts required to register cryo-EM maps with these models, and some pointers for processing the maps.

Please find the attached list of PDB ids:
model_angelo_train_pdbs.txt

Best,
Kiarash.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants