-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues with Training the C.Origami Model Using Only Sequence Data and Integrating Multi-Species Data #48
Comments
Hi @hanshandong2024 there are all doable and make sense.
|
Thank you for your quick reply. When using the model I trained with my data for other tasks such as Prediction and Editing/Perturbation, do I only need to input sequence data? Below is the help documentation for the Prediction task.
|
Yes you only need the seq data. Also since you change a lot of things, I would suggest you to edit and run the prediction file directly instead of using the CLI. https://github.com/tanjimin/C.Origami/blob/main/src/corigami/inference/prediction.py |
Thank you very much for your guidance. |
Dear Author,
Thank you for developing such a perfect model like C.Origami. It's a great work, but I have encountered some difficulties.
First, I have DNA sequence information from other species, but there is no corresponding ATAC data and ChIP-seq data. I would like to try training the model and making predictions using only the sequence data. Could you please advise me on how to modify the code to retrain the model?
Second, the sequences of my target species are relatively short and may differ by orders of magnitude compared to those of humans and mice. This might lead to poor training results due to insufficient training data. I would like to expand the training data by using sequences from multiple species corresponding to multiple three-dimensional structures. I noticed that our training data is input by chromosome. Is it possible to input sequence information from multiple species corresponding to multiple three-dimensional structures?
Thank you for your assistance.
The text was updated successfully, but these errors were encountered: