Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Polish a large genome with Pilon #170

Open
enriquepola1996 opened this issue May 16, 2024 · 2 comments
Open

Polish a large genome with Pilon #170

enriquepola1996 opened this issue May 16, 2024 · 2 comments

Comments

@enriquepola1996
Copy link

Hello everyone,

I'm trying to polish a large genome (3Gb) with Pilon but I'm having problems with RAM. I read that some people choose to split the genome to deal with RAM, so I would like to try this alternative. However, I have a question about how I can separate the genome and finally join the outputs of each independent polish. Does anyone have experience with this?. My genome is somewhat fragmented (10,000 scaffolds).

I would appreciate any comments.

@SergeWielhouwer
Copy link

You would probably want to try out the --targets argument and run Pilon multiple times (e.g. 500 pilon jobs in parallel) by providing each scaffold name to --targets. Afterwards you can concatenate all the polished scaffolds together. It is likely not needed to first split the input BAM files.

I haven't tried it out myself, so hopefully someone from the Pilon team can share some thoughts on this.

@enriquepola1996
Copy link
Author

Hello @SergeWielhouwer

Thank you very much for the help, I'm going to try it. At the moment I'm trying Hapo-G and it seems to be going well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants