Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

parameters for sparse data #63

Open
brentp opened this issue May 26, 2023 · 1 comment
Open

parameters for sparse data #63

brentp opened this issue May 26, 2023 · 1 comment

Comments

@brentp
Copy link
Contributor

brentp commented May 26, 2023

Hi Seth, do you have any recommendations for par_granges values for sparse data? I have some regions with very deep coverage that only span about 500KB of the genome so it seems that much time is spent with low CPU usage.
thanks!

@sstadick
Copy link
Owner

sstadick commented Jun 8, 2023

Nothing off the top of my head. I'd take a guess though and say that larger chunksize would be better for sparse data if you aren't using an intervals file of some sort to restrict the regions. My confidence in my guess is low though, looking at the code all you want to minimize is calls to process_regions that have have no reads in that region, so larger chunksize should decrease the number of misses on that front.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants