Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel processing of paired-end fastq files #195

Open
PramodRaoB opened this issue Oct 14, 2024 · 2 comments
Open

Parallel processing of paired-end fastq files #195

PramodRaoB opened this issue Oct 14, 2024 · 2 comments

Comments

@PramodRaoB
Copy link

From my understanding of the code, in paired-end processing, both fastq input files could be processed parallelly, but they are currently sequential. Is this correct? If so, I could take this feature up.

@FelixKrueger
Copy link
Owner

Yes, it this is correct. If the trimming could occur in parallel it might indeed speed up the run-time, which sill allowing the the same flexibility further downstream. It might have to be taken care of for resource allocation later on, e.g. on nf-core, but this would be a subsequent step. If you want to have a go at it - that could be nice!

@PramodRaoB
Copy link
Author

I was thinking of a different workflow that would reduce the execution time by removing some I/O overhead.

Process both of the paired-end files parallelly by cutadapt and have both the outputs streamed. Then, combine the trimming and validation phase and write the final output to disk directly. Since Cutadapt scales well for thread count less than 8, processing them in parallel (with half the original thread-count) would still give the same effective runtime. But, this would avoid writing the intermediary trimmed outputs (which are potentially compressed leading to even higher runtimes) thereby giving a lower overall runtime.

Let me know if this sounds good @FelixKrueger ! Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants