You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe
It's frustrating to have the pipeline run through completely, only to tell me at the end that there are file name clashes based on me (as a new user of the pipeline) having set up my TSV file incorrectly.
Describe the solution you'd like
To help with this, it would be useful for the pipeline to do some basic error checking on the TSV file before starting any other steps; this means checking things such as: are values in columns where each row needs to be unique actually unique? Do the columns provided (at least) seem to match what should go into them based on the start point given (for example, does a file name look like a fastq / bam / bai file? Does the file actually exist?)? It would also be helpful if the pipeline supported a header row in the TSV file, as I suspect part of the reason I messed up was due to not having a header row to refer to.
Describe alternatives you've considered
Not sure if there would be any good alternatives to this; it's mostly a convenience feature to get the pipeline to throw an error before it runs all night and then needs to be restarted from scratch in the morning.
The text was updated successfully, but these errors were encountered:
maxulysse
changed the title
[FEATURE] Add basic error checking for input TSV file based
Add basic error checking for input csv samplesheet
Jun 20, 2022
EDIT by @maxulysse to add some checkbox to track the progress:
we should have proper check to see if data is correct for step:
nextflow run . -profile test,docker --step mapping
nextflow run . -profile test,docker --step markduplicates
nextflow run . -profile test,docker --step prepare_recalibration
nextflow run . -profile test,docker --step recalibrate
nextflow run . -profile test,docker --step variant_calling
nextflow run . -profile test,docker --step annotate
Some additional ideas from slack (edited by @FriederikeHanssen )
--step variant_calling
or--step_annotation
, then--tools
should not be emptyIs your feature request related to a problem? Please describe
It's frustrating to have the pipeline run through completely, only to tell me at the end that there are file name clashes based on me (as a new user of the pipeline) having set up my TSV file incorrectly.
Describe the solution you'd like
To help with this, it would be useful for the pipeline to do some basic error checking on the TSV file before starting any other steps; this means checking things such as: are values in columns where each row needs to be unique actually unique? Do the columns provided (at least) seem to match what should go into them based on the start point given (for example, does a file name look like a fastq / bam / bai file? Does the file actually exist?)? It would also be helpful if the pipeline supported a header row in the TSV file, as I suspect part of the reason I messed up was due to not having a header row to refer to.
Describe alternatives you've considered
Not sure if there would be any good alternatives to this; it's mostly a convenience feature to get the pipeline to throw an error before it runs all night and then needs to be restarted from scratch in the morning.
The text was updated successfully, but these errors were encountered: