-
Notifications
You must be signed in to change notification settings - Fork 172
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PR for Release 2.0.0 #105
PR for Release 2.0.0 #105
Conversation
DSL2 version with working alevin workflow
Use filtered gtf
The problem saying that genomic coordinates are not available in the DNA fasta is yet persisting using new files. I was able to overcome this problem when I used the I've been able to test already:
|
Also, when I try it with -[nf-core/scrnaseq] Pipeline completed with errors-
Error executing process > 'NFCORE_SCRNASEQ:SCRNASEQ:INPUT_CHECK:SAMPLESHEET_CHECK (samplesheet_2.0_full.csv)'
Caused by:
Process `NFCORE_SCRNASEQ:SCRNASEQ:INPUT_CHECK:SAMPLESHEET_CHECK (samplesheet_2.0_full.csv)` terminated with an error exit status (255)
Command executed:
check_samplesheet.py \
samplesheet_2.0_full.csv \
samplesheet.valid.csv
cat <<-END_VERSIONS > versions.yml
"NFCORE_SCRNASEQ:SCRNASEQ:INPUT_CHECK:SAMPLESHEET_CHECK":
python: $(python --version | sed 's/Python //g')
END_VERSIONS
Command exit status:
255
Command output:
(empty)
Command error:
INFO: Converting SIF file to temporary sandbox...
INFO: Cleaning up image...
FATAL: container creation failed: failed to add as session directory: path . is not an absolute path
Work dir:
/opt/projects/1357_BIP/2022-09_gODMSinglecell/scratch.NOBACKUP/work/56/cb96a32ae75de25046f368e1fe5f10
Tip: you can replicate the issue by changing to the process work dir and entering the command `bash .command.run` Any idea on what would be raising it? |
I tested the workflow on one of my datasets (8 samples of unpublished 10x v3 data from mouse).
Starsolo fails with Command output:
Jun 14 11:04:54 ..... started STAR run
Jun 14 11:04:56 ..... loading genome
Jun 14 11:05:09 ..... processing annotations GTF
Jun 14 11:05:26 ..... inserting junctions into the genome indices
Jun 14 11:06:55 ..... started mapping
Command error:
ReadAlignChunk_processChunks.cpp:202:processChunks EXITING because of FATAL ERROR in input reads: unknown file format: the read ID should start with @ or >
Jun 14 11:06:57 ...... FATAL ERROR, exiting I suspect that it might not handle the gzipped fastq files ( |
no idea about the singularity issue, but it seems like a problem with your system rather than the pipeline. |
Also (re-)discovered this issue: |
It seems that my custom configuration (though not for |
Yeah, I am thinking that is the issue as well. I am not being able to start the image manually as well. |
Regarding the error you faced using your data with And now that this has passed as well, I was able to execute Cheers. |
Hi, I can add this reference on the s3 bucket. Just following the docs by cellranger on the references though (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/using/tutorial_mr) and they recommend using Ensembl references. Did you check if the Ensembl reference as they recommend works? If not, does using this |
Hi @ggabernet, But at least with these As soon as I check with
|
The problem with STARsolo and the compressed files was a problem with my configuration (and, ultimately, a bug in nextflow, that should be resolved soon). However #60 is still an issue, I think. |
removing repeated definitions of parameters
Hi! Back from quick leave - can we align on which reference files to use for running the full tests? I was unable to use the ones that 10X said we should use in general, always getting weird failures no matter if I run them from what they provide nor if I create these with the script they provide (same type of error). (2020-A from here https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest) - @grst which ones did you use? |
Hi Alex, I am using for now the Ensembl ones as Gisela commented to, and so far it is working. |
I'm also doing some tests at the moment, I am also uploading already those 2 to the s3 bucket (Ensembl fasta and GTF) and we can directly do the tests on s3. I think the reason why it was not working for you @apeltzer, might be that when using the |
Updates in this PR: #112 |
update CI and full size test
Update nextflow.config
Guess just #60 needs a bugfix, will try to find out what is wrong there and fix it |
FWIW, I tried with both
But it seems we have a working solution now anyway? |
Hi @ggabernet, As promised previously I am just updating that using the :) |
yes, the tests passed on all tools as well on AWS with the Ensembl references 🎉 |
Fix for STAR chemistry issue #60
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks all good to me 🥳
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I admit that I haven't checked all 110 files again, but I'd say we are good to go!
This version is definitely already better than the 1.1.0 release.
Here as discussed with Alex, I open a PR from
dev
tomaster
so we can have our last checkings and runfull_test
to make sure it works before first release.We will probably still have to fix some tests that run on PRs to master. It seems to be running some tests that are not available or relevant anymore (e.g.
-profile test_kallisto
).