Fix #418 (iGenomes GRCm38 paths) #419

apeltzer · 2019-10-10T10:42:44Z

This should fix the issue @drpatelh found!

PR checklist

This comment contains a description of changes (with reason)
If you've fixed a bug or added code that should be tested, add tests!
Documentation in docs is updated
CHANGELOG.md is updated
README.md is updated

drpatelh

Nice! Thanks. Im going to submit another PR adding in some of the UCSC genomes too 👍 and other standard fields used in the chipseq and atacseq pipelines like macs_gsize. Do you want to add anything else?

apeltzer · 2019-10-10T10:48:06Z

No, I think that makes sense - we should probably have almost everything in there that is existing in iGenomes at some point as many pipelines will use that stuff?

drpatelh · 2019-10-10T10:54:02Z

The only other thing I can see that would be worth adding is the BismarkIndex that is used in methylseq. Ill add that in too:
https://github.com/nf-core/methylseq/blob/master/conf/igenomes.config

Will have to check all of the paths exists after Ive added everything in but it looks pretty well organised and consistent.

ewels · 2019-10-14T05:44:19Z

I don’t think that we need to add the Bismark refs to the template. I’d be surprised if any pipelines other than the methylseq pipeline need them, so it’s just something extra to clean up for each new pipeline.

GTF / Fasta etc is more fair game :)

drpatelh · 2019-10-14T06:19:28Z

Morning! I've gone through all of the organisms that were in the current igenomes.config and added all of these files already. I've also added the latest version of the UCSC genomes and checked that the file paths exist against the AWS iGenomes listing. I guess if you don't use the parameter for a specific index in the pipeline itself then it doesn't matter? Probably easier to have these paths in as a full listing then to try and add/remove them later?
https://github.com/nf-core/chipseq/blob/dev/conf/igenomes.config

The great thing is that I can just copy the file for any of the pipelines using iGenomes AWS now. Couple of things that need to be resolved:

Adding blacklists to AWS iGenomes (:sweat_smile:)
For bwa we refer to the full path to the index i.e. index_dir/genome.fa because it gives you more flexibility with genome indices that are named differently. However, for all of the other indices we just refer to index_dir. Haven't explicitly checked how other pipelines are using the latter but may be good to be consistent.

Fix nf-core#418

514dac6

apeltzer requested a review from drpatelh October 10, 2019 10:42

apeltzer mentioned this pull request Oct 10, 2019

iGenomes paths wrong for GRCm38 #418

Closed

drpatelh approved these changes Oct 10, 2019

View reviewed changes

drpatelh merged commit 2358cdb into nf-core:dev Oct 10, 2019

apeltzer deleted the fix-igenomes branch October 10, 2019 10:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #418 (iGenomes GRCm38 paths) #419

Fix #418 (iGenomes GRCm38 paths) #419

apeltzer commented Oct 10, 2019

drpatelh left a comment

apeltzer commented Oct 10, 2019

drpatelh commented Oct 10, 2019

ewels commented Oct 14, 2019

drpatelh commented Oct 14, 2019

Fix #418 (iGenomes GRCm38 paths) #419

Fix #418 (iGenomes GRCm38 paths) #419

Conversation

apeltzer commented Oct 10, 2019

PR checklist

drpatelh left a comment

Choose a reason for hiding this comment

apeltzer commented Oct 10, 2019

drpatelh commented Oct 10, 2019

ewels commented Oct 14, 2019

drpatelh commented Oct 14, 2019