Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1.1 release #94

Merged
merged 101 commits into from
Jan 4, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
101 commits
Select commit Hold shift + click to select a range
7450071
Merge pull request #67 from sanger-tol/main
priyanka-surana May 18, 2023
3bef26c
nextflow version update
priyanka-surana May 18, 2023
c9f460f
pipeline version update
priyanka-surana May 19, 2023
542e1e8
Added the CI workflows to test the pipeline on the Sanger farm via Tower
muffato Aug 4, 2023
d715cbf
Pin the version to 2.8
muffato Aug 4, 2023
2ea0ec1
The file needs to be listed as having been modified from the template
muffato Aug 4, 2023
2c77818
Fixed this leftover from a wrong nf-core template
muffato Aug 4, 2023
39ed78e
Our new convention is use per-pipeline directories under pipeline_info/
muffato Aug 4, 2023
ab49082
updated ncbi container
priyanka-surana Aug 9, 2023
731f4fb
Merge pull request #76 from sanger-tol/ncbi
priyanka-surana Aug 10, 2023
861b93f
Latest nomenclature
muffato Aug 10, 2023
fd5abba
Merge pull request #74 from sanger-tol/sanger_ci
muffato Aug 11, 2023
54503e2
Faster tests
muffato Aug 10, 2023
05a3e42
Adding the CI back to see how long it takes
muffato Aug 10, 2023
043cc52
Just need smaller BAM files
muffato Aug 10, 2023
3dea427
Just a hunch
muffato Aug 11, 2023
c1f069e
Bumped the minimum version down
muffato Aug 11, 2023
c4a9235
Need a slightly larger k to get a non-zero mapping
muffato Aug 13, 2023
1cf45cb
Updated subset that has a non-zero kmer score
muffato Aug 13, 2023
16e8b66
More version changes
muffato Aug 13, 2023
f90e9c8
Use the latest version of the Tower action
muffato Aug 14, 2023
dfc63a8
BUSCO creates a lot of files and can overwhelm Lustre
muffato Aug 14, 2023
31f0239
Update sanger_test.yml
gq1 Aug 16, 2023
2603e51
Update sanger_test_full.yml
gq1 Aug 16, 2023
77fb145
Update sanger_test.yml
gq1 Aug 16, 2023
a044b4a
New tolsoft details
muffato Aug 18, 2023
d6439ef
Fixed the repository name
muffato Aug 18, 2023
c792b14
Merge pull request #77 from sanger-tol/quick_github_ci
muffato Aug 18, 2023
4f336ee
Get the BUSCO ODB name from the BUSCO definition file rather than GoaT
muffato Oct 9, 2023
5f8b88b
Check that the JSON contains an assembly
muffato Oct 10, 2023
7a6f3ff
Applied black
muffato Oct 10, 2023
1939254
Applied black
muffato Oct 10, 2023
f06f8cf
typo
muffato Oct 10, 2023
bdc772c
Merge pull request #83 from sanger-tol/busco_tax_ids
muffato Oct 18, 2023
33a8b4e
Merge pull request #84 from sanger-tol/check_ncbidatasets
muffato Oct 18, 2023
1684abd
Update maxRetries
priyanka-surana Oct 19, 2023
ebf0747
Update cooler base settings
priyanka-surana Oct 19, 2023
3a60381
Merge pull request #86 from sanger-tol/maxRetries
priyanka-surana Oct 19, 2023
f8d5965
Don't need a complete process for that
muffato Nov 9, 2023
57df724
Take 100M out for the various overheads
muffato Nov 9, 2023
0ca1e3e
Make sure the temporary files are created locally rather than in /tmp
muffato Nov 10, 2023
f85341f
First stab at defining optimised resource requirements for the pipeli…
muffato Nov 10, 2023
6dc2da7
Indentation should be a multiple of 4
muffato Nov 10, 2023
ec978ee
typo
muffato Nov 10, 2023
3e40be1
Another typo
muffato Nov 10, 2023
7372b6e
Removed trailing whitespace
muffato Nov 10, 2023
381536e
Another round of resource updates
muffato Nov 13, 2023
ca96b15
Everything should be increasing the number of CPUs with a log function
muffato Nov 14, 2023
e8e093f
Wrongly placed ifEmpty
muffato Nov 15, 2023
0878456
Comment
muffato Nov 16, 2023
00168f5
Both FastK and MerquryFK need `-P.` to avoid polluting /tmp
muffato Nov 16, 2023
0c13d22
typo
muffato Nov 16, 2023
7063fb4
Increased a few more memory requirements that were too low
muffato Nov 17, 2023
1c561eb
Reduced a few CPU requirements that were unnecessary high
muffato Nov 17, 2023
84ae909
Decreased some time requirements
muffato Nov 17, 2023
b5b11f0
More trace fields to be able to optimise the parameters later
muffato Nov 20, 2023
eea16f5
Take 100M out for the various overheads
muffato Nov 9, 2023
be05858
Make sure the temporary files are created locally rather than in /tmp
muffato Nov 10, 2023
7a64c68
Both FastK and MerquryFK need `-P.` to avoid polluting /tmp
muffato Nov 16, 2023
71803b6
CHANGELOG update
muffato Nov 20, 2023
b94eab5
Fixed the URLs since we don't need to use absolute URLs any more
muffato Nov 20, 2023
56d9e90
Added the CITATION.cff file
muffato Nov 20, 2023
ce79c39
[prettier]
muffato Nov 20, 2023
470a18b
Added more changes
muffato Nov 20, 2023
8ea65bc
Both EditorConfig and Prettier want to lint CITATION.cff differently.…
muffato Nov 20, 2023
a3b139a
Added Guoying to the author list
muffato Nov 20, 2023
35eddb2
Removed duplicated "affilication" line
muffato Nov 20, 2023
25cd843
Introduced the "use_work_dir_as_temp" parameter
muffato Nov 21, 2023
628976e
Added use_work_dir_as_temp to the documentation
muffato Nov 21, 2023
65133a6
Properly list the new parameter section in the schema
muffato Nov 21, 2023
273bdf5
Merge pull request #93 from sanger-tol/hotfix
muffato Nov 21, 2023
ac668ea
#92 got merged in
muffato Nov 23, 2023
a5d21f4
Merge branch 'dev' into resource_optimisation
muffato Nov 23, 2023
48f7738
Merge pull request #92 from sanger-tol/resource_optimisation
muffato Nov 24, 2023
60a2dd5
This shouldn't have been decreased
muffato Nov 24, 2023
6826416
bugfix: Need to use collect() to make it a list
muffato Nov 24, 2023
cceb42a
Merge pull request #96 from sanger-tol/fix_findall
muffato Nov 27, 2023
fd067d6
Need to factor the attempt number in as well
muffato Nov 27, 2023
86d1e9b
Merge pull request #97 from sanger-tol/busco_runtime
muffato Nov 27, 2023
ce03ebc
Easier to maintain memory assignment for BUSCO
muffato Nov 28, 2023
588be11
Updated the FastK containers to allow large genomes
muffato Nov 28, 2023
f04a524
Found a way to have the test config in the same place
muffato Dec 5, 2023
6c20bb7
"sample" is a sample name, not a path
muffato Dec 5, 2023
509e264
os.path.basename is meant for that
muffato Dec 5, 2023
ab946cb
bugfix: the extension has to be removed even if there is no _T1 suffix
muffato Dec 5, 2023
9f95411
Make it clear what we remove and why
muffato Dec 5, 2023
9436c23
Defensive programming: make sure all the variables match
muffato Dec 5, 2023
6ab85f7
Merge pull request #95 from sanger-tol/new_release
muffato Dec 5, 2023
6585615
Merge pull request #99 from sanger-tol/busco_fastk_completeness
muffato Dec 8, 2023
f264f49
Support Hi-C files in BAM format too
muffato Dec 19, 2023
7192736
Updated the changelog
muffato Dec 19, 2023
7aab720
Allow running the pipeline without Hi-C data
priyanka-surana Oct 20, 2023
e08b8e9
Allow using 10x reads to compute the QV/completeness
priyanka-surana Oct 20, 2023
3ab0602
Documentation update
muffato Dec 20, 2023
e6cc6be
Updated the text to indicate that the read data are optional
muffato Dec 20, 2023
2349ff1
Squeezing in a missing citation
muffato Dec 20, 2023
d5f0a56
Merge pull request #102 from sanger-tol/hic_bam
muffato Dec 20, 2023
9549f86
Merge pull request #105 from sanger-tol/10x
priyanka-surana Dec 22, 2023
df9902f
Updated the release date
muffato Jan 2, 2024
41b37d0
Added a release name
muffato Jan 3, 2024
e5164d5
Updated the release date
muffato Jan 4, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 7 additions & 2 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
@@ -1,7 +1,12 @@
name: nf-core CI
# This workflow runs the pipeline with the minimal test dataset to check that it completes without any syntax errors
on:
workflow_dispatch:
push:
branches:
- dev
pull_request:
release:
types: [published]

env:
NXF_ANSI_LOG: false
Expand All @@ -19,7 +24,7 @@ jobs:
strategy:
matrix:
NXF_VER:
- "23.04.0"
- "22.10.1"
- "latest-everything"
steps:
- name: Check out pipeline code
Expand Down
12 changes: 6 additions & 6 deletions .github/workflows/fix-linting.yml
Original file line number Diff line number Diff line change
Expand Up @@ -8,21 +8,21 @@ jobs:
# Only run if comment is on a PR with the main repo, and if it contains the magic keywords
if: >
contains(github.event.comment.html_url, '/pull/') &&
contains(github.event.comment.body, '@nf-core-bot fix linting') &&
contains(github.event.comment.body, '@sanger-tolsoft fix linting') &&
github.repository == 'sanger-tol/genomenote'
runs-on: ubuntu-latest
steps:
# Use the @nf-core-bot token to check out so we can push later
# Use the @sanger-tolsoft token to check out so we can push later
- uses: actions/checkout@v3
with:
token: ${{ secrets.nf_core_bot_auth_token }}
token: ${{ secrets.sangertolsoft_access_token }}

# Action runs on the issue comment, so we don't get the PR by default
# Use the gh cli to check out the PR
- name: Checkout Pull Request
run: gh pr checkout ${{ github.event.issue.number }}
env:
GITHUB_TOKEN: ${{ secrets.nf_core_bot_auth_token }}
GITHUB_TOKEN: ${{ secrets.sangertolsoft_access_token }}

- uses: actions/setup-node@v3

Expand All @@ -46,8 +46,8 @@ jobs:
- name: Commit & push changes
if: steps.prettier_status.outputs.result == 'fail'
run: |
git config user.email "core@nf-co.re"
git config user.name "nf-core-bot"
git config user.email "105875386+sanger-tolsoft@users.noreply.github.com"
git config user.name "sanger-tolsoft"
git config push.default upstream
git add .
git status
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/linting.yml
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ jobs:
run: npm install -g editorconfig-checker

- name: Run ECLint check
run: editorconfig-checker -exclude README.md $(find .* -type f | grep -v '.git\|.py\|.md\|json\|yml\|yaml\|html\|css\|work\|.nextflow\|build\|nf_core.egg-info\|log.txt\|Makefile')
run: editorconfig-checker -exclude README.md $(find .* -type f | grep -v '.git\|.py\|.md\|cff\|json\|yml\|yaml\|html\|css\|work\|.nextflow\|build\|nf_core.egg-info\|log.txt\|Makefile')

Prettier:
runs-on: ubuntu-latest
Expand Down Expand Up @@ -84,7 +84,7 @@ jobs:
- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install nf-core
pip install nf-core==2.8

- name: Run nf-core lint
env:
Expand Down
29 changes: 29 additions & 0 deletions .github/workflows/sanger_test.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
name: sanger-tol LSF tests

on:
workflow_dispatch:
jobs:
run-tower:
name: Run LSF tests
runs-on: ubuntu-latest
steps:
- name: Launch workflow via tower
uses: seqeralabs/action-tower-launch@v2
with:
workspace_id: ${{ secrets.TOWER_WORKSPACE_ID }}
access_token: ${{ secrets.TOWER_ACCESS_TOKEN }}
compute_env: ${{ secrets.TOWER_COMPUTE_ENV }}
revision: ${{ github.sha }}
workdir: ${{ secrets.TOWER_WORKDIR_PARENT }}/work/${{ github.repository }}/work-${{ github.sha }}
parameters: |
{
"outdir": "${{ secrets.TOWER_WORKDIR_PARENT }}/results/${{ github.repository }}/results-${{ github.sha }}",
}
profiles: test,sanger,singularity,cleanup

- uses: actions/upload-artifact@v3
with:
name: Tower debug log file
path: |
tower_action_*.log
tower_action_*.json
43 changes: 43 additions & 0 deletions .github/workflows/sanger_test_full.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
name: sanger-tol LSF full size tests

on:
push:
branches:
- main
- dev
workflow_dispatch:
jobs:
run-tower:
name: Run LSF full size tests
runs-on: ubuntu-latest
steps:
- name: Sets env vars for push
run: |
echo "REVISION=${{ github.sha }}" >> $GITHUB_ENV
if: github.event_name == 'push'

- name: Sets env vars for workflow_dispatch
run: |
echo "REVISION=${{ github.sha }}" >> $GITHUB_ENV
if: github.event_name == 'workflow_dispatch'

- name: Launch workflow via tower
uses: seqeralabs/action-tower-launch@v2
with:
workspace_id: ${{ secrets.TOWER_WORKSPACE_ID }}
access_token: ${{ secrets.TOWER_ACCESS_TOKEN }}
compute_env: ${{ secrets.TOWER_COMPUTE_ENV }}
revision: ${{ env.REVISION }}
workdir: ${{ secrets.TOWER_WORKDIR_PARENT }}/work/${{ github.repository }}/work-${{ env.REVISION }}
parameters: |
{
"outdir": "${{ secrets.TOWER_WORKDIR_PARENT }}/results/${{ github.repository }}/results-${{ env.REVISION }}",
}
profiles: test_full,sanger,singularity,cleanup

- uses: actions/upload-artifact@v3
with:
name: Tower debug log file
path: |
tower_action_*.log
tower_action_*.json
1 change: 1 addition & 0 deletions .nf-core.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ lint:
- .github/ISSUE_TEMPLATE/bug_report.yml
- lib/NfcoreTemplate.groovy
- .github/CONTRIBUTING.md
- .github/workflows/linting.yml
nextflow_config:
- manifest.name
- manifest.homePage
Expand Down
53 changes: 53 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,59 @@
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## [[1.1.0](https://github.com/sanger-tol/genomenote/releases/tag/1.1.0)] - Golden Retriever - [2024-01-04]

### Enhancements & fixes

- The pipeline now queries the NCBI Taxonomy API rather than
[GoaT](https://goat.genomehubs.org/api) to establish the list of lineages on
which to run BUSCO. The possible lineages are now defined [in the pipeline
configuration](assets/mapping_taxids-busco_dataset_name.eukaryota_odb10.2019-12-16.txt)
but can be overridden with the `--lineage_tax_ids` parameter.
- The pipeline will now immediately fail if the assembly can't be retrieve by
the [datasets](https://www.ncbi.nlm.nih.gov/datasets/docs/v2/download-and-install/)
command-line tool.
- Pipeline information is now outputted in `pipeline_info/genomenote/` instead
of `genomenote_info/`.
- `maxRetries` increased to 5 to cope with large datasets.
- BUSCO now runs in "scratch" mode, i.e. off a temporary directory, as the
number of files it creates could otherwise overwhelm a network filesystem.
- `SORT`, `FASTK`, and `MERQURYFK`, can now put their temporary files in the
work directory rather than `/tmp`. Turn that on with the `--use_work_dir_as_temp`
flag.
- The memory requirement of `SORT` is adjusted to account for some overheads
and avoid the job to be killed.
- All resource requirements (memory, time, CPUs) now fit the actual usage. This
is achieved by automatically adjusting to the size of the input whenever
possible.
- Genomes with sequences longer than 2 Gbp are now supported thanks to
upgrading FastK and MerquryFK.
- Fixed a bug that was causing the Completeness to be reported as 0 in the
statistics CSV file, when the k-mer database was constructed from BAM files.
- QV/Completeness can now be computed off 10X sequencing data.
- Minimal version of Nextflow downgraded to 23.04 to 22.10. 22.10 is tested as
part of our continuous integration (CI) pipeline.
- The "test" profile now runs faster, thanks to tuning some Busco/Metaeuk
parameters.
- The "test_full" profile is now tested automatically when updating the `dev`
and `main` branches.
- The pipelines now support Hi-C alignment files in the BAM format.

### Parameters

| Old parameter | New parameter |
| ------------- | ---------------------- |
| | --lineage_tax_ids |
| | --use_work_dir_as_temp |

### Software dependencies

| Dependency | Old version | New version |
| ----------- | ------------------------------------------ | ------------------------------------------ |
| `datasets` | 14.2 | 15.12 |
| `FastK` | `f18a4e6d2207539f7b84461daebc54530a9559b0` | `427104ea91c78c3b8b8b49f1a7d6bbeaa869ba1c` |
| `MerquryFK` | `8ae344092df5dcaf83cfb7f90f662597a9b1fc61` | `d00d98157618f4e8d1a9190026b19b471055b22e` |

## [[1.0.0](https://github.com/sanger-tol/genomenote/releases/tag/1.0.0)] - Czechoslovakian Wolfdog - [2023-05-18]

Initial release of sanger-tol/genomenote, created with the [nf-core](https://nf-co.re/) template.
Expand Down
38 changes: 38 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: sanger-tol/genomenote v1.1.0
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: Cibin
family-names: Sadasivan Baby
affiliation: Wellcome Sanger Institute
orcid: "https://orcid.org/0000-0002-8538-276X"
- given-names: Matthieu
family-names: Muffato
affiliation: Wellcome Sanger Institute
orcid: "https://orcid.org/0000-0002-7860-3560"
- given-names: Guoying
family-names: Qi
affiliation: Wellcome Sanger Institute
orcid: "https://orcid.org/0000-0003-1262-8973"
- given-names: Priyanka
family-names: Surana
affiliation: Wellcome Sanger Institute
orcid: "https://orcid.org/0000-0002-7167-0875"
- given-names: Yates
family-names: Bethan
affiliation: Wellcome Sanger Institute
orcid: "https://orcid.org/0000-0003-1658-1762"
identifiers:
- type: doi
value: 10.5281/zenodo.7949384
repository-code: "https://github.com/sanger-tol/genomenote"
license: MIT
commit: TODO
version: 1.1.0
date-released: "2022-10-07"
4 changes: 4 additions & 0 deletions CITATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,10 @@

> Abdennur, Nezar, and Leonid A Mirny. “Cooler: Scalable Storage for Hi-C Data and Other Genomically Labeled Arrays.” Bioinformatics, vol. 36, no. 1, 2019, pp. 311–316., https://doi.org/10.1093/bioinformatics/btz540.

- [Crumble](https://github.com/jkbonfield/crumble)

> James K Bonfield, Shane A McCarthy, and Richard Durbin. "Crumble: reference free lossy compression of sequence quality values" Bioinformatics, Volume 35, Issue 2, January 2019, Pages 337–339, https://doi.org/10.1093/bioinformatics/bty608

- [FastK](https://github.com/thegenemyers/FASTK)

- [MerquryFK](https://github.com/thegenemyers/MERQURY.FK)
Expand Down
12 changes: 6 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
[![GitHub Actions Linting Status](https://github.com/sanger-tol/genomenote/workflows/nf-core%20linting/badge.svg)](https://github.com/sanger-tol/genomenote/actions?query=workflow%3A%22nf-core+linting%22)
[![Cite with Zenodo](http://img.shields.io/badge/DOI-10.5281/zenodo.7949384-1073c8?labelColor=000000)](https://doi.org/10.5281/zenodo.7949384)

[![Nextflow](https://img.shields.io/badge/nextflow%20DSL2-%E2%89%A523.04.0-23aa62.svg)](https://www.nextflow.io/)
[![Nextflow](https://img.shields.io/badge/nextflow%20DSL2-%E2%89%A522.10.1-23aa62.svg)](https://www.nextflow.io/)
[![run with conda](http://img.shields.io/badge/run%20with-conda-3EB049?labelColor=000000&logo=anaconda)](https://docs.conda.io/en/latest/)
[![run with docker](https://img.shields.io/badge/run%20with-docker-0db7ed?labelColor=000000&logo=docker)](https://www.docker.com/)
[![run with singularity](https://img.shields.io/badge/run%20with-singularity-1d355c.svg?labelColor=000000)](https://sylabs.io/docs/)
Expand All @@ -22,9 +22,9 @@
3. Filter BED ([`GNU sort`](https://www.gnu.org/software/coreutils/manual/html_node/sort-invocation.html), [`filter bed`](https://raw.githubusercontent.com/sanger-tol/genomenote/main/bin/filter_bed.sh))
4. Contact maps ([`Cooler cload`](https://cooler.readthedocs.io/en/latest/cli.html#cooler-cload-pairs), [`Cooler zoomify`](https://cooler.readthedocs.io/en/latest/cli.html#cooler-zoomify), [`Cooler dump`](https://cooler.readthedocs.io/en/latest/cli.html#cooler-dump))
5. Summary statistics ([`NCBI datasets summary genome accession`](https://www.ncbi.nlm.nih.gov/datasets/docs/v2/reference-docs/command-line/datasets/summary/genome/datasets_summary_genome_accession/))
6. Genome completeness ([`GoaT API`](https://goat.genomehubs.org/api-docs/), [`BUSCO`](https://busco.ezlab.org))
6. Genome completeness ([`NCBI API`](https://www.ncbi.nlm.nih.gov/datasets/docs/v1/reference-docs/rest-api/), [`BUSCO`](https://busco.ezlab.org))
7. Consensus quality and k-mer completeness ([`FASTK`](https://github.com/thegenemyers/FASTK), [`MERQURY.FK`](https://github.com/thegenemyers/MERQURY.FK))
8. Collated summary table ([`createtable`](https://raw.githubusercontent.com/sanger-tol/genomenote/main/bin/create_table.py))
8. Collated summary table ([`createtable`](bin/create_table.py))
9. Present results and visualisations ([`MultiQC`](http://multiqc.info/), [`R`](https://www.r-project.org/))

## Usage
Expand All @@ -44,7 +44,7 @@ mMelMel3,hic,/analysis/mMelMel3.2_paternal_haplotype/read_mapping/hic/GCA_922984
mMelMel3,pacbio,/genomic_data/mMelMel3/pacbio/kmer/k31
```

Each row represents an aligned HiC reads file or an unaligned PacBio reads file or a PacBio k-mer database.
Each row represents an aligned HiC reads file, an unaligned PacBio/10X reads file, or a PacBio/10X k-mer database.

Now, you can run the pipeline using:

Expand Down Expand Up @@ -75,13 +75,13 @@ We thank the following people for their assistance in the development of this pi

## Contributions and Support

If you would like to contribute to this pipeline, please see the [contributing guidelines](https://raw.githubusercontent.com/sanger-tol/genomenote/main/.github/CONTRIBUTING.md).
If you would like to contribute to this pipeline, please see the [contributing guidelines](.github/CONTRIBUTING.md).

## Citations

If you use sanger-tol/genomenote for your analysis, please cite it using the following doi: [10.5281/zenodo.7949384](https://doi.org/10.5281/zenodo.7949384)

An extensive list of references for the tools used by the pipeline can be found in the [`CITATIONS.md`](https://raw.githubusercontent.com/sanger-tol/genomenote/main/CITATIONS.md) file.
An extensive list of references for the tools used by the pipeline can be found in the [`CITATIONS.md`](CITATIONS.md) file.

This pipeline uses code and infrastructure developed and maintained by the [nf-core](https://nf-co.re) community, reused here under the [MIT license](https://github.com/nf-core/tools/blob/master/LICENSE).

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
422676 aconoidasida
7898 actinopterygii
5338 agaricales
155619 agaricomycetes
33630 alveolata
5794 apicomplexa
6854 arachnida
6656 arthropoda
4890 ascomycota
8782 aves
5204 basidiomycota
68889 boletales
3699 brassicales
134362 capnodiales
33554 carnivora
91561 cetartiodactyla
34395 chaetothyriales
3041 chlorophyta
5796 coccidia
28738 cyprinodontiformes
7147 diptera
147541 dothideomycetes
3193 embryophyta
33392 endopterygota
314146 euarchontoglires
33682 euglenozoa
2759 eukaryota
5042 eurotiales
147545 eurotiomycetes
9347 eutheria
72025 fabales
4751 fungi
314147 glires
1028384 glomerellales
5178 helotiales
7524 hemiptera
7399 hymenoptera
5125 hypocreales
50557 insecta
314145 laurasiatheria
147548 leotiomycetes
7088 lepidoptera
4447 liliopsida
40674 mammalia
33208 metazoa
6029 microsporidia
6447 mollusca
4827 mucorales
1913637 mucoromycota
6231 nematoda
33183 onygenales
9126 passeriformes
5820 plasmodium
92860 pleosporales
38820 poales
5303 polyporales
9443 primates
4891 saccharomycetes
8457 sauropsida
4069 solanales
147550 sordariomycetes
33634 stramenopiles
32523 tetrapoda
155616 tremellomycetes
7742 vertebrata
33090 viridiplantae
71240 eudicots
4 changes: 2 additions & 2 deletions assets/samplesheet.csv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
sample,datatype,datafile
uoEpiScrs1,pacbio,https://tolit.cog.sanger.ac.uk/test-data/Epithemia_sp._CRS-2021b/genomic_data/uoEpiScrs1/pacbio/m64228e_220617_134154.ccs.bc1015_BAK8B_OA--bc1015_BAK8B_OA.rmdup.bam
uoEpiScrs1,pacbio,https://tolit.cog.sanger.ac.uk/test-data/Epithemia_sp._CRS-2021b/genomic_data/uoEpiScrs1/pacbio/m64016e_220621_193126.ccs.bc1008_BAK8A_OA--bc1008_BAK8A_OA.rmdup.bam
uoEpiScrs1,pacbio,https://tolit.cog.sanger.ac.uk/test-data/Epithemia_sp._CRS-2021b/genomic_data/uoEpiScrs1/pacbio/m64228e_220617_134154.ccs.bc1015_BAK8B_OA--bc1015_BAK8B_OA.rmdup.subset.bam
uoEpiScrs1,pacbio,https://tolit.cog.sanger.ac.uk/test-data/Epithemia_sp._CRS-2021b/genomic_data/uoEpiScrs1/pacbio/m64016e_220621_193126.ccs.bc1008_BAK8A_OA--bc1008_BAK8A_OA.rmdup.subset.bam
uoEpiScrs1,hic,https://tolit.cog.sanger.ac.uk/test-data/Epithemia_sp._CRS-2021b/analysis/uoEpiScrs1.1/read_mapping/hic/GCA_946965045.1.unmasked.hic.uoEpiScrs1.subsampled.cram
2 changes: 1 addition & 1 deletion assets/slackreport.json
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
{
"fallback": "Plain-text summary of the attachment.",
"color": "<% if (success) { %>good<% } else { %>danger<%} %>",
"author_name": "sanger-tol/readmapping v${version} - ${runName}",
"author_name": "sanger-tol/genomenote v${version} - ${runName}",
"author_icon": "https://www.nextflow.io/docs/latest/_static/favicon.ico",
"text": "<% if (success) { %>Pipeline completed successfully!<% } else { %>Pipeline completed with errors<% } %>",
"fields": [
Expand Down
Loading