Skip to content

Commit e6e5d80

Browse files
authored
2021.01 (#3065)
1 parent 30d7f5e commit e6e5d80

File tree

11 files changed

+1913
-32
lines changed

11 files changed

+1913
-32
lines changed

CHANGELOG.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,18 @@
11
# Qiita changelog
22

33

4+
Version 2021.01
5+
---------------
6+
7+
* Moved the qiita repo from biocore to [qiita-spots](https://github.com/qiita-spots/qiita/).
8+
* Created the [Qiita portal for the Cancer Microbiome](https://qiita.ucsd.edu/cancer/).
9+
* The EBI-ENA code now verifies that the sample information file has a description column; this wasn't previously required because it was automatically prefilled by the QIIME 1 mapping file.
10+
* Now it is possible to download the per preparation sample information file and the sample-preparation summary.
11+
* Added a faster metagenomic/metatranscriptomic adaptor and host removal step based on fastp and minimap2. The previous version, using atropos and bowtie2 for QC host filtering, is now deprecated.
12+
* Added qiime2.2020.11 to the system; which updated these plugins: qp-qiime2, qtp-biom, qtp-diversity, qtp-visualization.
13+
* Added [WoL](https://biocore.github.io/wol/) tree for phylogenetic analyses (/projects/wol/release/databases/qiime2/phylogeny.qza) with per-genome WoL artifacts.
14+
* Fixed the following issues: [#3060](https://github.com/qiita-spots/qiita/issues/3060), [#3049](https://github.com/qiita-spots/qiita/issues/3049), and [#2751](https://github.com/qiita-spots/qiita/issues/2751).
15+
416
Version 2020.11
517
---------------
618

README.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
Qiita (canonically pronounced *cheetah*)
22
========================================
33

4-
|Build Status| |Coverage Status| |Gitter|
4+
|Build Status| |Coverage Status|
55

66
Advances in sequencing, proteomics, transcriptomics and metabolomics are giving
77
us new insights into the microbial world and dramatically improving our ability

logos/qiita_cancer.ai

Lines changed: 1865 additions & 0 deletions
Large diffs are not rendered by default.

qiita_core/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@
66
# The full license is in the file LICENSE, distributed with this software.
77
# -----------------------------------------------------------------------------
88

9-
__version__ = "2020.11"
9+
__version__ = "2021.01"

qiita_db/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@
2727
from . import user
2828
from . import processing_job
2929

30-
__version__ = "2020.11"
30+
__version__ = "2021.01"
3131

3232
__all__ = ["analysis", "artifact", "archive", "base", "commands",
3333
"environment_manager", "exceptions", "investigation", "logger",

qiita_pet/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@
66
# The full license is in the file LICENSE, distributed with this software.
77
# -----------------------------------------------------------------------------
88

9-
__version__ = "2020.11"
9+
__version__ = "2021.01"

qiita_pet/handlers/api_proxy/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@
3838
from .user import (user_jobs_get_req)
3939
from .util import check_access, check_fp
4040

41-
__version__ = "2020.11"
41+
__version__ = "2021.01"
4242

4343
__all__ = ['prep_template_summary_get_req', 'data_types_get_req',
4444
'study_get_req', 'sample_template_filepaths_get_req',

qiita_pet/support_files/doc/source/processingdata/processing-recommendations.rst

Lines changed: 28 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -25,9 +25,11 @@ gene data: sequence clustering and sequence deblur.
2525
Sequencing deblur (preferred)
2626
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2727

28-
For this we use `deblur <https://github.com/biocore/deblur>`_. Here 2 BIOM tables are generated by default: fina.biom and final.only-16s.biom. The former is the full biom table, which can be used with any target gene and wetlab work;
29-
the latter is the trimmed version to those sequences that match Greengenes at 80% similarity, a really basic and naive filtering. Each of those BIOM tables, is accompanied by a FASTA that contains
30-
the representative sequences. The OTU IDs are given by the unique sequence.
28+
For this we use `deblur <https://github.com/biocore/deblur>`_. Here 2 BIOM tables are generated by default:
29+
`deblur final table` and `deblur reference hit table`. The former is the full biom table, which can be used with any
30+
target gene and wetlab work; the latter is the trimmed version to those sequences that match Greengenes at 80% similarity, a
31+
really basic and naive filtering. Each of those BIOM tables, is accompanied by a FASTA that contains the representative sequences.
32+
The OTU IDs are given by the unique sequence.
3133

3234
Note that deblur needs all sequences to be trimmed at the same length, thus the recommended pipeline is to trim everything at 150bp and the deblur.
3335

@@ -49,25 +51,28 @@ Below you will find more information about each of these options.
4951

5052
The current workflow is as follows:
5153

52-
#. Removal of adapter sequence and quality control: `Atropos <https://github.com/jdidion/atropos/>`_
53-
#. Removal of host contamination using `Bowtie2 <http://bowtie-bio.sourceforge.net/bowtie2/index.shtml>`_
54+
#. A single step per sample adapter removal (via `fastp <https://academic.oup.com/bioinformatics/article/34/17/i884/5093234>`_) and host filtering (via `minimap2 <https://academic.oup.com/bioinformatics/article/34/18/3094/4994778>`_); more information below.
5455
#. Taxonomy profiling using bowtie2 as an aligner and two different reference databases; see sections below
5556

5657
Note that we recommend only uploading sequences that have already been through QC and human sequence removal. However, we
57-
recommend that all sequence files go through adapter and quality control within the system to ensure they are ready for
58-
subsequent analyses. Currently, the command removes adaptor sequences (only KAPA HyperPlus with iTru, which are compatible
59-
with Illumina TruSeq).
60-
61-
Sequences generated with an instrument that relies on two-color chemistry (NextSeq, NovaSeq), need to undergo an additional
62-
quality control step. This step removes trailing G nucleotides which signify that the instrument has finished capturing new
63-
information. Per Illumina's specification, NovaSeq instruments have 3 quality levels (11, 25 and 37), and
64-
high-quality trailing Gs need to be removed. Typically this can be done in conjunction with adapter removal, with Atropos
65-
we recommend using the `--nextseq-trim 30` parameter.
66-
67-
For host removal we currently support *Danio Rerio* (zebrafish), *Drosophila Melanogaster* (fruit fly), *Mus Musculus* (mouse),
68-
*Rattus Norvegicus* (rat), and Enterobacteria phage phiX174 (the Illumina spike-in control).
58+
recommend that all sequence files go through adapter and host filtering within the system to ensure they are ready for
59+
subsequent meta-analyses. Currently, the `fastp` command is set to autodetect adaptors so this command is available for all different
60+
wetlab processing and we provide the following host references for your convenience:
61+
62+
- auto-detect adapters and artifacts + phix filtering: This is a `deblur artifacts <https://github.com/biocore/deblur/blob/master/deblur/support_files/artifacts.fa>`_ reference, mainly for debugging and testing
63+
- auto-detect adapters and cheetah + phix filtering
64+
- auto-detect adapters and cow + phix filtering
65+
- auto-detect adapters and hamster + phix filtering
66+
- auto-detect adapters and horse + phix filtering
67+
- auto-detect adapters and merge_genomes + phix filtering : is the combined genomes of a cheetah, cow, hamster, horse, human, mouse, pig, rabbit, and rat
68+
- auto-detect adapters and mouse + phix filtering
69+
- auto-detect adapters and pig + phix filtering
70+
- auto-detect adapters and rabbit + phix filtering
71+
- auto-detect adapters and rat + phix filtering
72+
- auto-detect adapters only filtering [not recommended]
6973

7074
Note that the command produces up to 6 output artifacts based on the aligner and database selected:
75+
7176
- Alignment Profile: contains the raw alignment file and the no rank classification BIOM table
7277
- Taxonomic Prediction - phylum: contains the phylum level taxonomic predictions BIOM table
7378
- Taxonomic Prediction - genus: contains the genus level taxonomic predictions BIOM table
@@ -186,19 +191,18 @@ Note that some of these are legacy option but not available for new processing.
186191
Metatranscriptome processing
187192
----------------------------
188193

194+
Qiita currently has one active Metatranscriptome data analysis pipeline, as follows:
195+
196+
#. Ribosomal read filtering via `SortMeRNA <https://pubmed.ncbi.nlm.nih.gov/23071270/>`_; details below. This produces a `Ribosomal reads` and a `Non-ribosomal reads` artifact/
197+
#. Taxonomic profiling via Woltka; for more information see details above.
198+
189199
Sample processing guidelines for metatranscriptomic data
190200
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
191201

192-
Total community RNA extracted from samples contain both coding and non-coding RNA. Typically, ribosomal RNA make up
193-
>90% of the library if not depleted prior to library construction. Ribosomal depletion allows for mRNA enrichment. Even if
194-
you are dealing with ribosomal RNA subtracted cDNA libraries, there will be some
195-
residual ribosomal RNA in the libraries that you want to remove/separate from the non ribosomal RNA sequences.
196-
197202
Ribosomal read filtering
198203
^^^^^^^^^^^^^^^^^^^^^^^^
199204

200-
`SortMeRNA <https://bioinfo.lifl.fr/RNA/sortmerna/>`_
201-
is used for removal of ribosomal reads from quality filtered Metatranscriptome data
205+
`SortMeRNA <https://pubmed.ncbi.nlm.nih.gov/23071270/>`_ is used for removal of ribosomal reads from quality filtered Metatranscriptome data
202206

203207
Latest SortMeRNA version: v2.1
204208

qiita_ware/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@
66
# The full license is in the file LICENSE, distributed with this software.
77
# -----------------------------------------------------------------------------
88

9-
__version__ = "2020.11"
9+
__version__ = "2021.01"

scripts/qiita-auto-processing

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ full_pipelines = [
4646
'steps': [
4747
{'previous-step': None,
4848
'plugin': 'qp-meta',
49-
'version': '2020.11',
49+
'version': '2021.01',
5050
'cmd_name': 'Atropos v1.1.24',
5151
'input_name': 'input',
5252
'ignore_parameters': ['Number of threads used'],

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
from setuptools import setup
1111
from glob import glob
1212

13-
__version__ = "2020.11"
13+
__version__ = "2021.01"
1414

1515

1616
classes = """

0 commit comments

Comments
 (0)