You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+12Lines changed: 12 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,18 @@
1
1
# Qiita changelog
2
2
3
3
4
+
Version 2021.01
5
+
---------------
6
+
7
+
* Moved the qiita repo from biocore to [qiita-spots](https://github.com/qiita-spots/qiita/).
8
+
* Created the [Qiita portal for the Cancer Microbiome](https://qiita.ucsd.edu/cancer/).
9
+
* The EBI-ENA code now verifies that the sample information file has a description column; this wasn't previously required because it was automatically prefilled by the QIIME 1 mapping file.
10
+
* Now it is possible to download the per preparation sample information file and the sample-preparation summary.
11
+
* Added a faster metagenomic/metatranscriptomic adaptor and host removal step based on fastp and minimap2. The previous version, using atropos and bowtie2 for QC host filtering, is now deprecated.
12
+
* Added qiime2.2020.11 to the system; which updated these plugins: qp-qiime2, qtp-biom, qtp-diversity, qtp-visualization.
13
+
* Added [WoL](https://biocore.github.io/wol/) tree for phylogenetic analyses (/projects/wol/release/databases/qiime2/phylogeny.qza) with per-genome WoL artifacts.
14
+
* Fixed the following issues: [#3060](https://github.com/qiita-spots/qiita/issues/3060), [#3049](https://github.com/qiita-spots/qiita/issues/3049), and [#2751](https://github.com/qiita-spots/qiita/issues/2751).
For this we use `deblur <https://github.com/biocore/deblur>`_. Here 2 BIOM tables are generated by default: fina.biom and final.only-16s.biom. The former is the full biom table, which can be used with any target gene and wetlab work;
29
-
the latter is the trimmed version to those sequences that match Greengenes at 80% similarity, a really basic and naive filtering. Each of those BIOM tables, is accompanied by a FASTA that contains
30
-
the representative sequences. The OTU IDs are given by the unique sequence.
28
+
For this we use `deblur <https://github.com/biocore/deblur>`_. Here 2 BIOM tables are generated by default:
29
+
`deblur final table` and `deblur reference hit table`. The former is the full biom table, which can be used with any
30
+
target gene and wetlab work; the latter is the trimmed version to those sequences that match Greengenes at 80% similarity, a
31
+
really basic and naive filtering. Each of those BIOM tables, is accompanied by a FASTA that contains the representative sequences.
32
+
The OTU IDs are given by the unique sequence.
31
33
32
34
Note that deblur needs all sequences to be trimmed at the same length, thus the recommended pipeline is to trim everything at 150bp and the deblur.
33
35
@@ -49,25 +51,28 @@ Below you will find more information about each of these options.
49
51
50
52
The current workflow is as follows:
51
53
52
-
#. Removal of adapter sequence and quality control: `Atropos <https://github.com/jdidion/atropos/>`_
53
-
#. Removal of host contamination using `Bowtie2 <http://bowtie-bio.sourceforge.net/bowtie2/index.shtml>`_
54
+
#. A single step per sample adapter removal (via `fastp <https://academic.oup.com/bioinformatics/article/34/17/i884/5093234>`_) and host filtering (via `minimap2 <https://academic.oup.com/bioinformatics/article/34/18/3094/4994778>`_); more information below.
54
55
#. Taxonomy profiling using bowtie2 as an aligner and two different reference databases; see sections below
55
56
56
57
Note that we recommend only uploading sequences that have already been through QC and human sequence removal. However, we
57
-
recommend that all sequence files go through adapter and quality control within the system to ensure they are ready for
58
-
subsequent analyses. Currently, the command removes adaptor sequences (only KAPA HyperPlus with iTru, which are compatible
59
-
with Illumina TruSeq).
60
-
61
-
Sequences generated with an instrument that relies on two-color chemistry (NextSeq, NovaSeq), need to undergo an additional
62
-
quality control step. This step removes trailing G nucleotides which signify that the instrument has finished capturing new
63
-
information. Per Illumina's specification, NovaSeq instruments have 3 quality levels (11, 25 and 37), and
64
-
high-quality trailing Gs need to be removed. Typically this can be done in conjunction with adapter removal, with Atropos
65
-
we recommend using the `--nextseq-trim 30` parameter.
66
-
67
-
For host removal we currently support *Danio Rerio* (zebrafish), *Drosophila Melanogaster* (fruit fly), *Mus Musculus* (mouse),
68
-
*Rattus Norvegicus* (rat), and Enterobacteria phage phiX174 (the Illumina spike-in control).
58
+
recommend that all sequence files go through adapter and host filtering within the system to ensure they are ready for
59
+
subsequent meta-analyses. Currently, the `fastp` command is set to autodetect adaptors so this command is available for all different
60
+
wetlab processing and we provide the following host references for your convenience:
61
+
62
+
- auto-detect adapters and artifacts + phix filtering: This is a `deblur artifacts <https://github.com/biocore/deblur/blob/master/deblur/support_files/artifacts.fa>`_ reference, mainly for debugging and testing
63
+
- auto-detect adapters and cheetah + phix filtering
64
+
- auto-detect adapters and cow + phix filtering
65
+
- auto-detect adapters and hamster + phix filtering
66
+
- auto-detect adapters and horse + phix filtering
67
+
- auto-detect adapters and merge_genomes + phix filtering : is the combined genomes of a cheetah, cow, hamster, horse, human, mouse, pig, rabbit, and rat
68
+
- auto-detect adapters and mouse + phix filtering
69
+
- auto-detect adapters and pig + phix filtering
70
+
- auto-detect adapters and rabbit + phix filtering
71
+
- auto-detect adapters and rat + phix filtering
72
+
- auto-detect adapters only filtering [not recommended]
69
73
70
74
Note that the command produces up to 6 output artifacts based on the aligner and database selected:
75
+
71
76
- Alignment Profile: contains the raw alignment file and the no rank classification BIOM table
- Taxonomic Prediction - genus: contains the genus level taxonomic predictions BIOM table
@@ -186,19 +191,18 @@ Note that some of these are legacy option but not available for new processing.
186
191
Metatranscriptome processing
187
192
----------------------------
188
193
194
+
Qiita currently has one active Metatranscriptome data analysis pipeline, as follows:
195
+
196
+
#. Ribosomal read filtering via `SortMeRNA <https://pubmed.ncbi.nlm.nih.gov/23071270/>`_; details below. This produces a `Ribosomal reads` and a `Non-ribosomal reads` artifact/
197
+
#. Taxonomic profiling via Woltka; for more information see details above.
198
+
189
199
Sample processing guidelines for metatranscriptomic data
0 commit comments