-
Notifications
You must be signed in to change notification settings - Fork 7
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(docs): transfer ambiguous symbols explanation #551
- Loading branch information
1 parent
5cea36d
commit 066f5b6
Showing
10 changed files
with
161 additions
and
142 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
5 changes: 5 additions & 0 deletions
5
lapis2-docs/src/components/MutationFilters/MaybeAminoAcidMutationExample.astro
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
--- | ||
import AminoAcidMutationExample from './AminoAcidMutationExample.astro'; | ||
--- | ||
|
||
<code>MAYBE(<AminoAcidMutationExample />)</code> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
53 changes: 53 additions & 0 deletions
53
lapis2-docs/src/content/docs/concepts/ambiguous-symbols.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,53 @@ | ||
--- | ||
title: Ambiguous symbols | ||
description: Explanation how ambiguous reads are handled in the data | ||
--- | ||
|
||
The underlying sequence files in `.FASTA` format can contain any of the following symbols: | ||
|
||
| Symbol | Meaning | | ||
| ------ | ----------------- | | ||
| A | Adenine | | ||
| C | Cytosine | | ||
| G | Guanine | | ||
| T | Thymine | | ||
| - | Deletion | | ||
| N | failed read / any | | ||
| R | A or G | | ||
| Y | C or T | | ||
| S | C or G | | ||
| W | A or T | | ||
| K | G or T | | ||
| M | A or C | | ||
| B | not A | | ||
| D | not C | | ||
| H | not G | | ||
| V | not T | | ||
|
||
The ambiguous symbols arise from imperfect reads in the sequencer. | ||
|
||
While one mostly queries for the symbols `A`, `C`, `G`, `T` and `-` to look for specific features and mutations of a sequence, | ||
or `N` for quality control of the underlying data, | ||
the ambiguous symbols `R` through `V` are often too cumbersome to consider in analyses. | ||
|
||
LAPIS supports the flexible consideration of these ambiguous symbols | ||
through an extension of the boolean logic syntax in the variant queries. | ||
|
||
Here we introduce a new expression `MAYBE` to consider sequences that have an ambiguous code which **maybe** matches the queried value. | ||
|
||
#### Example | ||
|
||
Consider the following sequences: | ||
|
||
``` | ||
12345 | ||
AAACG | ||
AARCG | ||
AANCG | ||
AAGCG | ||
AAACG | ||
``` | ||
|
||
A filter for the mutation `3G` returns only the sequence `AAGCG`, as it is the only sequence with the symbol `G` at position 3. | ||
The filter `MAYBE(3G)` however also considers that the sequences `AARCG` and `AANCG` **may** have the symbol `G` at position 3, | ||
because the symbols `R` and `N` can represent Guanine. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.