Quality control for the target decoy approach for peptide identification

The compagnon website of this repo can be found on https://statomics.github.io/TargetDecoyPaper/

Abstract

Reliable peptide identification is key in mass spectrometry (MS) based proteomics. To this end, the target-decoy approach (TDA) has become the cornerstone for extracting a set of reliable peptide-to-spectrum matches (PSMs) that will be used in downstream analysis. Indeed, TDA is now the default method to estimate the false discovery rate (FDR) for a given set of PSMs, and users typically view it as a universal solution for assessing the FDR in the peptide identification step. However, the TDA also relies on a minimal set of assumptions, which are typically never verified in practice. We argue that a violation of these assumptions can lead to poor FDR control, which can be detrimental to any downstream data analysis. We here therefore first clearly spell out these TDA assumptions, and introduce TargetDecoy, a Bioconductor package with all the necessary functionality to control the TDA quality and its underlying assumptions for a given set of PSMs.

Availability of data

The datasets required to reproduce all results that are displayed in this publication (including supplementary materials) are available at Zenodo. This includes both the raw data and intermediate results. Note that at the top of each analysis script it is indicated which dataset is required as input for the script; it may thus not be necessary to download all datasets from Zenodo. For easily reproducing our analyses, place the downloaded data in the Data folder of your local clone of this repository.

Analyses & Scripts

To reproduce the results that are displayed in this publication, proceed as follows:

Make a local clone of this Github repository
Open the selected R scripts (.Rmd) in the root of this repository.
Run the analyses - the figures will automatically be stored in the figs folder .

License

The website and code are licensed under CC BY-SA 4.0

The main paper is licensed under CC-BY-NC-ND 4.0

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github		.github
figs		figs
renv		renv
.Rprofile		.Rprofile
.gitignore		.gitignore
README.md		README.md
_site.yml		_site.yml
authors.tex		authors.tex
biblio.bib		biblio.bib
db-gpm-crap-entries.txt		db-gpm-crap-entries.txt
index.Rmd		index.Rmd
myTemp.tex		myTemp.tex
paper_figures.Rmd		paper_figures.Rmd
paper_main.pdf		paper_main.pdf
paper_supplementary.Rmd		paper_supplementary.Rmd
renv.lock		renv.lock
supplementaryBackMatter.tex		supplementaryBackMatter.tex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quality control for the target decoy approach for peptide identification

Abstract

Availability of data

Analyses & Scripts

License

About

Releases

Packages

Languages

statOmics/TargetDecoyPaper

Folders and files

Latest commit

History

Repository files navigation

Quality control for the target decoy approach for peptide identification

Abstract

Availability of data

Analyses & Scripts

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages