-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Somatic cnv checking #426
Conversation
fix: added vcf indexation & md5 checksums feat: remove possibility of running script without CNV bed file (should be done in Snakefile/workflow) refactor: make black happy
…esence of multiple tumors
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See comments
excluded_regions: "" # Bed file of regions to be excluded | ||
max_depth: 10000 # Max depth for pileups | ||
min_cov: 20 # Minimum depth for reference and alternative alleles to consider variant | ||
min_baf: 0.4 # Maximum BAF to consider variant as heterozygous (between 0 & 1/2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
variable name and comment not consistent. Please adjust.
y <- read.table(cnv, sep="\t", header=0) | ||
colnames(y) <- c("CHROM", "start", "stop", "name", "LFC", "strand") | ||
y <- y |> | ||
dplyr::mutate(LFC=replace(LFC, strand == "-", LFC[strand=="-"])) |> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not changing anything, Add "-" in front of replacement.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
At the moment, implementation of steps
somatic_wgs_cnv_calling
&somatic_targeted_seq_cnv_calling
are based on coverage depth only. From that coverage, the log ratio of tumor to normal samples depth is computed, and after segmentation, the expected number of alleles in the tumor is computed.To check quality of the log fold change of depths and the allele number call, the
somatic_cnv_checking
step is organized in the following way:Remains to be done: