chrombpnet-workflow

This repo contains a Krews workflow for running ChromBPNet on ATAC-seq and DNase-seq datasets. It currently supports human data aligned to the hg38 genome. It consists of the following steps:

Training: trains a bias and ChromBPNet model given filtered BAMs
Prediction: predicts bias-corrected signal profiles at input regions
Shap: computes base-resolution importance scores for predicting profiles and counts

The Shap step consists of three substeps. It first splits input regions into equally-sized smaller files for performance, computes shap scores on those, then merges the output.

The Training step can be skipped if a pre-trained model is available.

Input is a list of BAM files and a BED file containing regions on which to run predictions and shap score analysis. Output is a bias model, a ChromBPNet model, a predicted profile bigWig, a counts importance score bigWig, and a profile importance score bigWig. The bigWigs will only contain values in kilobase-sized windows around the input regions.

See the sample-configs directory for example configurations. To run, do scripts/run-workflow.sh --on google --config /path/to/config.conf.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
gradle/wrapper		gradle/wrapper
sample-configs		sample-configs
scripts		scripts
src/main/kotlin		src/main/kotlin
tasks		tasks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chrombpnet-workflow

About

Releases

Packages

Languages

License

weng-lab/chrombpnet-workflow

Folders and files

Latest commit

History

Repository files navigation

chrombpnet-workflow

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages