GitHub - PSI-Lab/TF-DNA

Training one model per TF family:

python cross_validate.py

Plots will be saved to report/.

Training one model for all TF families:

python cross_validate_one_model.py

Plots will be saved to report/one_model/.

TODOs:

other low throughput experimental data from the paper as test data?
Nested CV, actual test data
TF DNA-binding site sequence as input, instead of multi-head output
filter visualization
concentration as input to the model, or train as different output?
make sure there is no duplicated sequence across TF families, if there are, merge the data instead of concatenating (duplicating training example)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
raw_data		raw_data
.gitignore		.gitignore
README.md		README.md
config.py		config.py
cross_validate.py		cross_validate.py
cross_validate_one_model.py		cross_validate_one_model.py
preprocessing.py		preprocessing.py
run.sh		run.sh

Provide feedback