BBCAF ML Pipeline

Author: Victor Roth Cardoso - V.RothCardoso@bham.ac.uk

E-mail me suggestions, comments and issues.

Introduction

This is the repository for the ML scripts used in Winnie Chua, Yanish Purmah, Victor R Cardoso, Georgios V Gkoutos, Samantha P Tull, Georgiana Neculau, Mark R Thomas, Dipak Kotecha, Gregory Y H Lip, Paulus Kirchhof, Larissa Fabritz, Data-driven discovery and validation of circulating blood-based biomarkers associated with prevalent atrial fibrillation, European Heart Journal, Volume 40, Issue 16, 21 April 2019, Pages 1268–1276, https://doi.org/10.1093/eurheartj/ehy815.

There are many packages that were installed with our setup. You won't need all of them and they may be installed as required. The recommended packages are in "install_packages.R". The version of R used is 3.4.1.

Instructions

The script works in 3 main steps:

Import the settings and the pipe script
Change settings as required
Run the pipe

You'll require a function that loads your dataset. This functions must return a list with data. This dataset output (dependent) variable should be in a column named "ResultVariable"!

Example:

dataset <- function() {
    return(list(data=my_df))
}

Follow the example in test/sample.R or test/minimal_sample.R

Output and generated files

Some output files might be generated depending on the settings. These files will be located in:

output/env: saved environment files
output/models: created models
output/run_datasets: the datasets which the model was run
output/img: AUC or other images

Code organization

The remaining of the tool is organized in the following manner:

util: scripts needed by pipe.R
datasets: scripts to load datasets (you may save your own here to your branch)
output: output files (check above)
test: sample scripts to check functionality
tools: other auxiliary scripts

Issues

If there is a column called Bio_Panel it will expect to split the dataset into train ("CVD1" and test "CVD2")

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
output		output
test		test
tools		tools
util		util
.gitignore		.gitignore
CHANGELOG		CHANGELOG
LICENSE		LICENSE
README.md		README.md
install_packages.R		install_packages.R
pipe.R		pipe.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BBCAF ML Pipeline

Introduction

Instructions

Output and generated files

Code organization

Issues

About

Releases

Packages

Languages

License

gkoutos-group/bbcaf_pipeline

Folders and files

Latest commit

History

Repository files navigation

BBCAF ML Pipeline

Introduction

Instructions

Output and generated files

Code organization

Issues

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages