Explainable-Decision-Support-Lung-Cancer

This repository contains the code and trained pipelines from the abstract https://www.jto.org/article/S1556-0864(23)00466-5/fulltext and the submitted paper "Development of an explainable clinical decision support tool for advanced lung cancer patients" (not yet accepted nor published).

Models

This repository contains 10 trained pipelines with pre-processing steps and binary classification models included (see pipelines/). Pre-processing steps are one hot encoding for the categorical features and robust scaling for the continuous features. Unbalanced datasets are handled with Synthetic Minority Oversampling TEchnique for Nominal and Continuous (SMOTE-NC). All models are soft voting classifiers (ensembles) with three base estimators: logistic regression, random forest classifier and xgboost classifier. Feature selection was performed with the minimal Redundancy-Maximum Relevance.

Following questions are being addressed with the 10 pipelines:

Will the patient die within six weeks after baseline? (no/yes)

M_6weeks_ENS_model_allfeatures_calibrated.pkl M_6weeks_ENS_model_selectedfeatures_calibrated.pkl
Will the patient die between six weeks and three months after baseline? (no/yes)

M_3months_ENS_model_allfeatures_calibrated.pkl M_3months_ENS_model_selectedfeatures_calibrated.pkl
Will the patient die between three months and six months after baseline? (no/yes)

M_6months_ENS_model_allfeatures_calibrated.pkl M_6months_ENS_model_selectedfeatures_calibrated.pkl
Will the patient die between six months and one year after baseline? (no/yes)

M_1year_ENS_model_allfeatures_calibrated.pkl M_1year_ENS_model_selectedfeatures_calibrated.pkl
How will the alopecia of the patient evolve within five to nine weeks after baseline? (no change/negatively)

alopecia_6weeks_ENS_model_selectedfeatures_calibrated.pkl
How will the dysphagia of the patient evolve within five to nine weeks after baseline? (no change/negatively)

dysphagia_6weeks_ENS_model_selectedfeatures_calibrated.pkl

Notes on use

Pre-requisites:

python version 3.10
jupyter notebook

Install the correct versions of the packages.

With pip:

pip install -r requirements.txt

OR

With pipenv:

pipenv install

In the folder code, you can find four notebooks; feature_selection.ipynb, training.ipynb, explainability.ipynb and inference.ipynb. The first one demonstrates how we did the feature selection, the second guides you through training a pipeline with the preprocessing steps and models as discussed in the paper, the third plots several Shapley plots and the fourth runs inference of the pipelines on your dataset(s). The notebooks only demonstrate the methods for one cohort (mortality six weeks after baseline). The method is always the same for each research question.

You can use these notebooks with your own data, training results will then be saved in results. A dummy patient is included to show how the data should look like in data/dummy_data.csv. Make sure that you use the correct columns for each model, you can see what columns were used in the trained pipelines with:

columns = pipeline[:-1].get_feature_names_out()

print(columns)

The folder pipelines contains the pipelines (preprocessing + voting classifier) of which the results were mentioned in the paper. If you use these pipelines to run inference on your own data, please let us know your results.

For questions or issues, use the issues section in the repository or contact louise.berteloot@azdelta.be.

Authors

Berteloot Louise

De Jaeger Peter

License

Released under the GNU General Public License v3.0.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
code		code
data		data
parameters		parameters
pipelines		pipelines
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainable-Decision-Support-Lung-Cancer

Models

Notes on use

Authors

License

About

Releases

Packages

Contributors 2

Languages

License

RADar-AZDelta/azd-radar-ai-lungcancerExplainableDecisionSupport

Folders and files

Latest commit

History

Repository files navigation

Explainable-Decision-Support-Lung-Cancer

Models

Notes on use

Authors

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages