Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add RedCap export splitting script #78

Merged
merged 2 commits into from
Sep 5, 2024

Conversation

mshadbolt
Copy link
Contributor

@mshadbolt mshadbolt commented Sep 5, 2024

  • Adds a python script that takes a redcap export and splits it into individual csv files.
  • Minor refactoring so naming is clearer

Helps fix #75

@mshadbolt mshadbolt requested a review from daisieh September 5, 2024 03:29
@mshadbolt mshadbolt marked this pull request as ready for review September 5, 2024 03:31
@mshadbolt mshadbolt mentioned this pull request Sep 5, 2024
Copy link
Member

@daisieh daisieh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@mshadbolt mshadbolt merged commit 94434d2 into develop Sep 5, 2024
2 checks passed
@mshadbolt mshadbolt deleted the mshadbolt/add-redcap-csv-split-script branch September 5, 2024 19:46
daisieh pushed a commit that referenced this pull request Sep 25, 2024
* add splitting script

* minor refactoring
mshadbolt added a commit that referenced this pull request Sep 30, 2024
* adding date format to date intervals

* addded date format

* updated docs

* updated test date formats

* make exceptions more readable

* make exception specific

* cleanup

* cleanup

* Update src/clinical_etl/CSVConvert.py

Co-authored-by: Marion <mshadbolt@users.noreply.github.com>

* exception when parser fails

* update reference date docs

* improved error catching in manifest

* better error msg

* fix month intervals

* remove import

* minor fixes

* add sample redcap files (#67)

* add sample redcap files

* restructure suggestion

* mv sample files

* update sample inputs links and info

* Update README.md

---------

Co-authored-by: Marion <mshadbolt@users.noreply.github.com>

* create mohschemav3 class

* validate v3 donors

* validate primary diagnoses

* validate specimens

* validation logic fix

* validate treatments

* treatment validation fix

* validate systemic & radiation therapies

* validate followups

* replace match with if, less indentation

* validate exposures & comorbidities

* rename v3 to lowercase

* sample registration validation (required)

* validate surgeries

* added missing argument

* fix nested schemas

* update references to v3

* fix test

* fix test, using v2 by default

* update default templates

* update test data files

* manual mapping function changes to csv templates

* update test yamls

* update tests

* add biomarker validation, edit resolution to work

* update templates

* fix typo

* fix missing comma

* remove extra staging validation

* rename v2 template

* added multisheet line for testing

* stage group validation fix

* validate systemic therapy dates

* test systemic therapy date validation

* add sample redcap files (#67)

* add sample redcap files

* restructure suggestion

* mv sample files

* update sample inputs links and info

* Update README.md

---------

Co-authored-by: Marion <mshadbolt@users.noreply.github.com>

* Fix up some of the validations (#69)

* fix for loop

* fix keyerror

* fix if elses

* another fix

* update tests

* revert elif

* update to version 3 (#70)

* Add RedCap export splitting script (#78)

* add splitting script

* minor refactoring

* Update validation and test data for model 3.1 (#80)

* fix deceased validation

* change bools to single_vals

* fix more cause_of_death validations

* update test data, fix validations, update template/manifest

* update schema urls

* more schema changes in v3 templates

* DIG-1772 & DIG-1782: Handle -99 and 'Not available' as missing (#83)

* numeric not availabe and schema template update

* update default url

* improve template generation, add not_available to missingness

* update v3 template

* methods for -99 mappng

* update test template and test data

* update script and moh_diffs

* fix typo in validation

* updated mapping functions doc

* updates for latest version

---------

Co-authored-by: Javier Castillo-Arnemann <javiercastilloar@gmail.com>
Co-authored-by: Javi <30913679+yavyx@users.noreply.github.com>
Co-authored-by: Karen Cranston <karen.cranston@gmail.com>
Co-authored-by: Daisie Huang <daisieh@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

A question about
2 participants