-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A question about #75
Comments
Primary_site is under Donor in the sample redcap template, but in the v3 template is under primary_diagnoses. |
Hi Ray! The MoHCCN recently transitioned to the v3 model and is now available on their website: https://www.marathonofhopecancercentres.ca/researcher-hub/policies-and-guidelines As you have noticed, this included a few major changes such as the addition of systemic therapies, removal of chemotherapy/immunotherapy/hormone therapy objects, and moving primary site to primary diagnosis. The sample redcap template is from a v2 model export, we are yet to do a v3 model export at this stage. Sorry for the confusion there. If you are running the latest develop stack, you would need to have a clinical ingest json that is valid against the v3 data model schema. Was the redcap data you are working with curated to the v2 or v3 version of the data model? Happy to take a look at your csv template and manifest file to see if I can spot anything that might be causing issues. We are planning on making a stable release including these latest data model updates in about a month or so. There are also a few more minor changes coming in data model v3.1, then we are hoping the data model stays stable for a while... |
Hi Marion, First of all, thank you so much for getting back to me. I appreciate it.
We are running CanDIG v4.1.0.
I took the template from
I will attach the template and manifest file to this thread. I appreciate it for your help.
Thank you and the team for all these work. |
Here is the code of
|
Hi Ray, thanks for sharing the files. If you want to ingest the data into the stack running v4.1.0, the data will need to be compatible with data model v2. So the schema in the manifest will need to be the one on the stable branch of katsu. Can you try adjusting your manifest to:
Let me know if this works! |
Hi Marion, I will try that out and report back. Thank you very much, |
Hi Ray,
When we worked with a redcap export, we needed to do some preprocessing of the redcap csv to split it up into the different csvs that correspond to the various schemas before running it through clinical_etl. Perhaps this is a missing step for your data currently? |
Thank you for clearing them up for me, Marion.
This is what I missed. I thought I could just run the redcap sample mappings against the Do you happen to have those preprocessing split csvs files from that
At this moment, I just have one single exported csv file. As you mentioned, I need to split it up into different csv files to correspond to the various schemas.
Got it.
Here I was trying to run the sample. I don't have those |
Hi Ray, Ok! I think I understand. I can work on sharing the python script that will split the file into csvs in the same folder. These files are a bit out of date since the redcap export format we were working with changed. I am not sure whether or not it will be relevant for you when you export from your own redcap database. For now, are you just trying to see how things work using this as an example or are you trying to use the same methods to convert your own real data? Does your own data follow a similar format to the We provided these files as something that worked for us previously but I am not sure how much they need to be customised for different redcap databases and what options there are when exporting out of redcap that would affect how they run so would be great to understand your experience so far. |
Hi Marion, The REDCap data file I have is very similar to the All the best, |
Hi Ray, I have made a PR that adds the splitting script. It will be on the develop branch when it gets approved and merged but in the meantime you can also grab the script from here: Hope it works for your export! |
This is great. Thank you so much Marion. Big help! |
Hi folks,
In the
REDCap
sample inputs folder:sample_inputs/redcap_example/manifest.yml
, the schema line is (it does not designate the schema_class):schema: https://raw.githubusercontent.com/CanDIG/katsu/develop/chord_metadata_service/mohpackets/docs/schema.yml
In the generic folder,
manifest.yml
has:In the ETL_code root, there are moh_v3_template.csv and moh_v2_template.csv, generated using different schemas.
I am working on a
REDCap data
mapping with the most recent CanDIG and ETL_code ofdevelop
branch, which schema should I use? I am also using the template from theredcap
folder, what do I need to pay attention to?Thanks a lot,
Ray
The text was updated successfully, but these errors were encountered: