-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
265 metadata validation required fields #271
base: dev
Are you sure you want to change the base?
Conversation
t (s) flags, set valid MM DD dates
…the new yaml file
…adata-validation-required-fields
Ran the following test:
Created the following error message:
|
Ran the following test:
Meta data validation ran without error. It should have failed because organism is a required field. Additional information: contents of
|
It's failing when I test it - but missing geo_loc_name instead of the expected "missing organism". Geo_loc_name is created in the script, after the main validation. So possibly it's "failing to appropriately fail" and then failing later because it doesn't reach the geo_loc_name creation step? I'm working on it. Error: validate_metadata.py --meta_path mpxv_test_metadata_update.xlsx --output_dir . --custom_fields_file /scicomp/home-pure/ick4/01.scripts/tostadas/assets/custom_meta_fields/example_custom_fields.json --validate_custom_fields false --date_format_flag s --config_file /scicomp/home-pure/ick4/02.scratch/submission_config.yaml --biosample_fields_key /scicomp/home-pure/ick4/01.scripts/tostadas/assets/biosample_fields_key.yaml Command exit status: Command output: Metadata Validation Failed Please Consult : ./mpxv_test_metadata_update/errors/full_error.txt for a Detailed List Command error: Metadata Validation Failed Please Consult : ./mpxv_test_metadata_update/errors/full_error.txt for a Detailed List Work dir: Command: |
Ok I fixed the geo_loc_name problem, and now I'm trying to figure out why it returns "files don't exist or have insufficient permissions" errors on the test fastqs under assets. os.path.exists() finds them when I run python interactively so I'm not sure what's up. Bottom line: metadata validation is still failing, and that's the only error listed in the errors txt file. |
This error is almost resolved. The first issue resulted from a change I had previously made - to make metadata validation fail if any one sample fails (as it should). Then I turned to addressing the problem of "file not found" for SRA file paths. This is happening because relative paths are given in the metadata file, but they are relative to projectDir not workDir. The metadata validation is quite convoluted unfortunately, so making the change in location didn't fix the problem. The current error is that the filepaths aren't being resolved everywhere in the script. See DEBUG statements below: Command error: Metadata Validation Failed Please Consult : ./mpxv_test_metadata_update/errors/full_error.txt for a Detailed List Work dir: Tip: you can replicate the issue by changing to the process work dir and entering the command -- Check '.nextflow.log' file for details |
Ready for testing! Everything has been fixed. |
Description
Checklist
Go Through Checklist Below and Place A ✔️ (X Inside the Box) if Completed
General Checks
Have you run appropriate tests (unit/integration/end-to-end) to check logic across run environments (Conda/Docker/Singularity on Scicomp/AWS/NF Tower/Local)?
singularity on scicomp
For each relevant configuration:
Have you conducted proper linting procedures?
[] Have you updated existing documentation (README.md, etc.) or created new ones within docs?
None needed
CDC Checks
Are additional approvals needed for this change? If so, please mention them below:
Are there potential vulnerabilities or licensing issues with any new dependencies introduced? If so, please mention them below: