-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow control over index sample designation #434
Comments
I also though of an alternative solution that might work without having to change biomedsheets & cubi-tk: |
Let's implement it and call it "AI-driven sample sheet processing". |
okay I looked through the code some more and I think snappy does not do index selection it all, it fully relies on biomedsheets for this. The biomedsheet code controlling Pedigree defintions seems to located in biomedsheets/shortcuts/germline.py, which is then called by the io functions snappy uses the read the tsv-sheets. Do we need to move this ticket to biomedsheets? To solve this we probably either need to give biodmedsheet (io?) a new option to set the index based on on the 'custom pedigree field' or make this way of selecting index the default is that field is used anyway. |
Do we have a Index column in Biomedsheet? Seems to be the only sane option, otherwise just send everything to ChatGPT |
Is your feature request related to a problem? Please describe.
If a family has more than one affected sample (with the same parents) then snappy will - without informing about this - chose the alphabetically first sample as index, however in some cases this in unwanted behavior as i.e. some vcf files may only be available for a different sample that is the pre-designated index.
Describe the solution you'd like
It should be possible to set samples in a given Family as index. To so a new column (i.e. is_index) needs to be introduced to biomedsheets and also be supported by cubi-tk. This column/information should be optional, as to not disrupt all existing projects
Describe alternatives you've considered
Without this feature one has to change external circumstances to make snappy work as intended. Biomedsheets currently provides the only input for sample information to snappy, so other solutions would require more work.
The text was updated successfully, but these errors were encountered: