Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GSE164690 - CellInteractionsHNSCC #1319

Open
11 tasks
idazucchi opened this issue Oct 31, 2024 · 3 comments
Open
11 tasks

GSE164690 - CellInteractionsHNSCC #1319

idazucchi opened this issue Oct 31, 2024 · 3 comments
Assignees
Labels
dataset All dataset tickets should have this label, only one ticket per dataset ORCF v1 Dataset selected for the ORCF atlas v1.0 Release 45 DP Data Release @ 2/12

Comments

@idazucchi
Copy link
Collaborator

Project short name:

CellInteractionsHNSCC

Primary Wrangler:

Ida

Secondary Wrangler:

Associated files

Published study links

ingest

Key Events

  • Convert published metadata to HCA spreadsheet
  • Manually curate dataset to meet HCA metadata standard
  • Collect any matrix and cell-type annotation files
  • Are the analysis files suitable for CellxGene? If something is missing get in touch with the authors to request it
  • Upload sheet to validate metadata
  • Transfer raw files to ingest to validate data files
  • Check linking using ingest graph validator
  • Ask the Secondary Wrangler for an end-to-end review of the project. Ask the Expertise Wrangler to review specific tabs if needed
  • Submit dataset to Production
  • Complete the Export SOP
  • Convert project data to SCEA format following the SCEA conversion SOP if appropriate
@idazucchi idazucchi added dataset All dataset tickets should have this label, only one ticket per dataset ORCF v1 Dataset selected for the ORCF atlas v1.0 Release 45 DP Data Release @ 2/12 labels Oct 31, 2024
@idazucchi idazucchi self-assigned this Oct 31, 2024
@idazucchi
Copy link
Collaborator Author

idazucchi commented Nov 5, 2024

Smoking metadata is just Y/N with no context. The options for recording it could be:

  1. use yes-> active; no-> never
  2. skip this piece of metadata
  3. ask from contributor, although it's most likely that they won't know and wouldn't reach out to patients after all

I want to go with 2 because I don't want to create wrong data. If we get this metadata with tier 2 collection we will amend the dataset.

@arschat
Copy link
Collaborator

arschat commented Nov 20, 2024

Perfect wrangling Ida! Only a singular comment for donor description:

  • in table 1 there is a footnote for T/ N/ M stage of donors 5 & 8 †Pathologically staged. In our description only the icon is added, however there is no description of what that means. We could either remove the icon and skip the extra info, or add the Pathologically staged in the description of those fields.

@idazucchi
Copy link
Collaborator Author

fixed, exported and filled the form

@arschat arschat closed this as completed Nov 27, 2024
@arschat arschat reopened this Nov 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset All dataset tickets should have this label, only one ticket per dataset ORCF v1 Dataset selected for the ORCF atlas v1.0 Release 45 DP Data Release @ 2/12
Projects
None yet
Development

No branches or pull requests

2 participants