Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NRES & bionetwork backfill #1301

Open
arschat opened this issue Sep 20, 2024 · 10 comments
Open

NRES & bionetwork backfill #1301

arschat opened this issue Sep 20, 2024 · 10 comments
Labels
HCA metadata backfill operations This issue is an operational task Submissions Submission WS tasks

Comments

@arschat
Copy link
Collaborator

arschat commented Sep 20, 2024

There are two bulk metadata updates on the project level, that we'd like to do.

Reasoning

  1. NRES addition in all open access datasets
    After the introduction of managed access datasets in the portal, we would like to add the data_use_restriction field in the metadata of all open access projects i.e. all projects of the portal that this update was not done in the previous bulk update in Data Portal tracker - Data Repository tasks #1270. This would require bumping the project schema version to version 19.0.0 and add the field "data_use_restriction": "NRES" in the project metadata.
  2. Bionetwork backfilling
    Dave asked us to add the bionetwork information in the schema, since portal started showing the biological network on the front page by default. There are a couple of open questions here.
    a. what is the true list for bionetworks? Is it tracker?
    b. what is the true list for atlas names? In tracker some atlas names are initials (i.e. MSK 1.0, or ORCF 1.0). Do we want to add these names?
    c. Projects in portal with no bionetwork: would we like to show None instead of unspecified?

Plan

Since both metadata exist in the project level, we would like to update using @idazucchi 's script which exports only project metadata (don't have to update the state to graph valid, just return to exported). The steps would be:

  1. Select projects (uuids) that need update for NRES
  2. Select projects (uuids) that need bionetwork update & appropriate bionetwork(s)
  3. Select projects (uuids) that need atlas name & version update & appropriate atlas name(s) & version(s)
  4. Write script that via api calls to ingest, will update these informations
  5. Export project metadata via Ida's script
  6. Bulk import form sent to Travis

1,2,3 tasks can be done via the Task tracker spreadsheet
4 script is almost ready for previous bulk update in #1270 (see comments for script) a few modifications might be needed
5 if we provide uuids to script it runs quickly
6 we can also extract project title in order to populate the import form easily
Estimated time needed ~2 days

Risks

  1. information on tracker is not up to date
    • we will update project or re-run this script for bulk updates in a next release
  2. old project gets error in import validation
    • drop project from current release & investigate how we can re-export to avoid errors
    • ask from import team to re-populate staging area with reverse-import script & try again
@arschat arschat added metadata backfill operations This issue is an operational task Submissions Submission WS tasks labels Sep 20, 2024
@idazucchi idazucchi added the HCA label Sep 20, 2024
@arschat
Copy link
Collaborator Author

arschat commented Sep 24, 2024

@arschat
Copy link
Collaborator Author

arschat commented Oct 7, 2024

Lucia replied specifying that we should use the tracker names.

Here is a list of the tracker's atlases as of today:

Adipose v1.0
Breast v1.0
Development v1.0
Eye - Other v1.0
GDN v1.0
Gut v1.0
Heart v1.0
Immune v1.0
Kidney v1.0
Liver v1.0
Lung v1.0
Lung v2.0
MSK v1.0
ORCF v1.0
Organoid-Endoderm v1.0
Organoid-Neural v1.0
Pancreas v1.0
Retina v1.0
Retina v2.0
Skin v1.0

A patch in the hca_bionetworks module needs to be made to add those names, before we proceed with updating projects with atlas names, bionetworks and bump project schema version and add data_use_restriction.

@arschat
Copy link
Collaborator Author

arschat commented Oct 14, 2024

summarised the number of projects to update here (spreadsheet, folder)

If we want to update all data_use_restriction & bionetwork & atlas info: 431/477 projects
If we want to update only bionetwork and atlas info: 143/477 projects

@arschat
Copy link
Collaborator Author

arschat commented Oct 18, 2024

We decided to update only the projects that needed bionetwork & atlas update, but add also data_use_restriction field for that as well.

Out of the 143, 3 projects had no submission (probably wrangled by lattice).
Out of the 140 remaining, 139 all were edited successfully and I've add a new tab in the spreadsheet to specify the uuids and what has been updated, as well as their title if need for the import form.

list of ingest updates
uuid updated
027c51c6-0719-469f-a7f5-640fe57cbece schema_version,bionetworks,atlas
065e6c13-ad6b-46a3-8075-c3137eb03068 schema_version,bionetworks,atlas
07d5987e-7f9e-4f34-b0fb-a185a35504f5 schema_version,data_use_restriction,bionetworks,atlas
08fb10df-32e5-456c-9882-e33fcd49077a schema_version,data_use_restriction,atlas
0cc58d0b-1734-4e1d-9113-b32e52f75e36 schema_version,bionetworks,atlas
10201832-7c73-4033-9b65-3ef13d81656a schema_version,bionetworks,atlas
10a845f7-0361-46fa-92a3-2a36483136b1 schema_version,data_use_restriction,bionetworks,atlas
111d272b-c25a-49ac-9b25-e062b70d66e0 schema_version,data_use_restriction,atlas
12f32054-8f18-4dae-8959-bfce7e3108e7 schema_version,data_use_restriction
135f7f5c-4a85-4bcf-9f7c-4f035ff1e428 schema_version,bionetworks,atlas
1538d572-bcb7-426b-8d2c-84f3a7f87bb0 schema_version,bionetworks,atlas,publication title
17cf943b-e247-454f-908b-da58665fcc56 schema_version,data_use_restriction,bionetworks,atlas
1c4cbdd4-33e3-4ded-ab43-5958de817123 schema_version,data_use_restriction
1c6a960d-52ac-44ea-b728-a59c7ab9dc8e schema_version,bionetworks,atlas
1dd552a5-eb4f-4b92-8088-7224bcbd0629 schema_version,data_use_restriction,bionetworks,atlas
1dddae6e-3753-48af-b20e-fa22abad125d schema_version,data_use_restriction,bionetworks,atlas
1eba4d0b-2d15-4ba7-bb3c-d4654dd94519 schema_version,data_use_restriction,bionetworks,atlas
2184e63d-82d8-4ab2-839e-e93f8395f568 schema_version,data_use_restriction,bionetworks,atlas
222a92d5-277b-489c-aad8-a680d1fd2b12 schema_version,data_use_restriction,bionetworks,atlas
23509202-1e3c-4959-8a45-9c5b642a1066 schema_version,data_use_restriction
24d0dbbc-54eb-4904-8141-934d26f1c936 schema_version,data_use_restriction,bionetworks,atlas
27e2e0ae-5971-4927-aac1-19e81804097b schema_version,data_use_restriction,bionetworks,atlas
28dd1438-8f40-40d0-8e53-ee3301b66218 schema_version,data_use_restriction,bionetworks,atlas
29b54165-34ee-4da5-b257-b4c1f7343656 schema_version,data_use_restriction
2caedc30-c816-4b99-a237-b9f3b458c8e5 schema_version,data_use_restriction,bionetworks,atlas
2d4d89f2-ebeb-467c-ae60-a3efc5e8d4ba schema_version,atlas
2ef3655a-973d-4d69-9b41-21fa4041eed7 schema_version,project title,publication title
2fe3c60b-ac1a-4c61-9b59-f6556c0fce63 schema_version,data_use_restriction,bionetworks,atlas
3089d311-f9ed-44dd-bb10-397059bad4dc schema_version,data_use_restriction,bionetworks,atlas,project title
30dc3964-1135-4b56-b393-ce2dcbc6e379 schema_version,data_use_restriction,bionetworks,atlas
31887183-a72c-4308-9eac-c6140313f39c schema_version,bionetworks,atlas
35d5b057-3daf-4ccd-8112-196194598893 schema_version,data_use_restriction,bionetworks,atlas
377c35d1-93bf-470c-8067-08f954b269bd schema_version,data_use_restriction,bionetworks,atlas
3ce9ae94-c469-419a-9637-5d138a4e642f schema_version,data_use_restriction,atlas
40272c3b-4697-4bd4-ba3f-82fa96b9bf71 schema_version,bionetworks,atlas,publication title
415eb773-cadb-43d1-ab89-7d160d5cfc7d schema_version,data_use_restriction,bionetworks,atlas
425c2759-db66-4c93-a358-a562c069b1f1 schema_version,bionetworks,atlas
453d7ee2-319f-496c-9862-99d397870b63 schema_version,data_use_restriction,atlas
457d0bfe-79e4-43f1-be5d-83bf080d809e schema_version,data_use_restriction
46a7e4bf-0474-4a8f-8d18-43afcde90491 schema_version,bionetworks,atlas
4bec484d-ca7a-47b4-8d48-8830e06ad6db schema_version,bionetworks,atlas
4d6f6c96-2a83-43d8-8fe1-0f53bffd4674 schema_version,bionetworks,atlas
4f4f0193-ede8-4a82-8cb0-7a0a22f06e63 schema_version,data_use_restriction,bionetworks,atlas
50151324-f3ed-4358-98af-ec352a940a61 schema_version,bionetworks,atlas
50154d1e-2308-44bf-9608-10c7afaa560b schema_version,data_use_restriction
51f02950-ee25-4f4b-8d07-59aa99bb3498 schema_version,bionetworks,atlas
58028aa8-0ed2-49ca-b60f-15e2ed5989d5 schema_version,data_use_restriction,bionetworks,atlas
5a54c617-0eed-486e-8c1a-8a8041fc1729 schema_version,data_use_restriction
5f1a1aee-6c48-4dd4-a2c4-eb4ca6aadf74 schema_version,bionetworks,atlas
5f607e50-ba22-4598-b1e9-f3d9d7a35dcc schema_version,data_use_restriction,bionetworks,atlas
60109425-a6e6-4be1-a3bc-15de680317d4 schema_version,bionetworks,atlas
60ea42e1-af49-42f5-8164-d641fdb696bc schema_version,data_use_restriction,bionetworks,atlas
61515820-5bb8-45d0-8d12-f0850222ecf0 schema_version,data_use_restriction
645b20c9-5ed0-4500-86b5-7aef770d010a schema_version,data_use_restriction,bionetworks,atlas
660fc8b5-8fb8-4050-8c57-e6313195bc81 schema_version,bionetworks,atlas
66d7d92a-d6c5-492c-815b-f81c7c93c984 schema_version,data_use_restriction,bionetworks,atlas
6836c1e4-906b-4c34-a11c-cb025167896d schema_version,bionetworks,atlas
6874b7eb-3445-47ec-8773-75141430e169 schema_version,data_use_restriction,atlas
69324a96-a68a-4514-bbb4-f8f3ea4bd0f1 schema_version,data_use_restriction,bionetworks,atlas
6936da41-3692-46bb-bca1-cd0f507991e9 schema_version,data_use_restriction,atlas
6ac8e777-f9a0-4288-b5b0-446e8eba3078 schema_version,data_use_restriction,bionetworks,atlas
6e522b93-9b70-4f0c-9990-b9cff721251b schema_version,data_use_restriction
6f89a7f3-8d4a-4344-aa4f-eccfe7e91076 schema_version,data_use_restriction,bionetworks,atlas
73769e0a-5fcd-41f4-9083-41ae08bfa4c1 schema_version,data_use_restriction
750b455a-e3cf-4721-9581-8609a6c9d561 schema_version,bionetworks,atlas
769a08d1-b8a4-4f1e-95f7-6071a9827555 schema_version,data_use_restriction,bionetworks,atlas
77c13c40-a598-4036-807f-be09209ec2dd schema_version,data_use_restriction
783c9952-a4ae-4106-a6ce-56f20ce27f88 schema_version,data_use_restriction,bionetworks,atlas
7a8d45f1-353b-4508-8e89-65a96785b167 schema_version,data_use_restriction,bionetworks,atlas
7ac8822c-4ef0-4194-adf0-74290611b1c6 schema_version,data_use_restriction,bionetworks,atlas
7adede6a-0ab7-45e6-9b67-ffe7466bec1f schema_version,data_use_restriction,bionetworks,atlas
7bc1f14b-5e64-4c7f-86b0-23596b97e2aa schema_version,data_use_restriction,bionetworks,atlas
7c599029-7a3c-4b5c-8e79-e72c9a9a65fe schema_version,data_use_restriction
7f351a4c-d24c-4fcd-9040-f79071b097d0 schema_version,atlas
8185730f-4113-40d3-9cc3-929271784c2b schema_version,data_use_restriction,bionetworks,atlas
85c0d6fa-f117-4d76-b01a-5d5e8f5f9188 schema_version,data_use_restriction,bionetworks,atlas
86fe0a0c-88b3-4a3e-94a1-6f9feadc401e schema_version,data_use_restriction,bionetworks,atlas
87f519b4-8862-41f9-acff-75e823e0e430 schema_version,data_use_restriction,bionetworks,atlas
888f1766-4c84-43bb-8717-b5f9d2046097 schema_version,bionetworks,atlas,project title
894ae6ac-5b48-41a8-a72f-315a9b60a62e schema_version,data_use_restriction,bionetworks,atlas
8b9cb6ae-6a43-4e47-b9fb-3df7aeec941f schema_version,data_use_restriction,bionetworks,atlas
902dc043-7091-445c-9442-d72e163b9879 schema_version,data_use_restriction,bionetworks,atlas
91674dcf-8641-40e6-978d-c1706feffba8 schema_version,data_use_restriction,bionetworks,atlas
923d3231-7295-4184-b3f6-c3082766a8c7 schema_version,data_use_restriction,bionetworks,atlas
925f9a4c-cac0-444a-ad2c-612656ab3a85 schema_version,data_use_restriction,bionetworks,atlas
92afaa56-d501-481e-a027-dddd72212ba8 schema_version,data_use_restriction,bionetworks,atlas
9483c664-d546-4b30-9ba3-efbdbf9290b4 schema_version,bionetworks,atlas
94e4ee09-9b4b-410a-84dc-a751ad36d0df schema_version,data_use_restriction,bionetworks,atlas
957261f7-2bd6-4358-a6ed-24ee080d5cfc schema_version,data_use_restriction,bionetworks,atlas
95d058bc-9cec-4c88-8d2c-05b4a45bf24f schema_version,data_use_restriction
95f07e6e-6a73-4e1b-a880-c83996b3aa5c schema_version,data_use_restriction,atlas
990d251f-6dab-4a98-a2b6-6cfe7e4708b9 schema_version,data_use_restriction,bionetworks,atlas
99101928-d9b1-4aaf-b759-e97958ac7403 schema_version,bionetworks,atlas
9b876d31-0739-4e96-9846-f76e6a427279 schema_version,data_use_restriction,bionetworks,atlas
9dd91b6e-7c62-49d3-a3d4-74f603deffdb schema_version,bionetworks,atlas,project title
a4f154f8-5cc9-40b5-b8d7-af90afce8a8f schema_version,data_use_restriction,bionetworks,atlas
ad04c8e7-9b7d-4cce-b8e9-01e31da10b94 schema_version,bionetworks,atlas
ae62bb31-55ca-4127-b0fb-b1771a604645 schema_version,data_use_restriction,atlas
ae71be1d-ddd8-4feb-9bed-24c3ddb6e1ad schema_version,bionetworks,atlas
aebc99a3-3151-482a-9709-da6802617763 schema_version,data_use_restriction,bionetworks,atlas
b176d756-62d8-4933-83a4-8b026380262f schema_version,bionetworks,atlas
b208466a-6fb0-4385-8cfb-8e03ff6b939e schema_version,data_use_restriction,bionetworks,atlas
b91c623b-1945-4727-b167-0a93027b0d3f schema_version,data_use_restriction
b963bd4b-4bc1-4404-8425-69d74bc636b8 schema_version,atlas
bc5512cc-9544-4aa4-8b75-8af445ee2257 schema_version,data_use_restriction
bcdf233f-9246-4c0c-9843-0514120b7e3a schema_version,data_use_restriction,bionetworks,atlas
bd400331-54b9-4fcc-bff6-6bb8b079ee1f schema_version,data_use_restriction,bionetworks,atlas
c0fecf0b-af86-41b8-ba82-d5fd81b7542a schema_version,data_use_restriction,bionetworks,atlas
c211fd49-d980-4ba1-8c6a-c24254a3cb52 schema_version,bionetworks,atlas
c4e11369-78d4-4d29-ba8e-b67907c4c65c schema_version,data_use_restriction,bionetworks,atlas
c5ca43aa-3b2b-4216-8eb3-f57adcbc99a1 schema_version,bionetworks,atlas
c6ad8f9b-d26a-4811-b2ba-93d487978446 schema_version,bionetworks,atlas
c6ef0270-eafc-43bd-8097-c10020a03cfc schema_version,data_use_restriction,bionetworks,atlas
c893cb57-5c9f-4f26-9312-21b85be84313 schema_version,data_use_restriction,bionetworks,atlas
c9e83418-a9f0-4ed1-ab4f-56d9513417bf schema_version,data_use_restriction,bionetworks,atlas
cae461de-ecbd-482f-a5d4-11d607fc12ba schema_version,data_use_restriction
cbd2911f-252b-4428-abde-69e270aefdfc schema_version,data_use_restriction,bionetworks,atlas
cbd3d276-9f24-4af9-8381-b11f6cdbdc4b schema_version,data_use_restriction,bionetworks,atlas
ccc3b786-1da0-427f-a45f-76306d6143b6 schema_version,bionetworks,atlas
cd61771b-661a-4e19-b269-6e5d95350de6 schema_version,data_use_restriction
cdabcf0b-7602-4abf-9afb-3b410e545703 schema_version,atlas
cddab57b-6868-4be4-806f-395ed9dd635a schema_version,bionetworks,atlas
d3446f0c-30f3-4a12-b7c3-6af877c7bb2d schema_version,bionetworks,atlas
d8ae869c-39c2-4cdd-b3fc-2d0d8f60e7b8 schema_version,data_use_restriction,bionetworks,atlas
da77bd06-43ae-4012-a774-e4d62797df51 schema_version,data_use_restriction,bionetworks,atlas
daa371e8-1ec3-43ef-924f-896d901eab6f schema_version,data_use_restriction,bionetworks,atlas
daef3fda-2620-45ae-a3f7-1613814a35bf schema_version,data_use_restriction,bionetworks,atlas
dcbb50d1-9acf-4f70-9fda-b1f63a948c49 schema_version,data_use_restriction,bionetworks,atlas
e090445c-6971-4212-bc5f-ae4ec3914102 schema_version,data_use_restriction,bionetworks,atlas
e1fda217-7ee1-4c1a-adfa-648279dafac6 schema_version,data_use_restriction,bionetworks,atlas
e456c042-f6b6-4cec-a338-1a8ef80bd779 schema_version,data_use_restriction,atlas
e5fe8274-3769-4d7d-aa35-6d33c226ab43 schema_version,data_use_restriction,bionetworks,atlas
e77fed30-959d-4fad-bc15-a0a5a85c21d2 schema_version,data_use_restriction,bionetworks,atlas
e870ab56-3537-4b6d-a66f-534fbf8cc57f schema_version,bionetworks,atlas
e956e66a-ac8e-483a-963a-0f92c7e5abfb schema_version,data_use_restriction,bionetworks,atlas
e9f36305-d857-44a3-93f0-df4e6007dc97 schema_version,data_use_restriction,bionetworks,atlas
f2078d5f-2e7d-4844-8552-f7c41a231e52 schema_version,bionetworks,atlas
f86f1ab4-1fbb-4510-ae35-3ffd752d4dfc schema_version,data_use_restriction,bionetworks,atlas
fae72d89-4ac4-4aab-9b93-574775e168d4 schema_version,data_use_restriction,bionetworks,atlas
fcaa53cd-ba57-4bfe-af9c-eaa958f95c1a schema_version,data_use_restriction,bionetworks,atlas

It might be good to notify Dave and indexing team, that for projects with more than one atlas from same bionetwork, we had to duplicate the bionetwork for modelling reasons. However, we wouldn't like to show twice the same bionetwork in the browser.

Also, before we publish, we might need to ask execs to verify this publication since some bionetworks might not be ready to make (part of) their list public (see tracker confidentiality).

@idazucchi
Copy link
Collaborator

I'm only exporting project from the Lung v2.0 list as we wait for confirmation from Ellen that we can publish this information

exported projects
10201832-7c73-4033-9b65-3ef13d81656a
5f1a1aee-6c48-4dd4-a2c4-eb4ca6aadf74
5f607e50-ba22-4598-b1e9-f3d9d7a35dcc
769a08d1-b8a4-4f1e-95f7-6071a9827555
7ac8822c-4ef0-4194-adf0-74290611b1c6
7bc1f14b-5e64-4c7f-86b0-23596b97e2aa
92afaa56-d501-481e-a027-dddd72212ba8
957261f7-2bd6-4358-a6ed-24ee080d5cfc
ad04c8e7-9b7d-4cce-b8e9-01e31da10b94
b208466a-6fb0-4385-8cfb-8e03ff6b939e
e870ab56-3537-4b6d-a66f-534fbf8cc57f
fae72d89-4ac4-4aab-9b93-574775e168d4

@arschat
Copy link
Collaborator Author

arschat commented Oct 30, 2024

We exported the following uuids for R44:

exported & in manifest for R44

d8ae869c-39c2-4cdd-b3fc-2d0d8f60e7b8
027c51c6-0719-469f-a7f5-640fe57cbece
065e6c13-ad6b-46a3-8075-c3137eb03068
07d5987e-7f9e-4f34-b0fb-a185a35504f5
08fb10df-32e5-456c-9882-e33fcd49077a
0cc58d0b-1734-4e1d-9113-b32e52f75e36
10a845f7-0361-46fa-92a3-2a36483136b1
111d272b-c25a-49ac-9b25-e062b70d66e0
12f32054-8f18-4dae-8959-bfce7e3108e7
135f7f5c-4a85-4bcf-9f7c-4f035ff1e428
1538d572-bcb7-426b-8d2c-84f3a7f87bb0
17cf943b-e247-454f-908b-da58665fcc56
1c4cbdd4-33e3-4ded-ab43-5958de817123
1c6a960d-52ac-44ea-b728-a59c7ab9dc8e
1dd552a5-eb4f-4b92-8088-7224bcbd0629
1dddae6e-3753-48af-b20e-fa22abad125d
2184e63d-82d8-4ab2-839e-e93f8395f568
222a92d5-277b-489c-aad8-a680d1fd2b12
23509202-1e3c-4959-8a45-9c5b642a1066
24d0dbbc-54eb-4904-8141-934d26f1c936
27e2e0ae-5971-4927-aac1-19e81804097b
28dd1438-8f40-40d0-8e53-ee3301b66218
29b54165-34ee-4da5-b257-b4c1f7343656
2d4d89f2-ebeb-467c-ae60-a3efc5e8d4ba
2ef3655a-973d-4d69-9b41-21fa4041eed7
2fe3c60b-ac1a-4c61-9b59-f6556c0fce63
3089d311-f9ed-44dd-bb10-397059bad4dc
30dc3964-1135-4b56-b393-ce2dcbc6e379
31887183-a72c-4308-9eac-c6140313f39c
377c35d1-93bf-470c-8067-08f954b269bd
3ce9ae94-c469-419a-9637-5d138a4e642f
40272c3b-4697-4bd4-ba3f-82fa96b9bf71
415eb773-cadb-43d1-ab89-7d160d5cfc7d
425c2759-db66-4c93-a358-a562c069b1f1
453d7ee2-319f-496c-9862-99d397870b63
457d0bfe-79e4-43f1-be5d-83bf080d809e
46a7e4bf-0474-4a8f-8d18-43afcde90491
4bec484d-ca7a-47b4-8d48-8830e06ad6db
4d6f6c96-2a83-43d8-8fe1-0f53bffd4674
50151324-f3ed-4358-98af-ec352a940a61
50154d1e-2308-44bf-9608-10c7afaa560b
51f02950-ee25-4f4b-8d07-59aa99bb3498
5a54c617-0eed-486e-8c1a-8a8041fc1729
60109425-a6e6-4be1-a3bc-15de680317d4
60ea42e1-af49-42f5-8164-d641fdb696bc
61515820-5bb8-45d0-8d12-f0850222ecf0
645b20c9-5ed0-4500-86b5-7aef770d010a
660fc8b5-8fb8-4050-8c57-e6313195bc81
66d7d92a-d6c5-492c-815b-f81c7c93c984
6836c1e4-906b-4c34-a11c-cb025167896d
6874b7eb-3445-47ec-8773-75141430e169
69324a96-a68a-4514-bbb4-f8f3ea4bd0f1
6936da41-3692-46bb-bca1-cd0f507991e9
6ac8e777-f9a0-4288-b5b0-446e8eba3078
6e522b93-9b70-4f0c-9990-b9cff721251b
6f89a7f3-8d4a-4344-aa4f-eccfe7e91076
73769e0a-5fcd-41f4-9083-41ae08bfa4c1
750b455a-e3cf-4721-9581-8609a6c9d561
77c13c40-a598-4036-807f-be09209ec2dd
783c9952-a4ae-4106-a6ce-56f20ce27f88
7a8d45f1-353b-4508-8e89-65a96785b167
7adede6a-0ab7-45e6-9b67-ffe7466bec1f
7c599029-7a3c-4b5c-8e79-e72c9a9a65fe
7f351a4c-d24c-4fcd-9040-f79071b097d0
8185730f-4113-40d3-9cc3-929271784c2b
85c0d6fa-f117-4d76-b01a-5d5e8f5f9188
86fe0a0c-88b3-4a3e-94a1-6f9feadc401e
87f519b4-8862-41f9-acff-75e823e0e430
888f1766-4c84-43bb-8717-b5f9d2046097
894ae6ac-5b48-41a8-a72f-315a9b60a62e
8b9cb6ae-6a43-4e47-b9fb-3df7aeec941f
902dc043-7091-445c-9442-d72e163b9879
91674dcf-8641-40e6-978d-c1706feffba8
923d3231-7295-4184-b3f6-c3082766a8c7
925f9a4c-cac0-444a-ad2c-612656ab3a85
9483c664-d546-4b30-9ba3-efbdbf9290b4
94e4ee09-9b4b-410a-84dc-a751ad36d0df
95d058bc-9cec-4c88-8d2c-05b4a45bf24f
95f07e6e-6a73-4e1b-a880-c83996b3aa5c
990d251f-6dab-4a98-a2b6-6cfe7e4708b9
99101928-d9b1-4aaf-b759-e97958ac7403
9b876d31-0739-4e96-9846-f76e6a427279
9dd91b6e-7c62-49d3-a3d4-74f603deffdb
a4f154f8-5cc9-40b5-b8d7-af90afce8a8f
ae62bb31-55ca-4127-b0fb-b1771a604645
ae71be1d-ddd8-4feb-9bed-24c3ddb6e1ad
aebc99a3-3151-482a-9709-da6802617763
b176d756-62d8-4933-83a4-8b026380262f
b91c623b-1945-4727-b167-0a93027b0d3f
b963bd4b-4bc1-4404-8425-69d74bc636b8
bc5512cc-9544-4aa4-8b75-8af445ee2257
bcdf233f-9246-4c0c-9843-0514120b7e3a
bd400331-54b9-4fcc-bff6-6bb8b079ee1f
c0fecf0b-af86-41b8-ba82-d5fd81b7542a
c211fd49-d980-4ba1-8c6a-c24254a3cb52
c4e11369-78d4-4d29-ba8e-b67907c4c65c
c5ca43aa-3b2b-4216-8eb3-f57adcbc99a1
c6ad8f9b-d26a-4811-b2ba-93d487978446
c6ef0270-eafc-43bd-8097-c10020a03cfc
c9e83418-a9f0-4ed1-ab4f-56d9513417bf
cae461de-ecbd-482f-a5d4-11d607fc12ba
cbd2911f-252b-4428-abde-69e270aefdfc
cbd3d276-9f24-4af9-8381-b11f6cdbdc4b
ccc3b786-1da0-427f-a45f-76306d6143b6
cd61771b-661a-4e19-b269-6e5d95350de6
cddab57b-6868-4be4-806f-395ed9dd635a
d3446f0c-30f3-4a12-b7c3-6af877c7bb2d
da77bd06-43ae-4012-a774-e4d62797df51
daa371e8-1ec3-43ef-924f-896d901eab6f
daef3fda-2620-45ae-a3f7-1613814a35bf
dcbb50d1-9acf-4f70-9fda-b1f63a948c49
e090445c-6971-4212-bc5f-ae4ec3914102
e1fda217-7ee1-4c1a-adfa-648279dafac6
e456c042-f6b6-4cec-a338-1a8ef80bd779
e5fe8274-3769-4d7d-aa35-6d33c226ab43
e77fed30-959d-4fad-bc15-a0a5a85c21d2
e956e66a-ac8e-483a-963a-0f92c7e5abfb
e9f36305-d857-44a3-93f0-df4e6007dc97
f2078d5f-2e7d-4844-8552-f7c41a231e52
f86f1ab4-1fbb-4510-ae35-3ffd752d4dfc
fae72d89-4ac4-4aab-9b93-574775e168d4
fcaa53cd-ba57-4bfe-af9c-eaa958f95c1a
1eba4d0b-2d15-4ba7-bb3c-d4654dd94519
2caedc30-c816-4b99-a237-b9f3b458c8e5
4f4f0193-ede8-4a82-8cb0-7a0a22f06e63
58028aa8-0ed2-49ca-b60f-15e2ed5989d5
c893cb57-5c9f-4f26-9312-21b85be84313
10201832-7c73-4033-9b65-3ef13d81656a
5f1a1aee-6c48-4dd4-a2c4-eb4ca6aadf74
5f607e50-ba22-4598-b1e9-f3d9d7a35dcc
769a08d1-b8a4-4f1e-95f7-6071a9827555
7ac8822c-4ef0-4194-adf0-74290611b1c6
7bc1f14b-5e64-4c7f-86b0-23596b97e2aa
92afaa56-d501-481e-a027-dddd72212ba8
957261f7-2bd6-4358-a6ed-24ee080d5cfc
ad04c8e7-9b7d-4cce-b8e9-01e31da10b94
b208466a-6fb0-4385-8cfb-8e03ff6b939e
e870ab56-3537-4b6d-a66f-534fbf8cc57f
fae72d89-4ac4-4aab-9b93-574775e168d4

There were 3 uuids that were eligible but were missed out from the process:

005d611a-14d5-4fbf-846e-571a1f874f70
01aacb68-4076-4fd9-9eb9-aba0f48c1b5a
9c20a245-f2c0-43ae-82c9-2232ec6b594f

@arschat
Copy link
Collaborator Author

arschat commented Nov 6, 2024

Update done in ingest for 005d611a-14d5-4fbf-846e-571a1f874f70 and 01aacb68-4076-4fd9-9eb9-aba0f48c1b5a.
Updated bionetwork & atlas information, bumped project schema version to 19.0.1 and added data_use_restriction field.

Project 9c20a245-f2c0-43ae-82c9-2232ec6b594f has been wrangled by lattice.

@idazucchi
Copy link
Collaborator

projects exported and filled import form

@arschat
Copy link
Collaborator Author

arschat commented Nov 26, 2024

Verified all bionetwork updates in separate tab in spreadsheet using azul api

for uuid in uuids:
	proj = requests.get("https://service.azul.data.humancellatlas.org/index/projects/"+uuid+"?catalog=dcp44").json()
	tissue_atlas = ", ".join([t['atlas'] + " " + t['version'] for t in proj['projects'][0]['tissueAtlas']]) if proj['projects'][0]['tissueAtlas'] else ''
	bionetworks = ", ".join(proj['projects'][0]['bionetworkName']) if proj['projects'][0]['bionetworkName'][0] else ''
	data_use = proj['projects'][0]['dataUseRestriction']
	print('\t'.join([uuid, tissue_atlas, bionetworks, data_use]))

@arschat
Copy link
Collaborator Author

arschat commented Nov 26, 2024

Need to re-export #1307 to include bionetwork & atlas info
3373e59c-525f-4a83-8c9c-d8b280454697

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
HCA metadata backfill operations This issue is an operational task Submissions Submission WS tasks
Projects
None yet
Development

No branches or pull requests

2 participants