Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

report on usage of groupedValue #4244

Open
ndushay opened this issue Sep 22, 2022 · 8 comments
Open

report on usage of groupedValue #4244

ndushay opened this issue Sep 22, 2022 · 8 comments
Assignees
Labels
cocina reports request for report on metadata content

Comments

@ndushay
Copy link
Contributor

ndushay commented Sep 22, 2022

Arcadia:

for dros and collections

run a report on groupedValue? That was a MODSism.

@ndushay ndushay added the cocina reports request for report on metadata content label Sep 22, 2022
@ndushay ndushay self-assigned this Sep 22, 2022
@ndushay
Copy link
Contributor Author

ndushay commented Sep 22, 2022

I ran using property_existence*.rb reports.

DROs - no occurrences.
Collections - 10 occurrences.

Also here:

collection_druid,catkey,collection_name
druid:cx804cx9497,13161135,"al-  Shāmil fī-al-ṣināʻah al-ṭibbiyah, 13th cent"
druid:fd953pg7906,7913308,"Zhongguo wu ci xian ji ren kou pu cha shu ju"
druid:gy247sz1687,2922639,"[Collection of anti-Napoleon broadsides and broadsides on the proposed French invasion of England in 1803]"
druid:nh800mm2361,7649813,"China 2000 township population census data with maps"
druid:mq015fg9108,6640186,"Jing ji ban yue kan"
druid:sf111ch8577,11415872,"Guo ji mao yi dao bao"
druid:sh973sy2698,4235167,"Sea-floor images and data from multibeam surveys in San Francisco Bay, Southern California, Hawaii, the Gulf of Mexico, and Lake Tahoe, California--Nevada"
druid:sk296jm6766,411101,"Who's who in East Africa"
druid:wy489qg9472,11668039,"Jean and Juliette Logerais papers, 1885-1910s"

@ndushay
Copy link
Contributor Author

ndushay commented Sep 23, 2022

This DRO druid says I got it wrong:

https://argo.stanford.edu/items/druid:jc764sf5117.json

image

@ndushay
Copy link
Contributor Author

ndushay commented Sep 26, 2022

Okay, now showing 44K occurrences in DROs:

drosGroupedValue.csv

$ wc -l drosGroupedValue.csv 
44389 drosGroupedValue.csv

$ head drosGroupedValue.csv 
item_druid,catkey,collection_druid,collection_name
druid:wf711dr8563,3139502,,""
druid:wf697mc3225,3421337,,""
druid:wf707db3686,409884,,""
druid:wf835rv6544,372841,,""
druid:wf742hv8540,1040631,,""
druid:wf756wk1315,3974080,,""
druid:wf825mr3467,3352168,,""
druid:wf828pf4428,190789,,""
druid:wf714gm0334,372497,,""

$ tail drosGroupedValue.csv 
druid:wf586rk5853,1663237,,""
druid:wf593xm4029,353788,,""
druid:wf602pw7864,3747764,,""
druid:wf605ty9067,4698244,,""
druid:wf607dv8805,2854335,,""
druid:wf609hp1935,9751459,,""
druid:wf624pf3032,9381970,,""
druid:wf656gb0171,6712616,,""
druid:wf670gc1060,5056647,,""
druid:wf684fn0436,1610706,,""

none of them have a collection, some don't have catkeys

@arcadiafalcone
Copy link
Collaborator

I looked at the first three and the last three and all but one would lose the groupedValue if refreshed from source.

@ndushay
Copy link
Contributor Author

ndushay commented Sep 26, 2022

Maybe refresh "all" from source and rerun reports?

@ndushay ndushay removed their assignment Sep 26, 2022
@arcadiafalcone
Copy link
Collaborator

Something is wrong with getting the collection name - wf742hv8540 e.g. is in Google Books (as I suspect many of these are).

Working on the refresh.

@ndushay
Copy link
Contributor Author

ndushay commented Sep 28, 2022

Something is wrong with getting the collection name - wf742hv8540 e.g. is in Google Books (as I suspect many of these are).

yes - Mike found that in his PR review; I'm running the report again. Sorry about that.

@ndushay
Copy link
Contributor Author

ndushay commented Sep 28, 2022

updated dros report - now with collection info!

40,975 lines with header.

drosGroupedValue.csv

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cocina reports request for report on metadata content
Projects
None yet
Development

No branches or pull requests

2 participants