Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sanitize incoming DOIs from classic and bugfixes #51

Open
wants to merge 37 commits into
base: master
Choose a base branch
from

Conversation

tjacovich
Copy link
Contributor

@tjacovich tjacovich commented Apr 25, 2022

  • Sanitize DOIs that have trailing characters or are merged duplicates
  • Modify maintenance_reevaluate to check for bad DOIs in database
  • Added function to update any alternate bibcodes to uppercase in parsed_metadata
  • Modified populate_bibcode_column to respect curated_metadata but not overwrite parsed_metadata.
  • Fixed alembic revisions to respect column contents on downgrade.

…odes are handled for maintenance_curation and maintenance_metadata.
…and then create new citations to the sanitized records.
…cond regex to catch malformed records of the form zenodo/123456.
…data. Added check on populate_bibcode_column to make sure it is not run on a db with curated metadata already present.
@tjacovich tjacovich marked this pull request as ready for review April 27, 2022 13:59
@tjacovich
Copy link
Contributor Author

Removed alt_bibcode fix for alembic from this PR. Modified alembic revisions that change 'status' column so they preserve column values.

tjacovich added 9 commits May 19, 2022 20:16
… applied to software with versions but none that exist in ADS.
…s_expanded_raw_data() to look at raw_content column instead of content column for reconstruction.
…anitize new dois and to only mark registered citations as sanitized. Added sanitized targets to db in order to facilitate downgrade. Modified reevaluate to handle sanitizing records in a way consistent with new citation processing.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant