Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add issues #260

Open
26 of 60 tasks
joeroe opened this issue Mar 14, 2023 · 1 comment
Open
26 of 60 tasks

Add issues #260

joeroe opened this issue Mar 14, 2023 · 1 comment

Comments

@joeroe
Copy link
Contributor

joeroe commented Mar 14, 2023

Flagging potential data quality problems:

  • c14_labs
  • c14s
    • Missing bp
    • bp > 50000
    • Missing std
    • std > ?
    • Missing delta_c13
    • Missing delta_c13_std
    • Missing method
    • Unknown method
    • Missing c14_lab_id
    • Missing sample_id (maybe should be a validation?)
    • Missing lab_identifier
    • lab_identifier doesn't match expected format
    • lab_identifier doesn't match associated c14_lab
    • potential duplicate
    • Duplicate lab number with different measurement data
    • d13 = 0
    • negative bp or std? (the former is probably not an error)
  • citations
  • contexts:
    • Missing site_id (maybe should be a validation?)
    • No samples (maybe auto-delete?)
  • materials
  • references
    • Missing short_ref (maybe should be a validation?)
    • short_ref may contain multiple values (i.e. contains a common delimiter character)
    • short_ref is not actually short
    • Missing bibtex
    • Invalid bibtex
    • bibtex contains more than one entry
    • "pers. comm." etc. (not usable references, should be deleted)
  • samples
    • Missing material
    • Missing taxon
    • Material doesn't match taxon (e.g. bone <-> a plant)
    • Coordinates, but no CRS
    • position_crs is not a valid EPSG code (or proj string?)
    • No xrons (maybe auto-delete?)
  • sites
    • Missing lat
    • Missing lng
    • -90 ≤ lat ≤ 90
    • -180 ≤ lon ≤ 180
    • lat/lng is in the sea
    • Missing country_code
    • Invalid country_code
    • lat/lng doesn't match country_code
    • Excessive coordinate precision (e.g. 20.178000000000001 or 11.542999999999999)
    • Coordinates on Null Island (0°N 0°E)
  • taxons
    • Missing taxon
    • Unknown taxon
    • GBIF considers taxon a synonym (resolve to canonical usage)
    • Taxon with long name (likely free text entered by mistake)
    • Taxons that are recorded only once (or another small number of times)
  • typos:
    • Missing name (should probably be deleted)

Please feel free to add!

@joeroe joeroe pinned this issue Mar 14, 2023
joeroe added a commit that referenced this issue Mar 22, 2023
The low-hanging fruit: MISSING_C14_AGE, VERY_OLD_C14, MISSING_C14_ERROR,
MISSING_D13C, MISSING_D13C_ERROR, MISSING_C14_METHOD,
MISSING_C14_LAB_ID, and MISSING_C14_LAB
@joeroe
Copy link
Contributor Author

joeroe commented Mar 23, 2023

Added more issues in:

547bcb3
14026a9
a07d589
06360eb
506e355

@joeroe joeroe added this to the Improved issues milestone Mar 23, 2023
@joeroe joeroe mentioned this issue Mar 30, 2023
joeroe pushed a commit that referenced this issue Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant