Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Removes labels validated as incorrect from the API #3007

Merged
merged 1 commit into from
Aug 25, 2022

Conversation

misaugstad
Copy link
Member

Resolves #3003

Filters out labels that have been validated as incorrect, even if they come from users marked as "high quality".

On my local dev environment that's using a database from Seattle in 2021, here's how clustering changed.

  • Labels into clustering: 107,108 (before) -> 96,750 (after)
  • Clusters created from single-user clustering: 86,008 -> 78,452
  • Time for single-user clustering: 19.2 minutes -> 17.9 minutes
  • Final number of clustered attributes from multi-user clustering: 74,802 -> 68,020
  • Time for multi-user clustering: 22.9 minutes -> 21.2 minute

@misaugstad misaugstad self-assigned this Aug 25, 2022
@misaugstad misaugstad merged commit aacc85a into develop Aug 25, 2022
@misaugstad misaugstad deleted the 3003-api-remove-incorrect-labels branch August 25, 2022 02:50
@misaugstad misaugstad mentioned this pull request Aug 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Remove labels validated as "incorrect" from our API
1 participant