-
Notifications
You must be signed in to change notification settings - Fork 95
-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Non-Federal Data Harvesting #4870
Comments
We have a non-federal schema, but the JSON Schema version is super old. @rshewitt went through the process of upgrading the federal-v1.1 version and it is in the datagov-harvester here. I examined the differences between the federal and non-federal on the old version; they mostly consist of allowing REDACTED in the federal version, and some namespace convention changes that are mostly unnecessary. |
We need to enhance the current data harvesting system to support non-federal data sources. This involves updating the harvest_source table to include additional fields that differentiate between federal and non-federal data sources, or alternatively, using the existing schema_type field to manage this distinction.
How to reproduce
When harvesting non-federal data sources, such as NYC Data.json, validation errors occur, preventing all records from being harvested.
Expected behavior
The bureauCode field should not be validated when processing non-federal data.
Actual behavior
validation error:
<ValidationError: "'bureauCode' is a required property">
Sketch
The text was updated successfully, but these errors were encountered: