-
Notifications
You must be signed in to change notification settings - Fork 178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lack of understandable documentation for Custom Data Validation #246
Comments
@sachn1, Referencing to the examples for better understanding of custom data validation below.
PR #241 already addresses the broken links, I will bump this up internally to make sure the docs are up to date. |
@singhniraj08 Thank you! I managed to get the custom validations to work with custom configurations thanks to your help. In the meantime, I figured out how I can fully use Python API to do the same.
Just in case anyone is interested! I still have a few questions though!
Unfortunately I didn't manage to go through it, but I can certainly open another ticket if I have problems with it in the future. |
#144 (comment) talks about how we can update/edit schema using the standard protocol-buffer API. It is strongly advised to review the inferred schema and refine it as needed. TFDV also provides a few utility methods to make these updates easier. A short example of updating schema is shown in [tutorial]. Once you have updated your schema, you can apply same process for custom validation.
Thank you! |
Thanks again for the quick response.
I think you misunderstood my question. I am aware of editing/updating the schema from the docs. But inorder to validate something, tfdv first needs to have the statistics generated, right? My question is, could we create custom statistics other than the one automatically created by tfdv using
So basically |
@sachn1, tfdv.StatsOptions provides options for generating statistics by passing it in
|
Thanks for clarifying. Closing the issue as the main objective of the raising the issue is answered. |
URL(s) with the issue:
Description of issue (what needs changing):
Clear description:
custom_validation_config.proto
. Need more understanding about:custom_validation_config
argument intfdv.validate_statistics()
? I saw in some places that the protobuf file has to be converted to a python file and then import ValidationConfig from it. But saw no documentation related to it intensorflow
/tfdv
/tfx
.custom_validation_config
but no documentation how a custom config can be loaded and passed as an arg.tensorflow-data-validation
andtfx.ExampleValidator
?Correct links
The Link to
custom_validate_statistics
in https://www.tensorflow.org/tfx/data_validation/custom_data_validation is incorrect - Gets a404 - page not found
error.The text was updated successfully, but these errors were encountered: