-
Notifications
You must be signed in to change notification settings - Fork 135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cvat_format/extractor type casting value to float, causing issues #395
Comments
@kirill-sizov , could you please look at the issue? |
@nmanovic Yes, sure |
Datumaro does not have strong attribute data model yet, where value ranges and data types could be described, it should be resolved with #144. So the way is how it is implemented now is our best effort in preserving the attribute data types - logical values, numbers and strings. We tried to have everything as string values, but it didn't work very well.
I think yes, especially in case of CVAT format. It has attribute type information, which can be utilized even without big changes. BTW, could you share any details about your dataset checks? Are you using some custom code for this, which maybe can be shared? Is it possible that Datumaro does it for you with |
We are doing simple checks on user input like string length (we know this attribute should have 7 digits). Which is failing because because of leading zeros being removed. All failed checks are exported in CSV format, which we use as a reference to make changes in CVAT. CVAT has 2 attribute types:
We have not used
We are iterating through each dataset_element->label->attribute to carry out checks.
The above snippet will print an error if the attribute is defined as:
and value is defined as |
@zhiltsov-max , Datumaro should not convert text to a number. |
We have a text type attribute for a bounding box label in CVAT. The values for this attribute are all digits some of which start with zeros. Ex: 0012345, 0123456, 1234567, etc.
We are using datumaro to automatically do some checks on the annotations imported from CVAT. It looks like the extractor for CVAT is trying to typecast all imported numeric attribute values to float. This is causing an issue for us since this removes the leading zeros automatically from the value, changing user input.
Is there any reason behind typecasting the the attribute value to float? Can this typecasting step be deleted. If required a user can always do the typecasting later.
The text was updated successfully, but these errors were encountered: