Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset splitting & negative samples annotation type importer does not work correctly #89

Open
BigQszak opened this issue Feb 21, 2023 · 1 comment

Comments

@BigQszak
Copy link

I have followed provided notebook for dataset splitting with one notable change - instead of downloading the dataset I have imported my own using importer.Import... . I have tested it in a couple of ways:
a. Importing COCO and exporting YOLO
b. Importing YOLO, and exporting YOLO
In each case, I have tested both types of splitters.

  1. Stratified split seems to not work without some value for validation split
  2. They never correctly split the data for both of them - in terms of pure % wise division, the smaller the val or (val & test) part where the poorer the split was. For example, 80,10,10 split for Stratified split resulted in 92,4,4 samples per split.
    Is that because it prioritizes equal class split (I have two) over the number of samples?

As for the negative samples in COCO format - importer throws an error if I try to load a bunch of images with empty annotations - meaning no objects of interest are in those images.
It works for YOLO .txt empty labels file though

@alexheat
Copy link
Contributor

Hello. Can you share the images and sample notebooks that you can created so that I can repro the issue?

All of the the issues that you found related to samples that have no annotations (aka negative samples)? The splitting part was not tested for those scenarios so it is not a total surprise that it doesn't work. But please confirm so I know what to investigate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants