Image data selection for optimal performance in YOLO #7754
Replies: 1 comment 2 replies
-
If you only want to track the noses close up, and never want to track the nose from far away, then why train with the far away images? However -- what about halfway in between? What if they're walking up to the camera? Or they're leaning back? From that point of view it makes sense to include all the images. To consider: are all the noses your nose? Or very diverse noses from hundreds of people? What if they have glasses on? What if they have "Halloween" style makeup on with lots of different colours? What if they have a "clown" nose? What about all the various skin tones? What about people who turn their head? Have you considered negative samples and other scenarios? For example, you should also have the following pictures:
All of this applies to both yolo and yolo-tiny. And there is absolutely zero reason to use anything other than yolov4-tiny for the scenario you describe.
How so? Are we not talking about a webcam type of image where the face take up a large part of the image? If so, then this does not qualify as "small object" unless you tell me you annotated the two nostrils. If you have 1 class ("nose"), and you are using yolov4-tiny, and you've annotated your images correctly, I doubt you'll even need the full 6000 minimum for max batches. This should be a relatively simple neural network to train, and you should see some decent results relatively quickly into the training. |
Beta Was this translation helpful? Give feedback.
-
I'm training a custom YOLOv4 and YOLOv4 tiny model. The object that I'm trying to track is human nose and have images of it at 2 distances -
Though, the model will be used for detection at closer range,
I had a couple of questions regarding data selection for YOLO, for optimal results:
(Does is it work better if fed more images compared to YOLOv4 regular?, etc.)
Lastly, the object in-focus in this thread is a small object, but are these concepts similar for medium and large size objects too?
(Or there are some other points to keep in mind with those type of detectors)
P.s. - My priority is to understand the first two questions, but please pitch-in if you have anything to contribute to the discussion.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions