Browse > Computer Vision > Object Detection - [Link]
COCO is an image dataset designed to spur object detection research with a focus on detecting objects in context. The annotations include instance segmentations for object belonging to 80 categories, stuff segmentations for 91 categories, keypoint annotations for person instances, and five image captions per image.
COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:
Object segmentation
Recognition in context
Superpixel stuff segmentation
330K images (>200K labeled)
1.5 million object instances
80 object categories
91 stuff categories
5 captions per image
250,000 people with keypoints
- Download the 2017 Train images [118K/18GB]
- Download the 2017 Val images [5K/1GB]
- Download the 2017 Test images [41K/6GB]
- COCO API/PythonAPI
The goal of this challenge is to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects). It is fundamentally a supervised learning learning problem in that a training set of labelled images is provided. The twenty object classes that have been selected are:
Statistics
20 classes:
Person: person
Animal: bird, cat, cow, dog, horse, sheep
Vehicle: aeroplane, bicycle, boat, bus, car, motorbike, train
Indoor: bottle, chair, dining table, potted plant, sofa, tv/monitor
Train/validation/test: 9,963 images containing 24,640 annotated objects.
New developments
Number of classes increased from 10 to 20
Segmentation taster introduced
Person layout taster introduced
Truncation flag added to annotations
Evaluation measure for the classification challenge changed to Average Precision. Previously it had been ROC-AUC.
The annotated test data for the VOC challenge 2007 is now available:
- Download the training/validation data (450MB tar file)
- Download the development kit code and documentation (250KB tar file)
- Download the PDF documentation (120KB PDF)
- View the guidelines used for annotating the database
- Download the annotated test data (430MB tar file)
- Download the annotation only (12MB tar file, no images)
The development kit is now available:
- Download the training/validation data (2GB tar file)
- Download the development kit code and documentation (500KB tar file)
- Download the PDF documentation (500KB PDF)
- Browse the HTML documentation
- View the guidelines used for annotating the database (VOC2011)
- View the action guidelines used for annotating the action task images
ECCV 2018 Joint COCO and Mapillary Recognition - [Link]
The PASCAL Visual Object Classes Challenge 2007 - [Link]
Object detection: speed and accuracy comparison - [Link]