COCO and Pascal VOC

Browse > Computer Vision > Object Detection - [Link]

COCO

Introduction

COCO is an image dataset designed to spur object detection research with a focus on detecting objects in context. The annotations include instance segmentations for object belonging to 80 categories, stuff segmentations for 91 categories, keypoint annotations for person instances, and five image captions per image.

COCO is a large-scale object detection, segmentation, and captioning dataset. COCO has several features:

Object segmentation
Recognition in context
Superpixel stuff segmentation
330K images (>200K labeled)
1.5 million object instances
80 object categories
91 stuff categories
5 captions per image
250,000 people with keypoints

COCO2017 Data

Download the 2017 Train images [118K/18GB]
Download the 2017 Val images [5K/1GB]
Download the 2017 Test images [41K/6GB]
COCO API/PythonAPI

VOC2007

Introduction

The goal of this challenge is to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects). It is fundamentally a supervised learning learning problem in that a training set of labelled images is provided. The twenty object classes that have been selected are:

Statistics
20 classes:
Person: person
Animal: bird, cat, cow, dog, horse, sheep
Vehicle: aeroplane, bicycle, boat, bus, car, motorbike, train
Indoor: bottle, chair, dining table, potted plant, sofa, tv/monitor
Train/validation/test: 9,963 images containing 24,640 annotated objects.
New developments 
Number of classes increased from 10 to 20
Segmentation taster introduced
Person layout taster introduced
Truncation flag added to annotations
Evaluation measure for the classification challenge changed to Average Precision. Previously it had been ROC-AUC.

VOC2007 Data

The annotated test data for the VOC challenge 2007 is now available:

Download the training/validation data (450MB tar file)
Download the development kit code and documentation (250KB tar file)
Download the PDF documentation (120KB PDF)
View the guidelines used for annotating the database
Download the annotated test data (430MB tar file)
Download the annotation only (12MB tar file, no images)

VOC2012 Data

The development kit is now available:

Download the training/validation data (2GB tar file)
Download the development kit code and documentation (500KB tar file)
Download the PDF documentation (500KB PDF)
Browse the HTML documentation
View the guidelines used for annotating the database (VOC2011)
View the action guidelines used for annotating the action task images

References

ECCV 2018 Joint COCO and Mapillary Recognition - [Link]

The PASCAL Visual Object Classes Challenge 2007 - [Link]

Object detection: speed and accuracy comparison - [Link]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

COCO and Pascal VOC.md

COCO and Pascal VOC.md

COCO and Pascal VOC

Browse > Computer Vision > Object Detection - [Link]

COCO

Introduction

COCO2017 Data

VOC2007

Introduction

VOC2007 Data

VOC2012 Data

References

Files

COCO and Pascal VOC.md

Latest commit

History

COCO and Pascal VOC.md

File metadata and controls

COCO and Pascal VOC

Browse > Computer Vision > Object Detection - [Link]

COCO

Introduction

COCO2017 Data

VOC2007

Introduction

VOC2007 Data

VOC2012 Data

References