Easily import your videos with annotations in the Supervisely format. The Supervisely json-based annotation format supports such figures: rectangle
, line (polyline)
, polygon
, point
, bitmap
(mask
), graph
(keypoints
). It is a universal format that supports various types of annotations and is used in the Supervisely platform.
All information about the Supervisely JSON format can be found here
Supported video formats: .avi
, .mp4
, .3gp
, .flv
, .webm
, .wmv
, .mov
, and .mkv
With annotations: yes
Supported annotation format: .json
.
Data structure: Information is provided below.
Example data: download ⬇️.
Both directory and archive are supported.
Recommended directory structure:
📦input_folder
┣ 📂dataset_name_01
┃ ┣ 📂ann
┃ ┃ ┣ 📄vid_0748.mp4.json
┃ ┃ ┗ 📄vid_8144.mp4.json
┃ ┣ 📂video
┃ ┃ ┣ 🎬vid_0748.mp4
┃ ┃ ┗ 🎬vid_8144.mp4
┃ ┗ 📂meta (optional)
┃ ┣ 📄vid_0748.mp4.json
┃ ┗ 📄vid_8144.mp4.json
┣ 📄meta.json
┗ 📄key_id_map.json file (optional)
Struggled with the structure? No worries!
All videos will be uploaded to a single dataset, so you don't have to worry about the full project structure in Supervisely format. All you need is to prepare videos with annotations and meta.json
file (recommended).
Items even can be placed in any subdirectories or the root directory. Just make sure that the annotation file names match the video file names (e.g. annotaion file video_1.jpg.json
is for the video video_1.jpg
) and that the annotation file format is correct (we will provide an example in the next section). The application will do the rest.
Project meta file meta.json
is recommended to be present in the project directory. It contains classes and tags definitions for the project. If it is not present, it will try to create it from the annotations. Learn more about the meta.json
file here.
For each video, we store the annotations in a separate json
file named video_name.video_format.json
with the following file structure:
{
"size": {
"height": 1080,
"width": 1920
},
"description": "",
"key": "c8168b43ae1b45c38930f456df9d0f2b",
"tags": [],
"objects": [
{
"key": "198f727d40c749eebcacc4aed299b39a",
"classTitle": "rect",
"tags": [],
"labelerLogin": "alexxx",
"updatedAt": "2020-08-23T12:06:11.963Z",
"createdAt": "2020-08-23T12:06:11.963Z"
}
],
"frames": [
{
"index": 0,
"figures": [
{
"key": "65f21690780e43b49863c3cbd07eab3a",
"objectKey": "198f727d40c749eebcacc4aed299b39a",
"geometryType": "rectangle",
"geometry": {
"points": {
"exterior": [
[266, 420],
[847, 845]
],
"interior": []
}
},
"labelerLogin": "alexxx",
"updatedAt": "2020-08-23T12:06:13.544Z",
"createdAt": "2020-08-23T12:06:13.544Z"
}
]
}
],
"framesCount": 375
}
Fields definitions:
size
- string - is equal to image(frame) sizedescription
- string - (optional) - this field is used to store the text we want to assign to the video. In the labeling intrface it corresponds to the 'data' filed.tags
- list of strings that will be interpreted as video tagskey
- string, unique key for a given video (used in key_id_map.json to get the video ID)objects
- list of objects on the videoframes
- list of frames of which the video consists. List contains only frames with an object from the 'objects' fieldindex
- integer - number of the current framefigures
- integer - list of objects which the current frame contains
framesCount
- integer - total number of frames in the videoobjectKey
- string - unique key for a given object (used in key_id_map.json)labelerLogin
- string - the name of a user who created the current figuregeometryType
- "cuboid_3d" - class shapegeometry
- a dictionary containing indicators of location, rotation and dimensions of cuboids
Fields definitions for objects field:
key
- string, a unique key for the given object (used in key_id_map.json to get the object ID)classTitle
- string - the title of a class. It's used to identify the class shape from themeta.json
filetags
- list of strings that will be interpreted as object tagslabelerLogin
- string - the name of the user that added this figure to the project
Fields description for figures field:
key
- string, a unique key for the given figure (used in key_id_map.json to get the figure ID)objectKey
- string, a unique key for the given object (used in key_id_map.json to get the object ID).geometryType
- "rectangle" -class shapegeometry
- geometry of the objectclassTitle
- string - the title of a class. It's used to identify the class shape from themeta.json
filelabelerLogin
- string - the name of the user that added this figure to the current frame
Key_id_map.json file is optional. It is created when annotating the video inside Supervisely interface and sets the correspondence between the unique identifiers of the video, object and the frame on which the object is located. If you annotate manually, you do not need to create this file. This will not affect the work being done.
Json format of key_id_map.json:
{
"tags": {},
"objects": {
"198f727d40c749eebcacc4aed299b39a": 20520
},
"figures": {
"65f21690780e43b49863c3cbd07eab3a": 503130811
},
"videos": {
"c8168b43ae1b45c38930f456df9d0f2b": 157876296
}
}
Fields definitions:
objects
- dictionary, where the key is a unique string, generated inside Supervisely environment to set correspondence of current object in annotation, and values are unique integer ID corresponding to the current objectfigures
- dictionary, where the key is a unique string, generated inside Supervisely environment to set correspondence of object on current frame in annotation, and values are unique integer ID corresponding to the current framevideos
- dictionary, where the key is unique string, generated inside Supervisely environment to set correspondence of video in annotation, and value is a unique integer ID corresponding to the current videotags
- dictionary, where the keys are unique strings, generated inside Supervisely environment to set correspondence of tag on current frame in annotation, and values are a unique integer ID corresponding to the current tag