Data Commander is a user-friendly tool designed to simplify data organization. It helps you efficiently manage your projects, datasets, and items through an intuitive interface, allowing for easy movement, copying, and processing of data.
Data Commander as a Service offers two main functionalities:
-
Move/Copy: Effortlessly move or copy projects, datasets, and items across different hierarchy levels, while preserving the structure and annotations. Destination project definitions are updated automatically.
-
Transfer: All annotated entities from
completed
Labeling Jobs with anaccepted
status are included in the transfer. During the process, entities are copied to the destination, and a new backup version of the destination project is created. The hierarchy is maintained where possible, making it easy to manage your data across various structures. The destination project definitions are also updated to reflect the changes.
- A separate interface in the style of a commander.
- Additional action settings that expand data interaction capabilities.
- Hotkeys similar to Norton or other similar commanders.
- A dedicated taskbar - everything is here and now.
- Data structure is preserved.
- Familiar Supervisely interface.
- Works with data: projects, datasets, and items.
- Simple copy and move actions without lengthy decision-making:
- Data and their annotations move together.
- If entities have matching names, they are not moved to avoid conflicts. For example, if an image with the same name already exists, neither it nor its annotation in the source project will overwrite the entity in the destination project.
- Data structure is preserved.
- Familiar Supervisely interface.
- Copies items and their annotations from one location to another while maintaining structure.
- Copying can be initiated from: projects, datasets, labeling jobs, and queues.
- Copies only items that meet the following conditions:
- Were annotated within a Labeling Job, and these jobs are marked as completed.
- Were marked as accepted during review.
- Have no remaining labeling jobs with this items in an unfinished status.
- Automatically resolves conflicts by renaming conflicting objects with a sequential index.
- Automatically creates a backup version for the destination project, allowing you to create a copy of the project at any time with the same state it had after the transfer and run experiments on the exact same data.
Сurrently only for images
Hierarchy of Transfers and Result:
-
When initiating the process from the following instances, the system searches for related Labeling Jobs and exports items based on their results:
- Project → Workspace → A new project is created, preserving the structure.
- Project → Project → A dataset with the source project's name is created in the destination project, preserving nesting.
- Project → Dataset → Similar to the previous point, but the dataset is nested inside the destination dataset.
- Dataset → Project → A top-level dataset with the same name is created, containing only the items from that dataset (excluding nested datasets).
- Dataset → Dataset → Similar to the previous point, but the dataset is nested inside the destination dataset.
-
When initiating the process from the following instances, processing is limited to the specified Labeling Jobs, and items are exported based on their results:
- Job follows the same logic as Dataset. This means that if some items in an unrelated job have already been annotated and are in the required status, they will not be included.
- Queue follows the same logic as Job, but considers all its jobs as the only recognized completed ones.
1. Go to the Data Commander
2. Navigate through the structures and select items
3. Click on the buttons to perform actions and select additional settings
4. Track the task progress in the progress bar
1. In the Projects
management section, select the projects, datasets or items and tick the required checkboxes.
2. A new button will appear in the header: With N selected
.
3. Choose the Move selected
action or Copy selected
4. A modal window will appear, allowing you to select the destination for the items.
You can create a new project or dataset during this process.
Info tip will help you to understand the process. Select the desired destination and click the Move
or Copy
button.
5. You will be redirected to the Tasks page to track the task progress.
1. In the Projects
management section, select the projects or datasets and tick the required checkboxes.
2. A new button will appear in the header: With N selected
.
3. Choose the Transfer annotated items
action.
4. A modal window will appear, allowing you to select the destination for the items.
You can create a new project or dataset during this process.
Select the desired destination and click the Transfer
button.
The image shows that it is not possible to move annotated items to the workspace level, and the Transfer
button is disabled. To see the available options, check the tooltip. For more details, follow the link from the tooltip.
5. Once you select the correct directory and click the Transfer button, you will be redirected to the Tasks page to track the task progress.
Transferring Process Explained
Once the processing of datasets selected in Step 1 begins, the system will search for related Labeling Jobs, and items marked as accepted
only in completed
jobs will be transferred.
In our case, we have job #1
, which is completed, and 30 items have been marked as accepted
. However, there is also a second job #2
for the same dataset, which is still in the on review
stage. Since both jobs contain the same set of items, no items will be transferred for this dataset until job #2
is completed
, even though items in job #1
are already accepted
.
Once job #2
is completed
and these items are no longer associated with any other job, a repeat transferring operation will export all items marked as accepted
across all completed
jobs for this dataset, even if some of items were marked as rejected
in other completed jobs.
As a result, we will obtain annotated items with approved annotations. 40 items in the result indicate that, out of 13 accepted items from job #2
, 10 were unique, while 3 were also among the accepted items in job #1
.
Additionally, since the example project is an image-based project, a backup version is automatically created when new images are transferred into it.