Skip to content
/ crab Public

A FOSS stack for ML dataset management and annotation

License

Notifications You must be signed in to change notification settings

NOC-OI/crab

Repository files navigation

Centralised Repository for Annotations and BLOBs

About

Note

This repository is still in the testing phase. Some features may be broken or unstable.

CRAB is a FOSS stack for ML dataset management and annotation. It includes preconfigured profiles for importing IFCB and LISST-Holo data. If you have another data format you would like added, please open an issue on GitHub or send me an email. CRAB manages ingesting data, providing all files in a uniform TIFF format on an S3 compatible object store. Metadata is searchable using queries to the integrated CouchDB server, allowing you to easily access data stored on the platform.

Prerequisites

CRAB requires Docker Compose to run, but otherwise comes with everything you need for a minimal install out of the box.

Installation

For a local install, run flask/build.sh first. Then run generate-env.sh to auto-generate a .env file. If you're using externally managed S3 compatible storage, edit the .env file now and skip running init-db.sh. You'll need to configure databases manually. Run init-db.sh to prepare your databases automatically. Finally, run launch.sh to start your instance. CRAB should now be avaliable on http://localhost:8080

Documentation

For futher notes on installation, using the software, and connecting to the CRAB API, head to the docs folder. These markdown documents are also avaliable on an installed instance under the "Help" tab.

License

CRAB is free software under the AGPL-3.0 license. The CRAB Logo (© 2024, Alex Baldwin) is licensed under CC BY-SA 4.0.

About

A FOSS stack for ML dataset management and annotation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published