DingoDB is a distributed multi-modal vector database. It combines the features of a data lake and a vector database, allowing for the storage of any type of data (key-value, PDF, audio, video, etc.) regardless of its size. Utilizing DingoDB, you can construct your own Vector Ocean (the next-generation data architecture following data warehouse and data lake, as introduced by DataCanvas). This enables the analysis of both structured and unstructured data through a singular SQL with exceptionally low latency in real time.
Welcome to visit DingoDB. The documentation of DingoDB is located on the website: https://dingodb.readthedocs.io. The main projects about DingoDB are as follows:
- DingoDB: A Unified SQL Engine to parse and compute for both structured and unstructured data.
- Dingo-Store: A strongly consistent distributed storage system based on the Raft protocol.
- Dingo-Deploy: The deployment project of compute nodes and storage nodes.
- Provides comprehensive data storage solutions, accommodating a wide range of data types including but not limited to embeddings, audio files, text, videos, images, PDFs, and annotations.
- Facilitates efficient querying and vector searching with minimal latency using a singular SQL approach.
- Employs a hybrid search mechanism that caters to both structured and unstructured data, supporting operations like metadata querying and vector querying.
- Possesses the ability to dynamically ingest data and construct corresponding indexes in real time, promoting operational efficiency.
- MySQL Compatibility Built upon the acclaimed Apache Calcite SQL engine, DingoDB is capable of parsing, optimizing, and executing standard SQL statements, and can handle parts of TPC-H and TPC-DS(See TPC) queries. Compliant with MySQL Shell and MySQL-JDBC-Driver Client, it offers seamless integration with web services, BI tools, and more.
- Supports High Frequency Write Operations: With the use of RAFT and the log-structured key-value storage RocksDB., DingoDB can handle high-frequency INSERT, UPDATE, DELETE, and short-QUERY operations while maintaining strong data consistency.
- Facilitates Point Queries and Multi-dimensional Analysis Simultaneously: DingoDB can push down expressions to accelerate queries and quickly carry out multi-dimensional analysis with low latency.
- Distributed Storage Capabilities As a distributed storage engine, DingoDB has the capacity to store vast amounts of data. It allows for easy horizontal scaling operations on clusters as data scale increases.
- High Data Reliability and Recovery: Designed based on Raft, DingoDB provides a multi-replicated management mechanism, ensuring extraordinarily high data reliability. It can maintain high data consistency even in the event of disk or machine failures and offers a swift automatic recovery mechanism.
The documentation of DingoDB is located on the website: https://dingodb.readthedocs.io
or in the docs/
directory of the source code.
DingoDB is Sponsored by DataCanvas, a new platform to do data science and data process in real-time.