LumoSQL is a combination of two embedded data storage C language libraries: SQLite and LMDB. LumoSQL is an updated version of Howard Chu's 2013 proof of concept combining the codebases. Howard's LMDB library has become an ubiquitous replacement for bdb on the basis of performance, reliability, and license so the 2013 claims of it greatly increasing the performance of SQLite seemed credible. D Richard Hipp's SQLite is used in thousands of software projects, and since three of them are Google's Android, Mozilla's Firefox and Apple's iOS, an improved version of SQLite will benefit billions of people.
The original 2013 code changes btree.c
in SQLite 3.7.17 to use LMDB 0.9.9 . It
takes some work to replicate the original results because not only has much
changed since, but as a proof of concept there was no project established to
package code or make it accessible, and plenty of barriers to trying it out.
In January 2020 we established:
- Howard's 2013 performance work is reproducible
- SQLite's key-value store improved in performance since 2013, getting close to parity with LMDB by some measures
- SQLite on-disk corruption seems to be reported by more users than LMDB on-disk corruption, and there seem to be logical reasons for that
- SQLite can be readily modified to have multiple storage backends and still pass all of its own tests
- SQLite has a lack of error handling since it doesn't expect there to be multiple backends, and this is a bug
In addition there are several disclaimers made by the SQLite project including not being intended for high concurrency. Since high concurrency is a design feature of LMDB and a common use case for SQLite, this is important for LumoSQL.
LumoSQL was started in December 2019 by Dan Shearer, who did the original source tree archaeology, patching and test builds. Keith Maxwell joined shortly after and contributed version management to the Makefile and the benchmarking tools.
The goal of the LumoSQL Project is to create and maintain an improved version of SQLite.
LumoSQL is supported by the NLNet Foundation.
If you are interesting in contributing to LumoSQL please see </CONTRIBUTING.md>.
LumoSQL provides a Makefile and benchmarking subsystem which:
- Re-creates many combinations of SQLite and LMDB trees and versions, as specified by the user
- Creates a testing matrix of versions and results. These can be limited by the user because a full suite takes hours to run.
In the process, we noticed things that need to be fixed:
- Regressions. Some SQLite functionality doesn't work out of the box with the 2013 code, even some basic things like ".schema" . This fails silently, which is an SQLite bug.
- Close coupling in SQLite. Higher levels of SQLite makes assumptions about how the btree code is implemented, e.g. knowledge about page sizes.
- Close coupling in the port, e.g. the 2013
btree.c
#includes
LMDB'smdb.c
(notmdb.h
) and directly modifies the internal LMDB structMDB_page
. This was fixed by using LMDB as a separate library. - Test suite. Even the speed test portion of the SQLite test suite only partly works. This was fixed by using LMDB as a separate library. Anyone interested can see this in git history, but basically this is completely gone.
- Bit-rot. The upstream trees stopped working together in 2015. This was fixed by using LMDB as a separate library.
- Regressions, test failures, opportunities and more are tracked as issues
- Short term planning and progress reporting is tracked on a project board
- The
master
branch is the currently completed work, this should build on supported systems and pass the relevant tests (see below). - Development typically happens in branches beginning with
feature/
. - For further details see
./doc/using-git-and-github.md
In order to build LumoSQL and SQLite and to used different versions of the LMDB library, we use the following directory layout:
.
├── bld-LMDB_?.?.? Build artifacts for LumoSQL (src and src-lmdb)
├── bld-SQLite-?.?.? Build artifacts for sqlite (src-sqlite)
├── LICENSES License files, in line with https://reuse.software/spec/
├── lmdb-backend C source code to use SQLite with an LMDB backend
├── src-lmdb Clone of LMDB source code
├── src-sqlite Clone of sqlite.org git mirror
└── tool Cut down version of speedtest.tcl
On Ubuntu 18.0.4 LTS, Debian Stable (buster), and on any reasonably recent Debian or Ubuntu-derived distribution, you need only:
sudo apt install git build-essential tcl
sudo apt build-dep sqlite3
(apt build-dep
requires deb-src
lines uncommented in /etc/apt/sources.list).
On Fedora 30, and on any reasonably recent Fedora-derived distribution:
sudo dnf install --assumeyes \
git make gcc ncurses-devel readline-devel glibc-devel autoconf tcl-devel
The maintainers test building LumoSQL on Debian, Fedora, Gentoo and Ubuntu. Container images with the dependencies installed are available at https://quay.io/repository/keith_maxwell/lumosql-build and the build steps are in https://github.com/maxwell-k/containers.
Start with a clone of this repository as the current directory.
To build either (a) specific versions of SQLite or (b) sqlightning using different versions of LMDB, use commands like those below changing the version numbers to suit. A list of tested version numbers is in the table below.
make bld-SQLite-3.7.17
make bld-LMDB_0.9.9
To benchmark a single binary takes approximately 4 minutes to complete depending on hardware.
The instructions in this section explain how to benchmark four different versions:
V. | SQLite | LMDB | Repository | Report filename |
---|---|---|---|---|
A. | 3.7.17 | - | SQLite | SQLite-3.7.17.html |
B. | 3.30.1 | - | SQLite | SQLite-3.30.1.html |
C. | 3.7.17 | 0.9.9 | LumoSQL | LMDB_0.9.9.html |
D. | 3.7.17 | 0.9.16 | LumoSQL | LMDB_0.9.16.html |
To benchmark the four versions above use:
make benchmark
The "Repository" column means:
- SQLite
- LumoSQL
-
https://github.com/LumoSQL/LumoSQL (this repository)
mc_orig
was removed and mc_backup
added to mdb.c
in
https://github.com/LMDB/lmdb/commit/be47ca766713f55e5b3abd18120514fdad7d90f2
first released in LMDB_0.9.7
on 14 August 2013. LMDB_0.9.8
was 9 September
2013 and LMDB_0.9.9
was 24 October 2013.
https://github.com/LMDB/sqlightning/commit/58b473f3d5570fca94b88398e0e4314208a077cd
adapted sqlightning
to this change on 12 September 2013. So first try
LMDB_0.9.8
, but this fails with:
sqlite3.c:38156:2: error: unknown type name ‘mdb_hash_t’
.
Likely need this commit, found through a GitHub comparison.
Tag | Date | Compiles | Speed test | Files | Ins. | De. |
---|---|---|---|---|---|---|
LMDB_0.9.8 | 2013-09-09 | ✗ | - | - | - | - |
LMDB_0.9.9 | 2013-10-24 | ✓ | ✓ | 6 | 577 | 540 |
LMDB_0.9.10 | 2013-11-12 | ✓ | ✓ | 5 | 216 | 121 |
LMDB_0.9.11 | 2014-01-15 | ✓ | ✓ | 6 | 443 | 273 |
LMDB_0.9.12 | 2014-06-18 | ✓ | ✓ | 12 | 516 | 333 |
LMDB_0.9.13 | 2014-06-18 | ✓ | ✓ | 3 | 28 | 22 |
LMDB_0.9.14 | 2014-09-20 | ✓ | ✓ | 23 | 2331 | 441 |
LMDB_0.9.15 | 2015-06-19 | ✓ | ✓ | 24 | 388 | 187 |
LMDB_0.9.16 | 2015-08-14 | ✓ | ✓ | 5 | 44 | 19 |
LMDB_0.9.17 | 2015-11-30 | ✓ | ✗ | 10 | 1072 | 565 |
LMDB_0.9.18 | 2016-02-05 | ✓ | ✗ | 24 | 303 | 57 |
LMDB_0.9.19 | 2016-12-28 | ✓ | ✗ | 6 | 684 | 447 |
LMDB_0.9.21 | 2017-06-01 | ✓ | ✗ | 23 | 81 | 50 |
LMDB_0.9.22 | 2018-03-22 | ✓ | ✗ | 23 | 74 | 58 |
LMDB_0.9.23 | 2018-12-19 | ✓ | ✗ | 4 | 52 | 9 |
LMDB_0.9.24 | 2019-07-19 | ✓ | ✗ | 6 | 16 | 11 |
The GitHub LMDB mirror does not include
a release LMDB_0.9.20
, releases before 0.9.8 are not shown.
- Compiles
- ✓ means the process documented above completes successfully.
- Speed test
- ✓ means the cut down version of speed test passes in `./tool/speedtest.tcl` passes.
- Files
- The number of files changed between the previous release and this one, as
reported by
git diff --shortstat
. - Ins.
- The number of insertions as for the "Files" column.
- De.
- The number of deletions as for the "Files" column.
A ? means that this has not been tested, and a - means that it is not applicable at present.
- The Fedora Spec file for "sqlite3" lists dependencies.
- The documentation linking to the official SQLite GitHub mirror
- "sqlightning" repository
- Early benchmarking by Howard Chu of https://pastebin.com/B5SfEieL of 3.7.17
- Benchmarking https://github.com/google/leveldb/blob/master/benchmarks/db_bench_sqlite3.cc