NATS KV Corruption

This repository contains a minimal reproducible example of a NATS Jetstream KeyValue corruption error.

JetStream sometimes does not automatically recover correctly from a forced termination. The original error was found after an OOM kill. Restarting the corrupted instance does not fix the issue. Only removing the persistent volume and restarting the affected instance fixed the issue.

Dependencies

docker and bash.

Reproduction steps

Execute the bash script run.sh¹

Note: The run.sh script has a non-deterministic runtime. The average measured runtime of 5 executions was ~1m30s.

Repro explanation

First, docker compose up deploys a JetStream cluster $N=3$ and creates a KV bucket $R=3$.

This reproduction consist of two loops. The Producer loop and the Killer loop. The Producer loop removes (nats kv del) and writes (nats kv update or nats kv create) to random keys. The Killer loop kills the current leader of the stream with SIGKILL and restarts it after a short delay.

The Producer loop eventually generates wrong last sequence errors when executing nats kv update calls. After observing SEQ_ERR_COUNT_TARGET errors of this nature, both loops are halted. Then, the script checks if the KeyValue bucket is in an inconsistent state.

An inconsistent state is observed by counting the number of keys returned by nats kv ls. Repeated executions give different results when the KV has been corrupted.

There is an example execution log example.log.

The script may occasionally get stuck in a failed state. Logs with nats: error: nats: bucket not found and [KV] <...> - fail are printed indefinitely. Stop the execution in this case and try again. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
.env		.env
README.md		README.md
docker-compose.yaml		docker-compose.yaml
example.log		example.log
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NATS KV Corruption

Dependencies

Reproduction steps

Repro explanation

About

Releases

Packages

Languages

jrovira-kumori/NATS-KV-Corruption

Folders and files

Latest commit

History

Repository files navigation

NATS KV Corruption

Dependencies

Reproduction steps

Repro explanation

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages