Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unresolved: NATS orphaned streams deleted after Snapshot corrupt, too short error #6188

Closed
Zetanova opened this issue Nov 29, 2024 · 2 comments
Labels
defect Suspected defect such as a bug or regression

Comments

@Zetanova
Copy link

Observed behavior

The defect of issue still persist.

Version: 2.9.25

I had it now 2 time, after 2 of 3 nodes restarted at the same time all mem and disk streams got "Detected orphaned stream"
including KV streams

What happend:

Two nodes got killed and restarting
survivor node 1 "Healthcheck failed: "JetStream has not established contact with a meta leader"
Node 2 and 3 started again and recovered the streams with messages including KV
"Restored 15 messages for stream 'ACY...XUH2I > KV_myconfig'
followed by "RAFT [yrzKKRBu - meta] Snapshot corrupt, too short" on both restarting nodes
node 2 and 3 cleared the streams in the "JetStream cluster recovering state" and after "Server is ready" with
"Detected orphaned stream 'ACY...XUH2I> KV_myconfig', will cleanup"

Expected behavior

No data loss should occur.

Server and client version

nats official container version: 2.9.25

Host environment

kubernetes

Steps to reproduce

kill 2 of 3 nodes.

@Zetanova Zetanova added the defect Suspected defect such as a bug or regression label Nov 29, 2024
@neilalexander
Copy link
Member

There have been a ton of clustering improvements since 2.9.25, with many improvements already in GA in 2.10.22 and even more due to land in 2.10.23 next week (although release candidates are already available). Can you please upgrade to either the 2.10.22 GA or (preferably) 2.10.23-RC.7 and let us know how things look?

@wallyqs
Copy link
Member

wallyqs commented Nov 29, 2024

This would have been fixed by #5767 in v2.10.19

@wallyqs wallyqs closed this as completed Nov 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
defect Suspected defect such as a bug or regression
Projects
None yet
Development

No branches or pull requests

3 participants