-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kvserver: make poisoned latch errors less confusing #105798
Comments
cc @cockroachdb/replication |
I think this is because cockroach/pkg/kv/kvserver/batcheval/cmd_end_transaction.go Lines 98 to 103 in 0480fa9
That seems pretty excessive, and likely causes some additional overhead. I'll try to limit them to the range's spans. |
We saw something similar over on #104588, where we reported a poisoned latch on r548 as coming from r1. Probably because intent resolution on r1 tried to accessed the abort span of a txn anchored on r548, whose latches were poisoned. I started out by trying to reduce the I think the better solution here is to improve the error handling of poisoned latches to include the originating range ID where the latch was poisoned, instead of the range where the error is processed. Repurposing this issue. |
In #104709 (comment), we see a tripped circuit breaker (the one described in #105797) poisoning a meta2 latch, seemingly outside of the replica's range bounds:
Jira issue: CRDB-29203
The text was updated successfully, but these errors were encountered: