-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] AssertionError: pre-6.0 shard copy ___ unexpected to send valid local checkpoint -1 #39914
Comments
Pinging @elastic/es-distributed |
Same thing in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.7+multijob-unix-compatibility/os=centos-7&&immutable/66/console:
is in |
Same thing in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.7+intake/333/console That was built off commit 63587f7 so the problem pre-dates the most recent 6.7 commits. I don't recall seeing it last Friday though, so it may have been introduced last Friday or else was masked by other failures that day. |
I am on it now. This failure relates to #39482. |
Thanks @dnhatn. It makes sense, as the earliest failure I can find is in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.7+intake/329/console, which unfortunately happened just after I finished test triage on Friday. And that was built from a24a1f5. Do you think this is a blocker for 6.7.0? |
@droberts195 Yes, this should be a blocker since it can prevent rolling upgrade from 5.6.x to 6.7. |
I opened #39922. |
If a primary on 6.7 and a replica on 5.6 are running more than 5 minutes (retention leases background sync interval), the retention leases background sync will be triggered, and it will trip 6.7 node due to the illegal checkpoint value. We can fix the problem by making the returned checkpoint depends on the node version. This PR, however, chooses to enforce retention leases require soft deletes, and make retention leases sync noop if soft deletes is disabled instead. Closes #39914
If a primary on 6.7 and a replica on 5.6 are running more than 5 minutes (retention leases background sync interval), the retention leases background sync will be triggered, and it will trip 6.7 node due to the illegal checkpoint value. We can fix the problem by making the returned checkpoint depends on the node version. This PR, however, chooses to enforce retention leases require soft deletes, and make retention leases sync noop if soft deletes is disabled instead. Closes #39914
If a primary on 6.7 and a replica on 5.6 are running more than 5 minutes (retention leases background sync interval), the retention leases background sync will be triggered, and it will trip 6.7 node due to the illegal checkpoint value. We can fix the problem by making the returned checkpoint depends on the node version. This PR, however, chooses to enforce retention leases require soft deletes, and make retention leases sync noop if soft deletes is disabled instead. Closes #39914
If a primary on 6.7 and a replica on 5.6 are running more than 5 minutes (retention leases background sync interval), the retention leases background sync will be triggered, and it will trip 6.7 node due to the illegal checkpoint value. We can fix the problem by making the returned checkpoint depends on the node version. This PR, however, chooses to enforce retention leases require soft deletes, and make retention leases sync noop if soft deletes is disabled instead. Closes #39914
Several tests failed in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.7+multijob-unix-compatibility/os=oraclelinux-7/66/console but the root cause is an assertion failure in one of the nodes in the test cluster:
The full set of rolling upgrade cluster logs for the suite that failed is cluster_logs.zip
(Note: there have been several failures of BWC tests for 5.6 upgrading to 6.7 today. They may or may not all be due to this problem. I will update the issue once I've looked, but I'm raising this issue early in case the problem is a blocker for 6.7.)
The text was updated successfully, but these errors were encountered: