Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix erasure coded reads when a block is missing #86

Merged
merged 9 commits into from
Apr 5, 2024
Merged

Conversation

Kimahriman
Copy link
Owner

@Kimahriman Kimahriman commented Mar 24, 2024

Resolves #74

We weren't handling the case where a shard in an erasure coded block was missing, only when the read from that shard failed. This cropped up because HDFS will consider a file "complete" when it has a minimal number of shards successfully written, not when all shards are available. So when reading immediately after writing, there may be some shards that haven't synced to the block/NameNode yet. This also represents what happens when a shard is actually unavailable due to a DataNode being down.

@Kimahriman Kimahriman changed the title [WIP] Run just EC tests and show java logs Fix erasure coded reads when a block is missing Apr 5, 2024
@Kimahriman Kimahriman merged commit f8183c3 into master Apr 5, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Figure out flakiness with erasure code writing
1 participant