Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

implement blob batching to improve EH sender throughput #275

Merged
merged 3 commits into from
Jul 6, 2023

Conversation

sebastianburckhardt
Copy link
Member

This PR implements a new mechanism for optimizing the EH sender throughput.

Previously we observed in multiple cases that the limited throughput when sending data to a single partition caused bottlenecks in the application (#268, #240).

With this new mechanism, anytime a partition sender needs to send large amounts of data (large messages, or many small messages), it does not send the messages directly via EH, but writes them to a blob batch, then sends a message containing the blob name. The receiver then loads the data from the blob batch.

This may also resolve #272 since it replaces some of the code that was identified as the likely source of the error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant