-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimize maintenance rebalance/re-replicate with direct asd-to-asd communication #716
Comments
Quite some impact from maintenance trying to rebalance an asymmetric backend (1 disk extra in 1 node in this setup; but extra empty disks/nodes would have simular behaviour) Throughput from dd in a vm via edge: maintenance off:
maintenance on:
maintenance off:
maintenance on:
Network without maintenance:
network with maintenance:
|
Maybe the rebalancing should not be enabled by default, given the impact on the network (and disks) that gets lost for ingest? Is the time/work done for moving old data around indeed worth the effort? Probably this also depends on the use case and for a constant ingest things might be different than for a bursty one... Maybe the decision when to move data around plus from where to where to move is also something that needs more thoughtful insight (policies used / capacity planning / ...) than the maintenance process itself has? ps/ rebalancing can be turned off via
|
Isn't there a way to limit the impact of rebalancing (lowering its priority) so there still is some rebalancing going on? |
waiting on QA effort. |
Rebalance can be optimized by having the too full asd send the fragment data to the not-yet-full-enough asd directly.
Similarly for repair in case of a replication policy it should be possible to send the fragment data directly between the asds.
The text was updated successfully, but these errors were encountered: