opt: optimize cluster identification #3309

tediou5 · 2024-12-11T08:21:29Z

Second attempt to close #2900

The first commit is meaningless; it simply renames cache_id to piece_cache_id

For the cache, everything is straightforward; it's just a matter of recording the corresponding relationships in the controller. However, things are a litter bit more complicated for the farmer. First, we check the identify message to see if the farmer is newly discovered and whether the fingerprint has changed. Based on the results, we decide whether to use the stream to fetch details.

The final commit is to ensure compatibility with previous approaches.

(I’m really sorry, actually, I finished it a long time ago, but I forgot about it and left it in a corner.)

Code contributor checklist:

I have read, understood and followed contributing guide

tediou5 · 2024-12-16T03:15:01Z

@nazar-pc additionally, during my actual development, this part of the code is not easy to test, and some scenarios are hard to cover(like farm FingerprintUpdated). At the very least, I need to start 3 components: nats, controller, and farmer/cache to do so. I was thinking maybe I could first submit a PR to extract the update logic for caches and farms and cover enough test cases?

nazar-pc

Thanks for contribution and sorry it took this long to read into it. This is certainly the right direction, but it will cause issues for the way maintenance of caches and farms is done (prevent loop with select! from actually looping quickly, which was carefully avoided before).

I only left comments on cache, but similar comments apply to farmer side as well.

I also don't fully understand why the thing that we try to address here was sort of added back at the end, I'm confused.

And please rebase changes after further updates (if any) and squash changes to the same part of the codebase, it'll be easier to review that way.

crates/subspace-farmer/src/cluster/cache.rs

crates/subspace-farmer/src/farm.rs

crates/subspace-farmer/src/cluster/controller/caches.rs

tediou5

Yes, let me adjust the code. Once I've completed my modifications, I'll take care of those annoying merges.

crates/subspace-farmer/src/cluster/controller/caches.rs

teor2345

This looks good to me, once Nazar's comments have been addressed

tediou5 · 2025-01-15T09:18:55Z

No changes were made, just squash changes through rebase. Additionally, I removed ClusterCacheIdentifyPieceCacheBroadcast (also for the Farmer), they were reintroduced in a separate commit, and I simply dropped that commit. Nazar's comments will be addressed in subsequent commits.

tediou5 · 2025-01-20T05:52:54Z

I’ve rearranged the commit order to make squashing easier later.

@nazar-pc I finished the cache implementation (it’s relatively straightforward), so you can review it for any potential issues.

91b3dd4: When a new cache appears, the system will collect the stream in the background and update KnownCaches once it’s done.

Before making changes to the farmer, perhaps I could submit a separate PR to parallelly add or remove farms? It doesn’t look too complex right now (and may even simplify the implementation).

tediou5 · 2025-01-20T10:18:32Z

The farmer's work turned out to be simpler than I imagined, and it's also done.

1704ed5 is refactoring and moving code, with no actual changes.

75ddfe6 is the actual modification, but the logic after refactoring hasn’t changed much—it’s just split into two parts, with no other changes.

teor2345

These all seem fine to me, but Nazar knows this area much better than I do.

tediou5 · 2025-02-20T07:35:37Z

Just rebase #3354

tediou5 requested review from nazar-pc, shamil-gadelshin and rg3l3dr as code owners December 11, 2024 08:21

tediou5 changed the title ~~Tmp/opt/optimize cluster identification~~ opt: optimize cluster identification Dec 11, 2024

nazar-pc requested changes Jan 14, 2025

View reviewed changes

nazar-pc requested review from teor2345 and removed request for shamil-gadelshin January 14, 2025 02:07

tediou5 commented Jan 15, 2025

View reviewed changes

crates/subspace-farmer/src/cluster/controller/caches.rs Outdated Show resolved Hide resolved

crates/subspace-farmer/src/cluster/controller/caches.rs Outdated Show resolved Hide resolved

teor2345 reviewed Jan 15, 2025

View reviewed changes

tediou5 force-pushed the tmp/opt/optimize-cluster-identification branch from b633d33 to beb798c Compare January 15, 2025 09:18

tediou5 force-pushed the tmp/opt/optimize-cluster-identification branch 3 times, most recently from 4246b58 to cda20e5 Compare January 20, 2025 05:49

tediou5 requested a review from nazar-pc January 20, 2025 05:59

tediou5 mentioned this pull request Jan 20, 2025

opt: improve farms maintenance performance via parallelization #3354

Merged

1 task

tediou5 force-pushed the tmp/opt/optimize-cluster-identification branch from cda20e5 to 75b0755 Compare January 20, 2025 10:13

tediou5 requested a review from teor2345 January 20, 2025 10:18

teor2345 reviewed Jan 20, 2025

View reviewed changes

tediou5 added 8 commits February 20, 2025 14:50

chore: rename cache_id => piece_cache_id

60dfeb4

feat: optimize cache identification

fd44a36

opt: collect piece caches details stream request in the background

5a86107

feat: optimize farmer identification

42f5ff0

chore: refactor and moving for cluster::controller::farms

033ef8e

opt: collect farms details stream request in the background

f0ce542

chore: remove unnecessary checks when sending identification broadcast

ac3f2eb

chore: improve comments on cache and farmer's SUBJECT

3078439

tediou5 force-pushed the tmp/opt/optimize-cluster-identification branch from 75b0755 to 3078439 Compare February 20, 2025 07:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opt: optimize cluster identification #3309

opt: optimize cluster identification #3309

tediou5 commented Dec 11, 2024

tediou5 commented Dec 16, 2024 •

edited

Loading

nazar-pc left a comment •

edited

Loading

tediou5 left a comment

teor2345 left a comment •

edited

Loading

tediou5 commented Jan 15, 2025

tediou5 commented Jan 20, 2025

tediou5 commented Jan 20, 2025 •

edited

Loading

teor2345 left a comment

tediou5 commented Feb 20, 2025

opt: optimize cluster identification #3309

Are you sure you want to change the base?

opt: optimize cluster identification #3309

Conversation

tediou5 commented Dec 11, 2024

Code contributor checklist:

tediou5 commented Dec 16, 2024 • edited Loading

nazar-pc left a comment • edited Loading

Choose a reason for hiding this comment

tediou5 left a comment

Choose a reason for hiding this comment

teor2345 left a comment • edited Loading

Choose a reason for hiding this comment

tediou5 commented Jan 15, 2025

tediou5 commented Jan 20, 2025

tediou5 commented Jan 20, 2025 • edited Loading

teor2345 left a comment

Choose a reason for hiding this comment

tediou5 commented Feb 20, 2025

tediou5 commented Dec 16, 2024 •

edited

Loading

nazar-pc left a comment •

edited

Loading

teor2345 left a comment •

edited

Loading

tediou5 commented Jan 20, 2025 •

edited

Loading