feat(1-1-restore): validates if source and target clusters nodes are equal #4230

VAveryanov8 · 2025-01-27T11:34:53Z

This adds a validation stage for 1-1-restore. The logic is as follows:

Collect node information for the source cluster from backup manifests
Collect node information for the target cluster from the Scylla API
Apply node mappings to the source cluster nodes
Compare each source node with its corresponding target node

Fixes: #4201

Please make sure that:

Code is split to commits that address a single change
Commit messages are informative
Commit titles have module prefix
Commit titles have issue nr. suffix

pkg/service/one2onerestore/worker_manifest.go

pkg/service/one2onerestore/worker_validate.go

Michal-Leszczynski

The validation process looks way cleaner now:)

pkg/service/one2onerestore/worker.go

pkg/service/one2onerestore/worker_validate.go

karol-kokoszka

Few comments.
One is about changing the logic - to iterate over target nodes instead of locations. This is the most important for me.

karol-kokoszka · 2025-02-04T11:05:20Z

pkg/service/one2onerestore/model.go

@@ -28,10 +28,17 @@ type nodeMapping struct {

 type node struct {
 	DC     string `json:"dc"`
-	Rack   string `json:"rack"`
+	Rack   string `json:"rack_id"`


It's a rack name not ID.
Example: https://github.com/scylladb/scylladb/blob/master/conf/cassandra-rackdc.properties

But now it shows that this PR is actually changing it from rack_id to rack (instead of not touching it at all).
I guess that's because of having multiple PRs rebased on one another.

It only shows changes in pkg/command/one2onerestore, because i've renamed rack_id to rack in the scope of this PR

karol-kokoszka · 2025-02-04T11:16:13Z

pkg/service/one2onerestore/worker.go

+	logger log.Logger
+}
+
+// getManifestsAndHosts checks that each host in target cluster should have an access to at least one target location and fetches manifest info.


This comment is a bit confusing.
Each node must have the access to the location where its SSTables are stored.

1-1 restore provides mapping between source node and the target node.

Location defines the DC. If the DC is empty then it means (or should mean) that all DCs are here.
See https://manager.docs.scylladb.com/stable/backup#meta

The method must check that all nodes that are expected to restore particular DC have an access to the location which keeps SSTables of this DC.
The nodes-mapping for 1-1 restore defines which node is going to restore data from which DC.

Do you mean that we should require user to specify the full location <-> DC mapping with the --location flag and not calculate it on our? I was under different impression when writing the comment about ignoring location DC (resolved).

Calculating it on our own is a little bit more complicated, but it's more convenient for the user (similar to providing nodes mapping, which can also be calculated by SM). At the end of the day, somebody needs to calculate it and SM has all necessary information.

I think that making 1-1-restore as easy to use is important in general, but it does not need to happen in the first iteration targeting mainly Scylla Cloud as a user.

On the other hand, the code matching locations to DCs on SM side is already a part of this PR, so does it make sense to delete it? What do you guys think?

The 1-1 restore, includes nodes-mapping. It explicitly tells which target node is expected to download what set of SSTables. The goal is to find which location keeps SSTables of the source-node and ensure that the target node can access this location.

The current comment says: each host in target cluster should have an access to at least one target location
This is not true, we must ensure that each target host have an access to the exact location that stores SSTables for the mapped source node.

Do you mean that we should require user to specify the full location <-> DC mapping with the --location flag and not calculate it on our? I was under different impression when writing the #4230 (comment) about ignoring location DC (resolved).

We ARE requiring user to specifiy nodes-mapping. Node is part of on of the DC. Scylla Manager must understand which location (from the given locations) keeps the DC that is owning the source node.
To understand the full location <-> DC mapping , or even rather location <-> nodes mapping it's enough to check meta directory https://manager.docs.scylladb.com/stable/backup#meta from all bacup locations.

I think that making 1-1-restore as easy to use is important in general, but it does not need to happen in the first iteration targeting mainly Scylla Cloud as a user.

On the other hand, the code matching locations to DCs on SM side is already a part of this PR, so does it make sense to delete it? What do you guys think?

We don't care about the ease of use for end user much. There is still automation required to deliver cloned cluster. This automation creates the output which must involve the mapping -> .
We care about the compatibility with the automation tool.

OK, this method is just expected to get list of hosts from the target cluster and the list of manifests from the snapshot.

It adds simple validation to compare if the number of target hosts is equal to number of manifests.

It's fine.

But still... I think the comment is misleading.
getManifestsAndHosts -> getAllSnapshotManifestsAndAllTargetHosts would be more meaningful for me.

pkg/service/one2onerestore/worker.go

karol-kokoszka · 2025-02-04T11:32:13Z

pkg/service/one2onerestore/worker.go

+	if len(allManifests) != nodesWithAccessCount || len(allManifests) != len(nodeStatus) {
+		return nil, nil, fmt.Errorf("manifest count (%d) != target nodes (%d)", len(allManifests), len(nodesCountSet.List()))
+	}
+
+	return allManifests, nodesToHosts(nodeStatus), nil
+}


This logic will be completely different when you iterate over nodes instead of locations.

After ensuring that node has the access to the location, you just download corresponding manifest to check details like numer of shards and token ring.

After ensuring that node has the access to the location, you just download corresponding manifest to check details like numer of shards and token ring.

That's exactly what is happening inside validateCluster method.

A few word in general about my implementation and what was the reasoning behind it:
In a nutshell we have a source cluster (backup) which nodes are represented by manifests file in the backup location(s) and a target cluster which nodes are actual live nodes in the cluster we want to restore data.

Get source cluster nodes and target cluster nodes. (getManifestsAndHosts)

Collect information needed to compare sources and target cluster nodes. (collectNodeValidationInfo) (here I iterate over nodes)

Compare them using node mapping as rule how to match source cluster node and target cluster node. (checkOne2OneRestoreCompatibility)

If validation has passed successfully, then I can use nodes info from step 1 farther in the code, as I know - it's valid for 1-1-restore and each node has exact match.
Keeping the logic this way give us ability to keep validation logic in one place, without spreading it all across other parts of the code.

This logic will be completely different when you iterate over nodes instead of locations.

Here is how I see this logic

find node dc by looking at nodes mappings

find corresponding location by checking location.DC with DC from step 1

Download manifest content. Two steps actually list dir and then download, because manifest path contains taskID which we don't know (or do we?)

collect additional node info (token ring and etc)

compare nodes info with manifest content.

Without getting into the details, the main difference between two solutions, is that in first we collect all the info and then do the validation, in second, we validate nodes one by one.

For me it's more a less the same, but if you prefer second over first, let me know and i'm gonna change this pr

Two steps actually list dir and then download, because manifest path contains taskID which we don't know (or do we?)

We don't know it from the start, but it can be established just once at the beginning (by listing manifests) and then reused, but we can't escape listing all manifest dirs as it's needed to verify that manifest count equals nodes count.

In general I like the idea of creating per node worker as it provides more concurrent design.
In the context of the validation, such worker could be initialized with all basic information about backup and target nodes like:

target and backup host ID

target IP

target and backup cluster ID

DC

backup task ID

snapshot tag

backup location

If we assume that we get all location <-> DC mapping by user, the only missing piece is the task ID, but it can be established just once by the main workflow and passed to worker initialization (main workflow would also need to validate manifest count, but it wouldn't need to read any manifest).

But even if we don't get such mapping, we can first calculate it, and then move to the per node worker.
The only concern is that it might be more difficult to support this approach if we would like to make nodes mapping optional for the sake of user's convenience. Then, we couldn't provide backup DC and host ID from the start, but they would need to be filled in some other way.

With this design, there are two benefits:

easier parallelization of the validation workflow - all steps like reading the manifest, fetching target info, validating, etc. could sequential in the context of a single node, but we could just use parallel.Run on all of them without any additional effort. The current implementation also do the time consuming things in parallel, but it requires more code handling the parallelism.

easier body functions - there would be no need of handling slices or maps of some validation info and thinking whether we can be sure that they are at sync at given point. Validation functions would work just on the per node context with a single manifest and a single target node info.

EDIT: Note that such design would probably fit very well into the later restore data stages as well.

But I also agree that the current implementation does it's job, so this is more about personal preferences, so if the things from the previous comment didn't convince you, I'm not against the current state of implementation.

@VAveryanov8 are you checking somewhere in the code if the target node has the access to the expected location ? I mean the location that keeps SSTables that are expected to be downloaded to this node ?
It should be part of validation stage.
Otherwise, it may happen that you go to the copy stage, but one of the target nodes is not able to get its SSTables.

In general I like the idea of creating per node worker as it provides more concurrent design.
In the context of the validation, such worker could be initialized with all basic information about backup and target nodes like:

target and backup host ID
target IP
target and backup cluster ID
DC
backup task ID
snapshot tag
backup location
If we assume that we get all location <-> DC mapping by user, the only missing piece is the task ID, but it can be established just once by the main workflow and passed to worker initialization (main workflow would also need to validate manifest count, but it wouldn't need to read any manifest).

But even if we don't get such mapping, we can first calculate it, and then move to the per node worker.
The only concern is that it might be more difficult to support this approach if we would like to make nodes mapping optional for the sake of user's convenience. Then, we couldn't provide backup DC and host ID from the start, but they would need to be filled in some other way.

With this design, there are two benefits:

easier parallelization of the validation workflow - all steps like reading the manifest, fetching target info, validating, etc. could sequential in the context of a single node, but we could just use parallel.Run on all of them without any additional effort. The current implementation also do the time consuming things in parallel, but it requires more code handling the parallelism.
easier body functions - there would be no need of handling slices or maps of some validation info and thinking whether we can be sure that they are at sync at given point. Validation functions would work just on the per node context with a single manifest and a single target node info.
EDIT: Note that such design would probably fit very well into the later restore data stages as well.

That's the idea of 1-1 restore to do it per node in parallel. After all SSTables are copied to the corresponding node, then it's a matter of calling to refresh per all tables that are included in the backup location in this snapshot for the current node.
It should/must be done in parallel per node.

@VAveryanov8 @Michal-Leszczynski let's sync on monday. It may be faster way of discussing it.

I put in the design doc -> There must be one worker / goroutine per destination node copying and refreshing SSTables from the corresponding node.

pkg/service/one2onerestore/worker_validate.go

karol-kokoszka · 2025-02-04T11:36:16Z

pkg/service/one2onerestore/worker_validate_test.go

+func TestMapTargetHostToSource(t *testing.T) {
+	testCases := []struct {
+		name string
+
+		nodeMappings []nodeMapping
+		targetHosts  []Host
+		expected     map[string]Host
+		expectedErr  bool
+	}{
+		{
+			name: "All hosts have mappings",
+			nodeMappings: []nodeMapping{


These tests are OK.
But you should verify it bit broader way.
Please keep validateClusters as the method you test. Most likely it will require to use integration tests instead of this ones.

go.mod

This introduces new cli command for fast vnode restore procedure. Fixes: #4200

This is the result of running `make docs`

This renames fastrestore to 1-1 restore and also makes it a subcommand of restore cmd.

…s uuid This removes 1-1-restore options that are not needed at the moment (we can added them later if nedded). Also changed source-cluster-id type to uuid.UUID

…equal This adds a validation stage for 1-1-restore. The logic is as follows: - Collect node information for the source cluster from backup manifests - Collect node information for the target cluster from the Scylla API - Apply node mappings to the source cluster nodes - Compare each source node with its corresponding target node Fixes: #4201

This changes following parts of validation process: - Moves path.Join("backup", string(MetaDirKind)) to `backupspec` pkg - Moves getManifestContext to worker_manifest - Adds SourceClusterID validation to getManifestInfo - Simplifies how nodes info is collected by leveraging node mappings (maps manifests to nodes by host id) - Replaces LocationInfo struct with manifests and hosts - Sorts node tokens

This replaces CPUCount with ShardCount for clusters comparision. Fixes typo in function name.

This adds integration tests for the 1-1-restore validation stage.

This adds 1-1-restore integration tests to github actions.

Somehow lost a few files during rebase :D

karol-kokoszka · 2025-02-07T13:32:43Z

These are the only concerns I have right now:

feat(1-1-restore): validates if source and target clusters nodes are equal #4230 (comment) - DISCUSSED and fine
feat(1-1-restore): validates if source and target clusters nodes are equal #4230 (comment)

Other than that, it's fine.

karol-kokoszka

👍

This introduces following changes: 1. Uses location.DC for deciding which node to use for initial location access 2. Changes validation stage to check nodes one by one in parallel (instead of collection slices and then comparing them) 3. Updates unit and integration tests

VAveryanov8 · 2025-02-10T10:42:38Z

@Michal-Leszczynski @karol-kokoszka I've made changes to validation stage according to your comments, so please take a look one more time - it should be easier now :)

additionally updated unit and integrational tests.

VAveryanov8 marked this pull request as ready for review January 27, 2025 17:30

VAveryanov8 requested review from karol-kokoszka and Michal-Leszczynski as code owners January 27, 2025 17:30

Michal-Leszczynski reviewed Jan 28, 2025

View reviewed changes

pkg/service/one2onerestore/worker_manifest.go Outdated Show resolved Hide resolved

Michal-Leszczynski reviewed Jan 28, 2025

View reviewed changes

pkg/service/one2onerestore/worker_validate.go Outdated Show resolved Hide resolved

VAveryanov8 force-pushed the va/fast-restore-part-2 branch from 37409c0 to 7e9f877 Compare January 29, 2025 14:35

VAveryanov8 force-pushed the va/1-1-restore-validation branch from 022de10 to d7ad144 Compare January 29, 2025 15:39

VAveryanov8 requested a review from Michal-Leszczynski January 29, 2025 15:46

Michal-Leszczynski reviewed Jan 31, 2025

View reviewed changes

karol-kokoszka reviewed Feb 4, 2025

View reviewed changes

Base automatically changed from va/fast-restore-part-2 to 1-1-restore-feature-branch February 7, 2025 08:28

Michal-Leszczynski reviewed Feb 7, 2025

View reviewed changes

go.mod Outdated Show resolved Hide resolved

VAveryanov8 added 17 commits February 7, 2025 11:38

chore(go.mod): updates to the latest version of pkg/managerclient

e81c8f1

feat(fastrestore cli): adds new fastrestore cli to sctool

fb3f65b

This introduces new cli command for fast vnode restore procedure. Fixes: #4200

feat(fastrestore): adds fastrestore service stub

0c3fdc0

chore(docs): generates docs for the new cli command

b4b8cb8

This is the result of running `make docs`

chore: adds empty line, comment here and there

eb0917b

refactor: renames fastrestore to 1-1 restore

4890939

This renames fastrestore to 1-1 restore and also makes it a subcommand of restore cmd.

chore(docs): make docs

06819cf

refactor(1-1-restore): removed not needed options && src-cluster-id i…

ff86ccd

…s uuid This removes 1-1-restore options that are not needed at the moment (we can added them later if nedded). Also changed source-cluster-id type to uuid.UUID

feat(1-1-restore): adds unit tests

44de088

fix: replaces usage of fmt with pkg/errors

833a746

fix(1-1-restore): fixes json tag for the nodeMapping.Rack

0c16439

fix(1-1-restore): replaces cpu count with shard count + typo fix.

9b59f93

This replaces CPUCount with ShardCount for clusters comparision. Fixes typo in function name.

fixes(1-1-restore): renames rack_id json tag into rack

13e43c6

chore(tests): updates golden files for 1-1-restore tests

6909f70

chore(tests): adds integration tests for validate stage

10a970d

This adds integration tests for the 1-1-restore validation stage.

feat(ci): adds 1-1-restore integration test to gh actions

9139cf1

This adds 1-1-restore integration tests to github actions.

VAveryanov8 force-pushed the va/1-1-restore-validation branch from 45298e2 to 9139cf1 Compare February 7, 2025 10:55

fix(vendor): adds missing files form vendor/

ec22438

Somehow lost a few files during rebase :D

karol-kokoszka approved these changes Feb 7, 2025

View reviewed changes

VAveryanov8 added 2 commits February 7, 2025 16:29

chore(tests): adds more test cases for integration tests

cbea837

VAveryanov8 force-pushed the va/1-1-restore-validation branch from 5983f40 to 647111d Compare February 10, 2025 10:31

Michal-Leszczynski self-requested a review February 10, 2025 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(1-1-restore): validates if source and target clusters nodes are equal #4230

feat(1-1-restore): validates if source and target clusters nodes are equal #4230

VAveryanov8 commented Jan 27, 2025

Michal-Leszczynski left a comment

karol-kokoszka left a comment

karol-kokoszka Feb 4, 2025

Michal-Leszczynski Feb 10, 2025

VAveryanov8 Feb 10, 2025

karol-kokoszka Feb 4, 2025

Michal-Leszczynski Feb 7, 2025

karol-kokoszka Feb 7, 2025

karol-kokoszka Feb 7, 2025

karol-kokoszka Feb 4, 2025

VAveryanov8 Feb 5, 2025 •

edited

Loading

Michal-Leszczynski Feb 7, 2025

Michal-Leszczynski Feb 7, 2025 •

edited

Loading

Michal-Leszczynski Feb 7, 2025

karol-kokoszka Feb 7, 2025

karol-kokoszka Feb 7, 2025

karol-kokoszka Feb 4, 2025

karol-kokoszka commented Feb 7, 2025 •

edited

Loading

karol-kokoszka left a comment

VAveryanov8 commented Feb 10, 2025

feat(1-1-restore): validates if source and target clusters nodes are equal #4230

Are you sure you want to change the base?

feat(1-1-restore): validates if source and target clusters nodes are equal #4230

Conversation

VAveryanov8 commented Jan 27, 2025

Michal-Leszczynski left a comment

Choose a reason for hiding this comment

karol-kokoszka left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

VAveryanov8 Feb 5, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Michal-Leszczynski Feb 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karol-kokoszka commented Feb 7, 2025 • edited Loading

karol-kokoszka left a comment

Choose a reason for hiding this comment

VAveryanov8 commented Feb 10, 2025

VAveryanov8 Feb 5, 2025 •

edited

Loading

Michal-Leszczynski Feb 7, 2025 •

edited

Loading

karol-kokoszka commented Feb 7, 2025 •

edited

Loading