feat: migrations for annotations & notebooks resource types on operator token #21840

williamhbaker · 2021-07-13T17:52:03Z

This PR creates a new KV migration for adding read/write operator-level access to the operator token for the annotations and notebooks resource type.

It also adds code to run the migrations on restored bolt & sql databases. This will ensure that restored databases are in the correct state for the version they are restored into. This makes it possible to do a full restore from a 2.0.x version of the influxd server into a 2.1.x version successfully, and have the resource types added to the restored operator token.

The included unit test tests the migration. I also manually verified the following backup/restore scenarios with the latest influx-cli work:

Backup from 2.0.7, partial restore into 2.1
Backup from 2.0.7, full restore into 2.1
Backup from 2.1, partial restore into 2.1
Backup from 2.1, full restore into 2.1

…ions and notebooks

williamhbaker · 2021-07-13T17:52:58Z

bolt/kv.go

+// swapAndOpenRestored is used while restoring a database to close the existing
+// database, replace the existing database with the restored database at the
+// temp path, and then re-open the restored database.
+func (s *KVStore) swapAndOpenRestored() error {


This function was re-factored from existing code into a separate function to make it cleaner to manage the mutex locking and unlocking.

williamhbaker · 2021-07-13T17:54:24Z

sqlite/sqlite.go

@@ -230,14 +231,38 @@ func (s *SqlStore) RestoreSqlStore(ctx context.Context, r io.Reader) error {
 		return err
 	}

-	// Close the current DB.
+	// Close the current DB and run the restore while under lock.
+	s.Mu.Lock()


Didn't have locks here before but it seems like we really should be getting a lock prior to doing the sqlite database restoration as well so I adapted similar logic as in the bolt kv restore code for locking and unlocking.

williamhbaker · 2021-07-13T17:55:05Z

sqlite/sqlite.go

+// swapAndOpenRestored is used while restoring a database to close the existing
+// database, restore the backed up database, and then re-open the restored
+// database.
+func (s *SqlStore) runSqlRestore(ctx context.Context, tempFileName string) error {


As above in the bolt kv restore file, this function was adapted from existing code to make it easier to manage the lock.

danxmoran · 2021-07-13T17:59:41Z

bolt/kv.go

-		// Atomically swap temporary file with current DB file.
-		if err := fs.RenameFileWithReplacement(s.tempPath(), s.path); err != nil {
+		// Now that the DB has been re-opened, release the lock so that the
+		// migrations can be run.


This feels like a potential opening for race conditions? Unless there's some higher-level lock protecting this whole thing.

If the migrator is currently coded to take a lock, would it be possible/better to refactor it to remove that step?

There's definitely some potential for a race condition here & not a higher level lock. My initial thought was that it wouldn't be much worse than the potential race conditions between the multiple different restore endpoints that have to be used to do a (full) restore - there's the restore KV endpoint, restore SQL endpoint, and endpoints for all of the shards that the CLI orchestrates without any server-side locks.

I think what might be fairly easy to do would be to run the migrations on the restored database before swapping it in. I'll look into that.

It wasn't too bad to run the migrations on the restored database file(s) prior to swapping out the running database. I think that should take care of any race conditions related to the restore KV/restore SQL endpoint. It might be nice someday to consolidate the restore endpoints similar to how the single metadata backup endpoint works under a single lock to prevent anything funny from happening in between CLI calls to the restore endpoints.

danxmoran · 2021-07-13T18:03:46Z

kv/migration/all/0016_add-annotations-notebooks-to-oper-token.go

+				// operator token for annotations and notebooks would have matches the
+				// full list of operator permissions for a 2.1 operator token, we must
+				// be dealing with a 2.0.x operator token, so add it to the list.
+				if permListsMatch(oprPerms, append(t.Permissions, extraPerms()...)) {


Annoying as it is, I think it'd be better to hard-code the expected final list here. Assuming we add more resource-types in versions after 2.1, the contents of oprPerms will continue to grow as defined here. Once that happens, this conditional will no longer match for any users upgrading from 2.0->2.N, so the notebooks & annotations resources won't get added to their tokens.

Good call, hadn't thought about the case where there'd be more than one generate of additional resource types to migrate into. Made that change!

danxmoran · 2021-07-13T18:06:05Z

sqlite/sqlite.go

+
+	// Now that the DB has been re-opened, release the lock so that the migrations
+	// can be run.
+	s.Mu.Unlock()


Same question as in the bolt code: are we open to a race condition here? And if so, could we refactor the migrator to avoid it?

Addressed in the same way as with the bolt code...run the migrations on the restored database before switching over the active database to the restored database. Refactoring the sqlite migrator to not use a lock of its own would be easier than refactoring the bolt one, but this was even easier than that still.

danxmoran

Nice!

williamhbaker added 5 commits July 13, 2021 11:49

feat: migration for operator token to include permissions for annotat…

25091d5

…ions and notebooks

feat: run migrations for restored dbs

8827de8

chore: cleanup go.mod

94f2444

chore: better description comment for migration

e26c32d

fix: fixed cursor return condition

18a450f

williamhbaker commented Jul 13, 2021

View reviewed changes

danxmoran self-requested a review July 13, 2021 17:57

danxmoran reviewed Jul 13, 2021

View reviewed changes

williamhbaker added 4 commits July 13, 2021 14:13

fix: hard code the list of old operator permissions

16e7023

feat: run migrations prior to swapping restored dbs

aa9ca43

fix: fix the sqlite migrator

5920389

chore: update CHANGELOG

d74de78

danxmoran approved these changes Jul 14, 2021

View reviewed changes

williamhbaker merged commit b5b36b2 into master Jul 14, 2021

williamhbaker deleted the wb-oper-token-migration-21833 branch July 14, 2021 14:23

danxmoran mentioned this pull request Jul 16, 2021

fix: run migrations on restored KV database before swapping it #21868

Merged

danxmoran mentioned this pull request Aug 18, 2021

fix: migrate all-access tokens to grant notebook/annotation permissions #22257

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: migrations for annotations & notebooks resource types on operator token #21840

feat: migrations for annotations & notebooks resource types on operator token #21840

williamhbaker commented Jul 13, 2021

williamhbaker Jul 13, 2021

williamhbaker Jul 13, 2021

williamhbaker Jul 13, 2021

danxmoran Jul 13, 2021

williamhbaker Jul 13, 2021

williamhbaker Jul 13, 2021

danxmoran Jul 13, 2021

williamhbaker Jul 13, 2021

danxmoran Jul 13, 2021

williamhbaker Jul 13, 2021

danxmoran left a comment

feat: migrations for annotations & notebooks resource types on operator token #21840

feat: migrations for annotations & notebooks resource types on operator token #21840

Conversation

williamhbaker commented Jul 13, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danxmoran left a comment

Choose a reason for hiding this comment