New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

compactor changes for building per user index files in boltdb shipper #5026

Merged

sandeepsukhani merged 6 commits into grafana:main from sandeepsukhani:per-user-index-compactor-changes

Jan 7, 2022

Contributor

sandeepsukhani commented Jan 3, 2022 •

edited

Loading

What this PR does / why we need it:
This is the first PR for adding support for per user index files in boltdb shipper. It includes changes only related to the compactor.
The below image shows various formats that would be in use.

Roughly here is how it would work:

Currently we build index in Format1. Switching to per user index i.e Format3 would be controlled by a feature flag.
When the feature is enabled, ingesters would build index in Format2 which is to avoid building too many files in a large cluster with thousands of tenants.
Compactor would do compaction as follows:
- If the bucket name is index, it would write index in Format1.
- If the bucket name is not index then it would write index in Format3.
Queriers/IndexGateway would support all the formats for reads.

Special notes for your reviewer:
I am doing smaller PRs to avoid making it difficult to review a huge PR at once.

Checklist

Tests updated


          compactor changes for building per user index files in boltdb shipper

6bb44bc

sandeepsukhani requested a review from a team as a code owner

January 3, 2022 12:08

pull-request-size bot added the size/XXL label


          update comments

011a9bf

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/retention/expiration.go

@@ @@ -113,27 +120,33 @@ Outer: @@
               	return globalRetention
               }
-              func findSmallestRetentionPeriod(limits Limits) time.Duration {
+              func findLatestRetentionStartTime(now model.Time, limits Limits) (model.Time, map[string]model.Time) {
+              	// find the smallest retention period from default limits

Contributor

cyriltovena Jan 3, 2022

Seems like it could be a function comment instead.

Contributor Author

sandeepsukhani Jan 4, 2022

I wanted to explain the block below that comment. I will add a function comment as well.

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/storage/client.go

@@ @@ -14,11 +14,17 @@ const delimiter = "/" @@
               // Client is used to manage boltdb index files in object storage, when using boltdb-shipper.
               type Client interface {
-              	ListTables(ctx context.Context) ([]string, error)
-              	ListFiles(ctx context.Context, tableName string) ([]IndexFile, error)
+              	ListFiles(ctx context.Context, tableName string) ([]IndexFile, []string, error)

Contributor

cyriltovena Jan 3, 2022

Could you may be compose this interface with the two interface at the end, and also move then at the top ?

Contributor Author

sandeepsukhani Jan 4, 2022

Hah, that was the plan and I missed doing that. Thanks for pointing it out!

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/storage/index_set.go Outdated

+              type indexSet struct {
+              	client    Client
+              	userIndex bool

Contributor

cyriltovena Jan 3, 2022

userBasedIndex ?

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/index_set.go Show resolved Hide resolved

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/index_set.go Outdated

+              	if len(is.sourceObjects) > 0 {
+              		// we would only have compacted files in user index folder, so it is not expected to have -1 for seedFileIdx but
+              		// let's still check it as a safety mechanism to avoid panics.
+              		if seedFileIdx != -1 {

Contributor

cyriltovena Jan 4, 2022

Shouldn't this be == ?

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/index_set.go Outdated

+              	return
+              }
+              // mustRecreateCompactedDB returns true if the compacted db should be recreated

Contributor

cyriltovena Jan 4, 2022

This seems unused.

Contributor Author

sandeepsukhani Jan 4, 2022

Yeah, it should be used by the index set instance as you rightly pointed out below.

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/index_set.go Outdated

+              }
+              // recreateCompactedDB just copies the old db to the new one using bbolt.Compact for following reasons:
+              // 1. When files are deleted, boltdb leaves free pages in the file. The only way to drop those free pages is to re-create the file.

Contributor

cyriltovena Jan 4, 2022

Suggested change

      
            // 1. When files are deleted, boltdb leaves free pages in the file. The only way to drop those free pages is to re-create the file.
          
            // 1. When index entries are deleted, boltdb leaves free pages in the file. The only way to drop those free pages is to re-create the file.

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/index_set.go

+              // - upload the compacted db if required.
+              // - remove the source objects from storage if required.
+              func (is *indexSet) done() error {
+              	if !is.uploadCompactedDB && !is.removeSourceObjects && mustRecreateCompactedDB(is.sourceObjects) {

Contributor

cyriltovena Jan 4, 2022

may be it was supposed to be is.mustRecreateCompactedDB ?

Contributor Author

sandeepsukhani Jan 4, 2022

The usage is correct here, I was supposed to drop the mustRecreateCompactedDB method on the index set.

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/util/util.go Outdated

               	if err != nil {
               		return err
               	}
               	defer func() {
               		if err := readCloser.Close(); err != nil {
-              			level.Error(util_log.Logger)
+              			level.Error(util_log.Logger).Log("msg", "failed to close read closer", "err", err)

Contributor

cyriltovena Jan 4, 2022

Suggested change

      
            			level.Error(util_log.Logger).Log("msg", "failed to close read closer", "err", err)
          
            			level.Error(logger).Log("msg", "failed to close read closer", "err", err)

Contributor

cyriltovena Jan 4, 2022

We should also use logger in the defer. below

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/util/util.go Outdated

+              	return strings.HasSuffix(filename, ".gz")
+              }
+              func DoConcurrentWork(ctx context.Context, maxConcurrency, workCount int, logger log.Logger, do func(workNum int) error) error {

Contributor

cyriltovena Jan 4, 2022

You can use https://github.com/grafana/dskit/blob/main/concurrency/runner.go#L65 instead.

jobs []interface{} = [0,1,2,3,....,workCount]int

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/compactor.go Show resolved Hide resolved

cyriltovena reviewed

View reviewed changes

pkg/storage/stores/shipper/compactor/table.go

               	}
               	table.logger = log.With(util_log.Logger, "table-name", table.name)
               	return &table, nil
               }
-              func (t *table) compact(tableHasExpiredStreams bool) error {
-              	indexFiles, err := t.indexStorageClient.ListFiles(t.ctx, t.name)
+              func (t *table) compact(applyRetention bool) error {

Contributor

cyriltovena Jan 4, 2022

Can we separate the logic for multi tenant index, this function is quite hard to understand

Contributor Author

sandeepsukhani Jan 4, 2022

I did give it a try but it looks hard. I feel the complexity comes from the support for backwards compatibility and multiple index formats. I will see if I can do something or maybe we can discuss this in a call face to face.

Contributor Author

sandeepsukhani Jan 7, 2022

LGTM it's a bit difficult to follow.

With the PR description I was able to understand everything though. Although we might want to think about refactoring to make the code cleaner, single code path for each case and separate concern. Talking specifically about the compaction/retention code.

I did some thinking and it does not seem as easy to build a separate execution path without making it complex. I am going to merge this PR and open a separate PR if I can come up with something.

Thanks for quickly reviewing this PR!

cyriltovena approved these changes

View reviewed changes

Contributor

cyriltovena left a comment

LGTM it's a bit difficult to follow.

With the PR description I was able to understand everything though. Although we might want to think about refactoring to make the code cleaner, single code path for each case and separate concern. Talking specifically about the compaction/retention code.

sandeepsukhani added 4 commits

January 5, 2022 11:59


          lint and changes suggested from PR review


          Merge remote-tracking branch 'upstream/main' into sandeepsukhani/per-…

412f428

…user-index-compactor-changes


          fix broken code after merging main

bb09d1d


          Merge remote-tracking branch 'upstream/main' into sandeepsukhani/per-…

9ee02a8

…user-index-compactor-changes

sandeepsukhani merged commit ff47d74 into grafana:main

sandeepsukhani mentioned this pull request

changes on read path for supporting per user index with boltdb-shipper #5073

Merged

1 task

JordanRushing mentioned this pull request

Add a Ring to IndexGateway #5358

Merged

3 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels