Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/ingest cmd #1864

Merged
merged 10 commits into from
May 4, 2021
Merged

Feature/ingest cmd #1864

merged 10 commits into from
May 4, 2021

Conversation

ozkatz
Copy link
Collaborator

@ozkatz ozkatz commented May 2, 2021

Closes #1848

@ozkatz ozkatz requested review from nopcoder and guy-har May 2, 2021 17:38
@ozkatz ozkatz self-assigned this May 2, 2021
cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
pkg/api/controller.go Show resolved Hide resolved
cmd/lakectl/cmd/store/factory.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/store/factory.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/store/factory.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/store/gcs.go Outdated Show resolved Hide resolved
Comment on lines 44 to 54
resp, err := client.StageObjectWithResponse(ctx,
lakefsURI.Repository,
lakefsURI.Ref, &api.StageObjectParams{
Path: key,
},
api.StageObjectJSONRequestBody{
Checksum: e.ETag,
PhysicalAddress: e.Address,
SizeBytes: e.Size,
},
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Optimization: parallelize the calls instead of using a single worker that perform the calls one after the other.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As discussed, let's open a seperate issue

var staged int64
var stagedBytes int64
client := getClient()
err := store.Walk(ctx, from, func(e store.ObjectStoreEntry) error {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A simple progress bar will help the user understand that things are progressing and in which rate. See import progress bars as an example.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

currently possible with --verbose which will print each individual iobject being staged. Progress bars are nice for things that are done interactively, I feel the use case here is more "do this before running my job in airflow" in which case the output will appear in a log file, rendering the progress bar unreadable.. Happy to hear objections or suggestions though..

cmd/lakectl/cmd/ingest.go Outdated Show resolved Hide resolved
cmd/lakectl/cmd/store/factory.go Outdated Show resolved Hide resolved
@ozkatz ozkatz requested review from nopcoder and itaiad200 May 3, 2021 11:42
cmd/lakectl/cmd/fs.go Outdated Show resolved Hide resolved
Copy link
Contributor

@guy-har guy-har left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

docs/reference/import.md Show resolved Hide resolved
@ozkatz ozkatz requested review from nopcoder and guy-har May 4, 2021 14:15
@ozkatz ozkatz merged commit 2e2e005 into master May 4, 2021
@ozkatz ozkatz deleted the feature/ingest-cmd branch May 4, 2021 16:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Allow mapping object store paths without going through inventory
4 participants