-
Notifications
You must be signed in to change notification settings - Fork 66
Restore from S3 compatible API? #69
Comments
There are several S3 Fuse projects. I don't think it should be terribly difficult to make a VFS adapter (probably depends on error handling complexity). There are already apache VFS adapters. |
We are starting to use go-cloud for cloud support. It also supports the filesystem as a backend. |
Nice! We could use the
I've seen at least one customer giving the table a Chinese name, and mydumper will not escape them in the filename, making this implementation not usable. |
Yeah, looks like fileblob is just meant for testing purposes anyways, so two pathways (cloud or file) would still be needed. |
Is there any update on this? To be more cloud-native, we need to support restoring from S3 storage. Though we have a workaround in tidb-operator to use |
For the AWS Aurora scenario, Aurora exports data in CSV format, and it is partitioned into multiple files. It's worth taking into consideration. https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/AuroraMySQL.Integrating.SaveIntoS3.html#AuroraMySQL.Integrating.SaveIntoS3.Grant |
🤔 Partitioning into multiple files isn't a problem (it is actually desired). The problem is the file name does not end in
|
Can lightning choose file format based on its content? Also, Aurora can export data into TEXT format, and the name is the same as the CSV format. |
it can but i don't trust Lightning to do so 🙂. perhaps we need RFC 5 anyway. |
Seems |
Shall we provide an option to allow use explicitly set the input files format to |
Seems if we want to support partitioned files in s3 buckets or partitioned compression files, RFC 5 needs to be updated. And I afraid if the route rule is complex, it will be hard to teach user to use this feature |
For Aurora partition dump, Lightning could read Aurora dump manifest directly. |
That is a large departure from the existing model ( |
A feature request for your roadmap:
Can it be possible to restore directly from a mydumper backup stored in S3? In most cloud deployments this is where user backups will be stored (the S3 API is implemented by many other object stores).
Value
Value description
Support restore to TiDB via S3.
Value score
Workload estimation
Time
GanttStart: 2020-07-27
GanttDue: 2020-09-04
GanttProgress: 100%
The text was updated successfully, but these errors were encountered: