Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation, examples in README for how to import from S3 to Scylla #145

Closed
wpaven opened this issue May 28, 2024 · 3 comments · Fixed by #166
Closed

Documentation, examples in README for how to import from S3 to Scylla #145

wpaven opened this issue May 28, 2024 · 3 comments · Fixed by #166
Assignees

Comments

@wpaven
Copy link

wpaven commented May 28, 2024

This capability was added in #136 but what needs to be done to make it work? Can we have some documentation please? Sample config.yaml, sample spark-submit, any considerations for the S3 bucket, etc. would be very useful. And any limitations? Import only works with JSON exports correct?

@julienrf
Copy link
Collaborator

For now we have documentation in the file config.yaml.example:

# Example for loading from a DynamoDB S3 export (see https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/S3DataExport.Output.html)
# source:
# type: dynamodb-s3-export
# bucket: <bucket-name>
# # Key of the `manifest-summary.json` object in the bucket
# manifestKey: <manifest-summary-key>
# # Key schema and attribute definitions, see https://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_TableCreationParameters.html
# tableDescription:
# attributeDefinitions:
# - name: <attribute-name>
# type: <attribute-type> (see https://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_AttributeDefinition.html)
# - ...
# keySchema:
# - name: <key-name>
# type: <key-type> (see https://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_KeySchemaElement.html)
# - ...
#
# # Optional - load from a custom endpoint:
# endpoint:
# # Specify the hostname without a protocol
# host: <host>
# port: <port>
# # Optional - specify the region:
# region: <region>
#
# # Optional - static credentials:
# credentials:
# accessKey: <user>
# secretKey: <pass>
#
# # Optional - use path-style access in S3 (see https://docs.aws.amazon.com/AmazonS3/latest/userguide/VirtualHosting.html)
# usePathStyleAccess: true

I will publish the docs on the website soon (related: #116)

@julienrf julienrf self-assigned this May 28, 2024
@wpaven
Copy link
Author

wpaven commented Jun 5, 2024

Do you have a sample config.yaml that you can share with me until the docs are published please? I have questions, like is the bucket name the arn resource name for the bucket? Seeing an example of the formats for these attributes would be very helpful.

@wpaven
Copy link
Author

wpaven commented Jun 6, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants