Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[#5557] improvement(CI): Add some docs and tests about how to use Azure Blob Storage(ADLS) in Hive #5558

Open
wants to merge 15 commits into
base: main
Choose a base branch
from

Conversation

yuqi1129
Copy link
Contributor

What changes were proposed in this pull request?

Add some tests to demonstrate how to use ADLS in Hive.

Why are the changes needed?

To verify if we can use ADLS in Hive.
Fix: #5557

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

Test manually.

@yuqi1129
Copy link
Contributor Author

This PR depends on #5553

@yuqi1129 yuqi1129 closed this Nov 12, 2024
@yuqi1129 yuqi1129 reopened this Nov 12, 2024
@yuqi1129 yuqi1129 changed the title [#5557] improvement(CI): Add some tests for ADLS in Hive [#5557] improvement(CI): Add some tests about how to use ADLS in Hive Nov 12, 2024
@@ -11,6 +11,8 @@ license: "This software is licensed under the Apache License version 2."

Since Hive 2.x, Hive has supported S3 as a storage backend, enabling users to store and manage data in Amazon S3 directly through Hive. Gravitino enhances this capability by supporting the Hive catalog with S3, allowing users to efficiently manage the storage locations of files located in S3. This integration simplifies data operations and enables seamless access to S3 data from Hive queries.

For ADLS, the integration is similar to S3. The only difference is the configuration properties for ADLS. The following sections will guide you through the necessary steps to configure the Hive catalog to utilize S3 as a storage backend, including configuration details and examples for creating databases and tables.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shall we use ADLS, or ABS, or ADLS2, I think we should clarify here, it is so confusing. I think we should explain and clarify them in the doc, otherwise users may also be confused.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

@yuqi1129 yuqi1129 self-assigned this Nov 20, 2024
@yuqi1129 yuqi1129 changed the title [#5557] improvement(CI): Add some tests about how to use ADLS in Hive [#5557] improvement(CI): Add some docs about how to use ADLS in Hive Nov 21, 2024
@yuqi1129 yuqi1129 changed the title [#5557] improvement(CI): Add some docs about how to use ADLS in Hive [#5557] improvement(CI): Add some docs and tests about how to use ADLS in Hive Nov 21, 2024
@jerryshao
Copy link
Contributor

Do you need to update the PR based on #5630 ?

@yuqi1129
Copy link
Contributor Author

Do you need to update the PR based on #5630 ?

No, #5630 need to be polished and more effect.

@yuqi1129 yuqi1129 changed the title [#5557] improvement(CI): Add some docs and tests about how to use ADLS in Hive [#5557] improvement(CI): Add some docs and tests about how to use Azure Blob Storage(ADLS) in Hive Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Improvement] Add tests about how to use ADLS in Hive
2 participants