-
Notifications
You must be signed in to change notification settings - Fork 3.7k
branch-3.0: [Fix](oss-hdfs)Fix insert failure on Hive partitioned table with oss-hdfs #55565
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
branch-3.0: [Fix](oss-hdfs)Fix insert failure on Hive partitioned table with oss-hdfs #55565
Conversation
…hdfs When inserting into a Hive partitioned table stored on oss-hdfs, the following issue occurs: First insert succeeds: Since the partition does not exist yet, HiveTableSink#setPartitionValues does not set storage-related information for the partition. Subsequent inserts fail: Once the partition exists, the system tries to resolve the partition’s storage information. At this stage, oss-hdfs is incorrectly treated as s3 instead of being recognized as hdfs, leading to insert failure. This PR fixes the storage type handling logic so that oss-hdfs partitions are correctly recognized as hdfs.
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
TPC-H: Total hot run time: 39647 ms |
TPC-DS: Total hot run time: 192289 ms |
ClickBench: Total hot run time: 29.75 s |
FE UT Coverage ReportIncrement line coverage |
|
run buildall |
TPC-H: Total hot run time: 39958 ms |
TPC-DS: Total hot run time: 191277 ms |
ClickBench: Total hot run time: 30.13 s |
FE UT Coverage ReportIncrement line coverage |
dataroaring
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
PR approved by at least one committer and no changes requested. |
|
PR approved by anyone and no changes requested. |
…
What problem does this PR solve?
When inserting into a Hive partitioned table stored on oss-hdfs, the following issue occurs:
First insert succeeds: Since the partition does not exist yet, HiveTableSink#setPartitionValues does not set storage-related information for the partition.
Subsequent inserts fail: Once the partition exists, the system tries to resolve the partition’s storage information. At this stage, oss-hdfs is incorrectly treated as s3 instead of being recognized as hdfs, leading to insert failure.
This PR fixes the storage type handling logic so that oss-hdfs partitions are correctly recognized as hdfs.
How to Reproduce
Release note
None
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)