forked from qubole/spark-acid
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Issue-83 Fix the repartitioning logic to handle statement IDs (qubole#84
) For UPDATE/DELETE, we were repartitioning based on encoded bucketIds so that all rows with same bucket are processed by the same task. However, rows can have same bucket but different encoded bucketIds as encoded bucketIds are composed of both bucket+statementId. Hence, row with same bucket end up going to different tasks which can cause conflict as different task will be writing to the same delete delta bucket file. Approved-by: Sourabh Goyal <sourabhg@qubole.com>
- Loading branch information
1 parent
6abfb63
commit b0a5dce
Showing
2 changed files
with
75 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters