[Spark][Backport 3.2] Fix the semantic of shouldRewriteToBeIcebergCompatible
in REORG UPGRADE UNIFORM
#3474
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Which Delta project/connector is this regarding?
Description
currently we utilize the helper function
shouldRewriteToBeIcebergCompatible
to filter the portion of parquet files that need to be rewritten when runningREORG UPGRADE UNIFORM
based on the tags in theAddFile
.however, the
DeltaUpgradeUniformOperation.icebergCompatVersion
is accidentally shadowed, which will makeshouldRewriteToBeIcebergCompatible
always returnfalse
if theAddFile.tags
is notnull
- this is not the expected semantic of this function.this PR introduces the fix for this problem and add unit tests to ensure the correctness.
How was this patch tested?
through unit tests in
UniFormE2ESuite.scala
.Does this PR introduce any user-facing changes?
no.