Skip to content

Conversation

@ilicmarkodb
Copy link
Contributor

@ilicmarkodb ilicmarkodb commented Oct 29, 2025

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

In this PR, getStatsSchema is extended to include the collated stats schema for any collation referenced by DataSkippingPredicate (schema example). Also, added E2E tests for collated data skipping.

This PR is an extension of https://github.com/delta-io/delta/pull/5380.

How was this patch tested?

New tests.

Does this PR introduce any user-facing changes?

No.

@ilicmarkodb ilicmarkodb force-pushed the stats2 branch 6 times, most recently from 0223c1f to bd5abb6 Compare October 30, 2025 00:28
@ilicmarkodb ilicmarkodb force-pushed the stats2 branch 9 times, most recently from f7825f2 to 3f863c0 Compare October 30, 2025 14:17
Copy link
Collaborator

@allisonport-db allisonport-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A lot simpler thank you! Just a few comments + a question

Copy link
Collaborator

@allisonport-db allisonport-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@allisonport-db allisonport-db merged commit 78e8242 into delta-io:master Nov 3, 2025
19 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants