Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Treat legacy TIMSETAMP_X converted types as UTC #4309

Merged

Conversation

sergiimk
Copy link
Contributor

Which issue does this PR close?

Closes #4308.

Are there any user-facing changes?

This will change the behavior for users who read Parquet produced by older systems that still use legacy converted types (e.g. Apache Spark is one of them), but it changes it to be aligned with Parquet spec.

@github-actions github-actions bot added the parquet Changes to the parquet crate label May 29, 2023
@sergiimk
Copy link
Contributor Author

Let me know if you think user-facing impact is acceptable and I can try to align the tests with this change.

@tustvold
Copy link
Contributor

This change seems fine to me, thank you

@sergiimk sergiimk force-pushed the bug/treat_legacy_timestamps_as_utc branch from 0df03b0 to 69e8fba Compare May 30, 2023 23:30
@sergiimk sergiimk force-pushed the bug/treat_legacy_timestamps_as_utc branch from 69e8fba to ff1d152 Compare May 30, 2023 23:31
@sergiimk sergiimk marked this pull request as ready for review May 30, 2023 23:44
@sergiimk
Copy link
Contributor Author

Thanks for taking a look @tustvold, I've updated the tests and this PR should be ready for final review.

@tustvold tustvold added the api-change Changes to the arrow API label May 31, 2023
@tustvold
Copy link
Contributor

I've marked this API change, as whilst the prior behaviour was technically a bug, this will serve to highlight this change in the changelog

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api-change Changes to the arrow API parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[parquet] Not following the spec for TIMESTAMP_MILLIS legacy converted types
2 participants