Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Spark] Resolve inconsistencies between the V2 Checkpoint specification and implementation #2249

Closed
wants to merge 1 commit into from

Conversation

dhruvarya-db
Copy link
Collaborator

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Follow-up for #2214.

The V2 Checkpoint implementation does not match with what is expected in the PROTOCOL in some places.
It does not write some fields in the V2 Checkpoint-related actions:

  1. flavor in checkpointMetadata
  2. type in sidecar
    Also,
  3. The implementation writes a field called version (checkpoint version) in checkpointMetadata and relies on it but the PROTOCOL does not specify any such fields.
  4. The PROTOCOL requires that the sidecar’s relative file path should be specified under the field fileName in the sidecar action. But the implementation writes this under the field name path.

This PR updates the specification so that it correctly reflects the implementation.

Does this PR introduce any user-facing changes?

No

@tdas tdas closed this in cacb7a3 Nov 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant