Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(athena): Enable use of dataframe type, in athena2pyarrow type #2953

Merged
merged 7 commits into from
Sep 10, 2024

Conversation

eliabrio
Copy link
Contributor

@eliabrio eliabrio commented Sep 5, 2024

Feature or Bugfix

  • Bugfix

Detail

Fixing a bug of failing to save a dataframe to parquet files, when the range of the timestamp is outside of the bounds of pyarrows Timestamp[ns]. With this fix the new supported time units (and thus the ranges), are: ‘s’ [second], ‘ms’ [millisecond], ‘us’ [microsecond], or ‘ns’ [nanosecond] , based on https://arrow.apache.org/docs/python/generated/pyarrow.timestamp.html .

Any other dataframe type will be defaulted to timestamp[ns], similar to current behaviour.

Relates

Bug fix for: #2950

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@eliabrio eliabrio changed the title Enable use of dataframe type, in athena2pyarrow type fix(athena) Enable use of dataframe type, in athena2pyarrow type Sep 5, 2024
@eliabrio eliabrio changed the title fix(athena) Enable use of dataframe type, in athena2pyarrow type fix(athena): Enable use of dataframe type, in athena2pyarrow type Sep 5, 2024
@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant

This comment was marked as outdated.

@malachi-constant
Copy link
Contributor

AWS CodeBuild CI Report

  • CodeBuild project: GitHubDistributedCodeBuild6-jWcl5DLmvupS
  • Commit ID: b071f0b
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@malachi-constant
Copy link
Contributor

AWS CodeBuild CI Report

  • CodeBuild project: GitHubCodeBuild8756EF16-4rfo0GHQ0u9a
  • Commit ID: 2bac024
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@malachi-constant
Copy link
Contributor

AWS CodeBuild CI Report

  • CodeBuild project: GitHubDistributedCodeBuild6-jWcl5DLmvupS
  • Commit ID: 2bac024
  • Result: SUCCEEDED
  • Build Logs (available for 30 days)

Powered by github-codebuild-logs, available on the AWS Serverless Application Repository

@jaidisido jaidisido merged commit 119dc4e into aws:main Sep 10, 2024
17 checks passed
@eliabrio eliabrio deleted the AllowSavingNonNSDatetimeInAthena branch September 10, 2024 17:56
@eliabrio
Copy link
Contributor Author

@jaidisido thanks for approving and merging the PR. What is the release cadence, in which version this change is expected to be released?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Save to S3 with timestamp[us] data fails if Athena table already exists
4 participants