Merge latest upstream #393

dvogelbacher · 2018-07-30T19:46:30Z

Merging latest upstream, particularly for SPARK-24957.

robert3005 · 2018-07-30T20:14:46Z

You need to run ./dev/test-dependencies.sh --replace-manifest since this pr involves dependency changes.

LGTM - remember not to squash these

robert3005 · 2018-07-30T23:14:16Z

You might want to rebase on #394 as this fixes very annoying flake - which is the thing you're seeing

…ns wrong result ## What changes were proposed in this pull request? When we do an average, the result is computed dividing the sum of the values by their count. In the case the result is a DecimalType, the way we are casting/managing the precision and scale is not really optimized and it is not coherent with what we do normally. In particular, a problem can happen when the `Divide` operand returns a result which contains a precision and scale different by the ones which are expected as output of the `Divide` operand. In the case reported in the JIRA, for instance, the result of the `Divide` operand is a `Decimal(38, 36)`, while the output data type for `Divide` is 38, 22. This is not an issue when the `Divide` is followed by a `CheckOverflow` or a `Cast` to the right data type, as these operations return a decimal with the defined precision and scale. Despite in the `Average` operator we do have a `Cast`, this may be bypassed if the result of `Divide` is the same type which it is casted to, hence the issue reported in the JIRA may arise. The PR proposes to use the normal rules/handling of the arithmetic operators with Decimal data type, so we both reuse the existing code (having a single logic for operations between decimals) and we fix this problem as the result is always guarded by `CheckOverflow`. ## How was this patch tested? added UT Author: Marco Gaido <marcogaido91@gmail.com> Closes apache#21910 from mgaido91/SPARK-24957.

## What changes were proposed in this pull request? Looks Avro uses direct `getLogger` to create a SLF4J logger. Should better use `internal.Logging` instead. ## How was this patch tested? Exiting tests. Author: hyukjinkwon <gurwls223@apache.org> Closes apache#21914 from HyukjinKwon/avro-log.

## What changes were proposed in this pull request? Upgrade Apache Avro from 1.7.7 to 1.8.2. The major new features: 1. More logical types. From the spec of 1.8.2 https://avro.apache.org/docs/1.8.2/spec.html#Logical+Types we can see comparing to [1.7.7](https://avro.apache.org/docs/1.7.7/spec.html#Logical+Types), the new version support: - Date - Time (millisecond precision) - Time (microsecond precision) - Timestamp (millisecond precision) - Timestamp (microsecond precision) - Duration 2. Single-object encoding: https://avro.apache.org/docs/1.8.2/spec.html#single_object_encoding This PR aims to update Apache Spark to support these new features. ## How was this patch tested? Unit test Author: Gengliang Wang <gengliang.wang@databricks.com> Closes apache#21761 from gengliangwang/upgrade_avro_1.8.

## What changes were proposed in this pull request? This pr supported Date/Timestamp in a JDBC partition column (a numeric column is only supported in the master). This pr also modified code to verify a partition column type; ``` val jdbcTable = spark.read .option("partitionColumn", "text") .option("lowerBound", "aaa") .option("upperBound", "zzz") .option("numPartitions", 2) .jdbc("jdbc:postgresql:postgres", "t", options) // with this pr org.apache.spark.sql.AnalysisException: Partition column type should be numeric, date, or timestamp, but string found.; at org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.verifyAndGetNormalizedPartitionColumn(JDBCRelation.scala:165) at org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.columnPartition(JDBCRelation.scala:85) at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:36) at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:317) // without this pr java.lang.NumberFormatException: For input string: "aaa" at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) at java.lang.Long.parseLong(Long.java:589) at java.lang.Long.parseLong(Long.java:631) at scala.collection.immutable.StringLike$class.toLong(StringLike.scala:277) ``` Closes apache#19999 ## How was this patch tested? Added tests in `JDBCSuite`. Author: Takeshi Yamamuro <yamamuro@apache.org> Closes apache#21834 from maropu/SPARK-22814.

…ode test ## What changes were proposed in this pull request? Don't set service account name for the pod created in client mode ## How was this patch tested? Test should continue running smoothly in Jenkins. Author: mcheah <mcheah@palantir.com> Closes apache#21900 from mccheah/fix-integration-test-service-account.

dvogelbacher force-pushed the dv/upstream branch from f163925 to 3ff1116 Compare July 30, 2018 19:55

dvogelbacher requested a review from robert3005 July 30, 2018 20:04

mgaido91 and others added 6 commits July 31, 2018 09:50

updating manifest

ab0e4e7

robert3005 force-pushed the dv/upstream branch from a198e63 to ab0e4e7 Compare July 31, 2018 08:51

robert3005 merged commit dcd5aae into master Jul 31, 2018

robert3005 deleted the dv/upstream branch July 31, 2018 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge latest upstream #393

Merge latest upstream #393

Uh oh!

dvogelbacher commented Jul 30, 2018 •

edited

Loading

Uh oh!

robert3005 commented Jul 30, 2018

Uh oh!

robert3005 commented Jul 30, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Merge latest upstream #393

Merge latest upstream #393

Uh oh!

Conversation

dvogelbacher commented Jul 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

robert3005 commented Jul 30, 2018

Uh oh!

robert3005 commented Jul 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

dvogelbacher commented Jul 30, 2018 •

edited

Loading

robert3005 commented Jul 30, 2018 •

edited

Loading