Skip to content

Commit

Permalink
Fix files truncating according to maxRecordPerFile (databricks#180)
Browse files Browse the repository at this point in the history
* Fix files truncating according to maxRecordPerFile

* toDouble
  • Loading branch information
Guo Chenzhao authored and cloud-fan committed May 29, 2019
1 parent 3f92a09 commit 6b2bf9f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/main/scala/com/databricks/spark/sql/perf/Tables.scala
Original file line number Diff line number Diff line change
Expand Up @@ -222,7 +222,7 @@ abstract class Tables(sqlContext: SQLContext, scaleFactor: String,
log.info(s"Data has $numRows rows clustered $clusterByPartitionColumns for $maxRecordPerFile")

if (maxRecordPerFile > 0 && numRows > maxRecordPerFile) {
val numFiles = ((numRows)/maxRecordPerFile).ceil.toInt
val numFiles = (numRows.toDouble/maxRecordPerFile).ceil.toInt
println(s"Coalescing into $numFiles files")
log.info(s"Coalescing into $numFiles files")
data.coalesce(numFiles).write
Expand Down

0 comments on commit 6b2bf9f

Please sign in to comment.