-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-13899][SQL] Produce InternalRow instead of external Row at CSV data source #11717
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #53179 has finished for PR 11717 at commit
|
|
Test build #53180 has finished for PR 11717 at commit
|
|
This PR would allow to infer |
|
Test build #53189 has finished for PR 11717 at commit
|
| case dt: DecimalType => | ||
| val value = new BigDecimal(datum.replaceAll(",", "")) | ||
| Decimal(value, dt.precision, dt.scale) | ||
| // TODO(hossein): would be good to support other common timestamp formats |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove this todo?
|
Thanks - I'm going to merge this. There is a tiny comment. Can you remove that comment in one of your other pr? |
… data source ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13899 This PR makes CSV data source produce `InternalRow` instead of `Row`. Basically, this resembles JSON data source. It uses the same codes for casting. ## How was this patch tested? Unit tests were used within IDE and code style was checked by `./dev/run_tests`. Author: hyukjinkwon <gurwls223@gmail.com> Closes apache#11717 from HyukjinKwon/SPARK-13899.
What changes were proposed in this pull request?
https://issues.apache.org/jira/browse/SPARK-13899
This PR makes CSV data source produce
InternalRowinstead ofRow.Basically, this resembles JSON data source. It uses the same codes for casting.
How was this patch tested?
Unit tests were used within IDE and code style was checked by
./dev/run_tests.