Skip to content
This repository was archived by the owner on Mar 24, 2025. It is now read-only.

Conversation

@HyukjinKwon
Copy link
Member

#105

Currently, this library does not support PERMISSIVE parse mode. Similar with JSON data source, this also can be done in the same way with _corrupt_record.

This PR adds the support for PERMISSIVE mode and make this behaviour consistent with the other data sources supporting parse modes (JSON and CSV data sources.)

Also, this PR adds the support for _corrupt_record.

This PR is similar with apache/spark#11756 and apache/spark#11881.

@codecov-io
Copy link

codecov-io commented Apr 4, 2016

Current coverage is 90.53% (diff: 88.88%)

Merging #107 into master will increase coverage by 0.06%

@@             master       #107   diff @@
==========================================
  Files            14         15     +1   
  Lines           682        697    +15   
  Methods         621        638    +17   
  Messages          0          0          
  Branches         61         59     -2   
==========================================
+ Hits            617        631    +14   
- Misses           65         66     +1   
  Partials          0          0          

Powered by Codecov. Last update 5ca4579...4c9677c

@HyukjinKwon HyukjinKwon changed the title Support for PERMISSIVE mode and corrupt record option. Support for PERMISSIVE/DROPMALFORMED mode and corrupt record option. Apr 4, 2016
@HyukjinKwon
Copy link
Member Author

@cloud-fan Do you mind if I ask a quick look for this please? I rememeber JSON parse modes were reviewed by you.

@cloud-fan
Copy link

Haven't we absorbed it into Spark already?

@HyukjinKwon
Copy link
Member Author

HyukjinKwon commented Apr 25, 2016

@cloud-fan Ah, no CSV was ported into Spark but XML one has not been. I have been managing this and confused of who I should cc.

@cloud-fan
Copy link

cc @rxin

@HyukjinKwon
Copy link
Member Author

Sorry for cc here and there @cloud-fan. Could you please give a quick look please?

@HyukjinKwon
Copy link
Member Author

HyukjinKwon commented Aug 31, 2016

Maybe I will merge this after double-checking this by myself if you don't have time to review this PR (but will wait more just in case) - @cloud-fan

@HyukjinKwon
Copy link
Member Author

I am going to merge this as soon as the tests pass.

@HyukjinKwon HyukjinKwon mentioned this pull request Sep 10, 2016
HyukjinKwon added a commit that referenced this pull request Sep 10, 2016
This PR prepares the release for 0.4.0.

This will include the changes below:
  - Support for PERMISSIVE/DROPMALFORMED mode and corrupt record option. #107
  - Change default values for valueTag and attributePrefix to avoid always require field escape for some apis #142
  - deprecates saveAsXmlFile and promote the usage of write(). #150
  - deprecates xmlFile and promote the usage of read(). #150
  - drops 1.x compatibility from 0.4.0. #150
  - makes not supporting UserDefinedType as it became private. #150

Author: hyukjinkwon <gurwls223@gmail.com>

Closes #176 from HyukjinKwon/version-0.4.0.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants