-
Notifications
You must be signed in to change notification settings - Fork 18.3k
Description
I'm working with hundreds of 100,000+ line tsv files. They contain quotes in inappropriate places, so I am using LazyQuotes. However, somewhere in the guts of a few of the files, there are quotes at the beginning of a field, which breaks csv.Reader. That is #6287. When that happens, and LazyQuotes is on, the reported error location is at the very end of the file. See https://play.golang.org/p/y0jOObyBTV for an illustration of this.
However, if I turn LazyQuotes off, I get an error location on some other quotes that do no harm (with LazyQuotes on). I am left with an unreadable file that I cannot even fix manually, since there are some errors somewhere in hundreds of thousands of lines, but encoding/csv does not tell me where.
The error reported when LazyQuotes is on should probably at the beginning of the field in question if EOF is reached.
Also related: #3150.
I have some workarounds available. Just adding this to the list of encoding/csv quote issues. :)