-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quote handling issue #713
Comments
It seems that the issue is parsed due to using a bad delimiter. Is the file parsed correctly when using the right delimiter? I guess you should use the guessDelimiter functionality to avoid this kind of errors. |
Using a comma would behave better here of course, although would still be the wrong data since the file is malformed. The main point though is that the after-quote handling in papa parse would not be correct in the case of either the delimiter being present in the quoted string or newline being in the quoted string. Both of these cases are part of the reason quoting exists. It should at a minimum update these indexes if the starting point, in this case |
@jseter I've merged all your pull request but I'm not sure if this issues is already solved. Could you please check? |
I've been incrementally doing PR's for each item, but I have not submitted a PR to close #713 yet. I need to implement the strict quote option to handle the issue for this ticket. |
In my test 'questionably formatted' file, which is 2 MB the browser freezes, trying the parse the whole file as a single quote even though the quoting is properly formed where it is stuck. This completely locks the browser tab at 100% cpu, never making it into the step function until it exhausts the entire file or runs out of memory queuing errors.
In debugging, the main issue i see is that:
nextDelim and nextNewline have not been updated at this point in the code, so if they are within the quoted text they will be less than quoteSearch, used in extraSpaces function.
File delimiter is "~", but the user saved the file using ',' and settings such that all the contents ended up in the first column. the parser should have ended parsing the line at the first newline, but ends up in a loop trying to load the whole file as a single column.
ex.
The text was updated successfully, but these errors were encountered: