-
Notifications
You must be signed in to change notification settings - Fork 206
Consume invalid input #43
Comments
I saw it going into that mode when testing by hand. Also not something I don't know much about. BTW, pipes-parse is not a requirement, conduit or some such could be used too, if anyone knows how |
So I have a fix, but it requires patching I have a poc of that locally, so it does actually work, but I'm not sure if it's worth the trouble. Also I'm not sure if we'll manage to get those upstream, pinging @k0001 for that. I guess it's best to leave it for now and see if we can come up with an easier solution or if @k0001 is willing to accept the required changes. Otherwise I would just go for handrolling it without any streaming library, since I don't know anything about conduit either :) |
Whatever works. One thing I considered is to use line-delimited JSON instead of concatenated. That way it becomes a case of reading a line at a time from the stream, then parsing/processing it. If the parse fails, the line can be discarded anyway. |
I thought about that too, I thought that this might cause problems when we encode/decode some json string that contains newlines, but it looks like they are escaped so we that should be fine. |
I checked, json requires them to be escaped so this should be safe. Seems like the best solution for now, I'll have a go at it somewhat soon™. |
Great |
Long-term I'd love to use concatenated, it is great to have a user-friendly protocol that accepts pretty-indented json as input. |
I agree, but I think some sort of message framing separate from the actual message it a good idea. I was actually wondering whether we to not use one of STX / ETX / EOT or similar as a frame delimiter, they will be escaped in JSON anyway. |
Since you both think concatenated is nicer and I agree, I made a proposal in #49 of how it looks if we just pull in the necessary code from |
It seems to bring in a lot of complexity for something that should be simple. I still side with a separation between framing and parsing. |
The problem I have with using one of STX, ETX or EOT is that I don't even know how to type that. Ofc I could look it up, but it makes it harder for people to use. On the other hand adding a bit of complexity makes it pretty obvious for users and it's encapsulated so I don't think the complexity is that bad here. |
What happens if we get malformed JSON, with unbalanced braces? |
Attoparsec supports partial parses, so if you input a single |
So if I send
and then do the rest of my processing with valid JSON will will wait On Mon, Nov 9, 2015 at 10:51 AM, Moritz Kiefer notifications@github.com
|
Yep, I hadn't thought about that. Ok, I'll drop the idea of concatenated input, we really want some terminator. |
Thats the conclusion I had come to. I suggest STX. On Mon, Nov 9, 2015 at 11:08 AM, Moritz Kiefer notifications@github.com
|
It is not a good idea to have Nevertheless, that shouldn't be a problem. Once you have delimiters in your stream (say, newline markers) you can try different things: For example, by using Another alternative that works without bar = (fmap Maybe foo <|> (Nothing <$ takeWhile (/= '\n)) <* char '\n' That is, |
I wasn't trying to suggest that you replace the existing functions but rather add additional functions with the discard behavior. Anyway, since we decided to go for a separator approach this is not necessary. I tried figuring out how to to do it using |
@k0001 are you prepared to give it a go? |
@alanz I don't follow the development of haskell-ide-engine myself so I'm not sure what issue is being discussed here. But sure, I can help. @cocreature what do you think about working on this together? That way I can learn a bit more about about haskell-ide-engine and you can learn a bit more about pipes. |
@k0001 So I was trying to nicely write up where I failed but during that I think I found a solution :) I used |
I'm against trying to fix up a broken stream. Desync problems should be fatal. Rationale: I see two cases when invalid input on json-concatenated channel may happen:
Note that using STX or similar does not help on higher level as it causes at least one innocent message to be garbled and lost. Emacs treats this channel as reliable and I do not foresee any mechanisms to compensate for lost messages. I do not think any other integration will implement such a thing either. That is why I would prefer to just die when such an error happens. |
yes, but we need to be able to detect it. If we just use concatenated JSON and get a malformed message (missing If we have a delimeter, the transport can report a message back indicating Just dying will lose the debug opportunity. On Wed, Nov 11, 2015 at 9:52 AM, Gracjan Polak notifications@github.com
|
Reporting issue is good. But please do not do any kind of recovery there.
|
Expect json input separated by STX, fix #43
pipes-parse doesn't consume the input if it doesn't parse which results in us repeatedly trying to parse the same input which results in an perpetual flood of error messages.
My pipes foo is too weak to figure out how to fix this right now.
The text was updated successfully, but these errors were encountered: