Zero copy separatedby #197

Zij-IT · 2022-08-26T17:35:11Z

Thanks to your assistance with getting the parser to compile, I was able to give it a run and found that despite having changed none of the parsing logic itself, my Lua parser couldn't correctly parse some of the previous test items, such as the following:

x, y = 2, 3

The error was that the token representing 3 wasn't expected... which is odd because both the left and the right are basically comma separated lists, so it should have failed on y. I realized a subtle difference though:

The lhs was parsed effectively with this:

just(ident).then(just(comma).ignore_then(ident).repeated())

And the rhs was parsed with this:

expr.separated_by(comma).at_least(1)

I also noticed that this: x = 2, would successfully parse without having set allow_trailing. Since these should (I believe) always parse the same input to the same end result, I looked into how parsing for separated was done, and found this subtle bug:

match self.parser.go::<M>(inp) {
    Ok(out) => {
        /* snip */

        let before_separator = inp.save();
        if let Err(e) = self.separator.go::<Check>(inp) {
            inp.rewind(before_separator);
            break if count >= self.at_least {
                Ok(output)
            } else {
                Err(e)
            };
        }

        /* snip */
    }
    Err(e) => { /* snip */ }
}

The current implementation will always parse the separator after parsing the item. I was able to confirm this by using the tests from master and just moved them over to zero-copy, as well as updated the implementation. I tried to document my implementation well enough without going overboard, so hopefully it is up to par with what you expect.

…-zero-copy SeparatedBy * Both `separated_by_at_least_without_trailing` and `separated_by_leaves_last_separator` fail

zesterer · 2022-08-26T18:09:20Z

Nice catch! It's frustrating how incredibly subtle the logic for separated_by is. Do you know whether this bug affects separated_by_exactly too?

Zij-IT · 2022-08-26T18:21:42Z

As far as I can tell the bug only affects SeparatedBy because with SeparatedByExactly you can safely parse the item-separator pairs until the last one, in which the separator is only parsed if the allow_trailing is set

zesterer · 2022-08-26T18:23:49Z

Makes sense. Thanks for the change!

zesterer · 2022-08-26T18:27:37Z

Oh by the way, make sure to add yourself to the authors field in the Cargo.toml. I keep forgetting to mention this to contributors, but I think it's worth doing :)

Zij-IT added 2 commits August 26, 2022 16:58

ZeroCopySeparatedBy: Add tests from master used for testing the non…

60b58b8

…-zero-copy SeparatedBy * Both `separated_by_at_least_without_trailing` and `separated_by_leaves_last_separator` fail

ZeroCopySeparatedBy: Add reworked version that passes tests

8b71975

zesterer merged commit 88f3b04 into zesterer:zero-copy Aug 26, 2022

Zij-IT mentioned this pull request Aug 26, 2022

Support for zero-copy parsing #9

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zero copy separatedby #197

Zero copy separatedby #197

Zij-IT commented Aug 26, 2022 •

edited

Loading

zesterer commented Aug 26, 2022

Zij-IT commented Aug 26, 2022 •

edited

Loading

zesterer commented Aug 26, 2022

zesterer commented Aug 26, 2022

Zero copy separatedby #197

Zero copy separatedby #197

Conversation

Zij-IT commented Aug 26, 2022 • edited Loading

zesterer commented Aug 26, 2022

Zij-IT commented Aug 26, 2022 • edited Loading

zesterer commented Aug 26, 2022

zesterer commented Aug 26, 2022

Zij-IT commented Aug 26, 2022 •

edited

Loading

Zij-IT commented Aug 26, 2022 •

edited

Loading