Grammar pattern matching ignoring 'end' rule #52

garborg · 2015-07-23T19:51:13Z

A line matches the 'begin' pattern of two rules. It doesn't match the 'end' pattern for the first rule and does match the 'end' pattern for the second rule. I'd expect the second rule to win, but the first rule does.

Example at end.

Is this expected, and if so, any ideas on workarounds given that the contents between 'begin' and 'end' must match '$source' (or a large subset thereof)?

test-spec.coffee

describe "Test grammar", ->
  grammar = null
  beforeEach ->
    waitsForPromise ->
      atom.packages.activatePackage("language-test")
    runs ->
      grammar = atom.grammars.grammarForScopeName("source.test")

  it "parses the grammar", ->
    expect(grammar).toBeDefined()
    expect(grammar.scopeName).toBe "source.test"

 # this fails
  it "does what I want", ->
    {tokens} = grammar.tokenizeLine("f(x)yy")
    console.log(tokens)
    expect(tokens[0]).toEqual value: "f", scopes: ["source.test", "fcall.test", "support.function.test"]
    expect(tokens[1]).toEqual value: "(", scopes: ["source.test", "fcall.test", "paren.open.call.test"]
    expect(tokens[2]).toEqual value: "x", scopes: ["source.test", "fcall.test"]
    expect(tokens[3]).toEqual value: ")", scopes: ["source.test", "fcall.test", "paren.close.call.test"]
    expect(tokens[4]).toEqual value: "yy", scopes: ["source.test"]

 # this passes
  it "does what I don't want", ->
    {tokens} = grammar.tokenizeLine("f(x)yy")
    console.log(tokens)
    expect(tokens[0]).toEqual value: "f", scopes: ["source.test", "fdecl.test", "entity.name.function.test"]
    expect(tokens[1]).toEqual value: "(", scopes: ["source.test", "fdecl.test", "paren.open.decl.test"]
    expect(tokens[2]).toEqual value: "x)yy", scopes: ["source.test", "fdecl.test"]

test.cson

fileTypes: [
  "tst"
]
name: "Test"
patterns: [
  {
    include: "#function_decl"
  }
  {
    include: "#function_call"
  }
]
repository:
  function_call:
    begin: "([[:alpha:]_][[:word:]!]*)(\\()"
    beginCaptures:
      "1":
        name: "support.function.test"
      "2":
        name: "paren.open.call.test"
    end: "(\\))"
    endCaptures:
      "1":
        name: "paren.close.call.test"
    patterns: [
      {
        include: "$self"
      }
    ]
    name: "fcall.test"
  function_decl:
    begin: "([[:alpha:]_][[:word:]!]*)(\\()"
    beginCaptures:
      "1":
        name: "entity.name.function.test"
      "2":
        name: "paren.open.decl.test"
    end: "(\\))(\\s*=)"
    endCaptures:
      "1":
        name: "paren.close.decl"
      "2":
        name: "keyword.operator.update.test"
    patterns: [
      {
        include: "$self"
      }
    ]
    name: "fdecl.test"
scopeName: "source.test"

garborg · 2015-07-24T00:30:32Z

Cloning https://github.com/garborg/atom-language-test, apm link, atom ., & cmd+alt+ctrl+p replicates it for me. Thanks for looking into this. With the issue, I seem to be left with some ugly, brittle workarounds, so I'd really like to know if it should work as I had expected.

garborg · 2015-07-28T00:01:22Z

Apologies, if you cloned my repo and the and had trouble testing -- I pushed the package.json now so apm link gets the package name right.

garborg · 2015-07-28T00:05:29Z

In case the previous example isn't simple enough for a quick response, here's a minimal example (the boots branch of https://github.com/garborg/atom-language-test):

test.cson

#...
patterns: [
  {
    include: "#p1"
  }
  {
    include: "#p2"
  }
]
repository:
  p1:
    begin: "cat"
    end: "hat"
    name: "p1.hat.test"
  p2:
    begin: "cat"
    end: "boots"
    name: "p2.boots.test"

test-spec.coffee

# ...
  it "passes the baseline", -> # passes
    {tokens} = grammar.tokenizeLine("catinhat")
    console.log(tokens)
    expect(tokens[0]).toEqual value: "cat", scopes: ["source.test", "p1.hat.test"]
    expect(tokens[1]).toEqual value: "in", scopes: ["source.test", "p1.hat.test"]
    expect(tokens[2]).toEqual value: "hat", scopes: ["source.test", "p1.hat.test"]

  it "understands 'inboots' doesn't contain 'hat' but contains 'boots'", -> # fails
    {tokens} = grammar.tokenizeLine("catinboots")
    console.log(tokens)
    expect(tokens[0]).toEqual value: "cat", scopes: ["source.test", "p2.boots.test"]
    expect(tokens[1]).toEqual value: "in", scopes: ["source.test", "p2.boots.test"]
    expect(tokens[2]).toEqual value: "boots", scopes: ["source.test", "p2.boots.test"]

garborg · 2015-07-28T00:18:07Z

Context: This is coming up improving the grammar for Julia, where in between 'begin' and 'end', the contents may be nearly arbitrary Julia.

Without a way around this, I can't see how to consistently differentiate between, say, a function call (f([...])) and a one-line function definition (f([...]) = [...]), or between a parameterized type (S{[...]}), constructing a parameterized type (S{[...]}([...])), and a one-line, parameterized method definition (f{[...]}([...]) = [...]).

Is ignoring 'end' like this expected?

Thanks.

winstliu · 2015-07-28T00:32:51Z

Reproduced. Also, as expected, if you move p2 to become the first pattern, then test 1 will fail and test 2 will pass.

garborg · 2015-08-17T19:15:32Z

Is there anything I can do to help this along?

I don't spend too much of my time in JS, but if someone pointed me to the right place to start, I'd be happy to take a look, with the caveat that if finding a fix meant stealing more developer time, I'd throw it back rather than creating extra work for everyone.

garborg · 2015-08-24T17:10:09Z

Are any collaborators on this project planning to dig into this? Thanks, Sean

sglyon · 2015-08-28T12:46:20Z

Ping. I could also take a look and work under the same conditions that @garborg laid out. I just don't know where to look

Ingramz · 2015-08-31T08:09:59Z

https://manual.macromates.com/en/language_grammars#language_rules

The behavior that it only matches the first rule only is by design.

garborg · 2015-09-04T16:06:59Z

Thanks, @Ingramz. I suppose this is a dead end then. It will unfortunately keep us from having reliable highlighting, etc., unless/until a more powerful grammar is supported in Atom.

Here's what I found on that topic:
#50
atom/atom#8669
#34

Perhaps this close this once someone in the know links any other relevant discussions?

winstliu added the needs-reproduction label Jul 24, 2015

winstliu added bug and removed needs-reproduction labels Jul 28, 2015

garborg mentioned this issue Aug 2, 2015

Docstring Highlighting JuliaEditorSupport/atom-language-julia#33

Closed

winstliu added the help-wanted label Aug 28, 2015

mojavelinux mentioned this issue Apr 25, 2016

Code block for properties continue parsing after the end of the code block asciidoctor/atom-language-asciidoc#59

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grammar pattern matching ignoring 'end' rule #52

Grammar pattern matching ignoring 'end' rule #52

garborg commented Jul 23, 2015

garborg commented Jul 24, 2015

garborg commented Jul 28, 2015

garborg commented Jul 28, 2015

garborg commented Jul 28, 2015

winstliu commented Jul 28, 2015

garborg commented Aug 17, 2015

garborg commented Aug 24, 2015

sglyon commented Aug 28, 2015

Ingramz commented Aug 31, 2015

garborg commented Sep 4, 2015

Grammar pattern matching ignoring 'end' rule #52

Grammar pattern matching ignoring 'end' rule #52

Comments

garborg commented Jul 23, 2015

garborg commented Jul 24, 2015

garborg commented Jul 28, 2015

garborg commented Jul 28, 2015

garborg commented Jul 28, 2015

winstliu commented Jul 28, 2015

garborg commented Aug 17, 2015

garborg commented Aug 24, 2015

sglyon commented Aug 28, 2015

Ingramz commented Aug 31, 2015

garborg commented Sep 4, 2015