Parser should point to the name of non-existent functions rather than the end of the line #2637

bgoodri · 2018-09-13T17:06:01Z

Summary:

If a user tries to use a function that does not exist or otherwise specifies it wrong, the parser says something like error in ... at line X, column Y where Y is the end of the line. This confuses RStudio's error flagging (which goes off Y) and makes it look like the problem is with whatever is the last argument to the function, rather than the function itself.

Description:

See rstudio/rstudio#3474

Reproducible Steps:

Parse this

data {
  real mu;
  real<lower=0> sigma;
}
transformed data {
  real z = foo(mu, sigma);
}

Current Output:

SYNTAX ERROR, MESSAGE(S) FROM PARSER:

No matches for: 

  foo(real, real)

Function foo not found.
  error in 'model573502d9042_foo' at line 6, column 26
  -------------------------------------------------
     4: }
     5: transformed data {
     6:   real z = foo(mu, sigma);
                                 ^
     7: }
  -------------------------------------------------

Expected Output:

error in 'model573502d9042_foo' at line 6, column 14 (or maybe 12, something pointing to foo instead of the semicolon)

Additional Information:

Not a huge priority

Current Version:

v2.18.0

The text was updated successfully, but these errors were encountered:

bob-carpenter · 2018-09-14T13:39:48Z

What's wrong is that there's no signature to match the argument types. So it really is something that happens after you see the arguments. It could happen earlier if there's a way to tell if some initial arguments are inconsistent with all possibilities.

It might be possible to change the error message, but given the way it happens in the parser which is as things go along, it might not be so easy.

VMatthijs · 2018-12-13T12:30:52Z

Stanc3 now produces

Semantic error at file "test.stan", line 6, characters 11-25:
   -------------------------------------------------
     4:  }
     5:  transformed data {
     6:    real z = foo(mu, sigma);
                    ^
     7:  }
   -------------------------------------------------

A returning function was expected but an undeclared identifier foo was supplied.

bob-carpenter · 2018-12-13T13:39:41Z

I'm not sure how much we want the language to be things like "identifier", which is compiler jargon more than something that'll be familiar to most users. I'd say something like "unknown function foo". What happens if foo is declared but not for those arguments?

…

On Dec 13, 2018, at 7:30 AM, Matthijs Vákár ***@***.***> wrote: Stanc3 now produces Semantic error at file "test.stan", line 6, characters 11-25: ------------------------------------------------- 4: } 5: transformed data { 6: real z = foo(mu, sigma); ^ 7: } ------------------------------------------------- A returning function was expected but an undeclared identifier foo was supplied.

VMatthijs · 2018-12-13T14:15:03Z

Sure. Perhaps "variable" would be better? We could always specialize the message further to functions etc., but currently it is thrown elsewhere in a more generic fashion, when it's looking up an identifier in the symbol table.

If we have a model

functions {
  real foo(real[] x, vector y, int z) {return 3;}
}
data {
  real mu;
  real<lower=0> sigma;
}
transformed data {
  real z = foo(mu, sigma);
}

it says

Semantic error at file "test.stan", line 9, characters 11-25:
   -------------------------------------------------
     7:  }
     8:  transformed data {
     9:    real z = foo(mu, sigma);
                    ^
    10:  }
   -------------------------------------------------

Ill-typed arguments supplied to function foo. Available signatures:
(real[], vector, int) => real
Instead supplied arguments of incompatible type: real, real.

In case of a library function, it would list all the signatures here.

I feel like we should decide on a notation for function types for when we add them to the language (could be done swiftly once we decide on a notation) and I'd like to make the notation in error messages forward compatible with that notation. Currently, I'm writing (X, Y) => Z, but I could also imagine Fun(X, Y; Z) or (X, Y) -> Z.

bob-carpenter · 2018-12-13T14:37:20Z

That's great. But I wouldn't refer to "foo" as a variable. Our users aren't going to be thinking like a parser. They're going to think of "foo" as a function, not as a general identifier or as a variable, even though we can think of it as an identifier being used as a variable in programming language terms.

…

On Dec 13, 2018, at 9:15 AM, Matthijs Vákár ***@***.***> wrote: Sure. Perhaps "variable" would be better? We could always specialize the message further to functions etc., but currently it is thrown elsewhere in a more generic fashion, when it's looking up an identifier in the symbol table. If we have a model functions { real foo(real[] x, vector y, int z) {return 3;} } data { real mu; real<lower=0> sigma; } transformed data { real z = foo(mu, sigma); } it says Semantic error at file "test.stan", line 9, characters 11-25: ------------------------------------------------- 7: } 8: transformed data { 9: real z = foo(mu, sigma); ^ 10: } ------------------------------------------------- Ill-typed arguments supplied to function foo. Available signatures: (real[], vector, int) => real Instead supplied arguments of incompatible type: real, real. In case of a library function, it would list all the signatures here. — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

mcol · 2020-02-21T17:27:32Z

Fixed in stanc3. Current message is:

Semantic error in 'tmp.stan', line 6, column 11 to column 25:
   -------------------------------------------------
     4:  }
     5:  transformed data {
     6:    real z = foo(mu, sigma);
                    ^
     7:  }
   -------------------------------------------------

A returning function was expected but an undeclared identifier 'foo' was supplied.

bob-carpenter added language code cleanup labels Sep 14, 2018

bob-carpenter added this to the v3 milestone Sep 14, 2018

mcol closed this as completed Feb 21, 2020

mcol modified the milestones: v3, 2.22.0 Feb 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser should point to the name of non-existent functions rather than the end of the line #2637

Parser should point to the name of non-existent functions rather than the end of the line #2637

bgoodri commented Sep 13, 2018

bob-carpenter commented Sep 14, 2018

VMatthijs commented Dec 13, 2018

bob-carpenter commented Dec 13, 2018 via email

VMatthijs commented Dec 13, 2018 •

edited

Loading

bob-carpenter commented Dec 13, 2018 via email

mcol commented Feb 21, 2020

Parser should point to the name of non-existent functions rather than the end of the line #2637

Parser should point to the name of non-existent functions rather than the end of the line #2637

Comments

bgoodri commented Sep 13, 2018

Summary:

Description:

Reproducible Steps:

Current Output:

Expected Output:

Additional Information:

Current Version:

bob-carpenter commented Sep 14, 2018

VMatthijs commented Dec 13, 2018

bob-carpenter commented Dec 13, 2018 via email

VMatthijs commented Dec 13, 2018 • edited Loading

bob-carpenter commented Dec 13, 2018 via email

mcol commented Feb 21, 2020

VMatthijs commented Dec 13, 2018 •

edited

Loading