Parsing future opam files take 2 #44

dra27 · 2021-04-23T11:39:43Z

This is complete reimplementation of #43 with the following changes:

The first attempt couldn't cope with, say, opam-version: "42" < because the parsing error occurs before the Variable has been completely parsed. This revised version instead reads three tokens from the lexer and so "sniff" the opam-version before parsing starts. With this approach, there's no need to do all of the global state mucking around - we already know if the opam file is a "future" one and so any exception can instead just return the opam-version line we successfully parsed. In order to allow a future version of opam to intentionally stick with an old parser (e.g. because opam 2.2 adds no new lexing/parsing rules), as well as the opam-version item, a sentinel section of kind # is returned which opam can then use to determine that there was actually a parse error.

Finally, I thought of a devious, yet curiously actually valid, way of raising OpamLexer.Error from OpamBaseParser.main which means that opam-version in the wrong place now includes a description of the problem rather than just "parse error".

Completely reimplements the best-effort parsing mode. The previous attempt failed if a lexing or parsing error occured before the opam-version item had been fully parsed. This revised version begins by reading three tokens from the lexer directly to parse the opam-version, if it's present. It _then_ calls the ocamlyacc parser (feeding the three tokens back to it). If the opam-version parsed is greater than the library's internal version, then _any_ exception causes just the parsed opam-version header to be returned (which is sufficient for the client to display that the file is newer than itself). Finally, in order to permit, for example, opam 2.2 to have new fields but no new lexer or parser, exceptions cause both the opam-version variable to be returned and also a sentinel group with kind `#` which can be used by the client to determine that a parsing/lexing error occurred (and where).

AltGr · 2021-05-05T12:52:41Z

Wow, quite complex but seems to do the job — I trust you have tested it in various scenarios ;)

The parsing part is LGTM, but I think you should also have a look at the printer to enforce the invariants there too: may make some mistakes much easier to spot and fix.
Also the Printer.Normalise / Printer.FullPos.Normalise I think needs to be patched to still be valid syntax with this change ?

dra27 · 2021-05-05T13:51:15Z

Oh, I hadn't checked Normalise. For the others (Preserved, etc.) I don't think anything wants doing - there's a certain of "garbage in garbage out" - so if you feed an invalid file to the printer (e.g. where you've put the opam-version: "2.1" field further down the list then it will print it. I wasn't sure if we want it to fail (or even silently correct) the file at that point?

dra27 · 2021-05-05T13:51:38Z

For the correctness, I added some additional tests for it 🙂

opam-version with a value greater than 2.1 always comes first.

dra27 · 2021-05-06T13:30:14Z

Extra commit pushed - I should concoct some tests for it, though.

AltGr · 2021-05-06T16:13:21Z

Does the Normalise module outside of FullPos not need the change because it is older anyway ? Just making sure.

dra27 · 2021-05-06T16:37:23Z

🤦

dra27 · 2021-05-06T16:41:39Z

We should probably refactor the printer code completely to convert the old format to the full pos one (with dummy locations) and then print that...

dra27 · 2021-05-19T16:39:18Z

That extra commit causes all of the opamfile and items functions to raise Invalid_argument if they're passed a list of items which violate the opam-version placement rules.

I think this is ready to go now?

dra27 mentioned this pull request Apr 23, 2021

Bump to opam-file-format 2.1.3 ocaml/opam#4639

Merged

Demonstrate why old compilers are baaaad

1ef7b8f

Fix OpamPrinter.Normalise.items

fa9b402

opam-version with a value greater than 2.1 always comes first.

Fix OpamPrinter.FullPos.Normalise *and* OpamPrinter.Normalise!

1bb9405

Raise Invalid_argument in printers

b6eac43

A few more tests

5211b61

AltGr merged commit 007cda4 into ocaml:master May 20, 2021

dra27 deleted the back-to-the-future branch March 10, 2024 13:21

dra27 mentioned this pull request Mar 10, 2024

Silently mark packages requiring an unsupported version of opam as unavailable ocaml/opam#5665

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing future opam files take 2 #44

Parsing future opam files take 2 #44

dra27 commented Apr 23, 2021 •

edited

Loading

AltGr commented May 5, 2021

dra27 commented May 5, 2021

dra27 commented May 5, 2021

dra27 commented May 6, 2021

AltGr commented May 6, 2021

dra27 commented May 6, 2021

dra27 commented May 6, 2021

dra27 commented May 19, 2021

Parsing future opam files take 2 #44

Parsing future opam files take 2 #44

Conversation

dra27 commented Apr 23, 2021 • edited Loading

AltGr commented May 5, 2021

dra27 commented May 5, 2021

dra27 commented May 5, 2021

dra27 commented May 6, 2021

AltGr commented May 6, 2021

dra27 commented May 6, 2021

dra27 commented May 6, 2021

dra27 commented May 19, 2021

dra27 commented Apr 23, 2021 •

edited

Loading