test_runner: add TAP parser #43525

manekinekko · 2022-06-21T16:19:49Z

This PR adds initial support for a TAP LL(1) parser. This implementation is based on the grammar for TAP14 from https://testanything.org/tap-version-14-specification.html

TODO:

add a TAP checker (by design, the current parser does only parsing).
add parallel tests for the TAP lexer
add parallel tests for the TAP parser
add parallel tests for the TAP checker
add async parsing
add more js docs
integrate the new parser into the existing node:test runner implementation (shoutout to @MoLow for their help)
fix linting errors (make lint)

Refs: #43344
Signed-off-by: Wassim Chegham github@wassim.dev

@benjamincburns @fhinkel @cjihrig

lib/internal/test_runner/tap_lexer.js

lib/internal/test_runner/tap_checker.js

cjihrig · 2022-09-12T14:05:16Z

@manekinekko is there any update on this? I think this is probably the highest priority item for the test runner at this point.

MoLow · 2022-09-12T14:14:16Z

I think this is probably the highest priority item for the test runner at this point.

+1.
if there is any help needed with this LMK.

additionaly - will this solution support arbitrary console.log statements? #44372

manekinekko · 2022-09-12T15:15:49Z

Sorry for the delay guys. Had to deliver some work lately.

So, the biggest challenge I need to work on solving right now is redesigning the lexer so it can support async scanning and make the parser support partial parsing, and emit the results once they are available (as discussed with @cjihrig over Twitter).

A draft of a public API I am thinking of would look something like this:

  const parser = new TapParser(/*  probably a flag to enable stream support */);
  child.stdout.on('data', (chunk) => {
   parser.parse(chunk);
  });
  child.stdout.on('end', () => {
   const status = parser.end();
  assert.strictEqual(status.ok, true);
  });

@MoLow @cjihrig do you have any ideas about a different design / or a concern?

cjihrig · 2022-09-12T15:22:09Z

Is the lexer right now reading individual characters? If so, I think we can leverage the fact that TAP is line based. That would make it significantly simpler than a parser for something like a programming language. I think that would also make it simpler to handle streaming.

manekinekko · 2022-09-12T16:27:59Z

The parser is already splitting tokens (scanned by the lexer) into subsets, separated by EOL. The resulting array basically contains all tokens scanned at each TAP line.

from   [ token1, EOL, token2, EOL, ..., EOL, tokenN-1, tokenN ]
to     [ [token1], [token2], [...], [tokenN-1, tokenN] ]

So I was thinking of leveraging this logic for streams. is that was you were referring to?

cjihrig · 2022-09-12T17:00:40Z

I guess I'm just wondering if we need a full blown lexer. Would it be simpler (less code, faster for us to get something shipped) to read input, turn it into lines, and parse each line. There are only a handful of line types outside of yaml blocks, and we can generally distinguish them by looking at just the first few characters of the line (whitespace, 'ok', 'not ok', 'TAP version', etc.).

MoLow · 2022-09-12T17:25:07Z

@cjihrig how would you expect a console.log or any invalid tap to behave?

MoLow · 2022-09-12T17:26:59Z

we can generally distinguish them by looking at just the first few characters of the line (whitespace, 'ok', 'not ok', 'TAP version', etc.).

There is something vere similar here https://github.com/nodejs/tap2junit just FYI

cjihrig · 2022-09-12T17:38:43Z

@cjihrig how would you expect a console.log or any invalid tap to behave?

According to the spec:

Any output that is not a version, a plan, a test line, a YAML block,
a diagnostic or a bail out is incorrect. How a harness handles the
incorrect line is undefined. Test::Harness silently ignores incorrect
lines, but will become more stringent in the future. TAP::Harness
reports TAP syntax errors at the end of a test run.

and

In this document, the “harness” is any program analyzing TAP
output. Typically this will be Perl’s runtests program, or the underlying
TAP::Harness-runtests> method. A harness must only read TAP
output from standard output and not from standard error. Lines written
to standard output matching /^(not )?ok\b/ must be interpreted as test
lines. After a test line a block of lines starting with ‘—’ and ending with
‘…’ will be interpreted as an inline YAML document providing extended
diagnostic information about the preceding test. All other lines must not
be considered test output.

I think if there are extra lines, that might come from console.log()s, we should try to keep them but definitely not error because users would not be able to add logging statements in their tests. If the TAP lines themselves are malformed I think we should report an error (unless I missed something about this in the spec, in which case I defer to the spec).

manekinekko · 2022-09-12T19:18:55Z

I agree with @cjihrig on the specs. The parser in this PR will error if a line starts with an unrecognized token. Pragmas are parsed but not yet applied.

Could we print those console.logs as comments?

MoLow · 2022-09-12T19:33:48Z

I think it is ok to print invalid tap as a comment or something else, but we should not just ignore it

manekinekko · 2022-09-16T10:31:42Z

@cjihrig @MoLow wdyt about this API?

  const args = ['--test', testFixtures];
  const child = spawn(process.execPath, args);

  const parser = new TapParser();          //<-- create a parser instance

  child.stdout.on('data', (chunk) => {
    const line = chunk.toString('utf-8');
    const test = parser.parseChunk(line);  //<--- call parseChunk() 
    console.log(test);
  });

The test output can be either:

undefined (when we parse non-testpoint entries like pragmas or comments)
or the last parsed test node (extracted from the AST):

{
  "status": { fail: false, pass: true, todo: false, skip: false },
  "id": "1",
  "description": "/home/wassimchegham/oss/@nodejs/node/test/fixtures/test-runner/index.test.js",
  "reason": "",
  diagnostics: [ 'duration_ms: 0.033821156', 'duration_ms: 0.033821156' ]
}

MoLow · 2022-09-16T11:40:52Z

@manekinekko I would expect something based on node:readline, like:

class TapParser extends readline.InterfaceConstructor {
	constructor(input, output) {
      super(input, output);
      this.on('line', (line) => {
		..parse the line and
		this.emit('test') / this.emit('diagnostic') etc
	  })
    }
}

either that or

class TapParser extends TapStream {
	constructor(input) {
      this.handle = readline.createInterface({ input });
      this.handle.on('line', (line) => {
		..parse the line and
		this.ok(data) / this.fail(data) etc
	  })
    }
}

MoLow · 2022-09-16T11:48:24Z

that is obviously very simplistic since TAP parsing can include multilines, etc

nodejs-github-bot · 2022-11-21T22:50:23Z

Landed in f8ce911

manekinekko · 2022-11-22T06:09:56Z

OMG!! That was one hell of a ride!! Thanks y'all for taking the time to review and comment on this work!

Special Thank You to @cjihrig @MoLow @fhinkel @benjamingr for your help and for being such great mentors ❤️

Already looking forward to my next contributions 👍

ljharb · 2022-11-22T22:52:47Z

cc @nodejs/testing

Work in progress PR-URL: nodejs#43525 Refs: nodejs#43344 Reviewed-By: Franziska Hinkelmann <franziska.hinkelmann@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Moshe Atlow <moshe@atlow.co.il>

Work in progress PR-URL: #43525 Refs: #43344 Reviewed-By: Franziska Hinkelmann <franziska.hinkelmann@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Moshe Atlow <moshe@atlow.co.il>

Work in progress PR-URL: nodejs#43525 Refs: nodejs#43344 Reviewed-By: Franziska Hinkelmann <franziska.hinkelmann@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Moshe Atlow <moshe@atlow.co.il>

Work in progress PR-URL: #43525 Refs: #43344 Reviewed-By: Franziska Hinkelmann <franziska.hinkelmann@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Moshe Atlow <moshe@atlow.co.il>

Work in progress PR-URL: nodejs/node#43525 Refs: nodejs/node#43344 Reviewed-By: Franziska Hinkelmann <franziska.hinkelmann@gmail.com> Reviewed-By: Colin Ihrig <cjihrig@gmail.com> Reviewed-By: Moshe Atlow <moshe@atlow.co.il> (cherry picked from commit f8ce9117b19702487eb600493d941f7876e00e01)

nodejs-github-bot added needs-ci PRs that need a full CI run. test_runner Issues and PRs related to the test runner subsystem. labels Jun 21, 2022

manekinekko force-pushed the tap-14-parser branch from 3eec379 to 01e0520 Compare June 21, 2022 16:34

MoLow reviewed Jun 21, 2022

View reviewed changes

lib/internal/test_runner/tap_lexer.js Show resolved Hide resolved

manekinekko force-pushed the tap-14-parser branch from 01e0520 to 180d2fb Compare June 21, 2022 17:57

aduh95 reviewed Jun 22, 2022

View reviewed changes

lib/internal/test_runner/tap_lexer.js Outdated Show resolved Hide resolved

manekinekko changed the title ~~feat: add TAP parser~~ test_runner: add TAP parser Jun 22, 2022

manekinekko force-pushed the tap-14-parser branch 2 times, most recently from 5db6310 to 4f44c29 Compare June 27, 2022 15:23

aduh95 added dont-land-on-v14.x labels Jul 18, 2022

This was referenced Jul 29, 2022

test_runner: update output TAP format to follow TAP 14 specs #44040

Closed

test: update tap reporter version to 14 #44041

Closed

Mifrill reviewed Jul 29, 2022

View reviewed changes

lib/internal/test_runner/tap_checker.js Show resolved Hide resolved

targos removed the dont-land-on-v16.x label Jul 31, 2022

manekinekko force-pushed the tap-14-parser branch from 097797e to a94927e Compare September 16, 2022 13:29

nodejs-github-bot merged commit f8ce911 into nodejs:main Nov 21, 2022

github-actions bot mentioned this pull request Nov 22, 2022

CI Reliability 2022-11-22 nodejs/reliability#436

Open

15 tasks

This was referenced Nov 22, 2022

console.log in node:test ? #44372

Closed

Test count may not be as useful as it could be #43344

Closed

github-actions bot mentioned this pull request Nov 23, 2022

CI Reliability 2022-11-23 nodejs/reliability#437

Open

13 tasks

github-actions bot mentioned this pull request Nov 24, 2022

CI Reliability 2022-11-24 nodejs/reliability#438

Open

16 tasks

MoLow mentioned this pull request Nov 24, 2022

[v16.x backport] backport multiple test runner features #45602

Closed

ruyadorno mentioned this pull request Nov 24, 2022

v19.2.0 proposal #45615

Merged

cjihrig mentioned this pull request Nov 27, 2022

Feature request: test runner reporters #45648

Closed

danielleadams mentioned this pull request Dec 30, 2022

v18.13.0 release proposal #46025

Merged

cjihrig mentioned this pull request Jan 4, 2023

Tap parser fails if a test logs a number #46048

Closed

MoLow mentioned this pull request May 15, 2023

Test Runner: delegate arbitrary output formatting to each reporter #48011

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test_runner: add TAP parser #43525

test_runner: add TAP parser #43525

manekinekko commented Jun 21, 2022 •

edited

Loading

cjihrig commented Sep 12, 2022

MoLow commented Sep 12, 2022 •

edited

Loading

manekinekko commented Sep 12, 2022

cjihrig commented Sep 12, 2022

manekinekko commented Sep 12, 2022 •

edited

Loading

cjihrig commented Sep 12, 2022

MoLow commented Sep 12, 2022

MoLow commented Sep 12, 2022

cjihrig commented Sep 12, 2022

manekinekko commented Sep 12, 2022 •

edited

Loading

MoLow commented Sep 12, 2022

manekinekko commented Sep 16, 2022 •

edited

Loading

MoLow commented Sep 16, 2022 •

edited

Loading

MoLow commented Sep 16, 2022

nodejs-github-bot commented Nov 21, 2022

manekinekko commented Nov 22, 2022 •

edited

Loading

ljharb commented Nov 22, 2022

test_runner: add TAP parser #43525

test_runner: add TAP parser #43525

Conversation

manekinekko commented Jun 21, 2022 • edited Loading

cjihrig commented Sep 12, 2022

MoLow commented Sep 12, 2022 • edited Loading

manekinekko commented Sep 12, 2022

cjihrig commented Sep 12, 2022

manekinekko commented Sep 12, 2022 • edited Loading

cjihrig commented Sep 12, 2022

MoLow commented Sep 12, 2022

MoLow commented Sep 12, 2022

cjihrig commented Sep 12, 2022

manekinekko commented Sep 12, 2022 • edited Loading

MoLow commented Sep 12, 2022

manekinekko commented Sep 16, 2022 • edited Loading

MoLow commented Sep 16, 2022 • edited Loading

MoLow commented Sep 16, 2022

nodejs-github-bot commented Nov 21, 2022

manekinekko commented Nov 22, 2022 • edited Loading

ljharb commented Nov 22, 2022

manekinekko commented Jun 21, 2022 •

edited

Loading

MoLow commented Sep 12, 2022 •

edited

Loading

manekinekko commented Sep 12, 2022 •

edited

Loading

manekinekko commented Sep 12, 2022 •

edited

Loading

manekinekko commented Sep 16, 2022 •

edited

Loading

MoLow commented Sep 16, 2022 •

edited

Loading

manekinekko commented Nov 22, 2022 •

edited

Loading