Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Iteration on TLC wrapper script #6513

Merged
merged 37 commits into from
Oct 2, 2024
Merged

Iteration on TLC wrapper script #6513

merged 37 commits into from
Oct 2, 2024

Conversation

achamayou
Copy link
Member

@achamayou achamayou commented Sep 30, 2024

We have centralised some useful things in tlc.sh, but a lot of flags remain expanded inline in YAML, and tend to be copy-pasted per invocation. We then require extra descriptions, which can fall out of sync, to get readable CI steps. We also increasingly rely on environment variables, which are not necessarily validated nor discoverable.

tlc.py sets context-appropriate defaults, and exposes as CLI flags all options we currently use:

$ ./tlc.py --help
usage: tlc.py [-h] [-x] [-n] [--disable-cdot] [--workers WORKERS] [--checkpoint CHECKPOINT] [--lncheck {final,default}] [--jmx] [--dot]
              [--trace-name TRACE_NAME] [--config CONFIG]
              {mc,tv,sim} ... spec

TLC model checker wrapper for the CCF project

positional arguments:
  {mc,tv,sim}
    mc                  Model checking
    tv                  Trace validation
    sim                 Simulation
  spec                  Path to the TLA+ specification

optional arguments:
  -h, --help            show this help message and exit
  -x                    Print out command and environment before running
  -n                    Print out command and environment, but do not run
  --disable-cdot        Disable \cdot support
  --workers WORKERS     Number of workers to use, default is 'auto'
  --checkpoint CHECKPOINT
                        Checkpoint interval, default is 0
  --lncheck {final,default}
                        Liveness check, set to 'default' to run periodically or 'final' to run once at the end, default is final
  --jmx                 Enable JMX to allow monitoring, use echo 'get -b tlc2.tool:type=ModelChecker CurrentState' | jmxterm -l localhost:55449 -i to query
  --dot                 Generate a dot file for the state graph
  --trace-name TRACE_NAME
                        Name to give to the trace files, defaults to the config name if provided, otherwise to the base name of the spec
  --config CONFIG       Path to the TLA+ configuration, defaults to spec name

The following subcommands are available:

Model Checking

$ ./tlc.py mc --help
usage: tlc.py mc [-h] [--term-count TERM_COUNT] [--request-count REQUEST_COUNT] [--raft-configs RAFT_CONFIGS] [--disable-check-quorum]

optional arguments:
  -h, --help            show this help message and exit
  --term-count TERM_COUNT
                        Number of terms the nodes are allowed to advance through, defaults to 0
  --request-count REQUEST_COUNT
                        Number of requests the nodes are allowed to advance through, defaults to 3
  --raft-configs RAFT_CONFIGS
                        Raft configuration sequence, defaults to 1C2N
  --disable-check-quorum
                        Disable CheckQuorum action

Trace Validation

$ ./tlc.py tv --help
usage: tlc.py tv [-h] [--disable-dfs] [--ccf-raft-trace CCF_RAFT_TRACE]

optional arguments:
  -h, --help            show this help message and exit
  --disable-dfs         Set TLC to use depth-first search
  --ccf-raft-trace CCF_RAFT_TRACE
                        Path to a CCF Raft trace .ndjson file, for example produced by make_traces.sh

Simulation

$ ./tlc.py sim --help
usage: tlc.py sim [-h] [--num NUM] [--depth DEPTH] [--max-seconds MAX_SECONDS]

optional arguments:
  -h, --help            show this help message and exit
  --num NUM             Number of behaviours to simulate per worker thread
  --depth DEPTH         Set the depth of the simulation, defaults to 500
  --max-seconds MAX_SECONDS
                        Set the timeout of the simulation, defaults to 1200 seconds

This PR also includes some renaming to follow platform conventions, and remove the ambiguity caused by Limit variables with regards to the state space being explored.

@lemmy
Copy link
Contributor

lemmy commented Sep 30, 2024

We also increasingly rely on environment variables, which are not necessarily validated nor discoverable

The .tla specifications validate the environment variables and define their valid values.

@achamayou achamayou changed the title Iteration TLC wrapper script Iteration on TLC wrapper script Sep 30, 2024
@achamayou
Copy link
Member Author

We also increasingly rely on environment variables, which are not necessarily validated nor discoverable

The .tla specifications validate the environment variables and define their valid values.

They do for the values to some extent, but not the variables themselves, so if something is unset or mistyped, a default is picked, and it's easy to miss because the output is very verbose. There is also no way to --help and get a documented list.

@achamayou achamayou added the run-long-verification Run Long Verification jobs label Oct 1, 2024
scripts/ci-checks.sh Outdated Show resolved Hide resolved
tla/actions.py Outdated Show resolved Hide resolved
achamayou and others added 2 commits October 1, 2024 15:57
Co-authored-by: Heidi Howard <1835251+heidihoward@users.noreply.github.com>
Co-authored-by: Heidi Howard <1835251+heidihoward@users.noreply.github.com>
tla/tlc.py Outdated Show resolved Hide resolved
tla/tlc_debug.sh Show resolved Hide resolved
Co-authored-by: Heidi Howard <1835251+heidihoward@users.noreply.github.com>
tla/tlc.py Show resolved Hide resolved
@achamayou achamayou requested review from heidihoward and lemmy October 1, 2024 17:55
tla/consensus/MCccfraft.tla Outdated Show resolved Hide resolved
@achamayou achamayou enabled auto-merge October 2, 2024 10:48
@achamayou achamayou added this pull request to the merge queue Oct 2, 2024
Merged via the queue into microsoft:main with commit 50ffc62 Oct 2, 2024
18 of 19 checks passed
@achamayou achamayou deleted the tlc_cli branch October 2, 2024 11:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
run-long-verification Run Long Verification jobs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants