mintlayer · ImplOfAnImpl · Sep 19, 2025 · Sep 26, 2025 · Sep 29, 2025 · Oct 3, 2025
@@ -1,4 +1,5 @@
 import argparse
+import os
 import subprocess
 from pathlib import Path
 
@@ -14,6 +15,7 @@
 
 def collect_data(args):
     if args.output_file is None:
+        os.makedirs(DEFAULT_OUTPUT_DIR, exist_ok=True)
         output_file = DEFAULT_OUTPUT_DIR.joinpath(
             DEFAULT_OUTPUT_FILE_NAME_FMT.format(chain_type=args.chain_type)
         )

@@ -0,0 +1,53 @@
+## Fork detection script, for the extra peace of mind
+
+Here we have `detector.py`, which is a relatively crude way of detecting a permanent fork (split)
+in the network if it happens.
+
+The script basically runs the full sync in a loop, checking the node's log output for certain errors
+and comparing its mainchain block ids with those obtained from the API server.\
+If anything suspicious is detected during the full sync, the script will save the node's data
+directory and log file.\
+In any case, the script will temporarily ban some of the peers that participated in the sync
+(so that the next iteration has a chance to have different ones and to reduce the strain on
+the network) and start the full sync all over again, reusing the peerdb from the previous iteration.
+
+The node is always run with checkpoints disabled, so that it has the chance to find older forks too.
+
+The structure of the script's working directory (specified via the command line):
+- `current_attempt` - this corresponds to the current sync attempt (iteration).
+- `saved_attempts` - this contains subdirectories corresponding to attempts that
+  are considered suspicious; each subdirectory's name is the datetime of the moment
+  when the attempt was finished.
+- `saved_peer_dbs` - this is where peer dbs from previous attempts are stored; the script
+  only needs the one from the latest attempt, but, just in case, the penultimate one is
+  also stored.
+- `log.txt` - this is the log of the script itself.
+
+Each attempt's directory has the following structure:
+- `flags` - this directory contains flag files (which are usually zero-length) indicating
+  that certain problems were found during the sync. It is what determines whether the attempt's
+  directory will be saved in the end (i.e. if the directory is non-empty, the attempt will be saved).
+- `node_data` - this is the node's data directory of this attempt.
+- `node_log.txt` - the node's log.
+
+Some notes:
+* Currently the script requires Python 3.13 to run, though we may lift this requirement later.
+* The script can send an email when it detects an issue using the local SMTP server
+  (if you're on Linux, google for an SMTP Postfix tutorial to set it up).
+* Even if the script finds a problem (e.g. a checkpoint mismatch), you're still likely
+  to end up being on the correct chain. To download the actual fork for further investigation
+  you can initiate a separate full sync while using the node's option `--custom-checkpoints-csv-file`
+  to override the correct checkpoints with the wrong ones.
+* Once the fork has been downloaded, you'll want to examine the contents of its chainstate db.
+  Currently we have the `chainstate-db-dumper` tool that can dump certain info about blocks
+  to a CSV file (the most interesting part of it being the ids of pools that continue producing
+  blocks on that fork).
+* Once the fork has been investigated you can "permanently" ban the peers that have been sending it
+  to you, to prevent it from being reported again and again. To do so, you can add their ip
+  addresses to `permabanned_peers.txt` (one address per line) in the script's working directory
+  (it doesn't exist by default, so you'll have to create it first). Note that the file is checked
+  on every iteration, so you can update it while the script is already running and it will come
+  into effect when the next iteration starts.
+* The script is likely to fail if a networking error occurs, e.g. if it can't query the API server.
+  So, run it in a loop in a shell script (with some delay after each run, to prevent it from spamming
+  you with warning emails).