Experiments: Add breakpoint to an experiment #722

mattseddon · 2021-08-16T23:04:41Z

A nice to have for experiments would be the ability to add a breakpoint and debug the workings of the experiment.

Original point can be found here in the review ticket.

rogermparent · 2021-08-17T22:16:48Z

@iterative/dvc Is there any existing protocol for running a debugger against pipeline executions? I can't find anything in the DVC docs.

From when I explored using the debugger API to launch exp run, the way pipeline scripts are run from a static string on the shell (e.g. python train.py) prevents us from both basic Python debugging or debugpy attaching even if the script is also Python like our demo's train.py. We can see dvc in the debugger, and that it opens another process, but we can't inspect the child process. I suppose we could run the train script directly in the debugger, but I have a hunch that would be too far from dvc exp run to be useful.

pmrowla · 2021-08-18T00:42:37Z

Is there any existing protocol for running a debugger against pipeline executions? I can't find anything in the DVC docs.

There is not. For one thing, this would all have to be language specific, and there is no guarantee that pipeline stages are going to be python scripts. I think the normal workflow here would be that users debug their stages individually outside of DVC.

If you really want to debug something that is running inside DVC, the way to do it would be to use remote debugging. So you would configure your pipeline stage command to run your stage inside the appropriate debugger session, and then connect to it remotely from a separate debugger process.

So for debugpy something you would configure dvc.yaml like:

stages:
  train:
    cmd: python -m debugpy --listen 1234 --wait-for-client train.py ...

and to debug it, you would start dvc exp run and then connect to the debugger session with

$ python -m debugpy --connect 1234

For non-python stages it would work the same way. For a compiled executable and GDB example:

stages:
  train:
    cmd: gdbserver :1234 ./train ...

$ gdb -q train
(gdb) target remote 172.0.0.1:1234

mattseddon · 2021-09-07T23:34:20Z

Closing for now as this is a lower priority and we cannot progress without changes in DVC.

mattseddon added the A: experiments Area: experiments table webview and everything related label Aug 16, 2021

rogermparent mentioned this issue Sep 3, 2021

Add debug description to README #774

Merged

mattseddon closed this as completed Sep 7, 2021

mattseddon mentioned this issue Nov 30, 2023

Ability to attach debugger to any code within DVC pipeline #5048

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiments: Add breakpoint to an experiment #722

Experiments: Add breakpoint to an experiment #722

mattseddon commented Aug 16, 2021

rogermparent commented Aug 17, 2021 •

edited

Loading

pmrowla commented Aug 18, 2021

mattseddon commented Sep 7, 2021

Experiments: Add breakpoint to an experiment #722

Experiments: Add breakpoint to an experiment #722

Comments

mattseddon commented Aug 16, 2021

rogermparent commented Aug 17, 2021 • edited Loading

pmrowla commented Aug 18, 2021

mattseddon commented Sep 7, 2021

rogermparent commented Aug 17, 2021 •

edited

Loading