A useful feature would be to implement a rapids profile command to provide metrics for python scripts similar to the jupyterlab-nvdashboard for non-Jupyter users.
A few design choices to think about are:
- Real-time metrics similar to the nvdashboard vs metrics written to a file to avoid interfering with the outputs of the script
- Can existing tools such as nsight be integrated for this purpose
- At what level of the code should this profiling be done in order to better identify bottlenecks, maybe as decorators for important functions?