Support for profiling an algorithm's resource use #889

mccabete · 2024-01-08T19:28:17Z

mccabete
Jan 8, 2024

Those of us who are working on trying to improve the FEDS algorithm (me, @ranchodeluxe, @jsignell, and @eorland) have hit limits on how much we can improve the code without some profiling of how the algorithm uses memory, CPU, etc. What support exists for implementing that?

@wildintellect I know you mentioned that this was something worth following up on in the new year.

wildintellect · 2024-01-08T20:17:09Z

wildintellect
Jan 8, 2024
Maintainer

@mccabete you can see an example of python memory profiling implemented by @omshinde in https://github.com/orgs/MAAP-Project/discussions/863

@chuckwondo and I have discussed some other potential tools in the past. Perhaps the next step is a depth discussion session reviewing the current code and which tools would be worth trying?

1 reply

chuckwondo Jan 8, 2024
Maintainer

@wildintellect, I'd be happy to help with this, if it makes sense for me to spend time here.

omshinde · 2024-01-08T23:02:52Z

omshinde
Jan 8, 2024
Collaborator

I will post a documentation example possibly by tomorrow explaining the steps for memory profiling python script.

0 replies

mccabete · 2024-01-09T19:06:29Z

mccabete
Jan 9, 2024
Author

Thanks for being willing to help @chuckwondo @omshinde and @wildintellect! Documentation would be helpful. For a more in-depth discussion, would it make sense to schedule a meeting?

FEDS-specific difficulties that might crop up could include: profiling with dask- parallelized code, getting the profiler to run/report out on DPS jobs, and profiling across a wide range of data-ingest levels (the algorithm runs slower when there are more fires to keep track of).

If a meeting makes sense let me know the best platform to coordinate that: email, slack, here, or something else.

0 replies

ranchodeluxe · 2024-01-10T14:41:31Z

ranchodeluxe
Jan 10, 2024

Thanks for being willing to help @chuckwondo @omshinde and @wildintellect! Documentation would be helpful. For a more in-depth discussion, would it make sense to schedule a meeting?

If it's helpful we can invite these folks to next Monday's meeting on the 15th since we'll be talking about All The Work ™️ for the next quarter. Then folks can set up smaller meetings at different cadences if needed

FEDS-specific difficulties that might crop up could include: profiling with dask- parallelized code, getting the profiler to run/report out on DPS jobs, and profiling across a wide range of data-ingest levels (the algorithm runs slower when there are more fires to keep track of).

Another difficulty to mention here is that the Fire environment is pinned to old (and probably now unsupported versions of some libraries such as pandas) and some libs can't be upgraded b/c then the existing code would lose the ability to read the archive pickled objects that it created. Julia and I have a goal this PI to experiment with different data models and formats and recommend one to move away from the pickle. But this could limit what is able to be installed/upgraded FWIW

1 reply

wildintellect Jan 10, 2024
Maintainer

Looks like we'll coordinate on the MAAP slack and setup a meeting. The data model task will factor in to the overall planning of how to approach the algorithm(s).

mccabete · 2024-01-16T18:24:44Z

mccabete
Jan 16, 2024
Author

@chuckwondo Summarizing your feedback from the call -- use the scalene package if we are kicking off the DPS jobs from a single command

0 replies

mccabete · 2024-01-16T18:28:45Z

mccabete
Jan 16, 2024
Author

@wildintellect Suggested that folks need a high-level diagram that specifies when there is a file-read, the ingest steps, etc. @wildintellect is there a place with best practices for this/ an example? I know i'm not 100% sure what to include, or what the common parlance is.

0 replies

mccabete · 2024-01-16T18:31:54Z

mccabete
Jan 16, 2024
Author

We also need to design some "small tests". Need to design what those tests are. On discussion:

CONUS-wide example for a single timestep that we know ran and is small (Snapshot only)
A single fire (ie Creek fire)
A time/region with big fires present where we can test the alpha hull algorithms can be tested
Monthly files VS daily files.

0 replies

chuckwondo · 2024-01-24T21:46:31Z

chuckwondo
Jan 24, 2024
Maintainer

Scalene is a Python CPU+GPU+memory profiler that can even profile multi-threaded and multi-core programs. It's extremely easy to incorporate without requiring any code changes.

The scalene package is available via conda from the conda-forge channel, so it should be added to the list of dependencies in a conda yaml file. (Also available via pip, if necessary.)

Initially, I suggest using a "reduced" profile to keep the profile output on the smaller side, ideally pinpointing some hotspots for further investigation. To do so with scalene is straightforward.

For example, assume you want to profile a script (such as from a DPS algorithm's "run" script) that is executed in the following form (with or without python in front, depending on how you constructed your script file):

python my_script.py arg1 ... argN

In order to use scalene to produce a reduced profile, simply add scalene and some options to the front of the above command, like so:

scalene --reduced-profile --html --no-browser --outfile profile.html python my_algorithm.py arg1 ... argN

This will run your script like normal, but will also profile its execution and produce a profiling report in the file profile.html once the script completes.

The following is an example report that I produced from the GEDI Subsetter algorithm by adjusting the "run" script as described above. This file contains HTML, but attachments with a .html extension are not supported here, so after downloading the file, change the extension to .html and open in a browser: profile.txt

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAAP-Project

Support for profiling an algorithm's resource use #889

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 8 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

MAAP-Project

Support for profiling an algorithm's resource use #889

mccabete Jan 8, 2024

Replies: 8 comments · 2 replies

wildintellect Jan 8, 2024 Maintainer

chuckwondo Jan 8, 2024 Maintainer

omshinde Jan 8, 2024 Collaborator

mccabete Jan 9, 2024 Author

ranchodeluxe Jan 10, 2024

wildintellect Jan 10, 2024 Maintainer

mccabete Jan 16, 2024 Author

mccabete Jan 16, 2024 Author

mccabete Jan 16, 2024 Author

chuckwondo Jan 24, 2024 Maintainer

mccabete
Jan 8, 2024

Replies: 8 comments 2 replies

wildintellect
Jan 8, 2024
Maintainer

chuckwondo Jan 8, 2024
Maintainer

omshinde
Jan 8, 2024
Collaborator

mccabete
Jan 9, 2024
Author

ranchodeluxe
Jan 10, 2024

wildintellect Jan 10, 2024
Maintainer

mccabete
Jan 16, 2024
Author

mccabete
Jan 16, 2024
Author

mccabete
Jan 16, 2024
Author

chuckwondo
Jan 24, 2024
Maintainer