Memory profiler for cuda #1996

jainapurva · 2025-04-01T20:25:40Z

No description provided.

pytorch-bot · 2025-04-01T20:25:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1996

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 2c62286 with merge base 70fc520 ():

NEW FAILURES - The following jobs have failed:

.github/workflows/float8nocompile_test.yaml (gh)
PR Label Check / Check PR Labels (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull Request Overview

This PR introduces memory profiling support for CUDA in the microbenchmarks, integrating both model profiling and memory profiling into the benchmarking workflow. Key changes include the addition of utility functions for uploading trace files and generating URLs for Perfetto UI, modifications in the benchmark configuration to enable profiling, and updates to the benchmark runner to execute the new profiling functionalities.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
benchmarks/microbenchmarks/utils.py	Added functions to upload trace files, generate model and memory profiles, and generate Perfetto URLs.
benchmarks/microbenchmarks/test/benchmark_config.yml	Enabled profiler and memory profile flags for one benchmark configuration.
benchmarks/microbenchmarks/benchmark_runner.py	Updated error handling and conditional CSV generation based on collected results.
benchmarks/microbenchmarks/benchmark_inference.py	Integrated calls to the newly added profiling functions with proper error logging.

Copilot · 2025-04-01T20:26:03Z

benchmarks/microbenchmarks/utils.py

+    DEFAULT_TTL_SEC = 28 * 24 * 60 * 60
+    file_name = os.path.basename(local_path)
+    manifold_path = os.path.join(
+        MANIFOLD_FOLDER, f"{os.getlogin()}_{str(uuid.uuid4())}_{file_name}"


Using os.getlogin() can raise an OSError in non-interactive or service environments; consider using getpass.getuser() for improved robustness.

Suggested change

MANIFOLD_FOLDER, f"{os.getlogin()}_{str(uuid.uuid4())}_{file_name}"

MANIFOLD_FOLDER, f"{getpass.getuser()}_{str(uuid.uuid4())}_{file_name}"

Memory profiler for cuda

2c62286

jainapurva requested a review from Copilot April 1, 2025 20:25

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 1, 2025

jainapurva changed the base branch from main to profiler_combined_new April 1, 2025 20:25

Copilot AI reviewed Apr 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory profiler for cuda #1996

Memory profiler for cuda #1996

jainapurva commented Apr 1, 2025

pytorch-bot bot commented Apr 1, 2025 •

edited

Loading

Copilot AI left a comment

Copilot AI Apr 1, 2025

	MANIFOLD_FOLDER, f"{os.getlogin()}_{str(uuid.uuid4())}_{file_name}"
	MANIFOLD_FOLDER, f"{getpass.getuser()}_{str(uuid.uuid4())}_{file_name}"

Memory profiler for cuda #1996

Are you sure you want to change the base?

Memory profiler for cuda #1996

Conversation

jainapurva commented Apr 1, 2025

pytorch-bot bot commented Apr 1, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1996

❌ 2 New Failures

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Copilot AI Apr 1, 2025

Choose a reason for hiding this comment

pytorch-bot bot commented Apr 1, 2025 •

edited

Loading