cmd/pgopack: cache packed pgo profile during build #62400

jinlin-bayarea · 2023-08-31T18:03:39Z

The PGO-enabled Go compilation shows the significant latency improvement of generated binary which benefits from profile-guided inlining and code specialization. However, the workflow of processing and extracting profiling data is surprisingly suboptimal in the community Go compiler. Based on the observation and instrument timing data from small benchmark and real service, we notice that the overall pprof data parsing time varies from 70% to over 95% of total time when PGO flow is enabled in the compilation. Furthermore, the existing PGO flow reads and parses the pprof file in every single package compilation and the cumulative time of this process is long and unnecessary. One-shot processing profiling data and cache information in a well-designed format for optimization transformation among all packages is critical.

To optimize the existing flow and reduce build time of multiple Go services and projects, we propose to create a standalone tool, PGO preprocessor, to extract information from collected profiling files and to cache the WeightedCallGraph in one time fashion. By adding the new tool to the Go compiler, it reduces the time of repeated profiling file parsing in current Go Compiler. The new tool is capable of all existing PGO enabled optimizations including the inlining and devirtualization in the Go compiler.

In summary, we propose adding a standalone preprocess tool to reduce the compile time when the PGO is enabled. Inputs are welcome.

cespare · 2023-08-31T18:29:19Z

For the record, speeding up the underlying work is #58102.

cespare · 2023-08-31T18:30:08Z

@jinlin-bayarea Are your .pgo files in version control, so that all builds use PGO? I know that's the Go team's recommendation (here, for example).

At my company we've only started playing around with PGO a bit but our current approach is to only build with PGO as part of production deploys, not local development/testing. So having PGO builds be a bit slower doesn't really matter much.

(We don't want to put the .pgo files in our repo because we are collecting very large profiles and we are updating them with automated processes.)

jinlin-bayarea · 2023-09-01T02:02:08Z

The pgo files are not inversion control. We have collected our pgo profiles daily and always use the most recent profile to build the service code.

prattmic · 2023-09-14T19:08:33Z

cc @aclements @cherrymui

cherrymui · 2023-09-19T15:54:26Z

Discussed briefly offline. The idea of preprocessing sounds good overall.

In the usual case we'd want the go command to automatically run the preprocessor and cache the result. So this step is transparent to the user, and go build -pgo still works as before.
The preprocessor can be a standalone command, so external build tools like Bazel can invoke it (and do its own caching).

It would be good if you could share a prototype CL. Then we can discuss the details from there, like the exact format of the preprocessed profile. Thanks.

jinlin-bayarea · 2023-09-20T20:10:06Z

Hi Cherry. I have submitted the prototype in this CL. https://go-review.googlesource.com/c/go/+/529738.
Please help review the changes. Thanks.

rsc · 2023-11-01T18:07:43Z

I would suggest calling it pgopack or something like that instead of "preprofile" since Go has lots of profiles. Otherwise this seems fine, and it should be invisible to users, so I think it can proceed with compiler/runtime team agreement instead of a proposal.

/cc @bcmills

cherrymui · 2023-11-01T18:14:55Z

Taking out of proposal as this is invisible to users using the go command. The tool will be internal (the go command will run it automatically if needed), so is the preprocessed profile format. Users are not expected to check in profiles generated by the preprocessing tool.

jinlin-bayarea · 2023-11-01T21:51:23Z

Agree with @cherrymui . For bazel build users, they need to use the preprocess command separately to retrieve the preprocessed profile.

prattmic · 2024-01-22T21:51:40Z

Tool is submitted in https://go.dev/cl/529738. It still needs integration with cmd/go. I'm going to close this issue in favor of #58102, which is effectively a duplicate of this.

jinlin-bayarea added the Proposal label Aug 31, 2023

gopherbot added this to the Proposal milestone Aug 31, 2023

cespare added compiler/runtime Issues related to the Go compiler and/or runtime. ToolSpeed labels Aug 31, 2023

ianlancetaylor added this to Proposals Aug 31, 2023

ianlancetaylor moved this to Incoming in Proposals Aug 31, 2023

mknyszek added this to Go Compiler / Runtime Sep 6, 2023

cherrymui removed the Proposal label Nov 1, 2023

cherrymui changed the title ~~proposal: cmd/preprofile: perform preprocessing on the profile file to accelerate the compilation process~~ cmd/preprofile: perform preprocessing on the profile file to accelerate the compilation process Nov 1, 2023

rsc changed the title ~~cmd/preprofile: perform preprocessing on the profile file to accelerate the compilation process~~ cmd/pgopack: cache packed pgo profile during build Nov 1, 2023

bcmills added the GoCommand cmd/go label Nov 1, 2023

cherrymui modified the milestones: Proposal, Backlog Nov 2, 2023

cherrymui removed this from Proposals Nov 2, 2023

cherrymui added the NeedsFix The path to resolution is known, but the work has not been done. label Nov 2, 2023

mknyszek assigned jinlin-bayarea Nov 8, 2023

mknyszek moved this to In Progress in Go Compiler / Runtime Nov 8, 2023

cherrymui mentioned this issue Nov 27, 2023

cmd/compile: reduce PGO profile processing overhead #58102

Closed

prattmic closed this as completed Jan 22, 2024

github-project-automation bot moved this from In Progress to Done in Go Compiler / Runtime Jan 22, 2024

mknyszek removed this from Go Compiler / Runtime Feb 28, 2024

gabyhelp mentioned this issue Nov 16, 2024

cmd/compile: provide default PGO profiles that cover the runtime #70393

Open

golang locked and limited conversation to collaborators Jan 21, 2025

gopherbot added the FrozenDueToAge label Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/pgopack: cache packed pgo profile during build #62400

cmd/pgopack: cache packed pgo profile during build #62400

jinlin-bayarea commented Aug 31, 2023

cespare commented Aug 31, 2023

cespare commented Aug 31, 2023

jinlin-bayarea commented Sep 1, 2023

prattmic commented Sep 14, 2023

cherrymui commented Sep 19, 2023 •

edited

Loading

jinlin-bayarea commented Sep 20, 2023

rsc commented Nov 1, 2023

cherrymui commented Nov 1, 2023

jinlin-bayarea commented Nov 1, 2023

prattmic commented Jan 22, 2024

cmd/pgopack: cache packed pgo profile during build #62400

cmd/pgopack: cache packed pgo profile during build #62400

Comments

jinlin-bayarea commented Aug 31, 2023

cespare commented Aug 31, 2023

cespare commented Aug 31, 2023

jinlin-bayarea commented Sep 1, 2023

prattmic commented Sep 14, 2023

cherrymui commented Sep 19, 2023 • edited Loading

jinlin-bayarea commented Sep 20, 2023

rsc commented Nov 1, 2023

cherrymui commented Nov 1, 2023

jinlin-bayarea commented Nov 1, 2023

prattmic commented Jan 22, 2024

cherrymui commented Sep 19, 2023 •

edited

Loading