[AUTOTUNER] adding simple report flag for autotuner runs #3411

bringlein · 2024-03-19T13:42:43Z

For our application, we wanted to investigate when, how often, and for how long the triton autotuner is triggered (to asses the impact on the latency of the application).
Therefore, we implemented a simple report flag to the autotuner decorator:

@triton.autotune(
    configs=[...],  
    key=[...],
    report=True,  # new, optional
)
@triton.jit
def fused_add_rmsnorm_triton(...

If this flag is set to true, the autotuner will print the following statement, every time a new autotune run is triggered:

Autotuner for function JITFunction(__main__:fused_add_rmsnorm_triton) finished after 4.15s; best config selected: BLOCK_N_SIZE: 512, num_warps: 8, num_ctas: 1, num_stages: 3;

There are no prints if a cached configuration is used.

(We thought this could be also a helpful feature for others, therefore we created this PR directly and we can make/discuss requested changes here. However, if you think this should be discussed instead in an issue, let us know and sorry).

Jokeren · 2024-03-19T14:38:13Z

I'm OK with the change. Should it be a log or just print? Triton doesn't have a logging system though

cc @jlebar @ThomasRaoux

jlebar

I think this is a good idea!

jlebar · 2024-03-19T18:53:14Z

python/triton/runtime/autotuner.py

    :type warmup: int
    :param rep: Repetition time (in ms) to pass to benchmarking, defaults to 100.
    :type rep: int
+    :param report: Flag to enable printing the selected configuration


Rename the flag to print? "report" can be a noun or a verb, and that ambiguity is confusing here. (Do I pass the report in as a parameter? What is the report? Or is the autotuning report passed as an outparameter somehow?)

ok, changed it to print_autotune_stats

jlebar · 2024-03-19T18:54:37Z

python/triton/runtime/autotuner.py

    :type warmup: int
    :param rep: Repetition time (in ms) to pass to benchmarking, defaults to 100.
    :type rep: int
+    :param report: Flag to enable printing the selected configuration


Nit: Reword to something like what you wrote in the commit message, which is more helpful. For example: "If print is true, Triton will print a log message each time it autotunes a function."

Fair point, I updated it.

ThomasRaoux · 2024-03-19T19:17:44Z

I wonder why this can't be done by using a profiler?

Jokeren · 2024-03-19T20:11:49Z

I wonder why this can't be done by using a profiler?

Indeed this is a case I used the profiler a lot...as proton can rename kernels based on constants. The problem is probably because proton is not available yet...

jlebar · 2024-03-19T20:57:59Z

Personally I think something lightweight like this is nice to have even if we have heavier-weight like Proton. It's basically zero complexity overhead and can be really useful for quick-and-dirty debugging.

Jokeren · 2024-03-19T21:36:05Z

Personally I think something lightweight like this is nice to have even if we have heavier-weight like Proton. It's basically zero complexity overhead and can be really useful for quick-and-dirty debugging.

I think it also depends on whether you want to check tuning time on CPU + GPU or just GPU time. This PR seems to get end to end tuning time.

Jokeren · 2024-03-19T21:36:44Z

python/triton/runtime/autotuner.py

+        if self.report and not used_cached_result:
+            autotune_stop = time.time()
+            print(
+                f"Autotuner for function {self.fn} finished after {autotune_stop-autotune_start:.2f}s; best config selected: {self.best_config};"


Can we use self.bench_time?

Good point, the hook won't make such a difference. I changed it

bringlein · 2024-03-20T09:22:49Z

Thanks for your helpful comments!

I think it also depends on whether you want to check tuning time on CPU + GPU or just GPU time. This PR seems to get end to end tuning time.

Exactly, we were/are interested in the end-to-end time of the autotuner, so that we could tell easily if a variance in latency of our application was caused by the triton autotuner or smth else.

bringlein · 2024-03-20T09:25:00Z

Should it be a log or just print? Triton doesn't have a logging system though

I was also thinking if besides True/False maybe a streaming object could be passed to this argument, so to route where the log message should be printed to. But I guess this would assume more about an existing/coming logging infrastructure than just a simple print.

Jokeren · 2024-03-20T13:59:48Z

Thanks for your helpful comments!

I think it also depends on whether you want to check tuning time on CPU + GPU or just GPU time. This PR seems to get end to end tuning time.

Exactly, we were/are interested in the end-to-end time of the autotuner, so that we could tell easily if a variance in latency of our application was caused by the triton autotuner or smth else.

Hi @ThomasRaoux , since they are interested in end-to-end statistics. It might be fine to print some debugging information? What's your thought?

ptillet · 2024-03-24T18:23:04Z

Yeah I think this can be helpful to some people and really doesn't add much complexity since best_config is already here

ThomasRaoux · 2024-03-24T21:34:36Z

Having debug logs makes sense, do we want this to be a front end option or an env variable? In general debug features are controlled by env variables.

jlebar · 2024-03-25T20:35:43Z

I'm fine with either an envvar or the in-code flag. It sound like we'd have consensus if we went with the env var? I propose TRITON_PRINT_AUTOTUNING as a strawperson.

bringlein · 2024-03-28T09:42:36Z

I agree, controlling the logging via an environment variable serves the purpose better.
I changed the implementation to check for TRITON_PRINT_AUTOTUNING=1.

jlebar

Thanks!

python/triton/runtime/autotuner.py

jlebar · 2024-04-01T23:27:02Z

I can merge this once we make the final few remaining changes!

jlebar · 2024-04-02T21:33:42Z

Rebased and am trying to merge this.

bringlein · 2024-04-04T09:53:14Z

Thanks @jlebar! I was on a trip and didn't had the time to fix/react to your comments.

bringlein requested a review from ptillet as a code owner March 19, 2024 13:42

jlebar reviewed Mar 19, 2024

View reviewed changes

Jokeren reviewed Mar 19, 2024

View reviewed changes

bringlein force-pushed the ngl_pr_autotuner_report branch 2 times, most recently from 4731aad to 7d8b4b6 Compare March 20, 2024 09:20

bringlein force-pushed the ngl_pr_autotuner_report branch from 2918810 to 927f09f Compare March 28, 2024 09:42

jlebar approved these changes Mar 28, 2024

View reviewed changes

python/triton/runtime/autotuner.py Outdated Show resolved Hide resolved

python/triton/runtime/autotuner.py Outdated Show resolved Hide resolved

bringlein and others added 4 commits April 2, 2024 14:24

[AUTOTUNER] adding simple report flag for autotuner runs

ab55087

use existing bench_time; rename flag; update docstring

661a373

autotuner print stats controlled by env variable TRITON_PRINT_AUTOTUNING

9b740cf

Address review comments

23308be

jlebar force-pushed the ngl_pr_autotuner_report branch from 927f09f to 23308be Compare April 2, 2024 21:31

jlebar enabled auto-merge (squash) April 2, 2024 21:33

jlebar merged commit feb13ca into triton-lang:main Apr 2, 2024

jlebar added a commit that referenced this pull request Apr 2, 2024

[Docs] Document the TRITON_PRINT_AUTOTUNING flag added in #3411.

b878416

jlebar added a commit that referenced this pull request Apr 2, 2024

[Docs] Document the TRITON_PRINT_AUTOTUNING flag added in #3411. (#3542)

e14516a

[AUTOTUNER] adding simple report flag for autotuner runs #3411

[AUTOTUNER] adding simple report flag for autotuner runs #3411

Uh oh!

Conversation

bringlein commented Mar 19, 2024

Uh oh!

Jokeren commented Mar 19, 2024

Uh oh!

jlebar left a comment

Choose a reason for hiding this comment

Uh oh!

jlebar Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

bringlein Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

jlebar Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

bringlein Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

ThomasRaoux commented Mar 19, 2024

Uh oh!

Jokeren commented Mar 19, 2024

Uh oh!

jlebar commented Mar 19, 2024

Uh oh!

Jokeren commented Mar 19, 2024

Uh oh!

Jokeren Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

bringlein Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

bringlein commented Mar 20, 2024

Uh oh!

bringlein commented Mar 20, 2024

Uh oh!

Jokeren commented Mar 20, 2024

Uh oh!

ptillet commented Mar 24, 2024

Uh oh!

ThomasRaoux commented Mar 24, 2024

Uh oh!

jlebar commented Mar 25, 2024

Uh oh!

bringlein commented Mar 28, 2024

Uh oh!

jlebar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jlebar commented Apr 1, 2024

Uh oh!

jlebar commented Apr 2, 2024

Uh oh!

bringlein commented Apr 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants