device_prop.hpp: move static map to helper function and initialize there #1763

coconutruben · 2024-12-18T23:17:38Z

Summary:

Why

This causes hard to debug segfaults when running in inductor without ASan for some reason with the helper, it's still a static const, so it should only initialize once: on the first call

What

move the static name lookup map up to file scope and out of the inline'd get_device_name function

Test Plan:

Something like this without ASan

import torch

import torch.nn as nn
from torch._inductor import config as inductor_config
from torch._inductor.utils import fresh_inductor_cache

class SimpleModel(nn.Module):
    def __init__(self):
        super().__init__()

    def forward(self, x, y):
        return torch.mm(x, y)

M, N, K = 128, 128, 128
dtype = torch.float16
A = torch.randn(M, K, dtype=dtype).cuda()
B = torch.randn(K, N, dtype=dtype).cuda()

# create a fresh inductor cache
with fresh_inductor_cache():
    # sample the different backends independently
    with inductor_config.patch(
        {"max_autotune_gemm_backends": f"ATEN,CK"}
    ):
        # compile the model
        compiled_model = torch.compile(SimpleModel(), mode="max-autotune")
        # run the compiled model
        _ = compiled_model(A, B)

Checklist

Please put an x into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.

I have added tests relevant to the introduced functionality, and the unit tests are passing locally
I have added inline documentation which enables the maintainers with understanding the motivation
I have removed the stale documentation which is no longer relevant after this pull request
(If this change is user-facing) I have added release notes which provide the end users with a brief summary of the improvement from this pull request
I have run clang-format on all changed files
Any dependent changes have been merged

Summary: \# Why - This causes hard to debug segfaults when running in inductor without ASan for some reason with the helper, it's still a static const, so it should only initialize once: on the first call \# What - move the static name lookup map up to file scope and out of the inline'd get_device_name function Test Plan: Something like this without ASan ``` import torch import torch.nn as nn from torch._inductor import config as inductor_config from torch._inductor.utils import fresh_inductor_cache class SimpleModel(nn.Module): def __init__(self): super().__init__() def forward(self, x, y): return torch.mm(x, y) M, N, K = 128, 128, 128 dtype = torch.float16 A = torch.randn(M, K, dtype=dtype).cuda() B = torch.randn(K, N, dtype=dtype).cuda() \# create a fresh inductor cache with fresh_inductor_cache(): # sample the different backends independently with inductor_config.patch( {"max_autotune_gemm_backends": f"ATEN,CK"} ): # compile the model compiled_model = torch.compile(SimpleModel(), mode="max-autotune") # run the compiled model _ = compiled_model(A, B) ```

zjing14 · 2024-12-18T23:42:30Z

@illsilin @carlushuang Could you review it?

coconutruben requested review from junliume, illsilin, carlushuang, qianfengz, aosewski, poyenc, geyyer, bartekxk and andriy-ca as code owners December 18, 2024 23:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

device_prop.hpp: move static map to helper function and initialize there #1763

device_prop.hpp: move static map to helper function and initialize there #1763

coconutruben commented Dec 18, 2024

zjing14 commented Dec 18, 2024

device_prop.hpp: move static map to helper function and initialize there #1763

Are you sure you want to change the base?

device_prop.hpp: move static map to helper function and initialize there #1763

Conversation

coconutruben commented Dec 18, 2024

Why

What

Checklist

zjing14 commented Dec 18, 2024