Update on "[bc-breaking] Generalize FakeQuantizeConfig beyond intx"

andrewor14 · andrewor14 · commit 8245cee7b6cf · 2025-07-31T14:06:43.000-07:00
**Summary:** The existing `FakeQuantizeConfig` performs only
intx quantization, but we plan to extend QAT to other dtypes
such as fp8 and nvfp4 in the near future. This is the necessary
refactor before that. Specifically:

```
# New abstract class
FakeQuantizeConfigBase
# Rename
FakeQuantizeConfig -&gt; IntxFakeQuantizeConfig
```

In the future, we will have other types of `FakeQuantizeConfigBase`
for float dtypes that users can pass in instead of the existing
Intx one.

**BC-breaking notes:** For BC, we keep around the old names to
reference the new ones. However, this commit is still BC-breaking
in the sense that a few APIs now accept the abstract
`FakeQuantizeConfigBase` instead. For the most part, this abstract
class will be hidden from the user.

Before:
```
activation_config = FakeQuantizeConfig(torch.int8, "per_token", is_symmetric=False)
weight_config = FakeQuantizeConfig(torch.int4, group_size=32)
```

After:
```
activation_config = IntxFakeQuantizeConfig(torch.int8, "per_token", is_symmetric=False)
weight_config = IntxFakeQuantizeConfig(torch.int4, group_size=32)
```

**Test Plan:**
python test/quantization/test_qat.py

[ghstack-poisoned]
diff --git a/torchao/quantization/qat/fake_quantize_config.py b/torchao/quantization/qat/fake_quantize_config.py
@@ -25,7 +25,6 @@
 )
 
 
-@dataclass
 class FakeQuantizeConfigBase(abc.ABC):
     """
     Base class for representing fake quantization config.

Original file line number	Diff line number	Diff line change
`@@ -25,7 +25,6 @@`
`25`	`25`	`)`
`26`	`26`
`27`	`27`
`28`		`-@dataclass`
`29`	`28`	`class FakeQuantizeConfigBase(abc.ABC):`
`30`	`29`	`"""`
`31`	`30`	`Base class for representing fake quantization config.`