[WIP]support automatic mixed bits assignment #851

wenhuach21 · 2025-09-25T06:15:31Z

for more information, see https://pre-commit.ci

…o auto_scheme

for more information, see https://pre-commit.ci

…o auto_scheme

for more information, see https://pre-commit.ci

…o auto_scheme

for more information, see https://pre-commit.ci

wenhuach21 · 2025-10-09T08:38:37Z

@xin3he @n1ck-guo
please have a review of this function first

def compute_layer_bits(
    layer: torch.nn.Module,
    ignore_scale_zp_bits: bool = False,
) -> tuple[int, float]:
    """Compute total and average bitwidth for a single quantized layer.
    Args:
        layer: A PyTorch layer with quantization attributes.
        ignore_scale_zp_bits: Whether to ignore scale/zero-point overhead.
    Returns:
        A tuple (total_bits, avg_bits) representing bit usage.
    """
    weight = layer.weight
    n_param = weight.numel()
    weight_bits = getattr(layer, "bits", 16)
    group_size = getattr(layer, "group_size", 128)
    super_group_size = getattr(layer, "super_group_size", None)
    super_weight_bits = getattr(layer, "super_bits", None)

    # Unquantized layer or ignoring scale/zp overhead
    if weight_bits >= 16 or ignore_scale_zp_bits:
        if super_weight_bits is not None:  # reset gguf 16 bits to 32 bits
            return 32 * n_param, 32
        return weight_bits * n_param, 16.0

    in_features, out_features = get_layer_features(layer)

    # Determine number of groups based on group size
    if group_size > 0:
        n_group = out_features * (in_features + group_size - 1) // group_size
    elif group_size == 0:
        n_group = 1
    elif group_size == -1:
        n_group = out_features
    else:
        raise ValueError(f"Invalid group_size {group_size}")

    # Compute auxiliary bits (scales, zero-points, or double quantization)
    aux_total_bits = 0
    if not super_group_size:
        scale_bits = 16
        zp_bits = weight_bits
        aux_total_bits = n_group * (scale_bits + zp_bits)
    else:
        aux_total_bits += n_group * super_weight_bits * 2
        n_super_group = (n_group + super_group_size - 1) // super_group_size
        aux_total_bits += n_super_group * 32 * 2  # 32-bit scale and min_v

    total_bits = weight_bits * n_param + aux_total_bits
    avg_bits = total_bits / n_param
    return total_bits, avg_bits

for more information, see https://pre-commit.ci

…o auto_scheme

for more information, see https://pre-commit.ci

wenhuach21 and others added 30 commits September 25, 2025 14:14

try to enable auto_scheme API

6ffcf60

[pre-commit.ci] auto fixes from pre-commit.com hooks

5d80825

for more information, see https://pre-commit.ci

update a little

a4ef495

[pre-commit.ci] auto fixes from pre-commit.com hooks

4173c3e

for more information, see https://pre-commit.ci

update a little

87e9454

Merge branch 'main' into auto_scheme

f86eedb

try to refine parse layer config code

242d1ee

[pre-commit.ci] auto fixes from pre-commit.com hooks

4fc6b64

for more information, see https://pre-commit.ci

Merge branch 'main' into auto_scheme

63de904

Merge branch 'main' into auto_scheme

bb4d4ca

[pre-commit.ci] auto fixes from pre-commit.com hooks

7f76db2

for more information, see https://pre-commit.ci

fix

ae8837b

Merge branch 'auto_scheme' of https://github.com/intel/auto-round int…

44ca92d

…o auto_scheme

fix

531224d

[pre-commit.ci] auto fixes from pre-commit.com hooks

c9fa408

for more information, see https://pre-commit.ci

fix

6453200

Merge branch 'auto_scheme' of https://github.com/intel/auto-round int…

5b2dd60

…o auto_scheme

tmp_change

3811010

commit

4de7b08

commit

a9f0e44

update a little

59a9f5d

[pre-commit.ci] auto fixes from pre-commit.com hooks

1b7e911

for more information, see https://pre-commit.ci

fix

e068049

Merge branch 'auto_scheme' of https://github.com/intel/auto-round int…

1b84bf2

…o auto_scheme

[pre-commit.ci] auto fixes from pre-commit.com hooks

0357c0b

for more information, see https://pre-commit.ci

Merge branch 'main' into auto_scheme

7c034bd

merge autoscheme to scheme

602421c

refine layer_config code

091c5ad

Merge branch 'main' into auto_scheme

90b6fa1

[pre-commit.ci] auto fixes from pre-commit.com hooks

f027801

for more information, see https://pre-commit.ci

wenhuach21 and others added 19 commits September 29, 2025 14:57

tiny fix

1b9f24e

tmp change

2c0075a

tmp change

97198f0

[pre-commit.ci] auto fixes from pre-commit.com hooks

27b4b4d

for more information, see https://pre-commit.ci

update

2d3095a

[pre-commit.ci] auto fixes from pre-commit.com hooks

35a298b

for more information, see https://pre-commit.ci

fix

4a594cd

fix uts, still one left

dcd08d6

fix gguf issue

9172264

Merge branch 'main' into auto_scheme

1d9e593

[pre-commit.ci] auto fixes from pre-commit.com hooks

f98092c

for more information, see https://pre-commit.ci

update a little

033d1f6

[pre-commit.ci] auto fixes from pre-commit.com hooks

8ae1dfa

for more information, see https://pre-commit.ci

fix some issues

a3756ce

fix some issues

2f93471

[pre-commit.ci] auto fixes from pre-commit.com hooks

e0c3d4b

for more information, see https://pre-commit.ci

Merge branch 'main' into auto_scheme

0130932

update

6e04d10

[pre-commit.ci] auto fixes from pre-commit.com hooks

04c604c

for more information, see https://pre-commit.ci

wenhuach21 changed the title ~~[WIP]try to enable auto_scheme API~~ [WIP]support automatic mixed bits assignment Oct 9, 2025

Merge branch 'main' into auto_scheme

3880038

wenhuach21 and others added 8 commits October 9, 2025 21:07

fix one bug

87d3694

Merge branch 'main' into auto_scheme

fa85d42

fix

3855c8f

[pre-commit.ci] auto fixes from pre-commit.com hooks

d3e28c2

for more information, see https://pre-commit.ci

Merge branch 'main' into auto_scheme

706df03

set up the first version, there are many details to be handled

2d557d0

Merge branch 'auto_scheme' of https://github.com/intel/auto-round int…

567ebb8

…o auto_scheme

[pre-commit.ci] auto fixes from pre-commit.com hooks

cedad47

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP]support automatic mixed bits assignment #851

[WIP]support automatic mixed bits assignment #851

wenhuach21 commented Sep 25, 2025 •

edited

Loading

Uh oh!

wenhuach21 commented Oct 9, 2025

Uh oh!

Uh oh!

[WIP]support automatic mixed bits assignment #851

Are you sure you want to change the base?

[WIP]support automatic mixed bits assignment #851

Conversation

wenhuach21 commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenhuach21 commented Oct 9, 2025

Uh oh!

Uh oh!

wenhuach21 commented Sep 25, 2025 •

edited

Loading