Migrate export_llama to new ao quantize API

### 🚀 The feature, motivation and pitch

`Int8DynActInt4WeightQuantizer` for `-qmode 8da4w` is no longer being developed by ao and doesn't support bias. Migrate to the new `quantize_` api which can take in `int8_dynamic_activation_int4_weight`.

### Alternatives

_No response_

### Additional context

_No response_

### RFC (Optional)

_No response_

cc @mergennachin @iseeyuan @lucylq @helunwencser @tarun292 @kimishpatel @cccclai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate export_llama to new ao quantize API #8422

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Migrate export_llama to new ao quantize API #8422

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions