Skip to content

Conversation

Copy link

Copilot AI commented Jan 12, 2026

Plan to add iq4_nl quantization support based on iq4_nl branch

  • Checkout files from iq4_nl branch containing mixed precision quantization support
  • Add "iq4_nl" entry to GGML_FTYPE_MAP for ftype parsing
  • Add "iq4_nl" entry to GGML_TYPE_MAP for qtype parsing (for --tensor-type option)
  • Update ggml_parse_ftype to handle 'i' prefix for iq4_nl type
  • Add GGML_FTYPE_MOSTLY_IQ4_NL case to first ggml_common_quantize_0 function
  • Add GGML_TYPE_IQ4_NL case to first quantization switch statement
  • Add GGML_FTYPE_MOSTLY_IQ4_NL case to second ggml_common_quantize_0 function (with tensor_quant_specs)
  • Add GGML_TYPE_IQ4_NL case to second quantization switch statement
  • Build the project to verify the changes compile
  • Test the quantize tool with iq4_nl parameter to ensure it works correctly
  • Fix typo in ggml_quantize_chunk call
  • Code review and final verification
Original prompt

请基于iq4_nl分支,添加对iq4_nl的量化支持,主要是修改quantize代码


💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: lhpqaq <63844184+lhpqaq@users.noreply.github.com>
Copilot AI changed the title [WIP] Add quantization support for iq4_nl Add iq4_nl quantization support to whisper.cpp quantize tool Jan 12, 2026
Copilot AI requested a review from lhpqaq January 12, 2026 07:26
@lhpqaq
Copy link
Owner

lhpqaq commented Jan 12, 2026

@copilot 你要基于iq4_nl 分支修改,注意iq4_nl 分支里有关于混合精度量化的内容,参考master...lhpqaq:whisper.cpp:iq4_nl

Copy link
Author

Copilot AI commented Jan 12, 2026

@copilot 你要基于iq4_nl 分支修改,注意iq4_nl 分支里有关于混合精度量化的内容,参考master...lhpqaq:whisper.cpp:iq4_nl

我已经基于iq4_nl分支重新实现了修改。现在的代码包含了iq4_nl分支中的混合精度量化功能,并添加了对iq4_nl量化类型的完整支持。

主要变更:

  • 从iq4_nl分支合并了混合精度量化功能(包括 ggml_parse_qtype 函数和 --tensor-type 选项)
  • 在两个 GGML_FTYPE_MAPGGML_TYPE_MAP 中添加了 "iq4_nl" 条目
  • 在两个 ggml_common_quantize_0 函数(标准版本和带 tensor_quant_specs 的扩展版本)中都添加了 IQ4_NL 支持
  • 更新了parser以支持 'i' 前缀

提交: 5f562c4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants