faiss_hnsw support INT8 #991

cydrain · 2024-12-17T08:19:18Z

Issue: #977

Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

sre-ci-robot · 2024-12-17T08:19:23Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cydrain

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [cydrain]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mergify · 2024-12-17T08:20:30Z

@cydrain 🔍 Important: PR Classification Needed!

For efficient project management and a seamless review process, it's essential to classify your PR correctly. Here's how:

If you're fixing a bug, label it as kind/bug.
For small tweaks (less than 20 lines without altering any functionality), please use kind/improvement.
Significant changes that don't modify existing functionalities should be tagged as kind/enhancement.
Adjusting APIs or changing functionality? Go with kind/feature.

For any PR outside the kind/improvement category, ensure you link to the associated issue using the format: “issue: #”.

Thanks for your efforts and contribution to the community!.

cydrain · 2024-12-17T09:39:59Z

/kind improvement

codecov · 2024-12-17T09:45:43Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 74.00%. Comparing base (3c46f4c) to head (9023778).
Report is 272 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##           main     #991       +/-   ##
=========================================
+ Coverage      0   74.00%   +74.00%     
=========================================
  Files         0       82       +82     
  Lines         0     6948     +6948     
=========================================
+ Hits          0     5142     +5142     
- Misses        0     1806     +1806

see 82 files with indirect coverage changes

alexanderguzhva

lgtm overall, but please confirm about QT_8bit_direct_signed. Thanks.

alexanderguzhva · 2024-12-17T15:12:34Z

src/index/hnsw/faiss_hnsw.cc

+    } else if (dst_data_format == DataFormatEnum::int8) {
+        knowhere::int8* const dst = reinterpret_cast<knowhere::int8*>(dst_in);
+        for (size_t i = 0; i < nrows * dim; i++) {
+            KNOWHERE_THROW_IF_NOT_MSG(src[i] >= std::numeric_limits<knowhere::int8>::min() &&


it is better to use std::numeric_limilts<knowhere::int8>::lowest() here

Hi Alex, what's the difference between min() and lowest() here?

there's no difference for this particular use case. But lowest() is better to use, because of the connotations with std::numeric<float>::lowest() (which is -1e+40, the least value) and std::numeric<float>::min() (which is 1e-40, the least representable positive value)

alexanderguzhva · 2024-12-17T15:15:05Z

src/index/hnsw/faiss_hnsw.cc

@@ -1327,6 +1350,9 @@ class BaseFaissRegularIndexHNSWFlatNode : public BaseFaissRegularIndexHNSWNode {
            } else if (data_format == DataFormatEnum::bf16) {
                hnsw_index = std::make_unique<faiss::IndexHNSWSQCosine>(dim, faiss::ScalarQuantizer::QT_bf16,
                                                                        hnsw_cfg.M.value());
+            } else if (data_format == DataFormatEnum::int8) {
+                hnsw_index = std::make_unique<faiss::IndexHNSWSQCosine>(
+                    dim, faiss::ScalarQuantizer::QT_8bit_direct_signed, hnsw_cfg.M.value());


please DO confirm that you want to use QT_8bit_direct_signed here, because the use case is not clear to me. Basically, I can imagine a use case that works with the input data of [0..255] range (QT_8bit_direct), or the traditional QT_8bit that remaps input float values into [0..255] range, but what is the use case for the input data of [-128..127] range? Or is it just the requirement from Milvus?

it's the requirement from Milvus, since vespa and qdrant already support Vector_Int8 now

Hi Alex, I see no obvious difference between min() and lowest(), I prefer to use min() and max() in pair.

alexanderguzhva · 2024-12-18T20:40:31Z

@cydrain lgtm
please let me know if you'd like to change min() to lowest() and I'll lgtm this diff in either cases

alexanderguzhva · 2024-12-19T01:27:16Z

/lgtm

faiss_hnsw support INT8

9023778

Signed-off-by: Cai Yudong <yudong.cai@zilliz.com>

sre-ci-robot requested review from chasingegg and cqy123456 December 17, 2024 08:19

sre-ci-robot added approved size/L labels Dec 17, 2024

mergify bot added the dco-passed label Dec 17, 2024

mergify bot added the do-not-merge/missing-related-issue label Dec 17, 2024

sre-ci-robot added the kind/improvement label Dec 17, 2024

mergify bot added ci-passed and removed do-not-merge/missing-related-issue labels Dec 17, 2024

cydrain mentioned this pull request Dec 17, 2024

Faiss native support multi data types #977

Open

6 tasks

alexanderguzhva reviewed Dec 17, 2024

View reviewed changes

sre-ci-robot assigned alexanderguzhva Dec 19, 2024

sre-ci-robot added the lgtm label Dec 19, 2024

sre-ci-robot merged commit ca4ba32 into zilliztech:main Dec 19, 2024
14 checks passed

cydrain deleted the caiyd_977_faiss_native_support_multi_datatype branch December 19, 2024 01:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faiss_hnsw support INT8 #991

faiss_hnsw support INT8 #991

cydrain commented Dec 17, 2024 •

edited

Loading

sre-ci-robot commented Dec 17, 2024

mergify bot commented Dec 17, 2024

cydrain commented Dec 17, 2024

codecov bot commented Dec 17, 2024 •

edited

Loading

alexanderguzhva left a comment

alexanderguzhva Dec 17, 2024

cydrain Dec 18, 2024

alexanderguzhva Dec 18, 2024

alexanderguzhva Dec 17, 2024

cydrain Dec 18, 2024

cydrain Dec 19, 2024

alexanderguzhva commented Dec 18, 2024

alexanderguzhva commented Dec 19, 2024

faiss_hnsw support INT8 #991

faiss_hnsw support INT8 #991

Conversation

cydrain commented Dec 17, 2024 • edited Loading

sre-ci-robot commented Dec 17, 2024

mergify bot commented Dec 17, 2024

cydrain commented Dec 17, 2024

codecov bot commented Dec 17, 2024 • edited Loading

Codecov Report

alexanderguzhva left a comment

Choose a reason for hiding this comment

alexanderguzhva Dec 17, 2024

Choose a reason for hiding this comment

cydrain Dec 18, 2024

Choose a reason for hiding this comment

alexanderguzhva Dec 18, 2024

Choose a reason for hiding this comment

alexanderguzhva Dec 17, 2024

Choose a reason for hiding this comment

cydrain Dec 18, 2024

Choose a reason for hiding this comment

cydrain Dec 19, 2024

Choose a reason for hiding this comment

alexanderguzhva commented Dec 18, 2024

alexanderguzhva commented Dec 19, 2024

cydrain commented Dec 17, 2024 •

edited

Loading

codecov bot commented Dec 17, 2024 •

edited

Loading