hnsw support fp16/bf16 #494

cqy123456 · 2024-04-09T13:08:06Z

issue: #287

mergify · 2024-04-09T13:09:01Z

@cqy123456 🔍 Important: PR Classification Needed!

For efficient project management and a seamless review process, it's essential to classify your PR correctly. Here's how:

If you're fixing a bug, label it as kind/bug.
For small tweaks (less than 20 lines without altering any functionality), please use kind/improvement.
Significant changes that don't modify existing functionalities should be tagged as kind/enhancement.
Adjusting APIs or changing functionality? Go with kind/feature.

For any PR outside the kind/improvement category, ensure you link to the associated issue using the format: “issue: #”.

Thanks for your efforts and contribution to the community!.

alexanderguzhva · 2024-04-09T13:42:31Z

src/common/utils.cc

+    for (auto i = 0; i < d; i++) {
+        norm_l2_sqr += (float)x[i] * (float)x[i];
+    }
+    if (norm_l2_sqr > 0 && std::abs(1.0f - norm_l2_sqr) > FloatAccuracy) {


just in case for the future: should FloatAccuracy remain the same for bf16 and fp16?

I think float16 can use the smallest positive value(6.1 x 10^(-5)), and bfloat16 can use the same as float32.

alexanderguzhva · 2024-04-09T13:45:40Z

tests/ut/test_iterator.cc

@@ -317,65 +317,65 @@ TEST_CASE("Test Iterator IVFFlatCC With Newly Insert Vectors", "[float metrics]
    }
 }

-TEST_CASE("Test Iterator Mem Index With Binary Metrics", "[float metrics]") {


any comments / TODO about the reason for disabling this test?

Metric type and data type not match in this case. I change the data type to binary, and it work now.

alexanderguzhva · 2024-04-09T13:48:45Z

thirdparty/hnswlib/hnswlib/space_ip.h

-    float res = 0;
-    for (unsigned i = 0; i < qty; i++) {
-        res += ((float*)pVect1)[i] * ((float*)pVect2)[i];
+    if constexpr (!std::is_same<DataType, float>::value) {


is_same_v

alexanderguzhva · 2024-04-09T13:49:34Z

thirdparty/hnswlib/hnswlib/space_l2.h

+template <typename DataType, typename DistanceType>
+static DistanceType
+NormSqr(const void* pVect1v, const void* qty_ptr) {
+    if constexpr (!std::is_same<DataType, float>::value) {


is_same_v

alexanderguzhva · 2024-04-09T13:49:47Z

thirdparty/hnswlib/hnswlib/space_l2.h

+template <typename DataType, typename DistanceType>
+static DistanceType
+L2Sqr(const void* pVect1v, const void* pVect2v, const void* qty_ptr) {
+    if constexpr (!std::is_same<DataType, float>::value) {


is_same_v

codecov · 2024-04-10T07:49:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 71.11%. Comparing base (3c46f4c) to head (4f4908e).
Report is 14 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##           main     #494       +/-   ##
=========================================
+ Coverage      0   71.11%   +71.11%     
=========================================
  Files         0       67       +67     
  Lines         0     4393     +4393     
=========================================
+ Hits          0     3124     +3124     
- Misses        0     1269     +1269

see 67 files with indirect coverage changes

cqy123456 · 2024-04-10T08:12:22Z

When metric type = cosine, fp16 query vector will normalize into a new fp16 vector. Compare with float32, fraction of bf16(fraction = 7) is much less than float32(fraction=23). So e2e fail in fp16 recall check(0.9817 >= 0.99).

alexanderguzhva · 2024-04-10T14:23:39Z

/lgtm

src/common/utils.cc

cydrain · 2024-04-11T07:44:27Z

src/index/hnsw/hnsw.cc

-        auto index = new (std::nothrow) hnswlib::HierarchicalNSW<DistType, quant_type>(space, rows, hnsw_cfg.M.value(),
-                                                                                       hnsw_cfg.efConstruction.value());
+
+        auto index = new (std::nothrow) hnswlib::HierarchicalNSW<DataType, DistType, quant_type>(


should use "quantType" instead of "quant_type" to sync up the coding style

cydrain · 2024-04-11T07:52:04Z

thirdparty/hnswlib/hnswlib/hnswlib.h

@@ -183,7 +183,7 @@ struct IteratorWorkspace {
    // normalized_query_data(if any). Thus storing the normalized_query_data
    // separately in a unique_ptr so it can be freed when finished.
    IteratorWorkspace(const void* query_data, const size_t num_elements, const size_t seed_ef, const bool for_tuning,
-                      std::unique_ptr<float[]> normalized_query_data, const knowhere::BitsetView& bitset,
+                      std::unique_ptr<char[]> normalized_query_data, const knowhere::BitsetView& bitset,


why change float[] to char[] ?

for supporting float16 and bfloat16, and hnswlib use char[] to store data/

Presburger · 2024-04-12T08:07:32Z

thirdparty/hnswlib/hnswlib/hnswalg.h

+        if constexpr (knowhere::KnowhereFloatTypeCheck<data_t>::value) {
+            if (metric_type_ == Metric::COSINE) {
+                auto normalized_query = std::make_unique<char[]>(space_->get_data_size());
+                std::memcpy(normalized_query.get(), query_data, space_->get_data_size());


memcpy_s maybe safer?

memcpy_s( void *dest, size_t numberOfElements, const void *src, size_t count ); numberOfElements and count cannot be distinguished in these code, and memcpy_s is not as portable as memcpy.

alexanderguzhva · 2024-04-16T12:09:26Z

/lgtm

Signed-off-by: cqy123456 <[email protected]>

Presburger · 2024-04-22T02:39:21Z

/lgtm

Signed-off-by: cqy123456 <[email protected]>

Presburger · 2024-04-22T06:57:08Z

/lgtm

Presburger · 2024-04-22T06:57:25Z

/approve

cqy123456 · 2024-04-22T14:04:07Z

/approve

sre-ci-robot · 2024-04-22T14:04:19Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cqy123456, Presburger

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Presburger,cqy123456]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

alexanderguzhva · 2024-04-22T14:11:07Z

/lgtm

sre-ci-robot requested review from foxspy and zhengbuqian April 9, 2024 13:08

sre-ci-robot added approved size/XL labels Apr 9, 2024

mergify bot added the dco-passed label Apr 9, 2024

mergify bot added the do-not-merge/missing-related-issue label Apr 9, 2024

alexanderguzhva reviewed Apr 9, 2024

View reviewed changes

cqy123456 force-pushed the hnsw-fp16 branch 2 times, most recently from 5a468bb to 3221ccf Compare April 10, 2024 07:01

sre-ci-robot added size/L and removed size/XL labels Apr 10, 2024

sre-ci-robot assigned alexanderguzhva Apr 10, 2024

sre-ci-robot added the lgtm label Apr 10, 2024

cydrain reviewed Apr 11, 2024

View reviewed changes

src/common/utils.cc Show resolved Hide resolved

cydrain reviewed Apr 11, 2024

View reviewed changes

Presburger reviewed Apr 12, 2024

View reviewed changes

cqy123456 force-pushed the hnsw-fp16 branch from 3221ccf to c0a5fbc Compare April 15, 2024 04:07

sre-ci-robot removed the lgtm label Apr 15, 2024

cqy123456 force-pushed the hnsw-fp16 branch from c0a5fbc to 369aaca Compare April 15, 2024 04:08

sre-ci-robot added the lgtm label Apr 16, 2024

cqy123456 force-pushed the hnsw-fp16 branch from 369aaca to 3ef5738 Compare April 19, 2024 03:32

sre-ci-robot removed the lgtm label Apr 19, 2024

zhengbuqian mentioned this pull request Apr 19, 2024

improve the iterator implementation #500

Closed

hnsw support fp16/bf16

4f4908e

Signed-off-by: cqy123456 <[email protected]>

cqy123456 force-pushed the hnsw-fp16 branch from 3ef5738 to 4f4908e Compare April 19, 2024 10:35

sre-ci-robot added size/XL and removed size/L labels Apr 19, 2024

mergify bot added the ci-passed label Apr 19, 2024

sre-ci-robot assigned Presburger Apr 22, 2024

sre-ci-robot added the lgtm label Apr 22, 2024

cqy123456 added the kind/improvement label Apr 22, 2024

mergify bot removed the do-not-merge/missing-related-issue label Apr 22, 2024

Presburger pushed a commit to Presburger/knowhere that referenced this pull request Apr 22, 2024

hnsw support fp16/bf16 (zilliztech#494)

6efb02b

Signed-off-by: cqy123456 <[email protected]>

cqy123456 closed this May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hnsw support fp16/bf16 #494

hnsw support fp16/bf16 #494

cqy123456 commented Apr 9, 2024 •

edited

Loading

mergify bot commented Apr 9, 2024

alexanderguzhva Apr 9, 2024

cqy123456 Apr 10, 2024

alexanderguzhva Apr 9, 2024

cqy123456 Apr 10, 2024

alexanderguzhva Apr 9, 2024

cqy123456 Apr 10, 2024

alexanderguzhva Apr 9, 2024

cqy123456 Apr 10, 2024

alexanderguzhva Apr 9, 2024

cqy123456 Apr 10, 2024

codecov bot commented Apr 10, 2024 •

edited

Loading

cqy123456 commented Apr 10, 2024 •

edited

Loading

alexanderguzhva commented Apr 10, 2024

cydrain Apr 11, 2024

cydrain Apr 11, 2024

cqy123456 Apr 11, 2024 •

edited

Loading

Presburger Apr 12, 2024

cqy123456 Apr 15, 2024

alexanderguzhva commented Apr 16, 2024

Presburger commented Apr 22, 2024

Presburger commented Apr 22, 2024

Presburger commented Apr 22, 2024

cqy123456 commented Apr 22, 2024

sre-ci-robot commented Apr 22, 2024

alexanderguzhva commented Apr 22, 2024

hnsw support fp16/bf16 #494

hnsw support fp16/bf16 #494

Conversation

cqy123456 commented Apr 9, 2024 • edited Loading

mergify bot commented Apr 9, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Apr 10, 2024 • edited Loading

Codecov Report

cqy123456 commented Apr 10, 2024 • edited Loading

alexanderguzhva commented Apr 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cqy123456 Apr 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexanderguzhva commented Apr 16, 2024

Presburger commented Apr 22, 2024

Presburger commented Apr 22, 2024

Presburger commented Apr 22, 2024

cqy123456 commented Apr 22, 2024

sre-ci-robot commented Apr 22, 2024

alexanderguzhva commented Apr 22, 2024

cqy123456 commented Apr 9, 2024 •

edited

Loading

codecov bot commented Apr 10, 2024 •

edited

Loading

cqy123456 commented Apr 10, 2024 •

edited

Loading

cqy123456 Apr 11, 2024 •

edited

Loading