MB-62985 - Add support for binary quantised vectors. #329

metonymic-smokey · 2025-05-09T12:17:48Z

Include a binary vector index in the vector index section, alongside the float vector index.
Currently, BFlat and BIVF indexes are supported.

Modify the vector index cache to cache binary indexes after reading.
Indexing path: Modify the write and merge segment functionality to include binary vector indexes.
Search path: Perform an oversampled search on the binary index and use the results to perform individual distance computes(rescores and reranks) to pick the top K.

faiss_vector_cache.go

CascadingRadium · 2025-05-14T07:05:27Z

faiss_vector_cache.go

+// required info to create a cache entry.
+type cacheEntryReqs struct {
+	alpha       float64
+	index       *faiss.IndexImpl


maybe just have the indexes array itself here

indexes []*faiss.IndexImpl

CascadingRadium · 2025-05-14T07:06:01Z

faiss_vector_cache.go

-func (vc *vectorIndexCache) insertLOCKED(fieldIDPlus1 uint16,
-	index *faiss.IndexImpl, vecDocIDMap map[int64]uint32, loadDocVecIDMap bool,
-	docVecIDMap map[uint32][]int64) {
+func (vc *vectorIndexCache) insertLOCKED(fieldIDPlus1 uint16, ce cacheEntryReqs) {


take pointer to the cacheEntryReqs struct, don't copy the struct

faiss_vector_posting.go

CascadingRadium · 2025-05-29T09:12:28Z

section_faiss_vector_index.go

+}
+
+// return packed binary vectors.
+func convertToBinary(vecs []float32) []uint8 {


rename to vec as its only 1 vector being binarized here (nit)

CascadingRadium · 2025-05-29T09:22:34Z

section_faiss_vector_index.go

+
+// return packed binary vectors.
+func convertToBinary(vecs []float32) []uint8 {
+	var packed []uint8


prealloc packed

packedLen := (len(vec) + 7) / 8 packed := make([]uint8, 0, packedLen)

CascadingRadium · 2025-05-29T09:30:15Z

section_faiss_vector_index.go

+			bitCount = 0
+		}
+
+		// Optionally, you can handle cases where the number of floats isn't a multiple of 8


remove this comment as its misleading right, because this is already done in L596

faiss_vector_posting.go

faiss_vector_cache.go

section_faiss_vector_index.go

CascadingRadium · 2025-05-29T10:31:51Z

section_faiss_vector_index.go

+		}
+
+		// Shift the bit into the correct position in the byte
+		currentByte |= (bit << (7 - bitCount))


bit seems redundant to me
isn't the code below equivalent?

if value >= 0.0 { currentByte |= (1 << (7 - bitCount)) }

CascadingRadium · 2025-05-29T10:34:15Z

section_faiss_vector_index.go

+		}
+		defer binaryFaissIndex.Close()
+
+		bvecs := convertToBinary(indexData)


Same issue as before. If D is a multiple of 8 this works. Else we will create a wrong index itself. (creating a binary index where multiple X-dimensional vectors from different documents get compressed to one vector, thereby losing the posting data)

section_faiss_vector_index.go

faiss_vector_posting.go

CascadingRadium · 2025-05-29T11:32:33Z

faiss_vector_posting.go

+					h := &maxHeap{}
+					heap.Init(h)
+					for i := 0; i < len(binIDs); i++ {
+						heap.Push(h, &distanceID{distance: distances[i], id: binIDs[i]})
+						if h.Len() > int(k) {
+							heap.Pop(h)
+						}
+					}
+
+					// Pop the top K in reverse order to get them in ascending order
+					ids := make([]int64, k)
+					scores := make([]float32, k)
+					for i := int(k) - 1; i >= 0; i-- {
+						distanceID := heap.Pop(h).(*distanceID)
+						scores[i] = distanceID.distance
+						ids[i] = distanceID.id
+					}


At this stage, we already have the complete set of ids and their corresponding distances available for Top-K selection. This scenario differs from the K-sized heap approach used in Bleve (see knn.go#L116), where the full result set (N*K) is not known ahead of time and must be streamed or iterated over (making the K-Sized Heap approach optimal).

In our case, since the entire dataset is available upfront, using a heap becomes unnecessarily expensive—particularly in terms of space complexity O(K). This overhead can compound significantly with the number of segments in a highly distributed index, as the heap becomes temporary query-specific state for each segment.

Given that, it would be more efficient to use a Randomized QuickSelect algorithm (https://courses.grainger.illinois.edu/cs473/sp2015/w/lec/15_notes.pdf) for in-place (O(1) vs O(K) space complexity) Top-K selection. This approach reduces memory usage, avoids unnecessary allocations, and maintains excellent average-case performance (O(K) compared to O(KlogK)) suitable for batch-style processing where the full dataset is already materialized. Also, this benefits the argument that we do not actually need a sorted list of distances and ids, we just need the top K in any order, as the upstream implementations already handle sorting the final result set based on scores. Although this might be a bit of a stretch and potentially not impactful, would you mind coding this out and benchmarking the performance? I understand it may be a time sink, but it could help validate whether the Randomized QuickSelect approach is worthwhile in our context. Thanks a lot!

CascadingRadium · 2025-06-02T06:14:14Z

pls

rebase on top of master
update zap.md to reflect the new index file format
update command line tooling to work with binary index

metonymic-smokey added 10 commits April 26, 2025 18:01

wip

26093ec

use binary specific funcs

3af2383

cleanup

daa8fc6

try an idea

fd22b0d

only search top k*2

7eb9574

use BIVF

401a144

use ivf dist compute

581dbc9

refactor + cleanup

127a63d

reduce oversampling and check

815537a

account for deleted vecs

5c210e3

metonymic-smokey marked this pull request as ready for review May 12, 2025 15:34

metonymic-smokey requested review from CascadingRadium, Likith101, Thejas-bhat and abhinavdangeti and removed request for CascadingRadium May 12, 2025 15:34

use BFlat before Flat indexes

e652071

CascadingRadium reviewed May 14, 2025

View reviewed changes

faiss_vector_cache.go Outdated Show resolved Hide resolved

CascadingRadium reviewed May 14, 2025

View reviewed changes

faiss_vector_cache.go Outdated Show resolved Hide resolved

CascadingRadium reviewed May 14, 2025

View reviewed changes

faiss_vector_posting.go Outdated Show resolved Hide resolved

faiss_vector_posting.go Outdated Show resolved Hide resolved

metonymic-smokey force-pushed the bq branch 2 times, most recently from 097fc02 to e652071 Compare May 19, 2025 11:22

binary vec - search without ids

60f83f1

metonymic-smokey force-pushed the bq branch from 2e0cfab to 60f83f1 Compare May 21, 2025 06:47

CascadingRadium reviewed May 29, 2025

View reviewed changes

faiss_vector_posting.go Show resolved Hide resolved

CascadingRadium reviewed May 29, 2025

View reviewed changes

adapted section format to selectively use binary indexes

b20ae9d

metonymic-smokey force-pushed the bq branch from 1259649 to b20ae9d Compare June 5, 2025 06:52

MB-62985 - Add support for binary quantised vectors. #329

Are you sure you want to change the base?

MB-62985 - Add support for binary quantised vectors. #329

Conversation

metonymic-smokey commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium May 14, 2025

Choose a reason for hiding this comment

Uh oh!

CascadingRadium May 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CascadingRadium May 29, 2025

Choose a reason for hiding this comment

Uh oh!

CascadingRadium May 29, 2025

Choose a reason for hiding this comment

Uh oh!

CascadingRadium May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium May 29, 2025

Choose a reason for hiding this comment

Uh oh!

CascadingRadium May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CascadingRadium May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

CascadingRadium commented Jun 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

metonymic-smokey commented May 9, 2025 •

edited

Loading

CascadingRadium May 29, 2025 •

edited

Loading