bfloat16 (bf16) support in faiss #3862

mlomeli1 · 2024-09-16T09:48:32Z

Many LLMs are trained with bf16, if we want to use the hidden states of LLMs for retrieval, those vectors will be in bf16 dtype. It would be helpful to support bf16 in Faiss so that we can use LLMs as retriever or embedding model.

mdouze · 2024-09-16T11:14:07Z

Note that we cannot pass through the numpy wrapper because numpy does not support bf16.
Adapting gpu_knn code for pytorch should be easy

faiss/contrib/torch_utils.py

Line 497 in dc55e11

    
           def torch_replacement_knn_gpu(res, xq, xb, k, D=None, I=None, metric=faiss.METRIC_L2, device=-1, use_raft=False):

mdouze · 2024-09-17T13:09:45Z

add bfloat16 support here

https://github.com/facebookresearch/faiss/blob/main/faiss/gpu/GpuDistance.h#L76

mlomeli1 · 2024-10-18T17:49:18Z

Example of how we currently would use a PQ codec to encode/decode pytorch bf16 tensors:

torch.from_numpy( codec.sa_decode(codec.sa_encode(x.to(device='cpu', dtype=torch.float32).numpy()) )

this piece of code showcases all the cpu moves + up casting + converting to numpy array. Successively, at decoding, we need to convert back to a tensor. Ideally, we could avoid some of these if this was supported for pytorch tensors.

alexanderguzhva · 2024-10-18T21:47:27Z

just in case, there is a ScalarQuantizer implementation for bf16, maybe portions of it can be reused

asadoughi changed the title ~~bf16 support in faiss~~ bfloat16 (bf16) support in faiss Sep 16, 2024

mnorris11 added the Implementation label Sep 16, 2024

asadoughi added the feature request label Sep 17, 2024

asadoughi added the backlog label Oct 28, 2024

mdouze mentioned this issue Nov 5, 2024

Faiss GPU: bfloat16 brute-force kNN support #4014

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bfloat16 (bf16) support in faiss #3862

bfloat16 (bf16) support in faiss #3862

mlomeli1 commented Sep 16, 2024

mdouze commented Sep 16, 2024

mdouze commented Sep 17, 2024

mlomeli1 commented Oct 18, 2024

alexanderguzhva commented Oct 18, 2024

bfloat16 (bf16) support in faiss #3862

bfloat16 (bf16) support in faiss #3862

Comments

mlomeli1 commented Sep 16, 2024

mdouze commented Sep 16, 2024

mdouze commented Sep 17, 2024

mlomeli1 commented Oct 18, 2024

alexanderguzhva commented Oct 18, 2024