Approximate Top-K operation for TensorFlow

The operation allows to use approximate nearest neighbors search to do faster top-k retrievals over a set of embeddings with the assumption that the underlying embeddings do not change very quickly.

Installation

$ git clone https://github.com/criteo-research/tensorflow_approximate_top_k
$ cd tensorflow_approximate_top_k
$ cmake .
$ make

Example

This will index all_embs 2-D Tensor and will query it with target_embs retrieving 10 (=k) closest embeddings. With parameters num_trees and num_iters_per_update we can control the quality of our approximation vs running time.

import tensorflow as tf
import numpy as np
tf.enable_eager_execution()

lib_path = "~/tensorflow_approximate_top_k/approximate_top_k"
lib = tf.load_op_library(lib_path)

dim = 5
all_embs = rng.rand(10, dim).astype(np.float32)
target_embs = rng.rand(2, dim).astype(np.float32)
sample_ids = lib.approximate_top_k(
    all_embs, 
    target_embs,
    k=10, 
    num_trees=16,
    num_iters_per_update=100, 
    metric="cosine")

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
annoy @ 211ebe4		annoy @ 211ebe4
cmake/modules		cmake/modules
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
cmake_install.cmake		cmake_install.cmake

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Approximate Top-K operation for TensorFlow

Installation

Example

About

Releases

Packages

Languages

License

criteo-research/tensorflow_approximate_top_k

Folders and files

Latest commit

History

Repository files navigation

Approximate Top-K operation for TensorFlow

Installation

Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages