Show HN: A GPU-accelerated binary vector index This is a vector index I built that supports insertion and k-nearest neighbors (k-NN) querying, optimized for GPUs. It operates entirely in CUDA and can process queries on half a billion vectors in under 200 milliseconds. The codebase is structured as a standalone library with an HTTP API for remote access. It’s intended for high-performance search tasks—think similarity search, AI model retrieval, or reinforcement learning replay buffers. The codebase is located at https://ift.tt/RuCU0Dz . https://ift.tt/g0aE6n5 February 17, 2025 at 06:15AM
0 Comments: