Binary Vector Search at 350GB/S Using ARM Neon
4 Equiet 2 7/18/2025, 7:02:41 PM topk.io ↗
Comments (2)
MarekDlugos · 2h ago
re: optimization for 1024b vectors — do you pad shorter ones, or fallback to a more general kernel?
marekgalovic · 2h ago
We do a projection of the original vectors so that it matches one of our optimized kernel. This generally gives us better recall vs. simple padding since all bits are utilized.