KNN early termination in Manticore Search (manticoresearch.com) AI

Manticore Search’s blog explains how “early termination” for HNSW-based KNN vector search detects when the result set has converged (using a discovery-rate signal and adaptive quantile thresholds with a patience counter) and stops graph traversal before the exploration budget is exhausted. Benchmarks on a 1M-vector dataset report substantial reductions in distance computations at large k (e.g., ~65% of work at k=60, ~30% at k=1000, ~20% at k=10000) while keeping precision loss within ~2–4% and improving latency further under concurrent load. Early termination is enabled by default, disabled automatically for small k (<=10), and can be turned off in queries when maximum recall or deterministic benchmarking is needed.

June 08, 2026 01:00 Source: Hacker News