Show HN: I embedded 685M public texts in 32 minutes (on 8x A100, Rust, TensorRT) (github.com) AI

A GitHub project called IgniteMS from Artain-AI presents a self-hosted, batch text-embedding engine built in Rust and compiled with NVIDIA TensorRT, claiming high throughput on multi-GPU setups for search, RAG, and reindexing. The authors report production results such as embedding about 685M public texts in roughly 32 minutes on 8x A100 GPUs, with benchmark comparisons against Hugging Face TEI and other tools, and note first-run TensorRT engine compilation with caching for faster subsequent runs.

June 05, 2026 04:15 Source: Hacker News