LLM rerankers for production RAG: tips and tricks

4 mathcircler 1 9/15/2025, 10:02:23 AM fin.ai ↗

Comments (1)

alexpivnenko · 45m ago
Surprised that removing spaces actually had such a big effect on latency.

Also props for including the prompt and AB results