GSA inks another $1 OneGov vendor deal, this time with Anthropic (theregister.com)

I built VerbatimRAG to solve a specific problem: RAG systems that retrieve the right documents but then paraphrase incorrectly, introducing factual errors (hallucinations).

Instead of letting an LLM generate responses based on retrieved context, VerbatimRAG extracts and returns exact text spans from source documents. Every word in the output exists verbatim in your documents.

Technical approach:

- Fine-tuned ModernBERT on RAGBench dataset for span classification (relevant/not relevant)

- Documents chunked with Docling/Chonkie and indexed with SPLADE for sparse retrieval

- Query-time: retrieve → classify spans → compose response from exact quotes using dynamic templates

- Each span includes citation back to source document

Trade-offs:

- Responses can be choppy since they're composed of exact quotes

- No summarization or synthesis across documents

- Works poorly for conversational/creative tasks

An interesting part: You can run the entire pipeline without any LLM - just embeddings + our ModernBERT extractor. With SPLADE embeddings, it runs entirely on CPU.

- Code: https://github.com/KRLabsOrg/verbatim-rag (MIT)

- Paper: https://aclanthology.org/2025.bionlp-share.8.pdf

- HuggingFace model: https://huggingface.co/KRLabsOrg/verbatim-rag-modern-bert-v1

We can imagine this approach more in applications where accuracy matters more than fluency (e.g. compliant heavy domains).

Curious if others have tried similar "constrained generation" approaches.

Comments (0)

No comments yet

500 Days of Math (gmays.com)

GSA inks another $1 OneGov vendor deal, this time with Anthropic (theregister.com)

X (Twitter) bans its own AI chatbot (Grok) after calling out Israel's genocide (newrepublic.com)

High entropy alloy material could improve safety in nuclear reactors (phys.org)

Sam 2: Segment Anything in Images and Videos (openreview.net)

OpenAI open weight models now available on AWS (aboutamazon.com)

MakeMyDay – Start your morning with only positive, uplifting news

The Equality Delete Problem in Apache Iceberg (blog.dataengineerthings.org)

Capabilities of GPT-5 on Multimodal Medical Reasoning (arxiv.org)

The Cheapest Virtual Mailboxes in the United States (officeshuffle.com)

The Problem Is with Men's Sperm (nytimes.com)

Monero network turmoil as Qubic claims hashrate dominance (cointelegraph.com)

Show HN: AI-Powered Trip Planner (apps.apple.com)

Firebase Hack Going On?

D-Wave PyTorch plugin for quantum-classical hybrid machine learning (github.com)

Annotated History of Modern AI and Deep Learning (2022) (people.idsia.ch)

Heracles – Chosen Plaintext Attack on AMD SEV-SNP (heracles-attack.github.io)

Computer-Aided Research in Blackjack (en.wikipedia.org)

25 Years on; Lessons from the bursting of the technology bubble [pdf] (goldmansachs.com)

Show HN: Raspberry Pi with portable touch monitor with a single USB-C cable (getubo.com)

The Imminent Deprecation of memory_order:consume (people.kernel.org)

Thoughtfully Adding Digital Features to a Physical Book (blog.tendollaradventure.com)

The Sorcerer's Apprentice (articles.pragdave.me)

BMad-Method: Universal AI Agent Framework (github.com)

Let's get real about the one-person billion dollar company (marcrand.com)

Jamie Wyeth's Lost Portraits of Andy Warhol See the Light After 50 Years (news.artnet.com)

ML System Design: 450 Case Studies to Learn From (github.com)

The Crypto Maniacs and the Torture Townhouse (nymag.com)

Show HN: Minimal Claude-Powered Bookmark Manager (tryeyeball.com)

H-1B Visa Changes Approved by White House (newsweek.com)

Show HN: I implemented a RNN from scratch by reading a dense neural network book (github.com)

My Book "The Origins of Efficiency" (construction-physics.com)

Your Kid Should Learn to Code [video] (youtube.com)

Vegetarian and vegan diets 'cut cancer risk by up to a quarter' (thetimes.com)

PProlog: A Prolog for Plan 9 (prolog.pmikkelsen.com)

GPT-OSS Reasoning and Any-LLM: Nuances of OpenAI API Compatibility (blog.mozilla.ai)

Show HN: I found ChatGPT underwhelming for serious work. So I built Mnemosphere (mnemosphere.ai)

G-Core: A Simple, Scalable and Balanced RLHF Trainer (arxiv.org)

US Navy is retiring the last of its cruisers, 'the pinnacle of naval power' (pilotonline.com)

Consent – The Developer-First Cookie Banner (github.com)

Show HN: Turn your iPhone into a local OCR server using Vision Framework (github.com)

Could Prevent Rovers from Getting Stuck in Sand or Dust – Universe Today (universetoday.com)

China cautions tech firms over Nvidia H20 AI chip purchases, sources say (reuters.com)

Show HN: Orchestrator – browser-as-a-service platform for AI agents (orchestratorhq.com)

Microsoft Windows 2030 Vision with David Weston [video] (youtube.com)

Power BI August 2025 Feature Summary (powerbi.microsoft.com)

Virtual.private.network (what3words.com)

I do (type-safe) container types in C (louissven.xyz)

Testing AI coding agents on real codebases (render.com)

Evaluating Document Extraction Accuracy (runpulse.com)

Show HN: VerbatimRAG – RAG that returns only exact text from documents

Comments (0)