CCPS: Calibrating LLM Confidence via Perturbation Stability

Comments (1)

erfan_mhi · 2h ago

Author here. Our paper “Calibrating LLM Confidence by Probing Perturbed Representation Stability” was accepted to EMNLP 2025 Main Conference (top 15%) with a final rating of 9 (strong accept).

High-level summary: We probe LLM hidden states with slight perturbations to check answer stability—stable implies confidence; unstable implies uncertainty. This lightweight method delivers >50% reductions in calibration error (down to ~4.5%) across LLaMA, Mistral, Qwen on MMLU & MMLU-Pro, with no LLM fine-tuning.

Results, code, and dataset are available at: - Code: https://github.com/ledengary/CCPS - Data: https://huggingface.co/datasets/ledengary/CCPS

Happy to discuss technical details or calibration deployment strategies.

Preparing for the worst: Our core database failover test (vercel.com)

Spiped – secure pipe for SSH, SMTP, etc. (tarsnap.com)

Musk's xAI forays into agentic coding with new model (reuters.com)

Newton Data Storage (canicula.com)

Finding Bugs in a Coding Agent with Lightweight DST (wickstrom.tech)

Cigarette filters do nothing for health and create plastic pollution – ban them (theconversation.com)

I tried Vibe Physics. This is what I learned. [video] (youtube.com)

GPS Week Number Rollover (en.wikipedia.org)

Four of the best dumbphones for a digital detox (dezeen.com)

Amazon S3 Vectors or PostgreSQL- Is This the End of Specialized Vector Stores? (i-programmer.info)

In Vermont, one man is bringing pay phones back to life (popsci.com)

DIY Smart Locks vs. Pro Security Installation Costs (depohomes.com)

Zuckerberg Pressed Trump on Digital Taxes Before Tariff Vow (bloomberg.com)

CDC Implosion Continues as Staff Stage Unprecedented Walk Out (gizmodo.com)

Android App End-to-End Testing with FusionAuth (fusionauth.io)

Michael (2026 Film) (en.wikipedia.org)

Microsoft Engineer Pratik Pandey Dies on Silicon Valley Campus (bloomberg.com)

Galois Theory by Calculator (arxiv.org)

Contributing to Complex Projects (2022) (mitchellh.com)

Vercel Triples Valuation to $9B with Accel Investment (bloomberg.com)

Fuzzy file picker for tmux and Claude Code (github.com)

Nvidia details its itty bitty GB10 superchip for local AI development (theregister.com)

Teens are increasingly turning to AI companions, and it could be harming them (theconversation.com)

Marketing is... (world.hey.com)

Turkmen Internet Users Forced to Swear on Koran They Won't Use VPNs (2021) (rferl.org)

Ask HN: Share your favorite underrated Git projects

No AI Silver Bullet (smartmic.bearblog.dev)

A New Foreign Policy for Europe (cirsd.org)

I built a self-hosted alternative to Apple's Hide My Email service (webmonch.dev)

The Psychology of Fixing Bugs (lapcatsoftware.com)

Why I Ditched Malloc for AI Inference (gilli.dev)

ClickHouse and MooseStack: DX for data infrastructure (clickhouse.com)

Python: The Documentary – An origin story [video] (youtube.com)

'Jaw-droppingly weird' dinosaur from Morocco was studded with spikes (reuters.com)

Browse Travel and Adventure Across Alabama » (abdal.online)

Trump Is Building His Own Paramilitary Force (nytimes.com)

Test Microsoft's first in-house voice model, MAI-Voice-1 (copilot.microsoft.com)

Non-newsletter #1: This One's for the Survivors (mailchi.mp)

Debian 13: My list of new features (samueloph.dev)

Acne vaccines could offer robust defence (nature.com)

Large language models can reconstruct forbidden knowledge (fastcompany.com)

China vs. the West: Unity vs. Freedom (boris.fyi)

Citrix forgot to tell you CVE-2025–6543 has been used as a zero day since May (doublepulsar.com)

My startup banking story (2023) (mitchellh.com)

Start and track Copilot coding agent tasks from Raycast (github.blog)

Donald Trump's Big Gay Government (nytimes.com)

RFC 8594: The Sunset HTTP Header Field (datatracker.ietf.org)

Vivaldi slams Google, Microsoft for shoving AI into browsers, vows to stay clear (neowin.net)

Show HN: Put text in between images (Nano Banana) (textbetween.com)

Engineers send quantum signals with standard Internet Protocol (phys.org)

CCPS: Calibrating LLM Confidence via Perturbation Stability – EMNLP 2025

Comments (1)