Ask HN: Why hasn't x86 caught up with Apple M series?
431 points by stephenheron 3d ago 615 comments
Ask HN: Best codebases to study to learn software design?
103 points by pixelworm 4d ago 90 comments
CCPS: Calibrating LLM Confidence via Perturbation Stability – EMNLP 2025
2 erfan_mhi 1 8/28/2025, 6:23:58 PM arxiv.org ↗
High-level summary: We probe LLM hidden states with slight perturbations to check answer stability—stable implies confidence; unstable implies uncertainty. This lightweight method delivers >50% reductions in calibration error (down to ~4.5%) across LLaMA, Mistral, Qwen on MMLU & MMLU-Pro, with no LLM fine-tuning.
Results, code, and dataset are available at: - Code: https://github.com/ledengary/CCPS - Data: https://huggingface.co/datasets/ledengary/CCPS
Happy to discuss technical details or calibration deployment strategies.