Show HN: Phare: A Safety Probe for Large Language Models

3 dberenstein1957 0 5/21/2025, 10:08:05 AM arxiv.org ↗

We've just published a benchmark and accompanying paper on arXiv that challenges conventional leaderboard-driven LLM evaluation.

Phare focuses on factual reliability, prompt sensitivity, multilingual support, and how models handle false premises like issues that actually matter when you're building serious applications.

Some insights:

- Preference scores ≠ factual correctness.

- Framing effects can cause models to miss obvious falsehoods.

- Safety metrics like sycophancy and stereotype reproduction show surprising results across popular models.

Would love feedback from the community.

Show HN: Evolved.lua – An Evolved Entity Component System for Lua (github.com)

Show HN: Trendly AI – Trend detection across 42 languages (trendlyai.com)

Show HN: 90s.dev – Game maker that runs on the web (90s.dev)

Show HN: A Tiling Window Manager for Windows, Written in Janet (agent-kilo.github.io)

Show HN: I made "Who's Hiring?" searchable using GPT and Metabase (github.com)

Show HN: I built an Free AI tool to generate pixel art from text descriptions (pixelateimage.org)

Show HN: Juvio – UV Kernel for Jupyter (github.com)

Show HN: Text to 3D simulation on a map (does history pretty well) (mused.com)

Show HN: A Simple Server to Match Long/Lat to a TimeZone (github.com)

Show HN: AI Baby Monitor – local Video-LLM that beeps when safety rules break (github.com)

Show HN: OpenHands, an open source alternative to Devin, Codex, and Jules (github.com)

Show HN: Astra – a new js2exe compiler (github.com)

Show HN: Olelo Foil - NACA Airfoil Sim (foil.olelohonua.com)

Show HN: Kraa.io – Markdown editor for notes, blogs, chats (kraa.io)

Show HN: I built ColorSnap to generate Tailwind color palettes from images (color-snap-five.vercel.app)

Show HN: Sllm.nvim – Integrate Simon’s LLM cli into Neovim (500 LOC Lua) (github.com)

Show HN: JavaFactory – IntelliJ plugin to generate Java code (github.com)

Show HN: We saw a lot of issues with Stripe so we built an alternative (dodopayments.com)

Show HN: Bricks – One Click Dashboards from Your Data Using AI (app.thebricks.com)

Show HN: Phare: A Safety Probe for Large Language Models (arxiv.org)

Show HN: Native Japanese immersion reader app + Anki integration (Manabi Reader) (reader.manabi.io)

Show HN: I made a word puzzles app for improving your English vocabulary (dictionarygames.io)

Show HN: TitleBridge - A FinalCut Workflow Plugin (bustin.tech)

Show HN: We Made an AI-Powered Study Tools (examai.app)

Show HN: CodeBoarding – interactive map of your codebase for onboarding (github.com)

Show HN: Goboscript, text-based programming language, compiles to Scratch (github.com)

Show HN: Chat with 19 years of HN (app.camelai.com)

Show HN: A MCP server to evaluate Python code in WASM VM using RustPython (github.com)

Show HN: Windows 98 themed website in 1 HTML file for my post punk band (corp.band)

Show HN: JSON Tapose – A Simple, Client-Side JSON Diff Viewer (jsontapose.com)

Show HN: Tinker with Meta's "tokenizer-free" patcher (huggingface.co)

Show HN: A native Hacker News reader with integrated todo/done tracking (github.com)

Show HN: Hardtime.nvim – break bad habits and master Vim motions (github.com)

Show HN: Cogitator – A Python Toolkit for Chain-of-Thought Prompting (github.com)

Show HN: I modeled the Voynich Manuscript with SBERT to test for structure (github.com)

Show HN: A platform to find tech conferences, discounts, and ticket giveaways (tech.tickets)

Show HN: What Are People Doing Now? (whatpeopledoingnow.com)

Show HN: A browser-based tone generator built with the Web Audio API (maketonesonline.com)

Show HN: Free OSINT API to profile Reddit users (r00m101.com)

Show HN: I made an app to create personalized stories for children in 5 minutes (unlimitedtales.com)

Show HN: Buckaroo – Data table UI for Notebooks (github.com)

Show HN: A web browser agent in your Chrome side panel (github.com)

Show HN: Python Simulator of David Deutsch’s “Constructor Theory of Time” (github.com)

Show HN: Turn any workflow diagram into compilable, running and stateful code (workflows.diagrid.io)

Show HN: I made IP-to-Geo location data library for developers (ip2geo.framer.website)

Show HN: Vaev – A browser engine built from scratch (It renders google.com) (github.com)

Show HN: Job board aggregator for best paying remote SWE jobs in the U.S. (remoteswe.fyi)

Show HN: apply.coop - Matching people with jobs that fit their values & passions (apply.coop)

Show HN: Model2vec-Rs – Fast Static Text Embeddings in Rust (github.com)

Show HN: A highly extensible framework for building OCR systems (github.com)

Show HN: Phare: A Safety Probe for Large Language Models

Comments (0)