New Angular OpenAPI Client gen (looking for testers) (ng-openapi.dev)

1 points by tjami 28s ago 0 comments

Ask HN: Does No Response Mean a Bad Idea?

1 points by samehsbs 53s ago 0 comments

Jim Lovell Has Died (en.wikipedia.org)

1 points by ColinWright 2m ago 0 comments

ChatGPT Will Apologize for Anything (aiweirdness.com)

2 points by xnx 2m ago 0 comments

Apollo 13 Commander Jim Lovell has passed away (nasa.gov)

2 points by LorenDB 3m ago 0 comments

Show HN: HackMaster Pi – A $30 Flipper Zero Alternative Built with Raspberry Pi (github.com)

1 points by 1ping 4m ago 0 comments

How to Teach Your Kids to Play Poker: Start with One Card (bloomberg.com)

1 points by ioblomov 4m ago 1 comments

ChatGPT-5 Can't Do Basic Math

5 points by MarcellusDrum 8m ago 0 comments

Security alerts in Gmail. What a mess

2 points by chrisjj 9m ago 0 comments

GPT-5 AMA (reddit.com)

2 points by IdealeZahlen 10m ago 0 comments

Johns Hopkins is building its AI wargaming tools for DoD (breakingdefense.com)

1 points by geox 11m ago 0 comments

Fears of population collapse in the US are based on faulty assumptions (theconversation.com)

1 points by PaulHoule 11m ago 0 comments

GPT-5 Rollout Updates (twitter.com)

2 points by tosh 13m ago 0 comments

Cordoomceps – replacing an Amiga's brain with Doom (mjg59.dreamwidth.org)

1 points by LorenDB 13m ago 0 comments

Millions are flocking to grow virtual gardens in Roblox game created by teenager (apnews.com)

1 points by petethomas 16m ago 1 comments

The Illustrated TLS 1.2 Connection (tls12.xargs.org)

1 points by dmazin 17m ago 0 comments

The surprising economics of the meat industry – Lewis Bollard (dwarkesh.com)

2 points by paulpauper 17m ago 0 comments

Job growth has slowed sharply; the question is why (stayathomemacro.substack.com)

13 points by paulpauper 17m ago 4 comments

Campaigning for Extinction:Eradication of Sparrows and the Great Famine in China (nber.org)

1 points by paulpauper 18m ago 0 comments

GRETA to Open a New Eye on the Nucleus (newscenter.lbl.gov)

1 points by gnabgib 18m ago 0 comments

HTTP Is Not Simple (daniel.haxx.se)

4 points by thunderbong 20m ago 1 comments

Looking for Testers for an AI Privacy Platform (scanonai.carrd.co)

1 points by lotuslabs 21m ago 1 comments

Three Tiers of Responses to Fact (medium.com)

2 points by wsgeorge 24m ago 0 comments

Toxic convenience: what science tells us about plastic's hidden costs (rfi.fr)

2 points by everybodyknows 25m ago 0 comments

ChatGPT users hate GPT-5's overworked secretary energy, miss their GPT-4o buddy (arstechnica.com)

5 points by rntn 26m ago 0 comments

Welcome to DIY Rich Guy Fantasy Camp (theglobeandmail.com)

2 points by throw0101a 29m ago 1 comments

FIN - Fish Extensible Text Editor Written in Fish (codeberg.org)

2 points by ashitlerferad 29m ago 0 comments

json2dir: a JSON-to-directory converter, a fast alternative to home-manager (github.com)

5 points by alurm 29m ago 0 comments

M5 MacBook Pro No Longer Coming in 2025 (macrumors.com)

7 points by behnamoh 32m ago 0 comments

(Evil)Doggie: An open-source CAN bus research and penetration testing tool (blackhat.com)

1 points by wslh 34m ago 0 comments

LVFS Sustainability Plan (blogs.gnome.org)

3 points by Bogdanp 34m ago 0 comments

Query-Mutating Data Race in Go (coder.com)

3 points by kylecarbs 37m ago 0 comments

How Samsung Missed the AI Moment [video] (youtube.com)

1 points by mgh2 38m ago 0 comments

uses this (usesthis.com)

3 points by bookofjoe 38m ago 0 comments

The Mother of All Currency Crises Is on the Horizon (foreignpolicy.com)

2 points by voxleone 38m ago 0 comments

Mary Shields, First Woman to Finish the Iditarod, Dies at 80 (wsj.com)

1 points by impish9208 39m ago 2 comments

Omron took AppleHealth data without consent then silently updated privacy policy (substack.com)

1 points by aranypucek 40m ago 1 comments

Show HN: The calendar that schedules everything for you (rhythm.to)

1 points by georgeslz 42m ago 0 comments

Show HN: MCP Document Indexer – Local AI search for your documents using Ollama (github.com)

1 points by yairwein 45m ago 0 comments

Frequent Nightmares Predict Early Death More Strongly Than Smoking or Obesity (science.slashdot.org)

5 points by pizza 46m ago 3 comments

Fastest 5x5 Piston Door / Showcase [video] (youtube.com)

1 points by campital 47m ago 0 comments

Banning VPNs to protect kids? Good luck with that (theregister.com)

2 points by dp-hackernews 48m ago 0 comments

Modest solar boost could cut US CO2 by 8.5M tons (thenewlede.org)

1 points by PaulHoule 49m ago 0 comments

Nobelium becomes heaviest element with identified compounds (chemistryworld.com)

3 points by bookofjoe 49m ago 0 comments

Where do meetups go when they die? (now.beehiiv.com)

2 points by davekiss 50m ago 0 comments

MemSync - persistent memory for AI across apps (memsync.ai)

1 points by advaitjayant 51m ago 1 comments

Smartwatches offer little insight into stress levels, researchers find (theguardian.com)

2 points by giuliomagnifico 52m ago 0 comments

Show HN: Regolith – Regex library for TypeScript made to prevent ReDoS attacks (github.com)

2 points by roggenbuck 53m ago 0 comments

Garlic-Hub Digital Signage Enters Release Candidate Stage (github.com)

1 points by sagiadinos 54m ago 1 comments

Consent and Compromise (research.eye.security)

1 points by colonCapitalDee 54m ago 0 comments

Benchmarking GPT-5

9 aravindputrevu 1 8/8/2025, 4:27:36 PM coderabbit.ai ↗

Comments (1)

aravindputrevu · 2h ago

We put GPT-5 through our Golden PR Dataset.

Here is the TL;DR

- GPT-5 outperformed Opus-4, Sonnet-4, and OpenAI’s O3 across a battery of 300 varying difficulty, error-diverse pull requests.

- GPT-5 scored highest on our comprehensive test and found 254 out of 300 bugs or 85% where other models found between 200 and 207 – 16% to 22% less.

- On our 25 hardest PRs from our evaluation dataset, GPT-5 achieved the highest ever overall pass rate (77.3%), representing a 190% improvement over Sonnet-4, 132% over Opus-4, and 76% over O3.