GPT-5 vs. Sonnet: Complex Agentic Coding

37 intellectronica 11 8/8/2025, 3:38:48 PM elite-ai-assisted-coding.dev ↗

Comments (11)

macawfish · 6m ago
Claude is just so well rounded and considerate. A lot of this probably comes down to prompt and context engineering, though surely there's something magical about Anthropic's principled training methodologies. They invented constitutional AI and I can only imagine that behind the scenes they're doing really cool stuff. Can't wait to see Claude 5!
arcticfox · 16m ago
> Note that Claude 4 Sonnet isn’t the strongest model from Anthropic’s Claude series. Claude Opus is their most capable model for coding, but it seemed inappropriate to compare it with GPT-5 because it costs 10 times as much.

Well - I would have been interested in GPT-5 vs. Opus. Claude Code Max is affordable with Opus.

swader999 · 13m ago
You're absolutely right!
stitched2gethr · 6m ago
This take rings true for me after admittedly only a couple of hours of use of gpt-5. I had an issue I had been working with Claude on but it was difficult to give it real-time feedback so it floundered. gpt-5 struggled in the same areas but after about $2 of tokens it did fix the issue. It was far from a 1 shot like I might have expected from the hype, but it did get the job in about an hour done where Claude could not in 3.

For reference my Claude usage was mostly Sonnet, but with consulting from Opus.

0xfaded · 2m ago
Would you be comfortable sharing a brief description of what the issue was?
indigodaddy · 4m ago
What does the 1x and .33x mean on the list of models in copilot? (Never used but thinking about trying on the free tier)
carterparks · 12m ago
I'm getting an SSL error in Chrome: ERR_SSL_PROTOCOL_ERROR
OJFord · 1m ago
I get 'unable to connect' in Firefox Android for this and many little blogs on HN lately, idk what's going on. Cloudflare blocking me (but not for all sites)? Geo-restriction (UK)?
bn-l · 3m ago
Github copilot is utter garbage. The diffing crawls along at a snail’s pace. I think it’s coming up on two years and this must criticised aspect of it still isn’t fixed—-even with all the reverse engineering of how cursor did it. I wish I could find an alternative to cursor (which has other issues). Honestly, that company just threw away a golden opportunity as the first mover.
sourcecodeplz · 42s ago
Why did they throw it away? Because of the new opaque pricing?
SV_BubbleTime · 12m ago
> but when I'd point out the missing implementation, it would give its usual "you're absolutely right" and try to fix it.

I really trying to not be annoyed by Claude’s “You’re absolutely right” because I know I cannot control it but this is an increasingly difficult task.