This should have been compared with Opus... I know OP says he didn't because of cost but if you're comparing who is better then you need to compare the best to the best... if Claude Opus 4.1 is significantly better than GPT 5 then that could offset the extra expense. Not saying it will... but forget cost if we want to compare solely the quality
arcticfox · 33m ago
> Note that Claude 4 Sonnet isn’t the strongest model from Anthropic’s Claude series. Claude Opus is their most capable model for coding, but it seemed inappropriate to compare it with GPT-5 because it costs 10 times as much.
Well - I would have been interested in GPT-5 vs. Opus. Claude Code Max is affordable with Opus.
swader999 · 29m ago
You're absolutely right!
macawfish · 22m ago
Claude is just so well rounded and considerate. A lot of this probably comes down to prompt and context engineering, though surely there's something magical about Anthropic's principled training methodologies. They invented constitutional AI and I can only imagine that behind the scenes they're doing really cool stuff. Can't wait to see Claude 5!
anotheryou · 1m ago
I think we need to stop testing models raw.
Claude is trained for claude code and that's how it's used in the field too.
stitched2gethr · 22m ago
This take rings true for me after admittedly only a couple of hours of use of gpt-5. I had an issue I had been working with Claude on but it was difficult to give it real-time feedback so it floundered. gpt-5 struggled in the same areas but after about $2 of tokens it did fix the issue. It was far from a 1 shot like I might have expected from the hype, but it did get the job in about an hour done where Claude could not in 3.
For reference my Claude usage was mostly Sonnet, but with consulting from Opus.
0xfaded · 18m ago
Would you be comfortable sharing a brief description of what the issue was?
carterparks · 28m ago
I'm getting an SSL error in Chrome: ERR_SSL_PROTOCOL_ERROR
OJFord · 17m ago
I get 'unable to connect' in Firefox Android for this and many little blogs on HN lately, idk what's going on. Cloudflare blocking me (but not for all sites)? Geo-restriction (UK)?
No comments yet
indigodaddy · 20m ago
What does the 1x and .33x mean on the list of models in copilot? (Never used but thinking about trying on the free tier)
commandar · 8m ago
They're multipliers against your quota of requests. GPT-4.1 is "free" with a copilot sub, and then the premium models would burn credits against a multiplier. So higher multipliers count more against your monthly quota.
GPT5, Sonnet 4, and Gemini Pro 2.5 are all 1x. Opus is 10x, for comparison.
Also worth keeping in mind that Copilot has reduced context windows even for the premium models, which has a very real impact on agentic performance.
SV_BubbleTime · 28m ago
> but when I'd point out the missing implementation, it would give its usual "you're absolutely right" and try to fix it.
I really trying to not be annoyed by Claude’s “You’re absolutely right” because I know I cannot control it but this is an increasingly difficult task.
jpalawaga · 10m ago
I think it's because "you're right!" somehow presupposes it knew the answer and was just testing you.
an intern never says that. they say "oh, I see."
bn-l · 20m ago
Github copilot is utter garbage. The diffing crawls along at a snail’s pace. I think it’s coming up on two years and this must criticised aspect of it still isn’t fixed—-even with all the reverse engineering of how cursor did it. I wish I could find an alternative to cursor (which has other issues). Honestly, that company just threw away a golden opportunity as the first mover.
sourcecodeplz · 17m ago
Why did they throw it away? Because of the new opaque pricing?
Well - I would have been interested in GPT-5 vs. Opus. Claude Code Max is affordable with Opus.
Claude is trained for claude code and that's how it's used in the field too.
For reference my Claude usage was mostly Sonnet, but with consulting from Opus.
No comments yet
GPT5, Sonnet 4, and Gemini Pro 2.5 are all 1x. Opus is 10x, for comparison.
https://docs.github.com/en/copilot/reference/ai-models/suppo...
Also worth keeping in mind that Copilot has reduced context windows even for the premium models, which has a very real impact on agentic performance.
I really trying to not be annoyed by Claude’s “You’re absolutely right” because I know I cannot control it but this is an increasingly difficult task.
an intern never says that. they say "oh, I see."