Mercury, the first commercial-scale diffusion language model (inceptionlabs.ai)

106 points by HyprMusic 1h ago 37 comments

Espressif's ESP32-C5 Is Now in Mass Production (espressif.com)

40 points by radeeyate 1h ago 20 comments

Linux Kernel Exploitation: Attack of the Vsock (hoefler.dev)

116 points by todsacerdoti 4h ago 30 comments

Zhaoxin's KX-7000 (chipsandcheese.com)

64 points by ryandotsmith 3h ago 10 comments

Reversible computing with mechanical links and pivots (tennysontbardwell.com)

97 points by tennysont 6h ago 53 comments

NotebookLM Audio Overviews are now available in over 50 languages (blog.google)

209 points by saikatsg 6h ago 65 comments

Xiaomi MiMo Reasoning Model (github.com)

389 points by thm 14h ago 149 comments

Google Play sees 47% decline in apps since start of last year (techcrunch.com)

231 points by GeekyBear 4h ago 141 comments

Future of OSU Open Source Lab in Jeopardy (osuosl.org)

113 points by aendruk 4h ago 34 comments

I created Perfect Wiki and reached $250k in annual revenue without investors (habr.com)

541 points by sochix 15h ago 307 comments

Show HN: Create your own finetuned AI model using Google Sheets (promptrepo.com)

68 points by QueensGambit 7h ago 30 comments

DeepSeek-Prover-V2 (github.com)

278 points by meetpateltech 7h ago 52 comments

Someone at YouTube needs glasses (jayd.ml)

936 points by jaydenmilne 8h ago 533 comments

Show HN: ART – a new open-source RL framework for training agents (github.com)

64 points by kcorbitt 8h ago 8 comments

Sycophancy in GPT-4o (openai.com)

473 points by dsr12 20h ago 388 comments

The best – but not good – way to limit string length (adam-p.ca)

14 points by adam-p 3h ago 9 comments

Archil (YC F24) Is Hiring a Distributed Systems Engineer (In-Person, SF)

1 points by huntaub 6h ago 0 comments

NYC home prices rise 10% in early 2025 (qns.com)

76 points by geox 3h ago 55 comments

You Wouldn't Download a Hacker News (jasonthorsness.com)

355 points by jasonthorsness 22h ago 187 comments

Jepsen: Amazon RDS for PostgreSQL 17.4 (jepsen.io)

563 points by aphyr 1d ago 137 comments

Home washing machines fail to remove important pathogens from textiles (medicalxpress.com)

23 points by bookmtn 1h ago 26 comments

RFK Jr. Rejects Germ Theory (arstechnica.com)

6 points by Avshalom 18m ago 0 comments

The True Size Of (thetruesize.com)

208 points by thunderbong 4d ago 91 comments

The Leaderboard Illusion (arxiv.org)

142 points by pongogogo 15h ago 45 comments

Joining Sun Microsystems – 40 years ago (2022) (akapugs.blog)

159 points by TMWNN 8h ago 129 comments

Retailers will soon have only about 7 weeks of full inventories left (fortune.com)

402 points by andrewfromx 12h ago 644 comments

Show HN: Kexa.io – Open-Source IT Security and Compliance Verification

61 points by patrick4urcloud 10h ago 13 comments

What Is "Induced Atmospheric Vibration"? (physics.stackexchange.com)

130 points by belter 1d ago 68 comments

New atomic fountain clock joins group that keeps the world on time (nist.gov)

87 points by austinallegro 1d ago 31 comments

OCaml's Wings for Machine Learning (github.com)

98 points by musha68k 11h ago 55 comments

Port of Los Angeles says shipping volume will plummet 35% next week (cnbc.com)

635 points by perihelions 10h ago 548 comments

JetBrains defends removal of negative reviews for unpopular AI Assistant (devclass.com)

126 points by przemub 3h ago 66 comments

What It Takes to Defend a Cybersecurity Company from Today's Adversaries (sentinelone.com)

161 points by gnabgib 20h ago 73 comments

Linux in Excel (github.com)

168 points by radeeyate 20h ago 62 comments

I Found Malware in a BeamNG Mod (lemonyte.com)

7 points by davikr 4h ago 1 comments

My sourdough starter has twins (brainbaking.com)

191 points by Tomte 1d ago 76 comments

The missteps that led to a fatal plane crash at Reagan National Airport (nytimes.com)

127 points by keepamovin 2d ago 155 comments

Researchers are studying how to minimize human impact on public lands (undark.org)

61 points by droptext 1d ago 9 comments

The Mira Pro Color is Boox's first color E Ink monitor (theverge.com)

70 points by tortilla 6h ago 33 comments

Show HN: Beatsync – perfect audio sync across multiple devices (github.com)

396 points by freemanjiang 1d ago 117 comments

"AI-first" is the new Return To Office (anildash.com)

287 points by LorenDB 10h ago 175 comments

Bamba: An open-source LLM that crosses a transformer with an SSM (research.ibm.com)

197 points by shallow-mind 1d ago 66 comments

Doom GPU Flame Graphs (brendangregg.com)

15 points by LorenDB 8h ago 0 comments

Finland Bans Smartphones in Schools (yle.fi)

711 points by freetonik 14h ago 440 comments

The Group Chat from Hell Has Been Exposed (thenation.com)

25 points by johnshades 1h ago 4 comments

An illustrated guide to automatic sparse differentiation (iclr-blogposts.github.io)

128 points by mariuz 1d ago 22 comments

Everything we announced at our first LlamaCon (ai.meta.com)

203 points by meetpateltech 1d ago 107 comments

It's School time: Adventures in hacking an old Kindle (samkhawase.com)

159 points by FlyingSnake 2d ago 46 comments

I use zip bombs to protect my server (idiallo.com)

953 points by foxfired 2d ago 417 comments

WorldGen: Open-source 3D scene generator for Game/VR/XR (worldgen.github.io)

124 points by ziyangxie 2d ago 19 comments

Ask HN: OpenAI models vs. Gemini 2.5 Pro for coding and swe

4 endorphine 5 4/23/2025, 3:35:15 PM

In your experience, which of the two models (all of OpenAI vs Gemini 2.5 Pro) are better for having as assistants to ask SWE/software systems related questions and doing long and complex reasoning?

I'm debating whether there's any point in paying for ChatGPT vs. paying (or even using the free version) of Gemini 2.5 Pro.

I have the feeling that most HNers prefer the latter, however in livebench I think OpenAI surpasses Gemini for coding.

Comments (5)

JeduDev · 6d ago

I've been using Gemini 2.5 Pro, Claude 3.7 Sonnet, and GPT-4.1 recently and here are my thoughts.

Regarding context windows, Gemini currently offers 1M tokens (reportedly increasing to 2M soon), GPT-4.1 also handles a large window of 1m tokens, and Claude provides 200k. In my experience testing them with large code files (around 3-4k lines), I found Gemini 2.5 Pro and Claude 3.7 Sonnet performed quite similarly, both handling the large context well and providing good solutions.

However, my impression was that GPT-4.1 didn't perform quite as well, While GPT-4.1 is certainly capable, I feel Gemini has a slight edge in this area right now. Based on this, I'd lean towards using Gemini 2.5 Pro for extremely large contexts needing high-quality results, GPT-4.1 for backend logic, and found Claude 3.7 particularly effective for UI interface tasks.

TheKelsbee · 7d ago

I'm not sure its easy to say one is better than the other. I've used ChatGPT pro, it's good. I've also use Gemini, and it's also good. Claude is surprisingly good as well. And I've recently been using Q-cli, which was extremely easy to get integrated into my Neovim/Tmux workflow.

Purely from a code quality perspective, they're all about the same, and they all generate code that rarely works for the first time. At least from my experience, and highly depending on language. For instance, Q-cli with Rust seems to generate better output for me than Gemini with Rust. And ChatGPT with JS gives me way better code than Claude with JS.

I honestly think that currently in the market, it's not really a choice of which is better, but which is the right tool for workflow and language.

bn-l · 7d ago

It’s tricky. o3 is better (usually) but much much lazier IME. You probably have to pay for pro.

codingwagie · 5d ago

O3 is far ahead of the competition.

ginger_beer_m · 5d ago

Tiny context window is killing it.