Ask HN: How does GPT-OSS compare to other open-source models?

2 el_hacker 1 8/10/2025, 3:21:41 PM

How does it compare to other open-source LLMs such as DeepSeek, Qwen, and Gemma, especially in terms of reasoning & coding ability?

If you’ve tested it, did anything surprise you (good or bad)? Is it worth switching from an existing OSS model?

Looking for real-world impressions, not just benchmarks.

Comments (1)

roscas · 51m ago

Only compared with qwen3-coder and it's very bad.

First comparison was a 500 line Python program that 5 minuts later, gpt-oss:20b was silent. Canceled.

Put the same program on qwen3-coder and in about 20 to 30 seconds it made the summary of what the program does. Just top!

But other examples were so bad, I did not removed yet so I can do a few more tests but I will remove it soon.

Qwen3.coder:30b is the best model I tested so far. Almost every prompt has output in one second or a little more.

Sometimes I put the same prompt in ChatGPT and Perplexity and almost everytime I get what I need from qwen3.

Since it is really fast and with quality output, that is almost my go to for help.

We've been building Swarm agents incorrectly (starting from OpenAI's Swarm) (github.com)

Why Load Balancing at Scale Is Hard (startwithawhy.com)

Cryptoasset Realization: How Cryptocurrencies Are Frozen, Seized, and Forfeited (chainalysis.com)

AOL Underground (aolunderground.com)

Stephen Miran became Trump's top ideologue on tariffs (fortune.com)

Firecracker: Start a VM in less than a second (2021) (jvns.ca)

Sleeping in Airports (sleepinginairports.net)

Hugging Face TTS Arena V2 (Papla and Async.ai Ahead of ElevenLabs and Cartesia) (huggingface.co)

Adult sites trick users into Liking Facebook posts using a clickjack Trojan (malwarebytes.com)

Chinese state media says Nvidia H20 chips not safe for China (cnbc.com)

Super Intelligence Is Collective Intelligence (romeviharo.substack.com)

How we're shipping faster with Claude Code and Git Worktrees (incident.io)

Show HN: Fix your site's SEO with personalized instructions (seocheck.dev)

HoML: Hosting your own LLM at home (homl.dev)

How to Use the Internet Correctly (jackcmac.com)

Bullets in the Windows (yourlocalepidemiologist.substack.com)

Grok 4 is now free for all users worldwide (twitter.com)

Show HN: Created a library for rendering webview within Bevy (github.com)

Dubious UK local news websites, Russian links and cash for coverage (pressgazette.co.uk)

Assessing Students in the Era of AI (austinhenley.com)

Shutdown System, Reboot Join CCC for 39C3: Power Cycles (ccc.de)

Can coding agents self-improve? (latent.space)

Digital resurrection: fascination and fear over the rise of the deathbot (theguardian.com)

Physicists Can't Agree on What Quantum Mechanics Says about Reality (scientificamerican.com)

OpenAI brings GPT-4o back online after users melt down over the new model (engadget.com)

Omakase Computing (manuals.omamix.org)

Bash Gotchas [[ ]] vs. [ ] (blog.linuxnews.dev)

Contextual genomic perspective on physical activity, health, and well being (nature.com)

Why Wall Street's AI Bet May Be Dead Wrong (investorplace.com)

Fight Chat Control (fightchatcontrol.eu)

Show HN: DIY AI that estimates home improvement costs from a video and more (finderly.us)

Logic and Creativity (hakon.gylterud.net)

Mishearings: Hacking a tiny ASR model to write Dadaist poetry (evanking.io)

Gen AI is coming for online checkout in seismic shift for internet shopping (cnbc.com)

Writing a brand-new OS is almost impossible by now (blog.wellosoft.net)

The Identity Crisis: Why LLMs Don't Know Who They Are (eval.16x.engineer)

AI Prompt Crafting: A Race to the Global Bottom (toot.io)

Basking in the Grace of Others (startingfromnix.com)

The role of physical and cognitive effort on time perception (nature.com)

Show HN: I collected 70k online communities – semantic search to find your niche (pluggo.ai)

Analyzing Fear (gist.githubusercontent.com)

Can AI 'defeat' authentication? Depends on who you ask (thenewstack.io)

Sunlight-activated material turns PFAS in water into harmless fluoride (phys.org)

Ask HN: Why is Usenet not coming back?

Philz Coffee sold to private equity firm Freeman Spogli for $145M (missionlocal.org)

Building a Redis Clone – Turning a Single Node into a Distributed Cluster (beyondthesyntax.substack.com)

Zero-to-Hero Deep Reinforcement Learning Course: Update with Advanced Topics (drlzh.ai)

Parallelizing Linux Writeback (blog.linuxnews.dev)

GPT-5: It Just Does Stuff (oneusefulthing.org)

Review: Wildtype's Lab-Grown Salmon (romanhauksson.substack.com)

Ask HN: How does GPT-OSS compare to other open-source models?

Comments (1)