DeepSWE: Training an Open-Sourced Coding Agent by Scaling RL

Comments (1)

sijuntan · 11h ago

We introduce *`DeepSWE-Preview`*, a reasoning-enabled coding agent trained from `Qwen3-32B` with only reinforcement learning (RL). It achieves an impressive 59.0*%* on SWE-Bench-Verified with test-time scaling, reaching SOTA for open-weight coding agents (*42.2%* Pass@1, *71.0%* Pass@16).

DeepSWE is trained using [*rLLM*](https://www.notion.so/rLLM-A-Framework-for-Post-Training-Lan...), our framework for post-training language agents. We’ve *open sourced* everything—our dataset, code, training, and eval logs, for everyone to progress on scaling and improving agents with RL.

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights (jameshard.ing)

The Fed says this is a cube of $1M. They're off by half a million (calvin.sh)

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites (haaretz.com)

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad (daringfireball.net)

The new skill in AI is not prompting, it's context engineering (philschmid.de)

JavaScript Trademark Update (deno.com)

MCP: An (Accidentally) Universal Plugin System (worksonmymachine.substack.com)

Introducing tmux-rs (richardscollin.github.io)

Writing Code Was Never the Bottleneck (ordep.dev)

Engineered Addictions (masonyarbrough.substack.com)

I made my VM think it has a CPU fan (wbenny.github.io)

Xfinity using WiFi signals in your house to detect motion (xfinity.com)

Proton joins suit against Apple for practices that harm developers and consumers (proton.me)

I built something that changed my friend group's social fabric (blog.danpetrolito.xyz)

I deleted my second brain (joanwestenberg.com)

Exploiting the IKKO Activebuds “AI powered” earbuds (2024) (blog.mgdproductions.com)

Websites hosting major US climate reports taken down (apnews.com)

Cloudflare to introduce pay-per-crawl for AI bots (blog.cloudflare.com)

My open source project was relicensed by a YC company [license updated] (twitter.com)

Facebook is asking to use Meta AI on photos you haven’t yet shared (theverge.com)

Figma files for proposed IPO (figma.com)

Private sector lost 33k jobs, badly missing expectations of 100k increase (cnbc.com)

Don’t use “click here” as link text (2001) (w3.org)

There are no new ideas in AI, only new datasets (blog.jxmo.io)

ICEBlock, an app for anonymously reporting ICE sightings, goes viral (techcrunch.com)

YouTube No Translation (addons.mozilla.org)

Ask HN: What Are You Working On? (June 2025)

Gridfinity: The modular, open-source grid storage system (gridfinity.xyz)

Cloudflare Introduces Default Blocking of A.I. Data Scrapers (nytimes.com)

US Supreme Court limits federal judges' power to block Trump orders (theguardian.com)

Show HN: Spegel, a Terminal Browser That Uses LLMs to Rewrite Webpages (simedw.com)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

Show HN: CSS generator for a high-def glass effect (glass3d.dev)

I write type-safe generic data structures in C (danielchasehooper.com)

I'm dialing back my LLM usage (zed.dev)

Fakespot shuts down today after 9 years of detecting fake product reviews (blog.truestar.pro)

Introducing Gemma 3n (developers.googleblog.com)

Melbourne man discovers extensive model train network underneath house (sbs.com.au)

Alternative Layout System (alternativelayoutsystem.com)

XSLT – Native, zero-config build system for the Web (github.com)

Claude Code now supports hooks (docs.anthropic.com)

ICEBlock climbs to the top of the App Store charts after officials slam it (engadget.com)

The Rise of Whatever (eev.ee)

OpenFLOW – Quickly make beautiful infrastructure diagrams local to your machine (github.com)

US economy shrank 0.5% in the first quarter, worse than earlier estimates (apnews.com)

Major reversal in ocean circulation detected in the Southern Ocean (icm.csic.es)

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok (github.com)

Gene therapy restored hearing in deaf patients (news.ki.se)

JWST reveals its first direct image discovery of an exoplanet (smithsonianmag.com)

Sam Altman Slams Meta’s AI Talent Poaching: 'Missionaries Will Beat Mercenaries' (wired.com)

DeepSWE: Training an Open-Sourced Coding Agent by Scaling RL

Comments (1)