R-Zero: Self-Evolving Reasoning LLM from Zero Data
35 lawrenceyan 4 9/10/2025, 2:02:17 AM arxiv.org ↗
Comments (4)
jasonjmcghee · 4h ago
Conceptually, it's effectively a GAN
thom · 1h ago
For values of zero quite far above zero.
falcor84 · 1h ago
What am I missing? From my skimming, there's zero external data beyond what is needed for the Challenger to generate questions.
cyberge99 · 5h ago
What could go wrong?
magicalhippo · 1m ago
[delayed]