R-Zero: Self-Evolving Reasoning LLM from Zero Data

Comments (1)

vineethy · 2d ago

Interesting twist on automated curriculum learning. This paper is using an LLM for the environment and the policy. Other papers use LLMs for policy/value fn. Would be cool to see other reward strategies tying all these threads together

Ask HN: Why don't LLMs replace bosses instead of engineers?

Why do dev tools crush it on Product Hunt but never seem to raise money?

Gemini's brutal assessment of a vibe coding session

ASK: Could AI Replace the Slush Pile Intern?

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

Ask HN: What toolchains are people using for desktop app development in 2025?

Snapchat open source cross-platform mobile framework. Looking for beta testers

Ask HN: Why Is My Happiness Tied to My Productivity?

We'll need a universal basic income (UBI) in an AI-driven world

Ask HN: What trick of the trade took you too long to learn?

Ask HN: With all the AI hype, how are software engineers feeling?

Ask HN: Do you do anything with the "cool" languages that get posted here?

GitHub Outage?

Tell HN: Regulations.gov Comments API is shutting down on Friday

Google's RCS disconnected in several countries

Tell HN: Anthropic expires paid credits after a year

Ask HN: Has any of the Pivotal Tracker replacement attempts succeeded?

Ask HN: What tech skill gave you the biggest boost in your career?

Ask HN: Has anyone built anything useful using AI?

What's your favorite CLI tool for integrating LLMs into your terminal workflow?

Vectorless: open-source PDF chatbot without RAG

Ask HN: Canadian founders, how do you build in SF?

Ask HN: Advice for someone who wants to try AI-assisted coding?

Ask HN: What are some comfy/stress-free jobs a SWE can do? (LCOL country)

Ask HN: What do you dislike about ChatGPT and what needs improving?

Does anyone know a detailed residential cost estimator

Comparing 6M Feature Selection Methods on Credit Risk Data

Ask HN: Why is Usenet not coming back?

Ask HN: Best way to get a land line for my kids?

Ask HN: What's Going on with AI Psychosis?

ChatGPT 5 is slow and no better than 4

Feature Request: "Copy" Button Should Copy Only Main Output

Ask HN: How would you build second brain in the AI era?

GPT5 is worse than 4.1-mini for text and worse than Sonnet 4 for coding

Ask HN: In which programming language is it better to make your own language?

ChatGPT-5 Can't Do Basic Math

Tell HN: Charles Irby has passed away

GPT-5 streaming requires submission of biometric data

Ask HN: Are you running local LLMs? What are your key use cases?

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Comments (1)