The first code deployable biological computer now on the market (livescience.com)

1 points by Anumbia 1m ago 0 comments

Golf Course Game Made in TypeScript with Kaplay (erikgxdev.itch.io)

1 points by JSLegendDev 3m ago 1 comments

Folio Society Publish Use of Weapons (foliosociety.com)

2 points by saltysalt 10m ago 0 comments

Worlds first AMD GPU driven over USB 3 (twitter.com)

1 points by bfoks 17m ago 1 comments

Groom of the Stool – The worst job in history? (historic-uk.com)

1 points by austinallegro 17m ago 0 comments

Public Relations Spending by Police (2023) (equalityalec.substack.com)

1 points by nativeit 17m ago 0 comments

Public Relations Spending By Police [broken link] (open.substack.com)

1 points by nativeit 19m ago 2 comments

The six strongest materials on Earth are harder than diamonds (bigthink.com)

1 points by PaulHoule 21m ago 0 comments

Apocalypse Now (betterwithout.ai)

2 points by xigency 26m ago 2 comments

NY lawmakers add disclaimer to AI chatbots: They aren't human (gothamist.com)

1 points by geox 29m ago 0 comments

Network Account Management (docs.bsky.app)

2 points by Kye 30m ago 1 comments

Show HN: Cloud Coding Agents in VSCode (augmentcode.com)

1 points by knes 30m ago 0 comments

Frequency of Papal Names (johndcook.com)

1 points by gscott 30m ago 0 comments

Infinite TTY pixel art editor written in Rust (github.com)

2 points by ivanbelenky 31m ago 0 comments

Air Force pilots get innovative gear to 'go' while in the air (stripes.com)

2 points by Bender 34m ago 0 comments

Cinnamon could interact with some prescription medication according to new study (cnn.com)

3 points by Bender 35m ago 0 comments

The data survey disconnect and the dollar (kennethrogoff.substack.com)

1 points by danboarder 38m ago 0 comments

Newark Mayor Ras Baraka Arrested at ICE Detention Center in NJ (pix11.com)

9 points by tastyface 46m ago 0 comments

How many gadgets have YOU owned on the eWaste Graveyard? [video] (youtube.com)

1 points by lg_rocket 53m ago 0 comments

Career Progression: How to Use 30, 60, and 90 Days Approach (diamantinoalmeida.com)

1 points by MitiaHiers 55m ago 0 comments

Overcoming Self-Doubt: A Practical Guide to Building Lasting Confidence (diamantinoalmeida.com)

1 points by MitiaHiers 55m ago 0 comments

The Acid King (rollingstone.com)

3 points by udit99 56m ago 0 comments

Japanese PhD Student Has Visa Revoked in the US Due to Alleged Criminal History (tokyoweekender.com)

3 points by miles 56m ago 0 comments

Israel's NSO Group ordered to pay nearly $170M to WhatsApp for hacking accounts (politico.com)

3 points by TMWNN 57m ago 2 comments

Revisiting Lower Bounds for Two-Step Consensus (arxiv.org)

1 points by otrack 1h ago 0 comments

Show HN: Bardmore – AI Speech Analysis and Feedback (bardmore.com)

1 points by ChristopherLaw_ 1h ago 0 comments

How Being Watched Changes How You Think (scientificamerican.com)

1 points by SkyMarshal 1h ago 0 comments

Why everybody's drinking milk again (thehustle.co)

2 points by paulpauper 1h ago 0 comments

Perplexity Hacked Its Growth – Everything You Can Adopt from It

1 points by rishikeshranjan 1h ago 0 comments

Pope Leo XIV–why does this matter to the worlds of art and heritage? (theartnewspaper.com)

1 points by paulpauper 1h ago 0 comments

The post-screen future does not exist (chrbutler.com)

4 points by delaugust 1h ago 0 comments

NSF Unidata Pause in Most Operations (unidata.ucar.edu)

2 points by trauco 1h ago 0 comments

Start, Fresh – Redesigning the Windows Start Menu for You (microsoft.design)

2 points by withinrafael 1h ago 1 comments

Joys and sorrows of designing a language [video] (youtube.com)

2 points by todsacerdoti 1h ago 0 comments

Designing an architecture using dark matter and dark energy (microservices.io)

1 points by Alupis 1h ago 0 comments

CoreWeave seeks new $1.5B debt deal after downsized IPO (ft.com)

3 points by toomuchtodo 1h ago 2 comments

Show HN: Noti – Notes and reminders that live in your notification center (apps.apple.com)

1 points by ajs808 1h ago 0 comments

Era of U.S. dollar may be winding down (news.harvard.edu)

88 points by gnabgib 1h ago 103 comments

Refactoring Agent for Bad Coders (github.com)

2 points by BrinleeKidd 1h ago 0 comments

Extraterrestrial Tongues (aeon.co)

3 points by chrbutler 1h ago 0 comments

Added Quantum Field Theory to a Radiation Simulator for 22% Better Predictions (github.com)

1 points by r0nlt 1h ago 1 comments

CSS Snippets (adactio.com)

3 points by chrbutler 1h ago 0 comments

Engineering Design Optimization Textbook (mdobook.github.io)

3 points by TheHideout 1h ago 0 comments

Career Is Only Meaningful (substack.com)

3 points by MitiaHiers 1h ago 0 comments

Show HN: Free QR Code Generator (qrcode200.com)

3 points by bacdor 1h ago 0 comments

I built a free energy engine and show how it works [video] (youtube.com)

12 points by duck 1h ago 5 comments

Canadian companies shift focus to Europe for exports, growth (theglobeandmail.com)

3 points by Teever 1h ago 1 comments

Head of Library of Congress fired via email (usatoday.com)

8 points by anigbrowl 1h ago 1 comments

Dysfunctional one-carbon metabolism identifies vitamins as neuroprotective (cell.com)

1 points by FollowingTheDao 1h ago 0 comments

Reading "business" books is a waste of time (theorthagonist.substack.com)

2 points by ZeroTalent 1h ago 0 comments

I trained a Language Model to schedule events with GRPO

1 anakin87 1 5/9/2025, 12:07:22 PM huggingface.co ↗

Comments (1)

anakin87 · 10h ago

I experimented with GRPO lately, since I am fascinated by models learning from prompts and rewards - no example answers needed like in Supervised Fine-Tuning.

After the DeepSeek boom, everyone is trying GRPO with GSM8K or the Countdown Game, but I wanted a different challenge. So I opted for teaching a model to create a schedule from a list of events and priorities.

Choosing an original problem forced me to think about the problem setting, generate data, choose the base model, design reward functions, and run multiple rounds of training, hoping that my model would learn something.

A fun and rewarding experience. :-)

I learned a lot of things, that I want to share with you.

Blog post: https://huggingface.co/blog/anakin87/qwen-scheduler-grpo

Code: https://github.com/anakin87/qwen-scheduler-grpo

Hugging Face collection (dataset and model): https://huggingface.co/collections/anakin87/qwen-scheduler-g...