Show HN: Raman-01 – A Pocket Physics Solver LLM

3 Sai_Praneeth 0 5/5/2025, 6:13:12 AM huggingface.co ↗

I built a tiny physics solver LLM that performs surprisingly well on easy-to-medium difficulty physics problems. Most LLMs today still struggle with physics QA (as PhyBench recently highlighted), so I wanted to see how far I could push a small model with careful data and minimal compute.

Model: Qwen3-1.7B

Supervised Finetuning: ~1500 curated examples spanning kinematics, EM, acoustics, and more

RL Fine-tuning: GRPO, 1-shot RLVR style (single example, 70 steps)

Total cost: ~$5 on H100

It started with a cold-start SFT (~3 epochs, loss to 0.3), then I ran RL with accuracy reward that climbed from 0.1 → 0.8.

Goal: Create a lightweight physics solver that’s small enough to deploy anywhere—think of it as a "pocket tutor" for foundational physics.

Still working on evaluations—most benchmarks focus on very hard problems, while I want something that evaluates basic correctness, reasoning, and unit sense on easy/medium problems. If anyone has suggestions, I’d love to hear them.

AWS Built a Security Tool. It Introduced a Security Risk (token.security)

Show HN: Bracket – selfhosted tournament system (github.com)

The vocal effects of Daft Punk (bjango.com)

A Tektronix TDS 684B Oscilloscope Uses CCD Analog Memory (tomverbeure.github.io)

History of "Adventure" for the Atari 2600 (atariarchive.org)

I'd rather read the prompt (claytonwramsey.com)

AI Meets WinDBG (svnscha.de)

V.S. Naipaul: The Grief and the Glory (granta.com)

The Death of Daydreaming: What we lose when phones take away boredom (afterbabel.com)

Jiga (YC W21) Is Hiring Engineers (workatastartup.com)

Judge said Meta illegally used books to build its AI (wired.com)

Design for 3D-Printing (blog.rahix.de)

Show HN: My AI Native Resume (ai.jakegaylor.com)

Gandi March 9, 2025 incident postmortem (news.gandi.net)

Circuitpainter: Create PCBs using a simplfiied graphics language (github.com)

On Not Carrying a Camera – Cultivating memories instead of snapshots (hedgehogreview.com)

Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs (arxiv.org)

Towards the Cutest Neural Network (kevinlynagh.com)

Why Archers Didn't Volley Fire (acoup.blog)

Show HN: CodeCafé – A real-time collaborative code editor in the browser (github.com)

Fuzzy images are our first look at Amazon's super-secret satellites (arstechnica.com)

The Design of Compact Elastic Binary Trees (Cebtree) (wtarreau.blogspot.com)

Driving Compilers (2023) (fabiensanglard.net)

A 1903 Proposal to Preserve the Dead in Glass Cubes (hyperallergic.com)

Urtext: The Python plaintext library for people who've tried everything else (urtext.co)

Unparalleled Misalignments (rickiheicklen.com)

Ghost in the machine? Legend of the 'haunted' N64 video game cartridge (bbc.com)

Maker of AI 'vibe coding' app Cursor hits $9B valuation (ft.com)

Helmdar: 3D Scanning Brooklyn on Rollerblades (owentrueblood.com)

Technical analysis of TM SGNL, the unofficial Signal app Trump officials used (micahflee.com)

Graceful Shutdown in Go: Practical Patterns (victoriametrics.com)

An Alabama landline that keeps ringing (oxfordamerican.org)

Thunderscope update: My take: Why open source is better (crowdsupply.com)

TeleMessage, a modified Signal clone used by US govt. officials, has been hacked (techcrunch.com)

I turned a 40 year old Apple Mouse into a speech to text button (workshop.cjpais.com)

Effects of repetitive transcranial magnetic stimulation on sleep bruxism (pmc.ncbi.nlm.nih.gov)

Evidence of controversial Planet 9 uncovered in sky surveys taken 23 years apart (space.com)

Internet usage pattern during power outage in Spain and Portugal (blog.akamai-mpulse.com)

SpaceX pushed "sniper" theory with the feds far more than is publicly known (arstechnica.com)

Typed Lisp, a Primer (alhassy.com)

Modern Latex (github.com)

DNSanity: Quickly validate DNS servers at scale (github.com)

Oberon Pi (pascal.hansotten.com)

Show HN: Driverless print server for legacy printers, profit goes to open-source (printserver.ink)

Nevermind, an album on major chords (farina00.github.io)

Dummy's Guide to Modern LLM Sampling (rentry.co)

A Texan who built an empire of ecstasy (texasmonthly.com)

Semantic unit testing: test code without executing it (alexmolas.com)

Bootstrapping Lisp in a Boot Sector (github.com)

Building a more accessible GitHub CLI (github.blog)

Show HN: Raman-01 – A Pocket Physics Solver LLM

Comments (0)