Pretraining a LLM with less than $50 budget which outperforms Google BERT

Comments (1)

spindump8930 · 3h ago

The title makes it sound nice but the reported results are worse than random baselines on several benchmarks, including ones to claim superiority over BERT. At a glance, Hellaswag, boolq, winogrande are all at or below random guessing. At best this is a fun model with broken evaluation. At worst this is medium spam for clout farming - which won't work on anyone who can read the tables.

Saturation and the Triumph of History: How Most of the Behavior of Tech Leaders (notes.hella.cheap)

Little LLM on the RAM: Google's Gemma 270M hits the scene (theregister.com)

Kazeta: An operating system that brings the console gaming experience of 90s (kazeta.org)

PewDiePie has a GitHub now (github.com)

Free radar chart maker tool (radarchart.net)

Ralph Waldo Emerson's "Letter to Martin Van Buren" (1838) (en.wikipedia.org)

Burnout (scaleyourself.com)

Show HN: Dream Prompter: Bringing Nano Banana to GIMP (thoughts.greyh.at)

Laurie Spiegel's Expanding Universe (crackmagazine.net)

Control Claude Code via Telegram – ClaudeOnTheBeach (github.com)

Terra – The 1000 Plus Packages Fedora Doesn't Ship (terra.fyralabs.com)

Elements: Open-source registry of full-stack components (twitter.com)

Make Cool Stuff (sarthak2143.bearblog.dev)

Gimp 3.1.4: Second Development Release towards GIMP 3.2 (gimp.org)

Free online party games that work instantly in any browser (bestpartygames.net)

AI is an extension of my mind – Artist Refik Anadol [video] (youtube.com)

Could a unique rectangular telescope be the key to finding Earth 2.0? (space.com)

I Miss Using Em Dashes (bassi.li)

Man found dead at Burning Man (bbc.com)

Meta to stop its AI chatbots from talking to teens about suicide (bbc.com)

A Journey to the End of the World (Of Minecraft) (newyorker.com)

Solving LeetCode Problems with Racket (herecomesthemoon.net)

Show HN: A usercript to help you filter "Who's Hiring". (github.com)

Built my own Phone because innovation is sad rn [vídeo] (youtube.com)

Lies We Tell Kids (2008) (paulgraham.com)

Protect Your Database from Cursor (github.com)

Doctors develop AI stethoscope that detects major heart conditions in 15 seconds (theguardian.com)

Show HN: Ape – Exercises for Learning How to Build LLM Agents (ape.llm.phd)

Updates to Discord's Policies (discord.com)

Show HN: Cepl – A readline C/C++ REPL with history, tab-completion, and undo (github.com)

Understanding Android's Boot Process

From $4B to Forgotten: The Rise and Fall of Clubhouse (2024)

Event-Tracking Data Synchronization in Soccer Without Annotated Event Locations (arxiv.org)

Developer loses $500K from Open VSX fake Solidity extension (youtube.com)

Show HN: Android Toolkit for Debugging Networks (play.google.com)

Surya: Foundation Model for Heliophysics (github.com)

Year odyssey it took to emulate the Pioneer LaserActive (readonlymemo.com)

Spatial Joins in DuckDB (duckdb.org)

I'm All-In on Server-Side SQLite (fly.io)

Nvidia Says Two Buyers Drove 39% of Q2 Sales (pymnts.com)

Disney and the Decline of America's Middle Class (nytimes.com)

The Collapse of Builder.ai (restofworld.org)

Building a WASM compiler in Roc (series) (dusty.phillips.codes)

Exposing the Monsanto Conspiracy [Veritasium][video] (youtube.com)

Linea Builds Momentum Ahead of Token Launch

Automatic updating Spotify status without JavaScript (lina.sh)

Detecting and countering misuse of AI (anthropic.com)

55+ Star on new repo in just 3 days of launch, Crazzyyyy (github.com)

AI for Piloting Financial Charts (aulico.com)

Swift on iOS 6 (j-w-i.org)

Pretraining a LLM with less than $50 budget which outperforms Google BERT

Comments (1)