SOTA Model in 8B Size?

Comments (2)

ConteMascetti71 · 20h ago

I think it's not possible to have the same knowledge capabilities of greater models...but.... reasoning?

ConteMascetti71 · 20h ago

..we distilled the chain-of-thought from DeepSeek-R1-0528 to post-train Qwen3 8B Base, obtaining DeepSeek-R1-0528-Qwen3-8B. This model achieves state-of-the-art (SOTA) performance among open-source models on the AIME 2024, surpassing Qwen3 8B by +10.0% and matching the performance of Qwen3-235B-thinking

To boost nuclear power, controversial rewrite of radiation safety rules (science.org)

Show HN: An automation tool built upon MCPs (bagpiper.dev)

Escaping Enshittification: We're All Digital Migrants (bugwhisperer.dev)

Show HN: I made an ensemble model to find underpriced properties (propertydealfinder.com)

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents (arxiv.org)

RISC-V assembly board game (punkx.org)

Modern C++ – RAII (green7ea.github.io)

A tribute to Mario Kart 8 (ravi64.com)

Radio Astronomy Software Defined Radio (Rasdr) (radio-astronomy.org)

The U.S. Can't Afford to Lose the Biotech Race with China (time.com)

The Complete Stripe Alternatives List (stripealternatives.com)

Western Blotting Must Die (randombio.com)

Split Keyboards Are Superior (aftermath.site)

Brain drugs can now cross the once impenetrable blood–brain barrier (nature.com)

An Internet controlled by no-one, owned by us all (autonomi.com)

Space Pins: Spatial annotation layer for OSes (prabros.com)

Tokenization for language modeling: BPE vs. Unigram Language Modeling (2020) (ndingwall.github.io)

Strengthening Kotlin: A Strategic Partnership with Spring (blog.jetbrains.com)

MCP Server help MCP palyer deploying HTML and obtaining an accessible public URL (github.com)

Tip: Put your Rails app on a SQL Query diet (andyatkinson.com)

Kinesis MWave Mac Mechanical Keyboard: A Short Review (davidgomes.com)

Electric Go Kart (2023) (badar.tech)

Why do we get earworms? (theneuroscienceofeverydaylife.substack.com)

Ask HN: Apps exposing users through share links

The UK wants you to sign up for £1B cyber defense force (theregister.com)

FLOSS/Fund: First tranche of funding to 9 global FOSS projects (floss.fund)

Tell HN: Namecheap pre-purchasing searched domain names?

The case for using a web browser as your terminal (blog.pomdtr.me)

Memvid – Video-Based AI Memory (github.com)

PHP Pipe operator v3 Accepted (wiki.php.net)

Ping A real-life social app to meet people nearby (WIP) (figma.com)

Show HN: Overlay Images (overlayimages.app)

Anduril and Meta Partner for US Army VR Headsets (techcrunch.com)

Show HN: I redirected 10K+ URLs in 5 minutes (redirectifyapp.com)

Ask HN: Do we need AGI to filter spam?

Grammarly secures $1B from General Catalyst to build AI productivity platform (reuters.com)

First and Best Offers (brilliantorg.notion.site)

Writing is everywhere: write more (catalinpit.substack.com)

US suspends engine sales to Chinese planemaker COMAC (reuters.com)

JavaScript Errors (haydenbleasel.com)

Free Online Media Converter Tool (rendley.com)

Worldlines: Visualizing Special Relativity (2010) (worldlines.sourceforge.net)

Show HN: Fine-tune your image through intuitive dialogue (flux1kontext.org)

Apple Executives Won't Be Appearing at This Year's WWDC "The Talk Show Live" (macrumors.com)

Show HN: Edit photos with simple text prompts (fluxkontext.im)

Quaternion formulation claimed to resolve Navier Stokes Millennium Problem (arxiv.org)

RFK Jr's 'Maha' report found to contain citations to nonexistent studies (theguardian.com)

Robert Jarvik, who designed the first permanent artificial heart, dies (sltrib.com)

Neuroscience needs to empower early-career researchers, not fund moon shots (thetransmitter.org)

Passkey can detect auth cloning via signCount, but big tech do not support it (uzyn.com)

SOTA Model in 8B Size?

Comments (2)