The Big LLM Architecture Comparison

Comments (4)

strangescript · 1m ago

The diagrams in this article are amazing if you are somewhere in between a novice and expert. Seeing all of the new models laid out next to each other is fantastic.

Chloebaker · 35m ago

Honestly its crazy to think how far we’ve come since GPT-2 (2019), today comparing LLMs to determine their performance is notoriously challenging and it feels like every 2 weeks a models beats a new benchmark. I’m really glad DeepSeek was mentioned here, bc the key architectural techniques it introduced in V3 that improved its computational efficiency and distinguish it from many other LLMs was really transformational when it came out.

bravesoul2 · 3h ago

This is a nice catchup for some who hasn't been keeping up like me

dmezzetti · 1h ago

While all these architectures are innovative and have helped improve either accuracy or speed, the same fundamental problem of generating factual information still exists.

Retrieval Augmented Generation (RAG), Agents and other similar methods help mitigate this. It will be interesting to see if future architectures eventually replace these techniques.

The bewildering phenomenon of declining quality (english.elpais.com)

Async I/O on Linux in databases (blog.canoozie.net)

I'm betting against AI agents in 2025, despite building them (utkarshkanwat.com)

The Big LLM Architecture Comparison (magazine.sebastianraschka.com)

Hungary's oldest library is fighting to save books from a beetle infestation (npr.org)

Make Your Own Backup System – Part 1: Strategy Before Scripts (it-notes.dragas.net)

Show HN: MCP server for Blender that builds 3D scenes via natural language (blender-mcp-psi.vercel.app)

Show HN: ggc – A terminal-based Git CLI written in Go (github.com)

Death by AI (davebarry.substack.com)

Nobody knows how to build with AI yet (worksonmymachine.substack.com)

Local LLMs versus offline Wikipedia (evanhahn.com)

I tried vibe coding in BASIC and it didn't go well (goto10retro.com)

Beyond Meat fights for survival (foodinstitute.com)

How to run an Arduino for years on a battery (2021) (makecademy.com)

Borg – Deduplicating archiver with compression and encryption (borgbackup.org)

How the 'Minecraft' Score Became Big Business for Its Composer (billboard.com)

Will the Fear of Being Confused for AI Mean That We Will Now Write Differently? (3quarksdaily.com)

Mushroom learns to crawl after being given robot body (2024) (the-independent.com)

Roman Roads Research Association (UK) (romanroads.org)

Robot metabolism: Toward machines that can grow by consuming other machines (science.org)

Matterport walkthrough of the original Microsoft Building 3 (my.matterport.com)

Open-Source BCI Platform with Mobile SDK for Rapid Neurotech Prototyping (preprints.org)

What were the earliest laws like? (worldhistory.substack.com)

Ring introducing new feature to allow police to live-stream access to cameras (eff.org)

Rethinking CLI interfaces for AI (notcheckmark.com)

The curious case of the Unix workstation layout (thejpster.org.uk)

“Bypassing” specialization in Rust (oakchris1955.eu)

AI is killing the web. Can anything save it? (economist.com)

Piano Keys (mathpages.com)

How we tracked down a Go 1.24 memory regression (datadoghq.com)

A Treatise for One Network – Anonymous National Deliberation [pdf] (simurgh-beau.github.io)

The borrowchecker is what I like the least about Rust (viralinstruction.com)

Pimping My Casio: Part Deux (blog.jgc.org)

The AGI Final Frontier: The CLJ-AGI Benchmark (raspasov.posthaven.com)

Airbnb allowed rampant price gouging following L.A. fires, city attorney alleges (latimes.com)

I Used Arch, BTW: macOS, Day 1 (yberreby.com)

OpenAI claims gold-medal performance at IMO 2025 (twitter.com)

Zig Interface Revisited (williamw520.github.io)

The future of ultra-fast passenger travel (spaceambition.substack.com)

Erythritol linked to brain cell damage and stroke risk (sciencedaily.com)

“I noticed a clear violation of our contributing guidelines” (github.com)

Fstrings.wtf (fstrings.wtf)

TSMC to start building four new plants with 1.4nm technology (taipeitimes.com)

New York’s bill banning One-Person Train Operation (etany.org)

What the Fuck Python (colab.research.google.com)

Trigon: Exploiting coprocessors for fun and for profit (part 2) (alfiecg.uk)

First electronic-photonic quantum chip manufactured in commercial foundry (news.northwestern.edu)

Show HN: Am-I-vibing, detect agentic coding environments (github.com)

Valve confirms credit card companies pressured it to delist certain adult games (pcgamer.com)

Piramidal (YC W24) is hiring a full stack engineer (ycombinator.com)

The Big LLM Architecture Comparison

Comments (4)