Show HN: Explainer/docs for GGUF quantization (unofficial)

1 irt24 0 7/16/2025, 4:02:07 PM github.com ↗

"GGUF quantization" is the most popular tech stack for quantizing Llama-like models for CPU. But the documentation is very sparse, and the maintainers made it clear that writing a paper is not their priority. So I spent like a week reading through the code and understanding the various concepts (K-quants, I-quants, importance matrix, etc) and put together this (unofficial) repo with explainers.

It was mostly written by hand, without standard AI slop. I used AI mostly just to interrogate Claude Code on the llama.cpp codebase to help me understand it.

It's possible that I made mistakes or missed things here and there. If you have in-depth knowledge, I'd love your contributions!

Missouri Harasses AI Companies over Chatbots Dissing Glorious Leader Trump (reason.com)

The Patterns of Elites Who Conceal Their Assets Offshore (home.dartmouth.edu)

AI whiplash, and Neovim in the age of AI (dlants.me)

Ask HN: Looking for Unreal Engine 5 Developers for Dark Fantasy Game Concept

Ask HN: How do you automate recurring workflows without writing glue code?

How to win AI visibility: A survival guide for content writers in the LLMs age (lauradecastro.substack.com)

Speeding up compilation with `hint-mostly-unused` (blog.rust-lang.org)

Gradual negation types and the Python type system (jellezijlstra.github.io)

Arcs Is 2024's Best New Board Game (youtube.com)

Afghans relocated to UK under secret scheme after data leak (theguardian.com)

How A.I. really works [a "documentary" made with VEO3] (youtube.com)

Encryption and checking hashes slows faster SSDs (eclecticlight.co)

Asymmetry of Verification and Verifier's Law (jasonwei.net)

Be a 10x Engineer – Break the Promotion Timeline (medium.com)

Delta moves toward eliminating set prices in favor of AI (fortune.com)

Treating beef like coal would make a big dent in greenhouse-gas emissions (economist.com)

Why Canada needs to build a public cloud (disconnect.blog)

Physicists are afraid of Eric Weinstein – and they should be [video] (youtube.com)

A new era of Stack Overflow (stackoverflow.blog)

How to Run an Arduino for Years on a Battery (makecademy.com)

Tech Neck (lookaway.app)

Ask HN: Best way to model cross-functional workstreams (dev <> ops <> product)?

Show HN: YAMLResume, a Resume Compiler with Clang Style Error Reporting (asciinema.org)

Have you tried a no-code option?

Code Execution Through Email: How I Used Claude to Hack Itself (pynt.io)

Show HN: DualBoard – a collaborative whiteboard for face-to-face tutoring (dualboard.app)

Show HN: I built a self-learning AI without an LLM – memory, reflection

New No-Code Partner

Ask HN: Anyone using non-Atlassian tools for sprint+roadmap planning?

Minimalism as Anti-Entropy (domofutu.substack.com)

State Attorney General is investigating why AI chatbots don't like Trump (theverge.com)

Guide for bootstrapped SaaS on product development – I'd love feedback (notion.so)

I spent 24 hours flirting with Elon Musk's AI girlfriend (theverge.com)

Beeper is getting a big security upgrade with on-device connections (blog.beeper.com)

Eggs: Healthy or Risky? A Review of Evidence from High Quality Studies (pmc.ncbi.nlm.nih.gov)

Show HN: Bytesites.ai – Launch a full website in moments using AI (bytesites.ai)

Wttr: Console-oriented weather forecast service (github.com)

Advancing Polish Language Models (nask.pl)

Outsoci – Scrape leads (emails and data) from social media and Google Maps (outsoci.com)

Open-Meteo: Free Weather Forecast API for Non-Commercial Use (github.com)

Hyman Rickover and the Nuclear Navy (everything-everywhere.com)

Show HN: Help you to build/generate bulk UTM URLs for free (bulkutmbuilder.com)

Show HN: Made Wheel of Names Without Registration (nameonwheel.com)

Ask HN: How promote your project?

The Open Source xAI Ani that's next level (github.com)

Steam removes games due to pressure from payment processors (automaton-media.com)

Tricking our brains to learn and remember; is all learning incidental? (news.northeastern.edu)

Pixaras.com – Turnkey AI Image Generator SaaS (flippa.com)

The King and AI: A humanoid robot painted a picture of Charles. How did it do? (news.sky.com)

Ask HN: Is vibe coding viable for real work or operating products/services?

Show HN: Explainer/docs for GGUF quantization (unofficial)

Comments (0)