Show HN: An educational Local Qwen3 LLM Inference project written in Rust

Comments (1)

eiskalt · 2h ago

Hey all! I've just released my qwen3-rs, a Rust project for running and exporting Qwen3 models (Qwen3-0.6B, 4B, 8B, DeepSeek-R1-0528-Qwen3-8B, etc) with minimal dependencies and no Python required.

- Educational: Core algorithms are reimplemented from scratch for learning and transparency. - CLI tools: Export HuggingFace Qwen3 models to a custom binary format, then run inference (on CPU) - Modular: Clean separation between export, inference, and CLI. - Safety: Some unsafe code is used, mostly to work with memory mapping files (helpful to lower memory requirements on export/inference) - Future plans: I would be curious to see how to extend it to support: * fine-tuning of a small models * optimize inference performance (e.g. matmul operations) * WASM build to run inference in a browser

Basically, I used https://github.com/adriancable/qwen3.c as a reference implementation translated from C/Python to Rust with a help of commercial LLMs (mostly Claude Sonnet 4). Please note that my primary goal is self learning in this field, so some inaccuracies can be definitely there.

GitHub: https://github.com/reinterpretcat/qwen3-rs

A universal interface connecting you to today's AI models (tenzorro.com)

Show HN: Empromptu.ai – Agentic AI Building AI Apps

Ask HN: Gift Ideas for 3 year old?

Tuitar – Learning Guitar with Ratatui and Embedded Rust (ESP32) (github.com)

China's Mini PC Production: How Tiny Computers Are Made [video] (youtube.com)

Mathematical model reveals how humans store narrative memories with random trees (medicalxpress.com)

The UX Psychology Glossary (builtformars.com)

Wine 10.12 (Dev) – Run Windows Applications on Linux, BSD, Solaris and macOS (gitlab.winehq.org)

We Should Stop Making AI Look Human (medium.com)

Long Google (loeber.substack.com)

Senolytic Update (science.org)

Autonomous robot surgeon removes organs with 100% success rate (newatlas.com)

Most (ly Dead) Influential Programming Languages (2020) (hillelwayne.com)

Bcachefs Lands Fixes in Linux 6.16 for Some "High Severity" Regressions (phoronix.com)

Show HN: Planking for penguins, a real-time exercise tracking game (twitter.com)

FairLight TV #127, $D011 Mayhem [video] (youtube.com)

First U.S. Rare Earth Mine in 70 Years Opens in Wyoming (cowboystatedaily.com)

5 big EV takeaways from Trump’s “One Big Beautiful Bill” (wired.com)

Using AMD MI300X for High-Throughput, Low-Cost LLM Inference (herdora.com)

Explaining 6 Levels of Automated Driving and Which Ones Are Actually on US Roads (jalopnik.com)

Show HN: I Built a Stick-On Wireless Lamp That Installs in 30 Seconds (shopinfinitylamp.store)

Corn after soy: New study quantifies rotation benefits and trade-offs (phys.org)

Scientists Are Sneaking Passages into Research Papers to Trick AI Reviewers (msn.com)

YouTube Piano – Play It with Your Computer Keyboard (youtube.com)

Shipping Linear Drafts (mufeezamjad.com)

Aeron: Efficient reliable UDP unicast, UDP multicast, and IPC message transport (github.com)

Super Easy* 2-Stage Git Deployment (ratfactor.com)

Cooling a Raspberry Pi Device [pdf] (pip.raspberrypi.com)

Mcbot McHacked (captaincompliance.com)

Ask HN: Can "Pull Request to Get Hired" Replace Traditional Tech Hiring?

My Foray into Vlang (kristun.dev)

Show HN: Build web forms in rich text (kameo.dev)

Student Wins $250k Prize in Regeneron Science Talent Search (pasadenanow.com)

AI Agent Marketplace (aetheragentforge.org)

Angr (open-source binary analysis platform for Python) (angr.io)

They Fled War in Ethiopia. Then American Bombs Found Them (nytimes.com)

Melatonin: Much More Than You Wanted to Know (slatestarcodex.com)

A closer look at vertical agrivoltaics (pv-magazine.com)

It is absurd the YouTube monetizes these kind of abusive accounts (youtube.com)

Denaturalized Citizens Forced to Exit, Can't Escape Exit Tax (forbes.com)

Ask HN: How do you get first 10 customers?

Nodegram (nodegram.org)

I don't care what the code looks like anymore (substack.com)

Coordinating tasks between humans and Claude Code Agents using Leantime (leantime.io)

Collatz's Tape (gbragafibra.github.io)

Microsoft enables opt-out telemetry in Go 1.25 (devblogs.microsoft.com)

Killer whales appear to craft their own tools (economist.com)

The Streaming Wars Come Down to 2: YouTube vs. Netflix (nytimes.com)

'Starter packs' have played a central role in Bluesky's rapid growth (tu-darmstadt.de)

What Is Vibe Coding? (cloud.google.com)

Show HN: An educational Local Qwen3 LLM Inference project written in Rust

Comments (1)