Andrej Karpathy's YC AI SUS talk on the future of the industry (donnamagi.com)

121 points by pudiklubi 3h ago 57 comments

The Unreasonable Effectiveness of Fuzzing for Porting Programs (rjp.io)

92 points by Bogdanp 3h ago 11 comments

Show HN: Workout.cool – Open-source fitness coaching platform (github.com)

419 points by surgomat 7h ago 141 comments

Game Hacking – Valve Anti-Cheat (VAC) (codeneverdies.github.io)

57 points by LorenDB 2h ago 37 comments

Writing documentation for AI: best practices (docs.kapa.ai)

67 points by mooreds 3h ago 18 comments

Show HN: I built a tensor library from scratch in C++/CUDA (github.com)

62 points by nirw4nna 4h ago 5 comments

Homomorphically Encrypting CRDTs (jakelazaroff.com)

154 points by jakelazaroff 7h ago 49 comments

"poline" is an enigmatic color palette generator using polar coordinates (meodai.github.io)

135 points by zdw 3d ago 33 comments

Yes I Will Read Ulysses Yes (theatlantic.com)

29 points by petethomas 2h ago 23 comments

Terpstra Keyboard (terpstrakeyboard.com)

178 points by xeonmc 9h ago 62 comments

Introduction to the A* Algorithm (redblobgames.com)

192 points by auraham 1d ago 70 comments

Attimet (YC F24) – Quant Trading Research Lab – Is Hiring Founding Engineer (ycombinator.com)

1 points by kbanothu 3h ago 0 comments

My iPhone 8 Refuses to Die: Now It's a Solar-Powered Vision OCR Server (terminalbytes.com)

41 points by hemant6488 4h ago 13 comments

MiniMax-M1 open-weight, large-scale hybrid-attention reasoning model (github.com)

286 points by danboarder 13h ago 64 comments

Framework Laptop 12 review (arstechnica.com)

141 points by moelf 4h ago 173 comments

Is There a Half-Life for the Success Rates of AI Agents? (tobyord.com)

147 points by EvgeniyZh 9h ago 85 comments

Scrappy - make little apps for you and your friends (pontus.granstrom.me)

383 points by 8organicbits 14h ago 125 comments

Revisiting Minsky's Society of Mind in 2025 (suthakamal.substack.com)

35 points by suthakamal 4h ago 9 comments

Locally hosting an internet-connected server (mjg59.dreamwidth.org)

116 points by pabs3 15h ago 118 comments

Spatializing 6k years of global urbanization from 3700 BC to AD 2000 (nature.com)

17 points by talonx 3d ago 1 comments

Building agents using streaming SQL queries (morling.dev)

74 points by rmoff 4h ago 7 comments

I counted all of the yurts in Mongolia using machine learning (monroeclinton.com)

188 points by furkansahin 12h ago 67 comments

Should we design for iffy internet? (bytes.zone)

42 points by surprisetalk 2d ago 18 comments

After millions of years, why are carnivorous plants still so small? (smithsonianmag.com)

175 points by gmays 5d ago 71 comments

Real-time action chunking with large models (pi.website)

52 points by pr337h4m 1d ago 7 comments

A different take on S-expressions (gist.github.com)

26 points by tearflake 3d ago 16 comments

The Grug Brained Developer (2022) (grugbrain.dev)

978 points by smartmic 23h ago 476 comments

Spherical CNNs (2018) (arxiv.org)

7 points by rkp8000 2d ago 1 comments

Reasoning by Superposition: A Perspective on Chain of Continuous Thought (arxiv.org)

41 points by danielmorozoff 7h ago 1 comments

Show HN: Free local security checks for AI coding in VSCode, Cursor and Windsurf

20 points by jaimefjorge 7h ago 10 comments

Show HN: Trieve CLI – Terminal-based LLM agent loop with search tool for PDFs (github.com)

16 points by skeptrune 6h ago 7 comments

Show HN: Lstr – A modern, interactive tree command written in Rust (github.com)

199 points by w108bmg 17h ago 57 comments

Show HN: Delve, an open source (AGPL) enterprise-grade data analytics platform (github.com)

9 points by ilovetux 3h ago 5 comments

Building Effective AI Agents (anthropic.com)

493 points by Anon84 1d ago 84 comments

Honda conducts successful launch and landing of experimental reusable rocket (global.honda)

1230 points by LorenDB 1d ago 387 comments

Think of a Number (xenaproject.wordpress.com)

29 points by IdealeZahlen 3d ago 4 comments

What Google Translate can tell us about vibecoding (ingrids.space)

266 points by todsacerdoti 1d ago 155 comments

The Bethesda Declaration (science.org)

75 points by perihelions 8h ago 21 comments

Now might be the best time to learn software development (substack.com)

322 points by nathanfig 1d ago 315 comments

3D-printed device splits white noise into an acoustic rainbow without power (phys.org)

219 points by rbanffy 3d ago 59 comments

OpenSERDES – Open Hardware Serializer/Deserializer (SerDes) in Verilog (2020) (github.com)

75 points by peter_d_sherman 16h ago 9 comments

The Brute Squad (sourcegraph.com)

7 points by tosh 3h ago 1 comments

Steam Beta enables Proton on Linux making Linux gaming simpler (gamingonlinux.com)

19 points by haunter 3h ago 2 comments

Munich from a Hamburger's Perspective (mertbulan.com)

64 points by toomuchtodo 2d ago 38 comments

A Straightforward Explanation of the Good Regulator Theorem (lesswrong.com)

40 points by surprisetalk 4d ago 4 comments

Resurrecting a dead torrent tracker and finding 3M peers (kianbradley.com)

615 points by k-ian 1d ago 195 comments

Making 2.5 Flash and 2.5 Pro GA, and introducing Gemini 2.5 Flash-Lite (blog.google)

359 points by meetpateltech 1d ago 209 comments

Why JPEGs still rule the web (2024) (spectrum.ieee.org)

209 points by purpleko 1d ago 378 comments

LLMs pose an interesting problem for DSL designers (kirancodes.me)

204 points by gopiandcode 1d ago 137 comments

Proofs Without Words (artofproblemsolving.com)

94 points by squircle 4d ago 23 comments

Show HN: I built a tensor library from scratch in C++/CUDA

62 nirw4nna 5 6/18/2025, 3:20:05 PM github.com ↗

Hi HN,

Over the past few months, I've been building `dsc`, a tensor library from scratch in C++/CUDA. My main focus has been on getting the basics right, prioritizing a clean API, simplicity, and clear observability for running small LLMs locally.

The key features are: - C++ core with CUDA support written from scratch. - A familiar, PyTorch-like Python API. - Runs real models: it's complete enough to load a model like Qwen from HuggingFace and run inference on both CUDA and CPU with a single line change[1]. - Simple, built-in observability for both Python and C++.

Next on the roadmap is adding BF16 support and then I'll be working on visualization for GPU workloads.

The project is still early and I would be incredibly grateful for any feedback, code reviews, or questions from the HN community!

GitHub Repo: https://github.com/nirw4nna/dsc

[1]: https://github.com/nirw4nna/dsc/blob/main/examples/models/qw...

Comments (5)

aklein · 1h ago

I noticed you interface with the native code via ctypes. I think cffi is generally preferred (eg, https://cffi.readthedocs.io/en/stable/overview.html#api-mode...). Although you'd have more flexibility if you build your own python extension module (eg using pybind), which will free you from a simple/strict ABI. Curious if this strict separation of C & Python was a deliberate design choice.

rrhjm53270 · 38m ago

Do you have any plan for the serialization and deserialization of your tensor and nn library?

kajecounterhack · 2h ago

Cool stuff! Is the goal of this project personal learning, inference performance, or something else?

Would be nice to see how inference speed stacks up against say llama.cpp

liuliu · 1h ago

Both uses cublas under the hood. So I think it is similar for prefilling (of course, this framework is too early and don't have FP16 / BF16 support for GEMM it seems). Hand-roll gemv is faster for token generation hence llama.cpp is better.

helltone · 3h ago

This is very cool. I'm wondering if some of the templates and switch statements would be nicer if there was an intermediate representation and a compiler-like architecture.

I'm also curious about how this compares to something like Jax.

Also curious about how this compares to zml.