Tell HN: Help restore the tax deduction for software dev in the US (Section 174)

2431 points by dang 9d ago 905 comments

GCP Outage (status.cloud.google.com)

1458 points by thanhhaimai 6d ago 494 comments

Frequent reauth doesn't make you more secure (tailscale.com)

1265 points by ingve 6d ago 514 comments

Honda conducts successful launch and landing of experimental reusable rocket (global.honda)

1240 points by LorenDB 1d ago 388 comments

A receipt printer cured my procrastination (laurieherault.com)

1237 points by laurieherault 6d ago 599 comments

The Grug Brained Developer (2022) (grugbrain.dev)

988 points by smartmic 1d ago 485 comments

The last six months in LLMs, illustrated by pelicans on bicycles (simonwillison.net)

956 points by swyx 10d ago 233 comments

Magistral — the first reasoning model by Mistral AI (mistral.ai)

937 points by meetpateltech 8d ago 424 comments

If the moon were only 1 pixel: A tediously accurate solar system model (2014) (joshworth.com)

924 points by sdoering 5d ago 260 comments

Apple announces Foundation Models and Containerization frameworks, etc (apple.com)

858 points by thm 9d ago 489 comments

Working on databases from prison (turso.tech)

835 points by dvektor 2d ago 530 comments

Jemalloc Postmortem (jasone.github.io)

796 points by jasone 5d ago 235 comments

Containerization is a Swift package for running Linux containers on macOS (github.com)

768 points by gok 9d ago 409 comments

Research suggests Big Bang may have taken place inside a black hole (port.ac.uk)

767 points by zaik 7d ago 603 comments

Apple introduces a universal design across platforms (apple.com)

737 points by meetpateltech 9d ago 1202 comments

US-backed Israeli company's spyware used to target European journalists (apnews.com)

729 points by 01-_- 6d ago 381 comments

Marines being mobilized in response to LA protests (cnn.com)

706 points by sapphicsnail 8d ago 1681 comments

WhatsApp introduces ads in its app (nytimes.com)

691 points by greenburger 2d ago 988 comments

I convinced HP's board to buy Palm and watched them kill it (philmckinney.substack.com)

678 points by AndrewDucker 5d ago 495 comments

Chatterbox TTS (github.com)

666 points by pinter69 7d ago 188 comments

Congratulations on creating the one billionth repository on GitHub (github.com)

623 points by petercooper 6d ago 137 comments

Resurrecting a dead torrent tracker and finding 3M peers (kianbradley.com)

618 points by k-ian 1d ago 195 comments

Bruteforcing the phone number of any Google user (brutecat.com)

613 points by brutecat 9d ago 190 comments

How I program with agents (crawshaw.io)

611 points by bumbledraven 9d ago 294 comments

"Localhost tracking" explained. It could cost Meta €32B (zeropartydata.es)

591 points by donohoe 8d ago 272 comments

Launch HN: Vassar Robotics (YC X25) – $219 robot arm that learns new skills

579 points by charleszyong 8d ago 219 comments

Start your own Internet Resiliency Club (bowshock.nl)

572 points by todsacerdoti 2d ago 338 comments

Kagi Reaches 50k Users (kagi.com)

553 points by tigroferoce 9d ago 341 comments

Why SSL was renamed to TLS in late 90s (2014) (tim.dierks.org)

540 points by Bogdanp 3d ago 229 comments

OpenAI dropped the price of o3 by 80% (twitter.com)

516 points by mfiguiere 8d ago 493 comments

Waymo rides cost more than Uber or Lyft and people are paying anyway (techcrunch.com)

503 points by achristmascarl 6d ago 882 comments

Air India flight to London crashes in Ahmedabad with more than 240 onboard (theguardian.com)

498 points by Gud 6d ago 582 comments

Self-Host and Tech Independence: The Joy of Building Your Own (ssp.sh)

498 points by articsputnik 11d ago 241 comments

Building Effective AI Agents (anthropic.com)

496 points by Anon84 1d ago 85 comments

I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com)

478 points by yousef_g 4d ago 77 comments

Meta invests $14.3B in Scale AI to kick-start superintelligence lab (nytimes.com)

467 points by RyanShook 5d ago 474 comments

Show HN: Workout.cool – Open-source fitness coaching platform (github.com)

461 points by surgomat 8h ago 153 comments

We’re secretly winning the war on cancer (vox.com)

459 points by lr0 10d ago 217 comments

Joining Apple Computer (2018) (folklore.org)

456 points by tosh 11d ago 123 comments

Building supercomputers for autocrats probably isn't good for democracy (helentoner.substack.com)

453 points by rbanffy 10d ago 259 comments

Danish Ministry Replaces Windows and Microsoft Office with Linux and LibreOffice (heise.de)

450 points by jlpcsl 6d ago 227 comments

Convert photos to Atkinson dithering (gazs.github.io)

436 points by nvahalik 11d ago 54 comments

Fossify – A suite of open-source, ad-free apps (github.com)

421 points by jalict 1d ago 128 comments

Rendering Crispy Text on the GPU (osor.io)

420 points by ibobev 5d ago 131 comments

Show HN: I made a 3D printed VTOL drone (tsungxu.com)

416 points by tsungxu 8d ago 144 comments

FSE meets the FBI (blog.freespeechextremist.com)

414 points by 1337p337 9d ago 146 comments

Low-background Steel: content without AI contamination (blog.jgc.org)

408 points by jgrahamc 8d ago 268 comments

Show HN: Chili3d – A open-source, browser-based 3D CAD application

407 points by xiange 8d ago 117 comments

Successful people set constraints rather than chasing goals (joanwestenberg.com)

404 points by MaysonL 8d ago 220 comments

Ask HN: How do I give back to people helped me when I was young and had nothing?

395 points by jupiterglimpse 5d ago 206 comments

Show HN: I built a tensor library from scratch in C++/CUDA

73 nirw4nna 7 6/18/2025, 3:20:05 PM github.com ↗

Hi HN,

Over the past few months, I've been building `dsc`, a tensor library from scratch in C++/CUDA. My main focus has been on getting the basics right, prioritizing a clean API, simplicity, and clear observability for running small LLMs locally.

The key features are: - C++ core with CUDA support written from scratch. - A familiar, PyTorch-like Python API. - Runs real models: it's complete enough to load a model like Qwen from HuggingFace and run inference on both CUDA and CPU with a single line change[1]. - Simple, built-in observability for both Python and C++.

Next on the roadmap is adding BF16 support and then I'll be working on visualization for GPU workloads.

The project is still early and I would be incredibly grateful for any feedback, code reviews, or questions from the HN community!

GitHub Repo: https://github.com/nirw4nna/dsc

[1]: https://github.com/nirw4nna/dsc/blob/main/examples/models/qw...

Comments (7)

aklein · 2h ago

I noticed you interface with the native code via ctypes. I think cffi is generally preferred (eg, https://cffi.readthedocs.io/en/stable/overview.html#api-mode...). Although you'd have more flexibility if you build your own python extension module (eg using pybind), which will free you from a simple/strict ABI. Curious if this strict separation of C & Python was a deliberate design choice.

kajecounterhack · 3h ago

Cool stuff! Is the goal of this project personal learning, inference performance, or something else?

Would be nice to see how inference speed stacks up against say llama.cpp

liuliu · 3h ago

Both uses cublas under the hood. So I think it is similar for prefilling (of course, this framework is too early and don't have FP16 / BF16 support for GEMM it seems). Hand-roll gemv is faster for token generation hence llama.cpp is better.

rrhjm53270 · 1h ago

Do you have any plan for the serialization and deserialization of your tensor and nn library?

helltone · 5h ago

This is very cool. I'm wondering if some of the templates and switch statements would be nicer if there was an intermediate representation and a compiler-like architecture.

I'm also curious about how this compares to something like Jax.

Also curious about how this compares to zml.

amtk2 · 1h ago

super n00b question , what kind of labtop do you need to do project like this? Is mac ok? or do you need dedicated linux labtop?

kadushka · 50m ago

Any laptop with an Nvidia card