The Fed says this is a cube of $1M. They're off by half a million (calvin.sh)

1461 points by c249709 5d ago 548 comments

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad (daringfireball.net)

966 points by dotcoma 7d ago 635 comments

Nvidia won, we all lost (blog.sebin-nyshkim.net)

919 points by todsacerdoti 1d ago 544 comments

The new skill in AI is not prompting, it's context engineering (philschmid.de)

901 points by robotswantdata 5d ago 517 comments

My open source project was relicensed by a YC company [license updated] (twitter.com)

888 points by sohzm 2d ago 496 comments

Introducing tmux-rs (richardscollin.github.io)

843 points by Jtsummers 3d ago 287 comments

Local-first software (2019) (inkandswitch.com)

792 points by gasull 1d ago 269 comments

Writing Code Was Never the Bottleneck (ordep.dev)

756 points by phire 5d ago 384 comments

Being too ambitious is a clever form of self-sabotage (maalvika.substack.com)

703 points by alihm 1d ago 201 comments

I made my VM think it has a CPU fan (wbenny.github.io)

678 points by todsacerdoti 7d ago 188 comments

Xfinity using WiFi signals in your house to detect motion (xfinity.com)

666 points by bearsyankees 5d ago 500 comments

Proton joins suit against Apple for practices that harm developers and consumers (proton.me)

666 points by moose44 6d ago 633 comments

The Rise of Whatever (eev.ee)

596 points by cratermoon 2d ago 470 comments

Websites hosting major US climate reports taken down (apnews.com)

580 points by geox 3d ago 385 comments

Exploiting the IKKO Activebuds “AI powered” earbuds (2024) (blog.mgdproductions.com)

564 points by ajdude 4d ago 243 comments

Cloudflare to introduce pay-per-crawl for AI bots (blog.cloudflare.com)

559 points by scotchmi_st 5d ago 294 comments

Are we the baddies? (geohot.github.io)

542 points by AndrewSwift 13h ago 384 comments

Hidden interface controls that affect usability (interactions.acm.org)

520 points by cxr 19h ago 371 comments

Don’t use “click here” as link text (2001) (w3.org)

512 points by theandrewbailey 4d ago 342 comments

Figma files for proposed IPO (figma.com)

512 points by kualto 4d ago 232 comments

Private sector lost 33k jobs, badly missing expectations of 100k increase (cnbc.com)

511 points by ceejayoz 4d ago 335 comments

ICEBlock, an app for anonymously reporting ICE sightings, goes viral (techcrunch.com)

509 points by exiguus 4d ago 818 comments

There are no new ideas in AI, only new datasets (blog.jxmo.io)

487 points by bilsbie 6d ago 285 comments

Major reversal in ocean circulation detected in the Southern Ocean (icm.csic.es)

462 points by riffraff 2d ago 308 comments

YouTube No Translation (addons.mozilla.org)

462 points by doener 6d ago 332 comments

Mini NASes marry NVMe to Intel's efficient chip (jeffgeerling.com)

442 points by ingve 2d ago 232 comments

Ask HN: What Are You Working On? (June 2025)

437 points by david927 6d ago 1374 comments

Gridfinity: The modular, open-source grid storage system (gridfinity.xyz)

430 points by nateb2022 6d ago 176 comments

Cloudflare Introduces Default Blocking of A.I. Data Scrapers (nytimes.com)

426 points by stephendause 4d ago 328 comments

Show HN: Spegel, a Terminal Browser That Uses LLMs to Rewrite Webpages (simedw.com)

423 points by simedw 5d ago 180 comments

I'm dialing back my LLM usage (zed.dev)

417 points by sagacity 4d ago 237 comments

Show HN: CSS generator for a high-def glass effect (glass3d.dev)

416 points by kris-kay 4d ago 117 comments

Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

415 points by air7 7d ago 220 comments

OBBB signed: Reinstates immediate expensing for U.S.-based R&D (kbkg.com)

412 points by tareqak 1d ago 345 comments

I write type-safe generic data structures in C (danielchasehooper.com)

411 points by todsacerdoti 6d ago 179 comments

Fakespot shuts down today after 9 years of detecting fake product reviews (blog.truestar.pro)

407 points by doppio19 4d ago 267 comments

Melbourne man discovers extensive model train network underneath house (sbs.com.au)

402 points by cfcfcf 5d ago 153 comments

Claude Code now supports hooks (docs.anthropic.com)

381 points by ramoz 5d ago 170 comments

ICEBlock climbs to the top of the App Store charts after officials slam it (engadget.com)

378 points by doener 4d ago 5 comments

OpenFLOW – Quickly make beautiful infrastructure diagrams local to your machine (github.com)

377 points by x0z 5d ago 82 comments

The Moat of Low Status (usefulfictions.substack.com)

373 points by jger15 4d ago 163 comments

Why I left my tech job to work on chronic pain (sailhealth.substack.com)

362 points by glasscannon 2d ago 222 comments

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok (github.com)

355 points by geoctl 7d ago 150 comments

Gene therapy restored hearing in deaf patients (news.ki.se)

352 points by justacrow 4d ago 89 comments

Sam Altman Slams Meta’s AI Talent Poaching: 'Missionaries Will Beat Mercenaries' (wired.com)

342 points by spenvo 5d ago 695 comments

Larry (cat) (en.wikipedia.org)

339 points by dcminter 2d ago 74 comments

Opening up ‘Zero-Knowledge Proof’ technology (blog.google)

331 points by doomroot13 3d ago 186 comments

Flounder Mode – Kevin Kelly on a different way to do great work (joincolossus.com)

322 points by latentnumber 3d ago 81 comments

Huawei releases an open weight model trained on Huawei Ascend GPUs (arxiv.org)

320 points by buyucu 4d ago 331 comments

The $25k car is going extinct? (media.hubspot.com)

320 points by pseudolus 7d ago 1113 comments

Every Flop Counts: Scaling a 300B LLM Without Premium GPUs

117 bretpiatt 9 3/24/2025, 12:48:16 PM arxiv.org ↗

Comments (9)

flowerthoughts · 100d ago

They never mention what hardware they're on.

Table 1 is the closest thing. Device specs for six devices: 120-989 TFLOPS and 64-96 GB RAM.

An RTX 5090 is about 105 TFLOPS.

https://www.techpowerup.com/gpu-specs/geforce-rtx-5090.c4216

bshark · 99d ago

The 96GB (HBM2e) SKU is named PPU from T-head semiconductor (basically a subsidiary of Alibaba). The spec is very similar to H20. Other chips they were using include Huawei Ascend 910B (64GB) and maybe other domestic designed chips.

boulos · 99d ago

I was surprised not to see a Kunlun P800 there.

rahen · 100d ago

I'm pretty surprised by the claimed memory usage for 300B parameters (table 1). If we compare similar models:

- Llama 3.1 with 405B parameters: 2 TB of memory (FP32), 500 GB (FP8)

- DeepSeek R1 with 671B parameters: 1.3 TB (scaling linearly, around 600 GB for 300B parameters)

Ling claims no more than 96 GB of memory, most likely for inference. That's far more than a 20% reduction. Am I missing something?

cavisne · 100d ago

I think they only claim their "Ling-Lite" 17B model can fit on a single 96GB GPU, their 300B model needs 8 of them (768GB of HBM)

fxtentacle · 100d ago

Some of these models still produce great results with something low like 2.7 bits per variable.

vednig · 98d ago

They've shared some interesting optimization techniques for bigger LLMs that's all, not exactly low powered devices as in power consumption. Still a good read.

osti · 100d ago

I think this is the one where they train LLM without NVIDIA GPU's.

cavisne · 100d ago

They talk about CUDA level tracing in their framework. I assume its just consumer GPU's that Nvidia say arent meant to be used in datacenters.

No comments yet