Agents Built from Alloys (xbow.com)

1 points by azhenley 39s ago 0 comments

US EPA cutting workforce by 23%, closing research division (reuters.com)

2 points by pseudolus 4m ago 0 comments

I'm Rebelling Against the Algorithm (varunraghu.com)

1 points by Varun08 5m ago 0 comments

My worst tech purchase became my best DIY desk lamp (medium.com)

1 points by philjw 7m ago 1 comments

Show HN: Vizr – Ask questions about your marketing data, get real answers (vizr.app)

1 points by arifliftos 11m ago 0 comments

With One Call, Trump Alters the Fate of a Contested Power Project (nytimes.com)

1 points by zekrioca 13m ago 0 comments

Is Translation the Killer App? (substack.com)

1 points by mathattack 14m ago 0 comments

California wood pellet plants canceled amid market decline and public pushback (news.mongabay.com)

2 points by PaulHoule 15m ago 0 comments

Show HN: RunAgent: Model Context Protocol (MCP) and Vercel but for AI Agents (github.com)

2 points by adewba 16m ago 0 comments

Two-photon 3D printing of functional microstructures inside living cells [pdf] (arxiv.org)

1 points by thunderbong 17m ago 0 comments

'Utopian' city California Forever announces tech manufacturing park (techcrunch.com)

1 points by geox 19m ago 0 comments

Marathon fusion claims to invent alchemy, making 5000 kgs gold per gigawatt (marathonfusion.com)

3 points by apugoneappu 20m ago 1 comments

I Became the First Linux User in India (medium.com)

2 points by GuinansEyebrows 23m ago 0 comments

How to write Rust in the Linux kernel: part 3 (lwn.net)

3 points by chmaynard 24m ago 0 comments

Two Simple Rules to Fix Code Reviews (serce.me)

1 points by ghuntley 25m ago 0 comments

Show HN: Tech docs → video explainers in seconds (symvol.io)

2 points by feliks22 26m ago 0 comments

Pitfalls of Customer Feedback That Create Bad Products (jasonevanish.com)

2 points by jevanish 26m ago 0 comments

"So what if ChatGPT wrote it?" (sciencedirect.com)

1 points by bookofjoe 28m ago 0 comments

Ask HN: Will AI Usage Make Frameworks Last Longer?

1 points by CM30 29m ago 0 comments

Show HN: Numbl – A daily number puzzle inspired by Wordle and Sudoku (henryjburg.github.io)

1 points by henryjburg 30m ago 0 comments

Anthropic Is Expanding Their Compute (trust.anthropic.com)

1 points by JLO64 30m ago 1 comments

Amazon EKS ultra scale clusters (aws.amazon.com)

1 points by sbmthakur 31m ago 0 comments

A Short Story of the Google Error Page (meiert.com)

1 points by varun_ch 31m ago 0 comments

CCO of private investment firm SMH caught cheating on Series 24 exam [pdf] (sec.gov)

1 points by amendegree 33m ago 0 comments

Show HN: AI File Sorter: Organize Files and Folders with AI (Local LLMs) (github.com)

2 points by hyperfield 35m ago 0 comments

Who Hates YouTube?

1 points by thoth001 35m ago 2 comments

Stealth Macintosh Portable case mod (biosrhythm.com)

1 points by classichasclass 35m ago 0 comments

Fcrand (Go language): drop-in replacement for crypto/rand, up to 10x faster (github.com)

1 points by sdrapkin 36m ago 2 comments

Test Code Like Zelda: When to Implement Automated Testing (usetusk.ai)

2 points by Marceltan 52m ago 0 comments

Target to end price-matching policy amid business challenges (time.com)

2 points by hhs 53m ago 0 comments

How do you compute the midpoint of an interval? (2014) [pdf] (hal.science)

1 points by todsacerdoti 58m ago 0 comments

Show HN: Benchstreet – the stock prediction AI benchmark (github.com)

4 points by ColonelParrot 59m ago 0 comments

Dead Zone Dragging (steveruiz.me)

1 points by kierangill 1h ago 0 comments

Show HN: ts-explicit-errors – A TypeScript library for treating errors as values (github.com)

1 points by genshii 1h ago 0 comments

Psilocybin therapy for mood dysfunction in Parkinson's disease: open-label trial (nature.com)

1 points by nick__m 1h ago 4 comments

Easy Agents: Build autonomous agents with just natural language (github.com)

2 points by kpolls 1h ago 0 comments

Teufel Mynd open source / open hardware Bluetooth speaker (lu.teufelaudio.com)

3 points by Eduard 1h ago 0 comments

Virtual Humans for Hire (holostaff.ai)

3 points by dergalem 1h ago 0 comments

Slow Adoption Applies to Evil AI, Too (secondthoughts.ai)

3 points by gk1 1h ago 0 comments

Shape-shifting particles allow temperature control over fluid flow and stiffness (phys.org)

2 points by PaulHoule 1h ago 0 comments

Building Your Personal Assistant with Multi-Modal Memory (mirix.io)

3 points by wangyu164 1h ago 2 comments

Standardization of Office Open XML (en.wikipedia.org)

3 points by fsflover 1h ago 0 comments

Apple sues leaker Jon Prosser for allegedly stealing iOS26 info from an employee (engadget.com)

2 points by apparent 1h ago 0 comments

Canadian Cross (en.wikipedia.org)

7 points by tripdout 1h ago 0 comments

Arch Linux pulls AUR packages that installed Chaos RAT malware (bleepingcomputer.com)

4 points by mikece 1h ago 1 comments

Silence Is a Commons by Ivan Illich (1983) (davidtinapple.com)

31 points by entaloneralie 1h ago 4 comments

Detroit pitches Silicon Valley-types: Bring your next factory here (subscribe.detroitnews.com)

2 points by rmason 1h ago 1 comments

A Rare Object Found Deep in the Kuiper Belt – Universe Today (universetoday.com)

2 points by rbanffy 1h ago 0 comments

Dennis Gustafsson – Parallelizing the physics solver [video] (youtube.com)

3 points by SanJacobs 1h ago 0 comments

New Evidence of Obama Admin Conspiracy to Subvert President Trump's 2016 Victory (dni.gov)

2 points by odni 1h ago 0 comments

Multiplatform Matrix Multiplication Kernels

38 homarp 7 7/18/2025, 7:59:49 PM burn.dev ↗

Comments (7)

airstrike · 4m ago

[delayed]

nathanielsimard · 54m ago

One of the author here, don't hesitate if you have any question or comment!

raphaelty · 1h ago

Very interesting, willing to try burn

almostgotcaught · 34m ago

I'm sorry this is a low brow comment but this is the dumbest thing you can do in this space:

> Unit (thread in CUDA, invocation in Vulkan/Wgpu): the smallest execution entity performing computations.

> Plane (warp in CUDA, subgroup in Vulkan/Wgpu): a group of (typically 32) units executing in lockstep and able to share data efficiently through registers.

> Cube (thread block in CUDA, workgroup in Vulkan/Wgpu): a group of units that execute on the same SM, sharing memory and able to synchronize

It's already bad enough that the vendors themselves insisted on different names but why in the bejesus would you rename these concepts and diverge from literally all existing naming conventions when you're providing middleware. Ie when using your tool I'm still going to reference NVIDIA's or AMD's to understand how the hardware actually works. Like do you really think otherwise - that your thing is gonna be end of the line???

FYI the word warp isn't random techno babble but is actually a very clever pun that actually fits very well conceptually:

https://en.m.wikipedia.org/wiki/Warp_and_weft

nathanielsimard · 24m ago

Using the naming from one of the existing API would put too much bias towards that API. It started as a WebGPU project early on, but some features are not present so mixing terms wasn't ideal. We're also working on extending CubeCL to CPU, so we want terms not only tied to the GPU word.

sroussey · 3m ago

Why unit instead of point?

Unit, plane (as vs train), and cube?

Or point, plane, cube (1d, 2d, 3d)?

almostgotcaught · 14m ago

Thread, subgroup, workgroup.

There you go you've hit basically two of 3 completely (AMD and Vulkan) and are close enough CUDA that people would get it.

I have no idea what a plane connotes and a cube literally gives a distinct enough picture from block that I will be continuously reminding myself of the mapping.

What you did was pointless - you assigned new words to objects that you don't own and now your conceptual framework is askew from the actual underlying (true) conceptual framework.

> CubeCL to CPU

There is zero affinity between GPU programing models and multicore CPU programing models. If you don't believe me go ask the openmp people how they're doing supporting GPUs.

nathanielsimard · 2m ago

Well we can agree to disagree, CubeCL also has the concept of instruction parallelism, which would be used to target simd instructions on CPU. Our algorithms are normally flexible on both the plane size and the line size, adapting to the hardware with comptime logique. You are free to dislike the naming, but imo a mix of multiple APIs is worse than something new.