Show HN: InvestAI – An AI assistant to help people invest more independently (invest--ai.vercel.app)

Every couple of years I refresh my own parallel reduction benchmarks (https://github.com/ashvardanian/ParallelReductionsBenchmark), which are also memory-bound. Mine mostly focus on the boring simple throughput-maximizing cases on CPUs and GPUs.

Lately, as GPUs are pulled into more general data-processing tasks, I keep running into non-coalesced, pointer-chasing patterns — but I still don’t have a good mental model for estimating the cost of different access strategies. A crossover between these two topics — running MLP-style loads on GPUs — might be exactly the benchmark missing, in case someone is looking for a good weekend project!

ericye16 · 14h ago

I wish the chart extended past 28, otherwise how do we know that it tops out there?

saagarjha · 14h ago

You don't; the author explains that testing beyond that produces noise that makes it hard to analyze.

pixelpoet · 13h ago

It's pretty trivial to keep randomising the array and plot some min/max bands, or just the average.

No comments yet

Fp8 runs faster when the kernel name has "cutlass" in it (github.com)

Spacelift Raises $51M Series C to Redefine Enterprise Infrastructure Automation (spacelift.io)

What Every Data Scientist Needs to Know About GPUs [video] (youtube.com)

Sweden and Norway racing to launch satellites from mainland Europe (reuters.com)

Show HN: InvestAI – An AI assistant to help people invest more independently (invest--ai.vercel.app)

Bastille 1.0 – Bastille Day 2025 (github.com)

The Lazy Marketer's Guide to Not Writing Terrible AI Prompts (aistackmarketer.substack.com)

What Keeps the Lights On (thenewatlantis.com)

Arm estimates a 14-fold increase in data center customers since 2021 (reuters.com)

Japan Wires the Ocean with an Earthquake-Sensing 'Nervous System' (scientificamerican.com)

Robot performs realistic gallbladder surgery 'with 100% accuracy' (news.sky.com)

Jupiter endangers Earth, and may have extincted the dinosaurs (bigthink.com)

Parsing 1 Billion Rows in Bun/TypeScript Under 10s (taekim.dev)

Upgrading agentic coding capabilities with the new Devstral models (mistral.ai)

End-to-End News Sentiment Pipeline with Serverless AWS, DuckDB and Streamlit (github.com)

Pump Fiction (youtube.com)

Multi-Player Durable Stream Playground (s2.dev)

Satellite data indicates recent Arctic peatland expansion with warming (nature.com)

Robot performs first realistic surgery without human help (hub.jhu.edu)

Underwater turbine spinning for 6 years off Scotland's coast is a breakthrough (apnews.com)

Searchcraft: Advanced Search Developer Tools (searchcraft.io)

Psalm v7: up to 10x performance (blog.daniil.it)

Challenges for no code tools for data science (medium.com)

Elon Musk's X faces an uncertain future (axios.com)

Integrating Long-Term Memory with Gemini 2.5 (philschmid.de)

Show HN: Coherence – 5 min agentic chat SDK (withcoherence.com)

Auto Generating Blog Feed (pliutau.com)

I Found a Lost Music Generator from the 90s [video] (youtube.com)

Show HN: Perennial Task (Prn) (github.com)

SHOW HN: Stripe Ignoring Legal Letters and Holding $800k+

Go-EUVD: Go Library for Interacting with Enisa EU Vulnerability Database (EUVD) (github.com)

Tiny aquarium fish net 3D print (blog.qiqitori.com)

The Grip That Race and Identity Have on My Students (nytimes.com)

Evaluating the Critical Risks of Amazon’s Nova Premier (alphaxiv.org)

How I ported Penko Park to Switch (ghostbutter.com)

Europe's Great Founders Must Unretire (generalist.com)

Intel CEO says it's "too late" for them to catch up with AI competition (tomshardware.com)

Every Company Will Need a Chief Agent Officer (mike-hostetler.com)

If I Ran X (werd.io)

Microsoft says regulations and environmental issues are cramping Euro expansion (theregister.com)

Reframe – Open‑Source ISO20022 Message Transformer in Rust (github.com)

My Freelance Journey – From Zero to Building and Running My Own Agency (preetsuthar.me)

Andrew Ng: Building Faster with AI [video] (youtube.com)

Flix – A powerful effect-oriented programming language (flix.dev)

WPP names senior Microsoft boss Cindy Rose as new CEO (theguardian.com)

Epistemological Primes

Show HN: I rebuilt few years old project and now it covers my expenses (hextaui.com)

An almost catastrophic OpenZFS bug and the humans that made it (despairlabs.com)

Heat and Team Production: Experimental Evidence from Bangladesh (papers.ssrn.com)

Ask HN: Handheld Digital Writing Device

Memory-Level Parallelism: Apple M2 vs. Apple M4

Comments (7)