Google and PayPal Forge Multiyear Partnership to Revolutionize Commerce (googlecloudpresscorner.com)

> Memory optimization on ultra-high core count systems differs a lot from single-threaded memory management. Memory allocators themselves become contention points, memory bandwidth is divided across more cores, and allocation patterns that work fine on small systems can create cascading performance problems at scale. It is crucial to be mindful of how much memory is allocated and how memory is used.

In bioinformatics, one of the most popular alignment algorithms is roughly bottlenecked on random RAM access (the FM-index on the BWT of the genome), so I always wonder how these algorithms are going to perform on these beasts. It's been a decade since I spent any time optimizing large system performance for it though. NUMA was already challenging enough! I wonder how many memory channels these new chips have access to.

bee_rider · 14m ago

288 cores is an absurd number of cores.

Do these things have AVX512? It looks like some of the Sierra Forest chips do have AVX512 with 2xFMA…

That’s pretty wide. Wonder if they should put that thing on a card and sell it as a GPU (a totally original idea that has never been tried, sure…).

ashvardanian · 11m ago

Sadly, no! On the bright side, they support new AVX2 VNNI extensions, that help with low precision integer dot products for Vector Search!

SimSIMD (inside USearch (inside ClickHouse)) already has those SIMD kernels, but I don’t yet have the hardware to benchmark :(

pixelpoet · 39m ago

This post looks like excellent low-level optimisation writing just in the first sections, and (I know this is kinda petty, but...) my heart absolutely sings at their use of my preferred C++ coding convention where & (ref) neither belongs to the type nor the variable name!

Protein Synthesis (1971) [video] (youtube.com)

Total Epistemic Divorce (murmurationstwo.substack.com)

Chinese dissident who led pro-democracy group in NYC guilty of spying for CCP (apnews.com)

Why Things Go to Shit (everythingisbullshit.blog)

Quantum canvases: a "painted debate" on free will (nirvanicai.substack.com)

Google and PayPal Forge Multiyear Partnership to Revolutionize Commerce (googlecloudpresscorner.com)

Omarchy v3.0.0 Release (github.com)

CEOs of Discord, Steam, Twitch, Reddit Invited to Testify on User Radicalization (oversight.house.gov)

Scientists Replay Movie Encoded in DNA (nimh.nih.gov)

Gluon: a GPU programming language based on the same compiler stack as Triton (github.com)

Clever Hans, a horse that appeared to perform arithmetic (en.wikipedia.org)

Chimpanzees consume equivalent of a beer a day in alcohol from fermented fruit (theguardian.com)

Stringzilla v4 Introduces 500 GigaCUPS Edit Distance on GPUs (ashvardanian.com)

Google Researchers Warn of Looming AI-Run Economies (decrypt.co)

Elon Musk's xAI lays off workers tasked with training Grok (businessinsider.com)

GitHub Copilot is not updating code in file

Steam, Discord, Twitch, Reddit to testify before Congress over 'radicalization' (polygon.com)

PSF Board Election Results for 2025 (discuss.python.org)

Show HN: BestLanding – Squeeze More Signups from Your Traffic (bestland.ing)

Some air cleaners release harmful by-products. Now we have a way to measure them (phys.org)

Silicon Valley's Doing Hard Things Again [video] (youtube.com)

Identifying and Preventing Fraudulent Engineering Candidates: An Investigation (socket.dev)

Israeli spies control your VPN and Social Media (mronline.org)

Ts-base: TS library template with release-please and tsdown (bengubler.com)

Quart: a Fast Python web microframework (quart.palletsprojects.com)

Show HN: Annotate any document and train extraction by example, not prompts (deeptagger.com)

Fed approves quarter-point interest rate cut and sees two more coming this year (cnbc.com)

China is sending its world-beating auto industry into a tailspin (reuters.com)

New SOTA on Arc-AGI Using Grok 4 (twitter.com)

Shai-Hulud Supply-Chain Scanner (Rust) (github.com)

Self-Driving People (bitfieldconsulting.com)

The Quantum Ogre Dilemma (knightsdigest.com)

Show HN: Tutrilo – lightweight training management for small providers (tutrilo.com)

What We Do and Don't Know About US TikTok Deal with China (bloomberg.com)

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning (nature.com)

Indian Names (theparisreview.org)

I Am an Engineer (anna.kiwi)

Icarus raises $6.1M to take on space's "warehouse work" with embodied-AI robots (techcrunch.com)

Unsolved Problems in MLOps (queue.acm.org)

Show HN: A Cyberpunk Tuner (un.bounded.cc)

Education in a Post Text World (anandsanwal.me)

macOS 26 Tahoe review: Power under glass (sixcolors.com)

Tips for Faster Rust Compile Times (corrode.dev)

Bored Games (nik.art)

The Company Man (lesswrong.com)

Delphi-2M LLM uses medical records, lifestyle to provide risks for 1k+ diseases (nature.com)

Golang, JavaScript and C++ dancing together (github.com)

Aleph raises a $29M Series B to accelerate AI adoption in FP&A (getaleph.com)

Works in Progress is now in print (worksinprogress.news)

Microplastics May Trigger Alzheimer's-Like Brain Damage (scitechdaily.com)

Optimizing ClickHouse for Intel's 280 core processors

Comments (4)