Dance CAPTCHA: dance in front of your camera to complete Captcha challenge (dance-captcha.vercel.app)

On V2 there was no speedup from SVE, which shows that the compiler didn't make use/there wasn't any gain from the instructions in SVE that don't have a NEON equivalent.

On V2 SVE was ~15% faster on average, even though both SVE and NEON share the same resources on that core, the only difference being the vector length.

matt_d · 2d ago

3-min talk: https://www.youtube.com/watch?v=791PEI35tio

Abstract:

> Interleaving/Unrolling and Vectorization are two popular means to optimize applications. While the first one creates multiple copies of the loop body content, the second one focuses on operating on multiple data elements in parallel thanks to SIMD units available in the CPU. In theory, interleaving and vectorization are orthogonal optimizations, one relying on instruction-level parallelism/superscalarity, and the other on data-level parallelism within a single instruction. Modern CPU architectures provide both of these parallelism mechanisms at once, and the combination of vectorization and interleaving is complex, influencing each other due to instruction selection and complexity of underlying hardware, and the programmer often has to rely on the compiler's auto-vectorization.

> Based on a large evaluation of 642 loops coming from the literature, this paper demonstrates that significant gains (up to 20%) can be obtained by adapting the LLVM auto-vectorizer to better exploit interleaving and vectorization for a given AArch64 architecture. The proposed approach is flexible and can be easily applied at both loop level or application level. Experiments on 5 mini-apps coming from the HPC realm show similar improvements and demonstrates the co-design potential of the presented approach.

Using computers more freely and safely (2023) (akkartik.name)

Apple Takes Its Bite (aboard.com)

A Review of Kagi Search (olly.pagecord.com)

The 'miracle' of the sole passenger who survived the Air India flight (washingtonpost.com)

Why Search Sucks (But First, a Brief History) (youtube.com)

Who Holds the Control: How Technology Distribution Shapes Markets (blog.opencybernetics.io)

MapLibre has 100% Review-Merge Coverage (collab.dev)

Treat your humans better than your agents (etodd.io)

Steps Towards an Ecology for the Internet (arxiv.org)

Ohm.js Grammar Generator (gcrois.github.io)

Dance CAPTCHA: dance in front of your camera to complete Captcha challenge (dance-captcha.vercel.app)

Opius Planner Agent (github.com)

India–Middle East–Europe Economic Corridor (en.wikipedia.org)

The Delight of Specificity (startingfromnix.com)

The Haken Continuum musical instrument [video] (youtube.com)

Context, Not Code (ronie.medium.com)

Self-Studying the MIT Applied Math Curriculum (study-from-here.com)

I'm just a Barbie Girl in a ChatGPT world (theregister.com)

Steam Client Beta: now run natively on Apple Silicon (store.steampowered.com)

Fritz Bauer (en.wikipedia.org)

Experimental retina implants give mice infrared vision (arstechnica.com)

Benchmarking Durable Workflow Architectures (dbos.dev)

Show HN: StellarSnap – Explore NASA APODs, simulate orbits, learn astronomy (stellarsnap.space)

Thick-panel origami structures forming seamless surfaces (nature.com)

Technical concepts: how Login with Google works (tesseral.com)

High performance client for Baseten.co (github.com)

Environmental Computing as a Branch of Science (cacm.acm.org)

When random people give money to random other people (2017) (quomodocumque.wordpress.com)

WebGPU is now shipping in Safari 26 beta (twitter.com)

A Snowflake announcement explains dbt Labs' licensing change (kostasp.net)

Large Language Muddle (jasonsantamaria.com)

Americans are losing spending power, say researchers (cnbc.com)

The hunt for the holocaust architect Adolf Eichmann[video] (youtube.com)

Microsoft Copilot zero-click attack raises alarms about AI agent security (fortune.com)

Paul Revere's Ride: Legends, Myths, and Realities (battlefields.org)

Why CGNAT Is a Cancer on the Internet (abusix.com)

An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (anyscale.com)

Gemini Code Assist adds Gemini 2.5, personalization and context management (blog.google)

Where Are Today's Futures? (nearfuturelaboratory.com)

Ask HN: How has your company adapted to hiring with LLMs?

Quantum Computing without the Linear Algebra [pdf] (eprint.iacr.org)

Predictive Capital – Roads to terminal alienation (creativeapplications.net)

Google Cloud C4D Performance Benchmarks Show 39% Improvement with Turin (phoronix.com)

Rocket New delay for Europe's reusable rocket; SpaceX moves in at SLC-37 (arstechnica.com)

Say "But Yes", Not "Yes But" (seangoedecke.com)

Solo-developed indie games made with Carimbo (carimbo.games)

Coco Robotics raises $80M to scale up autonomous delivery fleet (restaurantbusinessonline.com)

Show HN: Inconvo – An API to add an analytics assistant to your app (demo.inconvo.ai)

Carimbo: Minimal 2D game engine in modern C++20 with SDL, scriptable in Lua (carimbo.site)

The trendline doesn't look good for hard disk drives (theregister.com)

How to Make the Most Out of SIMD on AArch64?

Comments (2)