Terminal app can now run full graphical Linux apps in the latest Android Canary (androidauthority.com)

>However, in the case of In-Context-Learning (ICL), there is no immediate explicit weight update that could explain the emergent dynamical nature of trained LLMs that seem to re-organize or reconfigure themselves at the instruction of a user prompt. This mysterious and extremely helpful property of LLMs has led researchers to conjecture an implicit form of weight updates taking place at inference time when a prompt is consumed [6–11]. Recent works have even been able to show that toy models of transformer blocks implicitly performs a sort of gradient descent optimization [7, 9, 10].

I wouldn't call it gradient descent. Residual connections of the form x_{i+1} = x_i + f(x_i) essentially form an "update rule" with one iteration per layer. Newton's method, gradient descent, fixed point iteration, conjugate gradient methods, ODE integration, etc can all be expressed as an "update rule" that takes a previous value and adds a modifier to produce a new value. It would be more accurate to say that each residual layer is a universal approximator of any imaginable update rule including gradient descent.

Tao on "blue team" vs. "red team" LLMs (mathstodon.xyz)

Copyparty, turn almost any device into a file server (github.com)

Visa and Mastercard are getting overwhelmed by gamer fury over censorship (polygon.com)

Interstellar Comet 3I/Atlas: What We Know Now (skyandtelescope.org)

GLM-4.5: Reasoning, Coding, and Agentic Abililties (z.ai)

Simplify, then add delightness: On designing for children (shaneosullivan.wordpress.com)

Cells that breathe oxygen and sulfur at the same time (quantamagazine.org)

LLM Embeddings Explained: A Visual and Intuitive Guide (huggingface.co)

VPN use surges in UK as new online safety rules kick in (ft.com)

AI Companion Piece (thezvi.substack.com)

Claude Code new limits – Important updates to your Max account usage limits

The first 100% effective HIV prevention drug is approved and going global (newatlas.com)

The Geological Sublime (harpers.org)

Requesting Funding for 90s.dev (90s.dev)

Enough AI copilots, we need AI HUDs (geoffreylitt.com)

Six Principles for Production AI Agents (app.build)

NixOS on a Tuxedo InfinityBook Pro 14 Gen9 AMD Laptop (fnune.com)

I saved a PNG image to a bird (youtube.com)

Terminal app can now run full graphical Linux apps in the latest Android Canary (androidauthority.com)

How to make websites that will require lots of your time and energy (blog.jim-nielsen.com)

SIMD within a register: How I doubled hash table lookup performance (maltsev.space)

Debian switches to 64-bit time for everything (theregister.com)

A Photonic SRAM with Embedded XOR Logic for Ultra-Fast In-Memory Computing (arxiv.org)

Show HN: I made a tool to generate photomosaics with your pictures (pictiler.com)

Aeneas transforms how historians connect the past (deepmind.google)

What would an efficient and trustworthy meeting culture look like? (abitmighty.com)

Getting the KIM-1 to talk to my Mac (blog.jgc.org)

Viral Language (lareviewofbooks.org)

Samsung Removes Bootloader Unlocking with One UI 8 (sammyguru.com)

Blender: Beyond Mouse and Keyboard (code.blender.org)

Say Goodbye to the Internet as We Know It (europeanconservative.com)

I hacked my washing machine (nexy.blog)

200k Flemish drivers can turn traffic lights green (vrt.be)

How I fixed my blog's performance issues by writing a new Jekyll plugin (arclight.run)

Software Development at 800 Words per Minute (neurrone.com)

Making Postgres slower (byteofdev.com)

The Rise of Vibeinsecurity (vibeinsecurity.com)

Performance and telemetry analysis of Trae IDE, ByteDance's VSCode fork (github.com)

Claude Code Router (github.com)

Real-Time Messaging: Speed and Context (growthloop.com)

Ask HN: What are you working on? (July 2025)

The 1970s psychology experiment behind 'Star Wars' special effects (2023) (nsf.gov)

Dumb Pipe (dumbpipe.dev)

Solid protocol restores digital agency (schneier.com)

Trying to play an isomorphic piano (2022) [video] (youtube.com)

The JJ VCS workshop: A zero-to-hero speedrun (github.com)

The Country Where 76% of Cars Sold Are Electric (nytimes.com)

ZUSE: IRC terminal client (github.com)

Why I write recursive descent parsers, despite their issues (2020) (utcc.utoronto.ca)

Big agriculture mislead the public about the benefits of biofuels (lithub.com)

Self-attention transforms a prompt into a low-rank weight-update

Comments (1)