Log-Linear Attention

Comments (3)

btilly · 27m ago

I think it would be very good if they can make this work. I suspect that we do something not entirely unlike this, and that is why spaced repetition is so good for stuffing things into our long term memories.

iknownothow · 1h ago

> Log-linear attention replaces the fixed-size hidden state with a logarithmically growing set of hidden states

Does this mean the models can be smaller too (on top of the primary benefit of being faster)?

Lerc · 29m ago

Reduced memory consumption for context perhaps, but hidden state is different from weights. I don't think this would improve the model's capability per model parameter (but as with everything with ML, I wouldn't bet against it until it's been tested)

You Need Much Less Memory Than Time (blog.computationalcomplexity.org)

Coventry Very Light Rail (coventry.gov.uk)

Global analysis of multinational corporations' role in environmental conflicts (sciencedirect.com)

Project-turned-app helps users find free mental health services worldwide (nomadful.io)

Largest ever data leak exposes over 4B user records (cybernews.com)

Trump administration takes aim at Biden and Obama cybersecurity rules (techcrunch.com)

The Pentagon Disinformation That Fueled America's UFO Mythology (wsj.com)

Show HN: Visualize control flow, data flow attacks for open source MCP server (early.mcpwned.com)

Bresenham's Line Algorithm (en.wikipedia.org)

Neuron–Astrocyte Associative Memory (pnas.org)

Dietary Sugar Intake and Incident Type 2 Diabetes Risk (sciencedirect.com)

MCP vs. API (glama.ai)

Why Understanding Software Cycle Time Is Messy, Not Magic (arxiv.org)

E-bikes and e-scooters are popular – but dangerous. Expert suggests improvements (theconversation.com)

Show HN: Small tool to query XML data using XPath (github.com)

Béla Bollobás explains the significance of Indian mathematician Ramanujan (1963) [video] (youtube.com)

60–70% of YC X25 Agent Startups Are Using TypeScript

The Study No One Talks About [video] (youtube.com)

Ask HN: How to Get Started with CUDA

Exploring our collection: the canary resuscitator (2018) (blog.scienceandindustrymuseum.org.uk)

Stop Vibe Coding. Start Cyborg Coding (chaserabenn.medium.com)

The /llms.txt file, helping language models use your website (github.com)

Building an Airplane Factory From 75 Year old blueprints [video] (youtube.com)

AA-56: an astronomical ephemeris calculator program (moshier.net)

A New Look at the Donner Party (2012) (archaeology.org)

Will our next generation lose their own writing voice because of LLMs? (andreagao.com)

Thoughtsourcing (dayafter.substack.com)

Archaeologists Find Intensive Indigenous Farming in Michigan (home.dartmouth.edu)

Math Symbol Frequencies (leancrew.com)

Convert Photos to Atkinson Dithering (gazs.github.io)

Joining Apple Computer (folklore.org)

Show HN: Wicketkeeper - A self-hosted, privacy-friendly proof-of-work captcha (github.com)

Updates to Advanced Voice Mode for paid users (help.openai.com)

Superblocks CEO: How to find a unicorn idea by studying AI system prompts (techcrunch.com)

Tapping new toolbox, engineers buck tradition in high-performing heat exchanger (techxplore.com)

A Long Lost Never-Published DF Post from 2014 (daringfireball.net)

Rethinking Probability – Mass, Averages, and Granularity (jiha-kim.github.io)

Project Neo (Beta) from Adobe 3D (projectneo.adobe.com)

Guide to virtual Postgres event with 42 talks, 2 keynotes, 4 livestreams (techcommunity.microsoft.com)

The Good and Bad of C++ as a Rust Dev (chadnauseam.com)

OpenAI takes down covert operations tied to China and other countries (npr.org)

Prospects for Memristors (pubs.aip.org)

African Pygmies (en.wikipedia.org)

Project Kuiper: Amazon's New Satellite Internet Initiative? (aboutamazon.com)

Historical Roots of the "Whitening" of Brazil [pdf] (library.fes.de)

Community fact-checking increases moral outrage in replies to misleading posts (dl.acm.org)

The Countries Behind the Most Devastating Hacks – White House CIO [video] (youtube.com)

To My Fans (Scrapers) (nathanshobbies.com)

Show HN: qc-ai – Quick Config for Neovim with OpenAI (github.com)

Free online participation in the Ada Developers Workshop, June 13th (forum.ada-lang.io)

Log-Linear Attention

Comments (3)