DeepSeek won the best paper award at ACL 2025

Comments (3)

sabakhoj · 14m ago

> Despite being sparse, NSA surpasses Full Attention baseline on average across general benchmarks, long-context tasks, and reasoning evaluation.

Isn't it very notable that the latency improvement didn't have a performance loss? I'm not super familiar with all the technical aspects, but that seems like it should be one of the main focuses of the paper.

gnabgib · 9m ago

Title: Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

The awards page for ACL seems to disagree with this editorialized title: https://2025.aclweb.org/program/awards/

CalmStorm · 4h ago

For the first time, it introduced native sparse attention into the full training process, achieving up to 11× inference speedup while maintaining model performance.

Pandemic Appears to Have Accelerated Brain Aging, Even Those Who Never Got Covid (wired.com)

Bitchat is a decentralized peer-to-peer messaging application (bitchat.free)

Corporation for Public Broadcasting says it's shutting down (npr.org)

Spack Package Manager releases v1.0.0 (github.com)

Proton Authenticator – secure 2FA, your way (proton.me)

Chris Wilson: How to get into the games industry [video] (youtube.com)

Show HN: Voice-First AI Code Review Workspace (lightlayer.dev)

Meta dishes out $250M to lure 24-year-old AI whiz kid (nypost.com)

Tim Cook Says 'It's Difficult to See a World' Without iPhones (businessinsider.com)

How Not to Run an Airline: The 2024 Saurya Airlines CRJ-200 Crash (admiralcloudberg.medium.com)

Figma IPO was so underpriced VCs lost out on $20B+ (innovationnation.blog)

Show HN: Inworld TTS open sourcing and technical report (github.com)

Trump mobilizes nuclear subs in response to Medvedev's comments (politico.com)

CO2-induced warming appears to be less damaging economically (energy.gov)

Show HN: ReplyFast Smart AI replies for faster customer messaging (replyfast.net)

Where Did Nintendo's Logo Come From? [video] (youtube.com)

Protoweb (protoweb.org)

Bing made Google dance and then stole some search traffic (theverge.com)

CalyxOS Temporarily Stops Updates (calyxos.org)

Make a digital bouquet for girlfriend day (digibouquet.vercel.app)

Eigent, a Multi-agent Workforce desktop application (github.com)

AI is leading to thousands of job losses, report finds (cbsnews.com)

Fingine: Thoughts on Side Projects and Financial Simulation (javednissar.ca)

Attendance at English football: a tale of tragedy and recovery (blog.engora.com)

What is a wealth tax and would it work in the UK? (theguardian.com)

Ask HN: Is "messaging systems specialist" a real job title or niche?

Show HN: HackerTool – Small toolkit with real features and prank commands (github.com)

Second Reality Ported to Windows (bsky.app)

A 2019 WEF-aligned your body is the new interface (theruminationcompilation.wordpress.com)

Tell HN: Gemini CLI is buggy; use at your own risk

Withdrawal of Baltimore-Washington Superconducting Magnetic Levitation Project [pdf] (public-inspection.federalregister.gov)

DIY Dual-Screen Cyberdeck with Raspberry Pi 5 (youtube.com)

Corporation for Public Broadcasting (funder of NPR, PBS) will shut down (nbcnews.com)

Dartboat: Darts Game Scorer (potato.am)

Fast and Slow (en.wikipedia.org)

Hal Abelson (Co-Author of SICP) InfiniteHistoryProject MIT (2011) [video] (youtube.com)

Generate Charli XCX Brat-style text and album covers (bratgenerator.lol)

Trailblazing SF coffee chain about to be sold to private equity firm for $145M (sfgate.com)

Twentyseven 1.0 (blog.poisson.chat)

Groundwater depletion sinks home prices in California's Central Valley (phys.org)

Was the Renaissance Real? (newyorker.com)

Cerebras Code (cerebras.ai)

Are you sure you are buying good car? (carconsul.com)

U.S. fires statistics chief after soft jobs report (politico.com)

Updated Digital Identity Guidelines are Here (NIST 800-63 Revision 4) (csrc.nist.gov)

Y Combinator is looking for DOGE-related startups for its next cohort (businessinsider.com)

Coffeematic PC – A coffee maker computer that pumps hot coffee to the CPU (dougmacdowell.com)

Tim Cook rallying Apple employees around AI efforts (bloomberg.com)

Anthropic Revokes OpenAI's Access to Claude (wired.com)

DeepSeek won the best paper award at ACL 2025

Comments (3)