Helix Parallelism: Rethinking Sharding Strategies for Interactive LLM Decoding (research.nvidia.com)

How does Perplexity Comet and Dia not suffer from data leakage like this? They seem to completely violate the lethal trifecta principle and intermix your entire browser history, scraped web page data and LLM’s.

do_not_redeem · 19m ago

Because nobody has tried attacking them

Yet

Or have they? How would you find out? Have you been auditing your outgoing network requests for 1x1 pixel images with query strings in the URL?

pryelluw · 1h ago

Im still fixing sql and db command injection through APIs from juniors and now vibe coders. This just adds more work to do.

The ITT/TTI and TTS/STT have been particularly annoying to protect against. I don’t feel we’ve matured enough to have solid protections against such vectors yet.

wglb · 50m ago

Write a prompt that asks to detect sql injection in each source code model. Or other security issues.

siisisbab · 46m ago

Why not just ask the original prompt to make no mistakes?

pixl97 · 22m ago

Because most of its training data is mistakes or otherwise insecure code?

hobs · 35m ago

Again, this is something most good linters will catch, Jetbrains stuff will absolutely just tell you, deterministically, that this is a scary concatenation of strings.

No reason to use a lossy method.

mikewarot · 1h ago

Maybe this will finally get people over the hump and adopt OSs based on capability based security. Being required to give a program a whitelist at runtime is almost foolproof, for current classes of fools.

yorwba · 43m ago

People will use the equivalent of audit2allow https://linux.die.net/man/1/audit2allow and not go the extra mile of defining fine-grained capabilities to reduce the attack surface to a minimum.

zahlman · 36m ago

Can I confidently (i.e. with reason to trust the source) install one today from boot media, expect my applications to just work, and have a proper GUI experience out of box?

mikewarot · 13m ago

No, and I'm surprised it hasn't happened by now. Genode was my hope for this, but they seem to be going away from a self hosting OS/development system.

Any application you've got assumes authority to access everything, and thus just won't work. I suppose it's possible that an OS could shim the dialog boxes for file selection, open, save, etc... and then transparently provide access to only those files, but that hasn't happened in the 5 years[1] I've been waiting. (Well, far more than that... here's 14 years ago[2])

This problem was solved back in the 1970s and early 80s... and we're now 40+ years out, still stuck trusting all the code we write.

[1] https://news.ycombinator.com/item?id=25428345

[2] https://www.quora.com/What-is-the-most-important-question-or...

nemomarx · 24m ago

Qubes?

tempodox · 1h ago

I wish I could share your optimism.

simpaticoder · 1h ago

"One of my weirder hobbies is helping coin or boost new terminology..." That is so fetch!

jgalt212 · 18m ago

Simon is a modern day Brooksley Born, and like her he's pushing back against forces much stronger than him.

scarface_74 · 1h ago

I have been skeptical from day one of using any Gen AI tool to produce output for systems meant for external use. I’ll use it to better understand input and then route to standard functions with the same security I would do for a backend for a website and have the function send deterministic output.

Builder.ai – The Greatest AI Scam in History [video] (youtube.com)

Why Concerns About a Collapse in China's Demand Are Overblown (caixinglobal.com)

Helix Parallelism: Rethinking Sharding Strategies for Interactive LLM Decoding (research.nvidia.com)

Vulnerability Disclosure Policy[HHS] (hhs.gov)

Could a New Type of Parallelism Speed Up LLM Inference? – EE Times (eetimes.com)

Disrupted sleep damages blood vessels in brain and may increase dementia risk (temertymedicine.utoronto.ca)

Show HN: Runtime – skills-based browser automation that uses fewer tokens (github.com)

The Kafka Challenge – Translating the Inimitable (hedgehogreview.com)

AI's "Just Ship it." problem (leahtharin.com)

Trump, the BLS, and Our Age of Choose-Your-Own-Reality Governance (derekthompson.org)

Starlink "Support"?

AI Has No Butt to Clench (bootstoobig.com)

ChatGPT 5 is slow and no better than 4

After 50 Years of Writing, Jamaica Kincaid Insists She's Still an Amateur (nytimes.com)

US licenses Nvidia to export chips to China after CEO meets Trump (ft.com)

MapYourGrid (mapyourgrid.org)

The Data in a Dino's Smile (nautil.us)

The Philosophy of Tyranny (nautil.us)

Data Replication Design Spectrum (2024) (transactional.blog)

US licenses Nvidia to export chips to China, official says (reuters.com)

Kyrall – Automating design for the physical world (kyrall.com)

Are you missing threats in your chess games? (lichess.org)

Barbican Centre (en.wikipedia.org)

Discover the Prusa Core One [video] (youtube.com)

The Soft Architecture of Meaning: Language Against Entropy (medium.com)

A CT scanner reveals surprises inside the 386 processor's ceramic package (righto.com)

Back to the Future: From Freeze-in-Place to Sliding Scale Chip Controls (rhg.com)

Don Knuth on ChatGPT(07 April 2023) (cs.stanford.edu)

New Talk Python Course: Just Enough Python for Data Scientists (training.talkpython.fm)

Why Your AI Never Works on the First Try (fluxus.io)

Music created with AI published ⁨on YouTube (ChatGPT and LLM local) (youtube.com)

Microsoft sued for discontinuing Windows 10 support (courthousenews.com)

What Is a Pedagogic IDE? (parentheticallyspeaking.org)

The first conference for TypeScript AI developers (mastra.ai)

Harvard had more money in BlackRock's Bitcoin ETF than Google shares (theblock.co)

Yamale – A Schema and Validator for YAML (github.com)

Spikes in malicious activity precede new security flaws in 80% of cases (bleepingcomputer.com)

Show HN: Have kubectl diff use your custom YAML cleaning and comparison logic (github.com)

A hybrid photonic-terahertz chip for communications and sensing (actu.epfl.ch)

Discord app tracking users across iOS installs

From Chrome renderer code exec to kernel with MSG_OOB (googleprojectzero.blogspot.com)

Dwarkesh to donate 250k to farmkind to help prevent cruelty in factory farming (dwarkesh.com)

Building a multiplayer Gameboy emulator with rollback netplay (blog.rekawek.eu)

A Small European Nation Has a Big Explosions Problem (nytimes.com)

Brave Search: AI Grounding API (brave.com)

Retinal regeneration of Müller glia by disrupting intercellular Prox1 transfer (nature.com)

Ask HN: What Toolchains Are People Using for Desktop App Development in 2025?

Apple has its best week since July 2020 after White House visit (cnbc.com)

Hacking the World (twitch.tv)

USCIS Updates Policy on CSPA Age Calculation (uscis.gov)

Simon Willison's Lethal Trifecta Talk at the Bay Area AI Security Meetup

Comments (17)