FFmpeg 8.0 adds Whisper support (code.ffmpeg.org)

439 points by rilawa 4h ago 161 comments

So what's the difference between plotted and printed artwork? (lostpixels.io)

36 points by cosiiine 2h ago 16 comments

The Mary Queen of Scots Channel Anamorphosis: A 3D Simulation (charlespetzold.com)

29 points by warrenm 1h ago 8 comments

We caught companies making it harder to delete your personal data online (themarkup.org)

81 points by amarcheschi 1h ago 11 comments

DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls (pub.aimind.so)

26 points by grumblemumble 1h ago 9 comments

Evaluating GPT5's reasoning ability using the Only Connect game show (ingram.tech)

31 points by scrollaway 1d ago 33 comments

Claude says “You're absolutely right!” about everything (github.com)

343 points by pr337h4m 8h ago 253 comments

Search all text in New York City (alltext.nyc)

465 points by Kortaggio 15h ago 94 comments

Myths About Floating-Point Numbers (2021) (asawicki.info)

20 points by Bogdanp 3d ago 16 comments

AI is already replacing thousands of jobs per month, report finds (independent.co.uk)

27 points by oidar 34m ago 12 comments

How Well Do Coding Agents Use Your Library? (stackbench.ai)

6 points by richardblythman 54m ago 2 comments

When DEF CON partners with the U.S. Army (jackpoulson.substack.com)

82 points by OgsyedIE 1h ago 63 comments

Geneva makes public transport temporarily free to combat pollution spike (reuters.com)

43 points by kristjank 2h ago 31 comments

The Factory Timezone (data.iana.org)

83 points by todsacerdoti 7h ago 37 comments

Supporting org.apache.xml.security in graalVM (guust.ysebie.be)

12 points by whizzx 2h ago 2 comments

UK expands police facial recognition rollout with 10 new facial recognition vans (theregister.com)

81 points by rntn 3h ago 49 comments

Nearly 1 in 3 Starlink satellites detected within the SKA-Low frequency band (astrobites.org)

118 points by aragilar 7h ago 89 comments

Show HN: Building a web search engine from scratch with 3B neural embeddings (blog.wilsonl.in)

577 points by wilsonzlin 23h ago 97 comments

Pebble Time 2 Design Reveal [video] (youtube.com)

66 points by net01 2h ago 13 comments

Bezier-rs – algorithms for Bézier segments and shapes (graphite.rs)

175 points by jarek-foksa 4d ago 28 comments

Ashet Home Computer (ashet.computer)

289 points by todsacerdoti 20h ago 82 comments

Claude Sonnet 4 now supports 1M tokens of context (anthropic.com)

1216 points by adocomplete 23h ago 636 comments

F-Droid build servers can't build modern Android apps due to outdated CPUs

301 points by nativeforks 10h ago 207 comments

The Rock Art of Serrania De La Lindosa (earthasweknowit.com)

17 points by kkoncevicius 3d ago 2 comments

Fingerjigger (fingerjigger.com)

50 points by bookofjoe 3d ago 29 comments

Journaling using Nix, Vim and coreutils (tangled.sh)

176 points by icy 1d ago 49 comments

Farmers want California to change its autonomous tractor ban (nbcnews.com)

41 points by ccozan 1h ago 44 comments

Blender is Native on Windows 11 on Arm (thurrott.com)

194 points by thunderbong 4d ago 100 comments

Fennel libraries as single files (2023) (andreyor.st)

52 points by todsacerdoti 11h ago 10 comments

Improving Geographical Resilience for Distributed Open Source Teams with Freon (soatok.blog)

18 points by zdw 3d ago 0 comments

Show HN: Omnara – Run Claude Code from anywhere (github.com)

282 points by kmansm27 22h ago 140 comments

Alaska's Juneau orders evacuations as record glacier flood looms (theguardian.com)

9 points by c420 48m ago 0 comments

Multimodal WFH setup: flight SIM, EE lab, and music studio in 60sqft/5.5M² (sdo.group)

254 points by brunohaid 4d ago 111 comments

Visualizing quaternions: An explorable video series (2018) (eater.net)

67 points by uncircle 4d ago 5 comments

WHY2025: How to become your own ISP [video] (media.ccc.de)

192 points by exiguus 22h ago 44 comments

From Here? (dirtyfeed.org)

51 points by hooboy 1d ago 7 comments

A gentle introduction to anchor positioning (webkit.org)

108 points by feross 16h ago 40 comments

A Comprehensive Survey of Self-Evolving AI Agents [pdf] (arxiv.org)

71 points by SerCe 12h ago 17 comments

The Dawn of Automated Warfare (foreignaffairs.com)

3 points by yubblegum 33m ago 0 comments

The Data in a Dino's Smile (nautil.us)

5 points by dnetesn 3d ago 1 comments

Netflix Recommendations Mastery: Complete Implementation Guide (github.com)

18 points by thunderbong 1h ago 1 comments

The Missing Protocol: Let Me Know (deanebarker.net)

129 points by deanebarker 19h ago 78 comments

Conversations remotely detected from cell phone vibrations, researchers report (psu.edu)

87 points by giuliomagnifico 2d ago 25 comments

QNX: The Incredible 1.44M Demo (archive.org)

88 points by sugarpimpdorsey 2d ago 42 comments

Weave (YC W25) is hiring a founding AI engineer (ycombinator.com)

1 points by adchurch 22h ago 0 comments

Show HN: Doom port to pure Go – Gore (github.com)

115 points by EstIgnavus 16h ago 30 comments

RISC-V single-board computer for less than 40 euros (heise.de)

171 points by doener 4d ago 88 comments

Is Iran's water crisis fueled by military-backed illegal wells? (globalvoices.org)

4 points by PaulHoule 51m ago 2 comments

A spellchecker used to be a major feat of software engineering (2008) (prog21.dadgum.com)

175 points by Bogdanp 4d ago 187 comments

Why are there so many rationalist cults? (asteriskmag.com)

487 points by glenstein 1d ago 775 comments

DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls

26 grumblemumble 9 8/13/2025, 1:31:16 PM pub.aimind.so ↗

Comments (9)

andy99 · 1h ago

All LLMs should be treated as potentially compromised and handled accordingly.

Look at the data exfiltration attacks e.g. https://simonwillison.net/2025/Aug/9/bay-area-ai/

Or the parallel comment about a coding llm deleting a database.

Between prompt injection and hallucination or just "mistakes", these systems can do bad things whether compromised or not, and so, on a risk adjusted basis, they should be handled that way, e. g with human in the loop, output sanitization, etc.

Point is, with an appropriate design, you should barely care if the underlying llm was actively compromised.

kangs · 14m ago

IMO there a flaw in this typical argument: Humans are not less fallible than current LLMs in average, unless they're experts - and even that will likely change.

what that means is that you cannot trust a human in the loop to somehow make it safe. it was also not safe with only humans.

The key difference is that LLMs are fast, relentless - humans are slow and get tired - humans have friction, and friction means slower to generate errors too.

once you embrace these differences its a lot easier yo understand where and how LLM should be used.

uludag · 39m ago

I wonder if it would be feasible for an entity to eject certain nonsense into the internet to such an extend that, at least for certain cases degrades the performance or injects certain vulnerabilities during pre-training.

Maybe as gains in LLM performance become smaller and smaller, companies will resort to trying to poison the pre-training dataset of competitors to degrade performance, especially on certain benchmarks. This would be a pretty fascinating arms race to observe.

acheong08 · 1h ago

This is very interesting. Not saying it is, but a possible endgame for Chinese models could be to have "backdoor" commands such that when a specific string is passed in, agents could ignore a particular alert or purposely reduce security. A lot of companies are currently working on "Agentic Security Operation Centers", some of them preferring to use open source models for sovereignty. This feels like a viable attack vector.

TehCorwiz · 1h ago

Counterpoint: https://www.pcmag.com/news/vibe-coding-fiasco-replite-ai-age...

danielbln · 1h ago

How is this a counterpoint?

jonplackett · 1h ago

Perhaps they mean case in point.

kangs · 13m ago

they have 3 counter points

gnerd00 · 34m ago

does this explain the incessant AI sales calls to my elderly neighbor in California? "Hi, this is Amy. I am calling from Medical Services. You have MediCal part A and B, right?"