Tell HN: Help restore the tax deduction for software dev in the US (Section 174)

2414 points by dang 5d ago 902 comments

GCP Outage (status.cloud.google.com)

1441 points by thanhhaimai 2d ago 491 comments

Frequent reauth doesn't make you more secure (tailscale.com)

1206 points by ingve 2d ago 498 comments

A receipt printer cured my procrastination (laurieherault.com)

1180 points by laurieherault 2d ago 578 comments

The last six months in LLMs, illustrated by pelicans on bicycles (simonwillison.net)

945 points by swyx 6d ago 232 comments

Magistral — the first reasoning model by Mistral AI (mistral.ai)

925 points by meetpateltech 4d ago 423 comments

If the moon were only 1 pixel: A tediously accurate solar system model (2014) (joshworth.com)

862 points by sdoering 1d ago 252 comments

Apple announces Foundation Models and Containerization frameworks, etc (apple.com)

849 points by thm 5d ago 488 comments

Jemalloc Postmortem (jasone.github.io)

762 points by jasone 2d ago 229 comments

Containerization is a Swift package for running Linux containers on macOS (github.com)

759 points by gok 5d ago 409 comments

Research suggests Big Bang may have taken place inside a black hole (port.ac.uk)

746 points by zaik 3d ago 591 comments

Apple introduces a universal design across platforms (apple.com)

730 points by meetpateltech 5d ago 1197 comments

US-backed Israeli company's spyware used to target European journalists (apnews.com)

706 points by 01-_- 2d ago 369 comments

Marines being mobilized in response to LA protests (cnn.com)

695 points by sapphicsnail 5d ago 1672 comments

Chatterbox TTS (github.com)

652 points by pinter69 3d ago 185 comments

I convinced HP's board to buy Palm and watched them kill it (philmckinney.substack.com)

643 points by AndrewDucker 1d ago 488 comments

Congratulations on creating the one billionth repository on GitHub (github.com)

605 points by petercooper 3d ago 136 comments

Bruteforcing the phone number of any Google user (brutecat.com)

605 points by brutecat 5d ago 190 comments

Launch HN: Vassar Robotics (YC X25) – $219 robot arm that learns new skills

571 points by charleszyong 4d ago 219 comments

"Localhost tracking" explained. It could cost Meta €32B (zeropartydata.es)

569 points by donohoe 4d ago 268 comments

How I program with agents (crawshaw.io)

562 points by bumbledraven 5d ago 289 comments

Kagi Reaches 50k Users (kagi.com)

543 points by tigroferoce 5d ago 340 comments

OpenAI dropped the price of o3 by 80% (twitter.com)

506 points by mfiguiere 4d ago 489 comments

Self-Host and Tech Independence: The Joy of Building Your Own (ssp.sh)

494 points by articsputnik 7d ago 241 comments

Air India flight to London crashes in Ahmedabad with more than 240 onboard (theguardian.com)

491 points by Gud 2d ago 573 comments

Joining Apple Computer (2018) (folklore.org)

453 points by tosh 7d ago 123 comments

Meta invests $14.3B in Scale AI to kick-start superintelligence lab (nytimes.com)

449 points by RyanShook 1d ago 462 comments

Building supercomputers for autocrats probably isn't good for democracy (helentoner.substack.com)

447 points by rbanffy 6d ago 259 comments

We’re secretly winning the war on cancer (vox.com)

447 points by lr0 6d ago 214 comments

Danish Ministry Replaces Windows and Microsoft Office with Linux and LibreOffice (heise.de)

440 points by jlpcsl 2d ago 223 comments

Convert photos to Atkinson dithering (gazs.github.io)

435 points by nvahalik 7d ago 54 comments

Show HN: I made a 3D printed VTOL drone (tsungxu.com)

413 points by tsungxu 4d ago 143 comments

FSE meets the FBI (blog.freespeechextremist.com)

408 points by 1337p337 6d ago 145 comments

Low-background Steel: content without AI contamination (blog.jgc.org)

406 points by jgrahamc 4d ago 267 comments

Rendering Crispy Text on the GPU (osor.io)

404 points by ibobev 2d ago 128 comments

Show HN: Chili3d – A open-source, browser-based 3D CAD application

402 points by xiange 4d ago 116 comments

Successful people set constraints rather than chasing goals (joanwestenberg.com)

398 points by MaysonL 4d ago 218 comments

Brian Wilson has died (pitchfork.com)

390 points by coloneltcb 3d ago 127 comments

Finding Shawn Mendes (2019) (ericneyman.wordpress.com)

384 points by jzwinck 5d ago 56 comments

macOS Tahoe brings a new disk image format (eclecticlight.co)

378 points by zdw 2d ago 137 comments

Sly Stone has died (abcnews.go.com)

377 points by brudgers 5d ago 66 comments

Ask HN: How do I give back to people helped me when I was young and had nothing?

376 points by jupiterglimpse 1d ago 192 comments

iPhone 11 emulation done in QEMU (github.com)

376 points by 71bw 2d ago 33 comments

Show HN: Spark, An advanced 3D Gaussian Splatting renderer for Three.js (sparkjs.dev)

374 points by dmarcos 3d ago 86 comments

Microsoft Office migration from Source Depot to Git (danielsada.tech)

358 points by dshacker 3d ago 274 comments

I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com)

352 points by yousef_g 13h ago 63 comments

Field Notes from Shipping Real Code with Claude (diwank.space)

348 points by diwank 7d ago 97 comments

Why Android can't use CDC Ethernet (2023) (jordemort.dev)

343 points by goodburb 6d ago 136 comments

Menstrual tracking app data is gold mine for advertisers that risks women safety (cam.ac.uk)

341 points by Improvement 3d ago 381 comments

LLMs are cheap (snellman.net)

341 points by Bogdanp 5d ago 309 comments

Large Language Models Often Know When They Are Being Evaluated

11 jonbaer 5 6/15/2025, 2:17:42 AM arxiv.org ↗

Comments (5)

random3 · 18m ago

Just like they "know" English. "know" is quite an anthropomorphization. As long as an LLM will be able to describe what an evaluation is (why wouldn't it?) there's a reasonable expectation to distinguish/recognize/match patterns for evaluations. But to say they "know" is plenty of (unnecessary) steps ahead.

sidewndr46 · 3m ago

This was my thought as well when I read this. Using the word 'know' implies an LLM has cognition, which is a pretty huge claim just on its own.

noosphr · 19m ago

The anthropization of llms is getting off the charts.

They don't know they are being evaluated. The underlying distribution is skewed because of training data contamination.

zer00eyz · 5m ago

No, they do not. No LLM is ever going to be self aware.

It's a system that is trained, that only does what you build into. If you run an LLM for 10 years it's not going to "learn" anything new.

The whole industry needs to quit with the emergent thinking, reasoning, hallucination anthropomorphizing.

We have an amazing set of tools in LLM's, that have the potential to unlock another massive upswing in productivity, but the hype and snake oil are getting old.

khimaros · 26m ago

Rob Miles must be saying "I told you so"