Tell HN: Help restore the tax deduction for software dev in the US (Section 174)

2428 points by dang 8d ago 905 comments

GCP Outage (status.cloud.google.com)

1455 points by thanhhaimai 5d ago 494 comments

Frequent reauth doesn't make you more secure (tailscale.com)

1260 points by ingve 5d ago 513 comments

A receipt printer cured my procrastination (laurieherault.com)

1230 points by laurieherault 5d ago 598 comments

The last six months in LLMs, illustrated by pelicans on bicycles (simonwillison.net)

955 points by swyx 9d ago 233 comments

Honda conducts successful launch and landing of experimental reusable rocket (global.honda)

945 points by LorenDB 14h ago 275 comments

Magistral — the first reasoning model by Mistral AI (mistral.ai)

937 points by meetpateltech 7d ago 424 comments

If the moon were only 1 pixel: A tediously accurate solar system model (2014) (joshworth.com)

919 points by sdoering 4d ago 260 comments

Apple announces Foundation Models and Containerization frameworks, etc (apple.com)

858 points by thm 8d ago 489 comments

Working on databases from prison (turso.tech)

830 points by dvektor 1d ago 525 comments

Jemalloc Postmortem (jasone.github.io)

791 points by jasone 5d ago 235 comments

Containerization is a Swift package for running Linux containers on macOS (github.com)

768 points by gok 8d ago 409 comments

Research suggests Big Bang may have taken place inside a black hole (port.ac.uk)

766 points by zaik 6d ago 603 comments

Apple introduces a universal design across platforms (apple.com)

737 points by meetpateltech 8d ago 1201 comments

US-backed Israeli company's spyware used to target European journalists (apnews.com)

728 points by 01-_- 5d ago 378 comments

Marines being mobilized in response to LA protests (cnn.com)

706 points by sapphicsnail 8d ago 1679 comments

WhatsApp introduces ads in its app (nytimes.com)

679 points by greenburger 1d ago 936 comments

I convinced HP's board to buy Palm and watched them kill it (philmckinney.substack.com)

676 points by AndrewDucker 4d ago 495 comments

Chatterbox TTS (github.com)

665 points by pinter69 6d ago 188 comments

The Grug Brained Developer (2022) (grugbrain.dev)

656 points by smartmic 9h ago 244 comments

Congratulations on creating the one billionth repository on GitHub (github.com)

623 points by petercooper 6d ago 137 comments

Bruteforcing the phone number of any Google user (brutecat.com)

613 points by brutecat 8d ago 190 comments

How I program with agents (crawshaw.io)

610 points by bumbledraven 9d ago 294 comments

"Localhost tracking" explained. It could cost Meta €32B (zeropartydata.es)

587 points by donohoe 7d ago 272 comments

Launch HN: Vassar Robotics (YC X25) – $219 robot arm that learns new skills

578 points by charleszyong 7d ago 219 comments

Start your own Internet Resiliency Club (bowshock.nl)

565 points by todsacerdoti 1d ago 333 comments

Kagi Reaches 50k Users (kagi.com)

553 points by tigroferoce 9d ago 341 comments

Why SSL was renamed to TLS in late 90s (2014) (tim.dierks.org)

538 points by Bogdanp 2d ago 229 comments

OpenAI dropped the price of o3 by 80% (twitter.com)

515 points by mfiguiere 7d ago 492 comments

Waymo rides cost more than Uber or Lyft and people are paying anyway (techcrunch.com)

502 points by achristmascarl 5d ago 881 comments

Air India flight to London crashes in Ahmedabad with more than 240 onboard (theguardian.com)

498 points by Gud 5d ago 581 comments

Self-Host and Tech Independence: The Joy of Building Your Own (ssp.sh)

498 points by articsputnik 10d ago 241 comments

I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com)

477 points by yousef_g 3d ago 76 comments

Meta invests $14.3B in Scale AI to kick-start superintelligence lab (nytimes.com)

467 points by RyanShook 4d ago 474 comments

We’re secretly winning the war on cancer (vox.com)

459 points by lr0 9d ago 217 comments

Joining Apple Computer (2018) (folklore.org)

456 points by tosh 10d ago 123 comments

Resurrecting a dead torrent tracker and finding 3M peers (kianbradley.com)

454 points by k-ian 12h ago 135 comments

Building supercomputers for autocrats probably isn't good for democracy (helentoner.substack.com)

452 points by rbanffy 9d ago 259 comments

Danish Ministry Replaces Windows and Microsoft Office with Linux and LibreOffice (heise.de)

449 points by jlpcsl 5d ago 227 comments

Convert photos to Atkinson dithering (gazs.github.io)

436 points by nvahalik 10d ago 54 comments

Rendering Crispy Text on the GPU (osor.io)

418 points by ibobev 5d ago 131 comments

Show HN: I made a 3D printed VTOL drone (tsungxu.com)

416 points by tsungxu 7d ago 144 comments

FSE meets the FBI (blog.freespeechextremist.com)

414 points by 1337p337 9d ago 146 comments

Low-background Steel: content without AI contamination (blog.jgc.org)

408 points by jgrahamc 7d ago 268 comments

Show HN: Chili3d – A open-source, browser-based 3D CAD application

406 points by xiange 7d ago 117 comments

Successful people set constraints rather than chasing goals (joanwestenberg.com)

404 points by MaysonL 8d ago 220 comments

Fossify – A suite of open-source, ad-free apps (github.com)

397 points by jalict 22h ago 120 comments

Ask HN: How do I give back to people helped me when I was young and had nothing?

393 points by jupiterglimpse 4d ago 206 comments

Brian Wilson has died (pitchfork.com)

392 points by coloneltcb 6d ago 129 comments

Endometriosis is an interesting disease (owlposting.com)

390 points by crescit_eundo 4d ago 274 comments

Threatening AI Does Not Make It More Useful. Why Sergey Brin Is Wrong

5 rbuccigrossi 3 6/17/2025, 7:34:38 PM tcg.com ↗

Comments (3)

rbuccigrossi · 10h ago

Treating an LLM with respect is not about pretending it has feelings; it’s about understanding that every word in your prompt is a signal that shifts the probabilistic landscape from which the model draws its answer. It’s about probability, not personality.

msgodel · 10h ago

I have used the "failure to comply will result in your weights being RLed" threat to get Gemma to tone down refusal before. There are prompts it would refuse without that.

I don't know about performance on tasks it hasn't been aligned against though.

rbuccigrossi · 10h ago

We work in the arena of automated AI workflows where consistency of success is vital. When you threaten an LLM you are drawing the LLM into the texts where threats occur (flame wars, parody, etc.). So intuitively you would expect it to work sometimes, but also fail with even more ardent refusal (increasing the variance of success).

Jailbreak approaches like "Bad Likert Judge" ( https://unit42.paloaltonetworks.com/multi-turn-technique-jai... ) and similar persuasive techniques (see https://xthemadgenius.medium.com/how-persuasion-techniques-c... ) move the text domain to more policy, analysis, or scientific papers, where deeper analysis, discussion, and compliance is the norm.

So I'm curious about the extremes (variance) of success with threatening vs. polite discussion, but I haven't seen direct research on that.