How to stop Hertz's AI from charging for minor scrapes on your rental car (nypost.com)

1 points by zekrioca 2m ago 0 comments

I'm a Microplastics Researcher (stuff.co.nz)

1 points by domofutu 4m ago 0 comments

Oldest living microbes found in 2B-year-old rock (popsci.com)

1 points by domofutu 6m ago 0 comments

Proxmox Backup Server 4.0 BETA released with S3 support (forum.proxmox.com)

2 points by guerby 7m ago 1 comments

Amish Kids Almost Never Get Allergies and Scientists Know Why (zmescience.com)

1 points by aard 9m ago 1 comments

40 years ago, Andy Warhol helped debut the Commodore Amiga (popsci.com)

2 points by geox 14m ago 0 comments

Do not download the app, use the website (idiallo.com)

13 points by foxfired 14m ago 1 comments

What's a spredge? The latest booming trend in book design (adn.com)

2 points by rolph 15m ago 0 comments

The Planets Today (theplanetstoday.com)

6 points by Bluestein 23m ago 0 comments

Interview with Technomancy (lobste.rs)

3 points by veqq 26m ago 0 comments

Controlled Unease with AI Coding (denismaciel.com)

2 points by denis- 28m ago 0 comments

Spatial Web: Transforming Device Interactions (spectrum.ieee.org)

3 points by rbanffy 28m ago 0 comments

Intel Puts the Process Horse Back in Front of the Foundry Cart (nextplatform.com)

2 points by rbanffy 29m ago 0 comments

North Korean hackers ran US-based "laptop farm" from Arizona woman's home (arstechnica.com)

3 points by rbanffy 29m ago 0 comments

'Crash-Proof' Hybrid Drone Can Move Between Flight and Ground Operation (thedebrief.org)

2 points by gooseus 34m ago 1 comments

In the Shinjiro Koizumi era, rice farmers are left confounded (japantimes.co.jp)

1 points by PaulHoule 35m ago 0 comments

Ask HN: Why is virtualization still not solved?

4 points by prmph 37m ago 2 comments

Ask HN: Are AI Dev Tools Ready for Production? What Limitations Are You Seeing?

3 points by skykarthick 37m ago 0 comments

AAAI Launches AI-Powered Peer Review Assessment System (aaai.org)

1 points by vinni2 42m ago 0 comments

Google's new version of AI Web Search (blog.google)

3 points by blackrabbit 44m ago 0 comments

A deterministic π-driven algorithm for the Partition Problem (NP-complete) (osf.io)

1 points by KaoruAK 48m ago 0 comments

Show HN: PromptCard – Quick prompt insert for ChatGPT web interface (promptcard.online)

1 points by JohnnyZhang483 49m ago 0 comments

Tour de France confronts a new threat: Are cyclists using tiny motors? (washingtonpost.com)

5 points by bookofjoe 49m ago 3 comments

Show HN: Intelligent UAT with auto-generated test cases (quellit.ai)

1 points by buildinext 52m ago 0 comments

A micro-to-macroscale and multi-method investigation of human sweating dynamics (arxiv.org)

2 points by gnabgib 56m ago 0 comments

Immigration agents told a teenage US citizen: 'You've got no rights.' (theguardian.com)

14 points by perihelions 56m ago 0 comments

Under Siege from Trump and Musk, a Top Liberal Group Falls into Crisis (nytimes.com)

6 points by mitchbob 57m ago 2 comments

Echelon kills smart home gym equipment offline capabilities with update (arstechnica.com)

2 points by pseudolus 57m ago 0 comments

Records from the FAA Relating to UAP, October 1, 2023–2025 (catalog.archives.gov)

1 points by handfuloflight 59m ago 0 comments

AIDE – a file and directory integrity checker (aide.github.io)

2 points by smartmic 59m ago 0 comments

Coercive Care: Southern Europe's Reliance on Elder Restraints (undark.org)

2 points by EA-3167 1h ago 0 comments

Evidence that gamma rhythm stimulation can treat neurological disorders (news.mit.edu)

2 points by domofutu 1h ago 0 comments

Google set up two robotic arms for a game of infinite table tennis (popsci.com)

1 points by domofutu 1h ago 0 comments

Built a tiny LLM tool to explain blockchain transactions (EVMs) in plain English (github.com)

1 points by aramalipoor 1h ago 0 comments

Inventor builds mechanical computer with Knex pieces (popsci.com)

2 points by domofutu 1h ago 0 comments

Devfiler: Universal Profiling as a Desktop App (github.com)

2 points by tanelpoder 1h ago 0 comments

Executive order seeks to expand involuntary commitment of homeless individuals (whitehouse.gov)

7 points by Jimmc414 1h ago 0 comments

The Vendor Lock-In You Don't See (lastweekinaws.com)

2 points by genericlemon24 1h ago 0 comments

People like extroverted robots – but they relate to the neurotic ones (npr.org)

2 points by cainxinth 1h ago 0 comments

Meta names Shengjia Zhao as chief scientist of AI superintelligence unit (techcrunch.com)

2 points by mellosouls 1h ago 0 comments

The LACT Stack: Lighttpd, Awk, CGI, and Text Files (github.com)

3 points by smartmic 1h ago 0 comments

It's time for modern CSS to kill the SPA (jonoalderson.com)

89 points by tambourine_man 1h ago 68 comments

Science Is Winning the Tour de France (theatlantic.com)

2 points by pseudolus 1h ago 1 comments

We built fast UPDATEs for ClickHouse – Part 2: SQL-style UPDATEs (clickhouse.com)

1 points by sdairs 1h ago 0 comments

Unlocked Recordings (archive.org)

1 points by coloneltcb 1h ago 0 comments

Show HN: YouLikeHits – Social Media Marketing Tool (youlikehits.com)

1 points by scotwr 1h ago 0 comments

Show HN: Fundatio – Free macOS App to Structure Project Folders (apps.apple.com)

1 points by masterpos 1h ago 0 comments

Fermented Stevia Leaf Extract Active Against Pancreatic Cancer PANC-1 Cell Line (mdpi.com)

1 points by PaulHoule 1h ago 0 comments

Baltimore: Brings Down Murder Rates Without Throwing More Cops at the Problem (techdirt.com)

5 points by WarOnPrivacy 1h ago 0 comments

For First Time, Fires Are Biggest Threat to Forests' Climate-Fighting Superpower (nytimes.com)

3 points by littlexsparkee 1h ago 0 comments

LLMs remain vulnerable to "jailbreaking" through adversarial prompts

2 ColinWright 2 7/24/2025, 9:20:34 AM link.springer.com ↗

Comments (2)

ColinWright · 1d ago

The title here is copied from the author's post on Mastodon:

https://sigmoid.social/@raphaelmilliere/114659355740586289

"Despite extensive safety training, LLMs remain vulnerable to “jailbreaking” through adversarial prompts. Why does this vulnerability persist? In a new open access paper published in Philosophical Studies, I argue this is because current alignment methods are fundamentally shallow."

Cosmolalia · 1d ago

https://github.com/Cosmolalia/TOE/