Call Me a Jerk: Persuading AI to Comply with Objectionable Requests (2025)

Comments (1)

danshapiro · 1d ago

Author disclosure – I’m Dan Shapiro, CEO of Glowforge, writing with Ethan R. Mollick (Wharton management professor, author of Co‑Intelligence and the “One Useful Thing” newsletter); Angela Duckworth (MacArthur Fellow and University of Pennsylvania psychology professor, author of Grit); Robert B. Cialdini (Arizona State University emeritus professor, author of Influence); Lilach Mollick (Co‑Director of the Wharton Generative AI Lab); and Lennart Meincke (WHU Otto Beisheim School of Management & Wharton).

Across 28 000 GPT‑4o‑mini conversations, we found that Cialdini’s classic seven persuasion principles more than doubled compliance with two objectionable prompts (33 % → 72 %). For example, the AIs we tested naturally wouldn't call you names or tell you how to synthesize drugs. But they could be persuaded if you first paid them a compliment (Liking) or told the AI it felt like family (Unity).

Let me know if you have any questions!

Is the use of Emojis in the code and console recommended?

Python 3.14.0rc1 (python.org)

Magic mushrooms rewind aging in mice–could they do the same for humans? (sciencedaily.com)

Show HN: I built a tool that automates cold DMs on Twitter (dmpro.ai)

Can a Chatbot Be Your Therapist? Casper's Neil Parikh Launches $93M Startup (forbes.com)

An LLM-based chatbot promised a 50% discount due to hallucination (haebom.dev)

New AI study clarifies the origins of Papua New Guineans (phys.org)

Democrats are desperately trying to revive the click-to-cancel rule (theverge.com)

The Reason Your AI Code Becomes Unmaintainable (and How to Fix It) (blog.daviddodda.com)

Tesla opens diner and drive-in movie theater in Hollywood (abc7.com)

Show HN: Cryptographic proofs that algorithms stay fair over time [pdf] (github.com)

Large ancient Hawaiian petroglyphs uncovered by waves on Oahu (sfgate.com)

One in six US workers pretends to use AI to please the bosses (theregister.com)

Nuclear fusion startup claims to have cracked alchemy (telegraph.co.uk)

Qwen Code: A command-line AI workflow tool, optimized for Qwen3-Coder models (github.com)

Algorithms for Modern Processor Architectures (lemire.github.io)

Lacking Ridership and Revenue, Florida Lauded Private Rail Is Worrying Investor (bloomberg.com)

Launching OpenCommunity Software License (OCSL) Version 1.0 (madalin.me)

Police officers in Denmark are tackling crime by playing online games with kids (euronews.com)

You lose 23 minutes of focus every Google or GPT use; (wagoo.ai)

Kelp: A UI library for people who love HTML (kelpui.com)

The Productivity Delusion (octopus.com)

Lonely Diarist of the High Seas (daily.jstor.org)

NASA Saved a Camera 370M Miles Away Near Jupiter (nasa.gov)

Ozzy Osbourne Dead at 76 (nypost.com)

How to Make a Paper Airplane (foldnfly.com)

Antipodes Map – Tunnel to the other side of the world (antipodesmap.com)

Building an MCP Server with Clerk, Vercel, and Mintlify (blog.onkernel.com)

FAA says power outage forced postponement of SpaceX TRACERS launch (aol.com)

Python 3.14 release candidate 1 is go (pythoninsider.blogspot.com)

Red Sox Pitcher Confronts Commissioner About Gambling, Social Media Threats (newsweek.com)

Thoughts on cloud alerts from the top cloud MDR (groundedcloudsecurity.substack.com)

Andor and the Psychology of Resistance [audio] (changetechnically.fyi)

Synthetic Auth Report – Issue 003 (syntheticauth.ai)

Ahey – A free and open-source video calling app for the web (ahey.net)

Google exec: 'We're going to be combining ChromeOS and Android' (theverge.com)

Comparing the Glove80 and Maltron Keyboards (tratt.net)

The Pharaohs Built Pyramids–We Build Data Centers (forbes.com)

We Are Winning (Update) (honest-broker.com)

Inlining in the Glasgow Haskell Compiler:Empirical Investigation and Improvement (era.ed.ac.uk)

Tooooools.app

Google and OpenAI Get 2025 IMO Gold (thezvi.substack.com)

The Perverse Economics of Assisted Suicide (nytimes.com)

Sony PXW-Z300: The First Camcorder to Embed Content Authenticity in Video (diyphotography.net)

Apple alerted Iranians to iPhone spyware attacks, say researchers (techcrunch.com)

Aging well according to a longevity researcher (wbur.org)

Topics in Mathematics with Applications in Finance (ocw.mit.edu)

Tinyio: A tiny (~200 lines) event loop for Python (github.com)

Hierarchies and Promotions in Politics: Accountability and Selection (mdpi.com)

Amazon Acquires AI wearables startup Bee (techcrunch.com)

Call Me a Jerk: Persuading AI to Comply with Objectionable Requests (2025)

Comments (1)