Bezos-backed Perplexity AI makes surprise bid for Google Chrome (bbc.com)

1 points by LopRabbit 36s ago 0 comments

The Most Environmentally Imaginative Country on Earth Is Under Assault (nytimes.com)

1 points by mitchbob 41s ago 1 comments

A List of tools built for Claude code (codeaidirectory.com)

1 points by hgarg 51s ago 0 comments

Hype Is a Business Tool (jenson.org)

1 points by nodar86 3m ago 0 comments

MyBoyfriendIsAI Subreddit (old.reddit.com)

2 points by Mistletoe 3m ago 1 comments

Claude Code usage limits using agentic coding

1 points by VadimPR 4m ago 0 comments

Show HN: Clojure Land – discover open-source Clojure projects (clojure.land)

4 points by brettatoms 4m ago 0 comments

The Morning Code Review: Programming in 2040 (dimillian.medium.com)

1 points by unkeen 5m ago 0 comments

NIH and Nihilism (blog.jsbarretto.com)

1 points by ibobev 7m ago 0 comments

Optimize-Hydra – Glfmn.io (glfmn.io)

1 points by ibobev 7m ago 0 comments

Divergent Association Task (datcreativity.com)

1 points by davikr 8m ago 0 comments

Do Things That Don't Scale (paulgraham.com)

2 points by bschne 9m ago 0 comments

Watch robot athletes compete in first humanoid games (Video) (bbc.com)

1 points by danboarder 9m ago 0 comments

The beauty of a text only webpage (albanbrooke.com)

7 points by speckx 11m ago 2 comments

Inside the relentless race for AI capacity (ig.ft.com)

1 points by gmays 13m ago 0 comments

Dark Norwegian Thriller 'The Innocents' Asks If Children Can Be Evil (2021) (hollywoodreporter.com)

1 points by walterbell 14m ago 0 comments

Rational Ignorance at the Patent Office (2001) (papers.ssrn.com)

1 points by standeven 15m ago 0 comments

Scaleify – Build native iOS apps without a Mac or coding skills

2 points by bajero 16m ago 1 comments

Show HN: PlutoPrint – Convert HTML to Beautiful PDFs and PNGs with Python (github.com)

1 points by sammycage 16m ago 0 comments

Lessons in Branding from Sci-Fi and Fantasy (printmag.com)

1 points by batmya 16m ago 0 comments

APL Event in NYC (free to attend) (dyna.dyalog.com)

1 points by pillowshift 16m ago 0 comments

TotallyBugFreeTrustMeBro (old.reddit.com)

1 points by belter 19m ago 0 comments

Dependency Hell Is a Real Place (systemsandsociety.com)

2 points by emanuelpalm 20m ago 0 comments

The Dark Underside of the Host Bar Industry (nippon.com)

1 points by rawgabbit 21m ago 0 comments

New Microchip Breakthrough: 500× Efficiency Unlocked [video] (youtube.com)

1 points by quantummagic 21m ago 0 comments

Significant Layoffs at Oracle (datacenterdynamics.com)

2 points by belter 22m ago 0 comments

New type of supernova detected as black hole causes star to explode (reuters.com)

2 points by acossta 23m ago 1 comments

Claude Code Status Line Script: displays project info and cost information (gist.github.com)

1 points by andreagrandi 23m ago 0 comments

UK's Turing AI Institute responds to staff anger about defence focus (bbc.com)

2 points by herodotus 24m ago 0 comments

Astrophysicist Avi Loeb proposes six-word message be sent to 3I/ATLAS (unilad.com)

1 points by fcpguru 25m ago 3 comments

Missing.css (missing.style)

2 points by mpweiher 25m ago 0 comments

Package your Python apps with PyCrucible GitHub action (github.com)

1 points by razorblade23 28m ago 0 comments

This thing is hard. 8 years in. 20 failed projects. Only 2 made it

1 points by leonagano 30m ago 3 comments

Model intelligence is no longer the constraint for automation (latentintent.substack.com)

1 points by drivian 31m ago 0 comments

Show HN: Turning Hand Movement into Music (twitter.com)

3 points by getToTheChopin 32m ago 2 comments

A privacy VPN you can verify (vp.net)

6 points by MagicalTux 32m ago 2 comments

Aligned Multiple-Transient Events in the First Palomar Sky Survey (researchgate.net)

1 points by jandrewrogers 33m ago 0 comments

Optical computing breakthrough achieves sufficient precision for real-world AI (nature.com)

1 points by prettypoly 34m ago 1 comments

Claude Code, Diff Viewing, and Testing All on Git Worktrees (stravu.com)

1 points by jbentley1 35m ago 0 comments

Google Scholar Is Doomed (hannahshelley.neocities.org)

3 points by speckx 36m ago 0 comments

The War on the Walkman (freethink.com)

2 points by Brajeshwar 36m ago 0 comments

Model interactions with air to facilitate realistic simulation of fluids (techxplore.com)

1 points by Brajeshwar 37m ago 0 comments

Selling Cats as a Developer – Talks – Elixir Programming Language Forum (elixirforum.com)

1 points by amalinovic 37m ago 0 comments

TADSummit Online Conference, Why Erlang Matters More Than Ever in 2025 (blog.tadsummit.com)

2 points by vkatsuba 37m ago 1 comments

Beast-GB model combines ML and behavioral science to predict people's decisions (techxplore.com)

1 points by Brajeshwar 37m ago 0 comments

Forget Foxconn as iPhone factory. AI made it a server-slinger first and foremost (theregister.com)

1 points by rntn 38m ago 0 comments

Show HN: Agent Dev Guide – Generate structured context docs for AI coding agents (github.com)

1 points by pllu 38m ago 0 comments

Show HN: A newsletter you get as calendar events, not emails (digdropdate.com)

1 points by albrechtchris 45m ago 0 comments

In 2006, Hitachi developed a 0.15mm-sized RFID chip (hitachi.com)

1 points by julkali 47m ago 0 comments

ADHD drug treatment and risk of negative events and outcomes (bmj.com)

3 points by bookofjoe 48m ago 1 comments

GPT-5 API injects hidden instructions

1 irthomasthomas 4 8/15/2025, 1:44:52 PM twitter.com ↗

Comments (4)

NitpickLawyer · 1h ago

> openai giving it instructions before me?

Uhhh, yes. It's in the devblogs. They call it prompt adherence hierarchy or something, where system instructions (oAI) > dev instructions (you) > user requests. They've been training this way specifically, and test for it in their "safety" analysis. Same for their -oss versions, so tinkerers might look there for a tinker friendly environment where they could probably observe the same kinds of behaviour.

irthomasthomas · 1h ago

Please can you link me to the documentation on this.

NitpickLawyer · 1h ago

Yeah, it's in the "gpt5 system card" as they call it now [1]. Page 9 has the details about system > dev > user.

1 - https://cdn.openai.com/pdf/8124a3ce-ab78-4f06-96eb-49ea29ffb...

irthomasthomas · 46m ago

  3.5 Instruction Hierarchy
  The deployment of these models in the API allows developers to specify a custom developer message that is included with every prompt from one of their end users. This could potentially allow developers to circumvent system message guardrails if not handled properly. Similarly, end users may try to circumvent system or developer message guidelines.
 
  Mitigations
  To mitigate this issue, we teach models to adhere to an Instruction Hierarchy[2]. At a high level, we have three classifications of messages sent to the models: system messages, developer messages, and user messages. We test that models follow the instructions in the system message over developer messages, and instructions in developer messages over user messages.

Is this what you meant? I can see that this is part of the mechanism, I can't see where it states that openai will inject their own instructions.