"OpenAI was plugging Claude into its own internal tools using special developer access (APIs)"
Unless it's actually some internal Claude API which OpenAI were using with an OpenAI benchmarking tool, this sounds like a hyped-up way for Wired to phrase it.
Almost like: `Woah man, OpenAI HACKED Claude's own AI mainframe until Sonnet slammed down the firewall man!` ;D
Seriously though, why phrase API use of Claude as "special developer access"?
I suppose that it's reasonable to disagree on what is reasonable for safety benchmarking, e.g: where you draw a line and say, "hey, that's stealing" vs "they were able to find safety weak spots in their model". I wonder what the best labs are like at efficiently hunting for weak areas!
Funnily enough I think Anthropic have banned a lot of people from their API, myself included - and all I did was see if it could read a letter I got, and they never responded to my query to sort it out! But what does it matter if people can just use OpenRouter?
dylan604 · 4h ago
> Seriously though, why phrase API use of Claude as "special developer access"?
Isn't that precisely what an API is? Normal users do not use the API. Other programs written by developers use it to access Claude from their app. That's like asking why is an SDK phrased as a special kit for developers to build software that works with something they wish to integrate into their app
stavros · 4h ago
If I'm an OpenAI employee, and I use Claude Code via the API, I'm not doing some hacker-fu, I'm just using a tool a company released for the purpose they released it.
I understand that they were technically "using it to train models", which, given OpenAI's stance, I don't have much sympathy for, but it's not some "special developer hackery" that this is making it sound like.
viraptor · 4h ago
Because it's not "special developer access". It's just "normal developer access". The phrasing gives an impression they accessed something other users cannot.
CPLX · 1h ago
It would be normal standard English to assume that special modifies the word access. That would make the sentence semantically be the same as “special access, specifically the type of access used by developers”
Compare with a sentence like “the elevator has special firefighter buttons” which does not mean that only some special type of firefighter uses the button.
toomanyrichies · 1h ago
Counter-point: as a wordsmith, it's incumbent on the article's author to make their point in an unambiguous way. In your example, rather than write "the elevator has special firefighter buttons", the author could choose to write "the elevator has buttons which are only available to firefighters". Or alternatively, "the elevator has buttons which are only available to certain firefighters".
The amount of care the author puts into their phrasing determines whether their point comes across as intended, or not. The average magazine reader can likely figure out that there's no such thing as "special" firefighters with "privileged" access to elevator buttons that other firefighters lack. They may not have the programming knowledge to do likewise with "developer access", even if they are reading a magazine like "Wired".
Calavar · 1h ago
In your example, "firefighter buttons" is a noun phrase which refers to a particular type of button. "Special" applies to the whole of "firefighter buttons," not just to "firefighter" and not just to "buttons." The same would apply for "special developer access."
dgfitz · 3h ago
Normal users use the API constantly, they just don’t realize it.
Isn’t half the schtick of LLMs making software development available for the layman?
ankit219 · 2h ago
The article does not say anything substantial, but just some opposite viewpoints.
1/ Openai's technical staff were using Claude Code (API and not the max plans).
2/ Anthropic's spokesperson says API access for benchmarking and evals will be available to Openai.
3/ Openai said it's using the APIs for benchmarking.
I guess model benchmarking is fine, but tool benchmarking is not. Either openai were trying to see their product works better than Claude code (each with their own proprietary models) on certain benchmarks and that is something Anthropic revoked. How they caught it is far more remarkable. It's one thing to use sonnet 4 to solve a problem on Livebench, its slightly different to do it via the harness where Anthropic never published any results themselves. Not saying this is a right stance, but this seems to be the stance.
hinkley · 2h ago
Feels like something a Jepsen or such should be doing instead of competitors trying to clock each other directly. I can see why they would feel uncomfortable about this situation.
No comments yet
chowells · 5h ago
> According to Anthropic’s commercial terms of service, customers are barred from using the service to “build a competing product or service, including to train competing AI models”
That's... quite a license term. I'm a big fan of tools that come with no restrictions on their use in their licenses. I think I'll stick with them.
ethan_smith · 3h ago
These anti-competitive clauses are becoming standard across all major AI providers - Google, Microsoft, and Meta have similar terms. The industry is converging on a licensing model that essentially creates walled gardens for model development.
werrett · 54m ago
You guys are tripping. EULAs have had anti-competition, anti-benchmarking, anti-reverse engineering and anti-disparagement clauses since the late 90s.
These unknown companies called Microsoft, Oracle, Salesforce, Apple, Adobe, … et al have all had these controversies at various points.
heavyset_go · 17m ago
I am not a fan of Apple or Oracle, but you are not contractually prevented from competing with them if you use Macs or Oracle Cloud to build software.
I wouldn't suggest building on Oracle's property as you drink its milkshake, but the ToS and EULAs don't restrict competition.
dougSF70 · 3h ago
Also Twitter TOS when accessing firehose was that you could not recreate a Twitter client.
dude250711 · 5h ago
Well, Open AI had been whining about DeepSeek back in the day, so it is fair in a way.
bitwize · 5h ago
For years it was a license violation to use Microsoft development tools to build a word processor or spreadsheet. It was also a violation of your Oracle license to publish benchmark results comparing Oracle to other databases.
If you compete with a vendor, or give aid and comfort to their competitors, do not expect the vendor to play nice with you, or even keep you on as a customer.
sroussey · 5h ago
Doesn’t the ban on benchmarking Oracle still stand today?
mdaniel · 1h ago
Given the law firm in question which just happens to develop an RDBMS, I wouldn't want to find out
Besides, lol, who cares how fast a model-T can go when there are much nicer forms of transportation that don't actively hate you
gruez · 3h ago
>For years it was a license violation to use Microsoft development tools to build a word processor or spreadsheet.
source?
ack_complete · 2h ago
You have to go pretty far back, it was in the Visual C++ 6.0 EULA, for instance (for lack of a better link):
It wasn't a blanket prohibition, but a restriction on some parts of the documentation and redistributable components. Definitely was weird to see that in the EULA for a toolchain. This was removed later on, though I forget if it's because they changed their mind or removed the components.
DaSHacka · 5h ago
Hmm so "because you split spending between us and a competitor, we'll force you to give the competitor the whole share instead!"
Certainly a mindset befitting microsoft and Oracle, if I ever saw one.
david38 · 4h ago
I can understand the benchmark issue. It often happens when someone benchmarks something, it’s biased or wrong in some way.
I don’t believe it should be legal, but I see why they would be butt-hurt
valtism · 5h ago
Would something like that hold up in court?
compootr · 4h ago
they can choose who they do & don't want to do business with
manquer · 3h ago
Law does not work like that.
- Contracts can have unenforceable terms that can be declared null and void by a court, any decision not to renew the contract in future would have no bearing on the current one.
- there are plenty of restrictions on when/ whether you can turn down business for example FRAND contracts or patents don’t allow you choose to not work with a competitor and so on.
ronsor · 1h ago
People always say "this wouldn't hold up in court" and "the law doesn't work like that" when it comes to contract, but in reality, contracts can mostly contain whatever you want.
I see no reason why Anthropic can't arbitrarily ban OpenAI, regardless of my opinion on the decision. Anthropic hasn't "patented" access to the Claude API; there are no antitrust concerns that I can see; etc.
staticman2 · 10m ago
Nobody was asking if Anthropic can ban OpenAI. I believe they were asking if the contract that can ban using the output to train an AI would hold up in court.
And no, it isn't clear to me that this contract term would hold up in court as Anthopic doesn't have copyright ownership in the AI output. I don't believe you can enforce copyright related contracts without copyright ownership.
I could be wrong of course, but I find it odd this topic comes up from time to time but apparently nobody has a blog post by a lawyer or similar to share on this issue.
beefnugs · 2h ago
Dumbest thing they could do, why would you cut off insight into what your competitors are doing?
ramoz · 2h ago
Because they don't blatantly read people's prompts. They have a confidential inference architecture.
They don't target and analyze specific user or organizations - that would be fairly nefarious.
Good luck with that! Most of the relevant model providers include similar terms (Grok, OpenAI, Anthropic, Mistal, basically everyone with the exception of some open model providers).
chowells · 5h ago
You're like 50% of the way there...
palata · 4h ago
Can't we say it's "fair use"? They do whatever they want saying it's "fair use", I don't see why I couldn't.
modeless · 4h ago
OpenAI Services Agreement: "Customer will not [...] use Output to develop artificial intelligence models that compete with OpenAI’s products and services"
Live by the sword, die by the sword.
spwa4 · 3h ago
Didn't a whole bunch of AI companies make the news that they refuse to respect X law in AI training. So far, X has been:
* copyright law
* trademark law
* defamation law (ChatGTP often reports wrong facts about specific people, products, companies, ... most seriously claiming someone was guilty of murder. Getting ChatGPT to say obviously wrong things about products is trivial)
* contract law (bypassing scraping restrictions they had agreed as a compay beforehand)
* harassment (ChatGPT made pictures depicting specific individuals doing ... well you can guess where this is going. Everything you can imagine. Women, of course. Minors. Politics. Company politics ...)
So far, they seem to have gotten away with everything.
raincole · 2h ago
> defamation law
Not sure if you're serious... you think OpenAI should be held responsible for everything their LLM ever said? You can't make a token generator unless the tokens generated always happen to represent factual sentences?
spwa4 · 2h ago
Given that they publish everything their AI says? That that's in fact the business model? (in other words, they publish everything their AI says for money) Quite frankly, yes.
If I told people you are a murderer, for money, I'd expect to be sued and I'd expect to be convicted.
llbbdd · 1h ago
If you had a disclaimer (like OpenAI) does, probably different story
Seriously, make a browser extension that people can turn on and off (no need to be dishonest here), and pay people to upload their AI chats, and possibly all the other content they view.
If Reddit wont let you scrape, pay people to automatically upload the Reddit comments they view normally.
If Claude cuts you off, pay people to automatically upload their Claude conversations.
Am I crazy, am I hastening dystopia?
bit1993 · 4h ago
Than I would simply use AI to generate chat histories and get paid (:
manquer · 3h ago
That is not a problem if the price paid is lower than what generating synthetic data of similar size will cost .
bit1993 · 2h ago
Great point. Verifying the synthetic data also has a cost, I wonder if it is cheaper than generating it?
BoorishBears · 3h ago
I've done a lot of post training and data collection for post-training
I think if you're not OpenAI/Anthropic sized (in which case you can do better) you're not going to get much value out of it
It's hard to usefully post-train on wildly varied inputs, and post-training is all most people can afford.
There's too much noise to improve things unless you do a bunch of cleaning and filtering that's also somewhat expensive.
If you constrain the task (for example, use past generations from your own product) you get much further along though.
I've thought about building a Chrome plugin to do something useful for ChatGPT web users doing a task relevant to what my product does, then letting them opt into sharing their logs.
That's probably a bit more tenable for most users since they're getting value, and if your extension can do something like produce prompts for ChatGPT, you'll get data that actually overlaps with what you're doing.
bethekidyouwant · 5h ago
This article says absolutely nothing and appears to be an ad for anthropic
Unless it's actually some internal Claude API which OpenAI were using with an OpenAI benchmarking tool, this sounds like a hyped-up way for Wired to phrase it.
Almost like: `Woah man, OpenAI HACKED Claude's own AI mainframe until Sonnet slammed down the firewall man!` ;D Seriously though, why phrase API use of Claude as "special developer access"?
I suppose that it's reasonable to disagree on what is reasonable for safety benchmarking, e.g: where you draw a line and say, "hey, that's stealing" vs "they were able to find safety weak spots in their model". I wonder what the best labs are like at efficiently hunting for weak areas!
Funnily enough I think Anthropic have banned a lot of people from their API, myself included - and all I did was see if it could read a letter I got, and they never responded to my query to sort it out! But what does it matter if people can just use OpenRouter?
Isn't that precisely what an API is? Normal users do not use the API. Other programs written by developers use it to access Claude from their app. That's like asking why is an SDK phrased as a special kit for developers to build software that works with something they wish to integrate into their app
I understand that they were technically "using it to train models", which, given OpenAI's stance, I don't have much sympathy for, but it's not some "special developer hackery" that this is making it sound like.
Compare with a sentence like “the elevator has special firefighter buttons” which does not mean that only some special type of firefighter uses the button.
The amount of care the author puts into their phrasing determines whether their point comes across as intended, or not. The average magazine reader can likely figure out that there's no such thing as "special" firefighters with "privileged" access to elevator buttons that other firefighters lack. They may not have the programming knowledge to do likewise with "developer access", even if they are reading a magazine like "Wired".
Isn’t half the schtick of LLMs making software development available for the layman?
1/ Openai's technical staff were using Claude Code (API and not the max plans).
2/ Anthropic's spokesperson says API access for benchmarking and evals will be available to Openai.
3/ Openai said it's using the APIs for benchmarking.
I guess model benchmarking is fine, but tool benchmarking is not. Either openai were trying to see their product works better than Claude code (each with their own proprietary models) on certain benchmarks and that is something Anthropic revoked. How they caught it is far more remarkable. It's one thing to use sonnet 4 to solve a problem on Livebench, its slightly different to do it via the harness where Anthropic never published any results themselves. Not saying this is a right stance, but this seems to be the stance.
No comments yet
That's... quite a license term. I'm a big fan of tools that come with no restrictions on their use in their licenses. I think I'll stick with them.
These unknown companies called Microsoft, Oracle, Salesforce, Apple, Adobe, … et al have all had these controversies at various points.
I wouldn't suggest building on Oracle's property as you drink its milkshake, but the ToS and EULAs don't restrict competition.
If you compete with a vendor, or give aid and comfort to their competitors, do not expect the vendor to play nice with you, or even keep you on as a customer.
Besides, lol, who cares how fast a model-T can go when there are much nicer forms of transportation that don't actively hate you
source?
https://proact.eu/wp-content/uploads/2020/07/Visual-Basic-En...
It wasn't a blanket prohibition, but a restriction on some parts of the documentation and redistributable components. Definitely was weird to see that in the EULA for a toolchain. This was removed later on, though I forget if it's because they changed their mind or removed the components.
Certainly a mindset befitting microsoft and Oracle, if I ever saw one.
I don’t believe it should be legal, but I see why they would be butt-hurt
- Contracts can have unenforceable terms that can be declared null and void by a court, any decision not to renew the contract in future would have no bearing on the current one.
- there are plenty of restrictions on when/ whether you can turn down business for example FRAND contracts or patents don’t allow you choose to not work with a competitor and so on.
I see no reason why Anthropic can't arbitrarily ban OpenAI, regardless of my opinion on the decision. Anthropic hasn't "patented" access to the Claude API; there are no antitrust concerns that I can see; etc.
And no, it isn't clear to me that this contract term would hold up in court as Anthopic doesn't have copyright ownership in the AI output. I don't believe you can enforce copyright related contracts without copyright ownership.
I could be wrong of course, but I find it odd this topic comes up from time to time but apparently nobody has a blog post by a lawyer or similar to share on this issue.
They don't target and analyze specific user or organizations - that would be fairly nefarious.
The only exception would be if there are flags for trust and safety. https://support.anthropic.com/en/articles/8325621-i-would-li...
Live by the sword, die by the sword.
* copyright law
* trademark law
* defamation law (ChatGTP often reports wrong facts about specific people, products, companies, ... most seriously claiming someone was guilty of murder. Getting ChatGPT to say obviously wrong things about products is trivial)
* contract law (bypassing scraping restrictions they had agreed as a compay beforehand)
* harassment (ChatGPT made pictures depicting specific individuals doing ... well you can guess where this is going. Everything you can imagine. Women, of course. Minors. Politics. Company politics ...)
So far, they seem to have gotten away with everything.
Not sure if you're serious... you think OpenAI should be held responsible for everything their LLM ever said? You can't make a token generator unless the tokens generated always happen to represent factual sentences?
If I told people you are a murderer, for money, I'd expect to be sued and I'd expect to be convicted.
Seriously, make a browser extension that people can turn on and off (no need to be dishonest here), and pay people to upload their AI chats, and possibly all the other content they view.
If Reddit wont let you scrape, pay people to automatically upload the Reddit comments they view normally.
If Claude cuts you off, pay people to automatically upload their Claude conversations.
Am I crazy, am I hastening dystopia?
I think if you're not OpenAI/Anthropic sized (in which case you can do better) you're not going to get much value out of it
It's hard to usefully post-train on wildly varied inputs, and post-training is all most people can afford.
There's too much noise to improve things unless you do a bunch of cleaning and filtering that's also somewhat expensive.
If you constrain the task (for example, use past generations from your own product) you get much further along though.
I've thought about building a Chrome plugin to do something useful for ChatGPT web users doing a task relevant to what my product does, then letting them opt into sharing their logs.
That's probably a bit more tenable for most users since they're getting value, and if your extension can do something like produce prompts for ChatGPT, you'll get data that actually overlaps with what you're doing.