Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs (whitelightning.ai)

Unless it's actually some internal Claude API which OpenAI were using with an OpenAI benchmarking tool, this sounds like a hyped-up way for Wired to phrase it.

Almost like: `Woah man, OpenAI HACKED Claude's own AI mainframe until Sonnet slammed down the firewall man!` ;D Seriously though, why phrase API use of Claude as "special developer access"?

I suppose that it's reasonable to disagree on what is reasonable for safety benchmarking, e.g: where you draw a line and say, "hey, that's stealing" vs "they were able to find safety weak spots in their model". I wonder what the best labs are like at efficiently hunting for weak areas!

Funnily enough I think Anthropic have banned a lot of people from their API, myself included - and all I did was see if it could read a letter I got, and they never responded to my query to sort it out! But what does it matter if people can just use OpenRouter?

dylan604 · 4h ago

> Seriously though, why phrase API use of Claude as "special developer access"?

Isn't that precisely what an API is? Normal users do not use the API. Other programs written by developers use it to access Claude from their app. That's like asking why is an SDK phrased as a special kit for developers to build software that works with something they wish to integrate into their app

stavros · 4h ago

If I'm an OpenAI employee, and I use Claude Code via the API, I'm not doing some hacker-fu, I'm just using a tool a company released for the purpose they released it.

I understand that they were technically "using it to train models", which, given OpenAI's stance, I don't have much sympathy for, but it's not some "special developer hackery" that this is making it sound like.

viraptor · 4h ago

Because it's not "special developer access". It's just "normal developer access". The phrasing gives an impression they accessed something other users cannot.

CPLX · 1h ago

It would be normal standard English to assume that special modifies the word access. That would make the sentence semantically be the same as “special access, specifically the type of access used by developers”

Compare with a sentence like “the elevator has special firefighter buttons” which does not mean that only some special type of firefighter uses the button.

toomanyrichies · 1h ago

Counter-point: as a wordsmith, it's incumbent on the article's author to make their point in an unambiguous way. In your example, rather than write "the elevator has special firefighter buttons", the author could choose to write "the elevator has buttons which are only available to firefighters". Or alternatively, "the elevator has buttons which are only available to certain firefighters".

The amount of care the author puts into their phrasing determines whether their point comes across as intended, or not. The average magazine reader can likely figure out that there's no such thing as "special" firefighters with "privileged" access to elevator buttons that other firefighters lack. They may not have the programming knowledge to do likewise with "developer access", even if they are reading a magazine like "Wired".

Calavar · 1h ago

In your example, "firefighter buttons" is a noun phrase which refers to a particular type of button. "Special" applies to the whole of "firefighter buttons," not just to "firefighter" and not just to "buttons." The same would apply for "special developer access."

dgfitz · 3h ago

Normal users use the API constantly, they just don’t realize it.

Isn’t half the schtick of LLMs making software development available for the layman?

ankit219 · 2h ago

The article does not say anything substantial, but just some opposite viewpoints.

1/ Openai's technical staff were using Claude Code (API and not the max plans).

2/ Anthropic's spokesperson says API access for benchmarking and evals will be available to Openai.

3/ Openai said it's using the APIs for benchmarking.

I guess model benchmarking is fine, but tool benchmarking is not. Either openai were trying to see their product works better than Claude code (each with their own proprietary models) on certain benchmarks and that is something Anthropic revoked. How they caught it is far more remarkable. It's one thing to use sonnet 4 to solve a problem on Livebench, its slightly different to do it via the harness where Anthropic never published any results themselves. Not saying this is a right stance, but this seems to be the stance.

hinkley · 2h ago

Feels like something a Jepsen or such should be doing instead of competitors trying to clock each other directly. I can see why they would feel uncomfortable about this situation.

No comments yet

chowells · 5h ago

> According to Anthropic’s commercial terms of service, customers are barred from using the service to “build a competing product or service, including to train competing AI models”

That's... quite a license term. I'm a big fan of tools that come with no restrictions on their use in their licenses. I think I'll stick with them.

ethan_smith · 3h ago

These anti-competitive clauses are becoming standard across all major AI providers - Google, Microsoft, and Meta have similar terms. The industry is converging on a licensing model that essentially creates walled gardens for model development.

werrett · 54m ago

You guys are tripping. EULAs have had anti-competition, anti-benchmarking, anti-reverse engineering and anti-disparagement clauses since the late 90s.

These unknown companies called Microsoft, Oracle, Salesforce, Apple, Adobe, … et al have all had these controversies at various points.

heavyset_go · 17m ago

I am not a fan of Apple or Oracle, but you are not contractually prevented from competing with them if you use Macs or Oracle Cloud to build software.

I wouldn't suggest building on Oracle's property as you drink its milkshake, but the ToS and EULAs don't restrict competition.

dougSF70 · 3h ago

Also Twitter TOS when accessing firehose was that you could not recreate a Twitter client.

dude250711 · 5h ago

Well, Open AI had been whining about DeepSeek back in the day, so it is fair in a way.

bitwize · 5h ago

For years it was a license violation to use Microsoft development tools to build a word processor or spreadsheet. It was also a violation of your Oracle license to publish benchmark results comparing Oracle to other databases.

If you compete with a vendor, or give aid and comfort to their competitors, do not expect the vendor to play nice with you, or even keep you on as a customer.

sroussey · 5h ago

Doesn’t the ban on benchmarking Oracle still stand today?

mdaniel · 1h ago

Given the law firm in question which just happens to develop an RDBMS, I wouldn't want to find out

Besides, lol, who cares how fast a model-T can go when there are much nicer forms of transportation that don't actively hate you

gruez · 3h ago

>For years it was a license violation to use Microsoft development tools to build a word processor or spreadsheet.

source?

ack_complete · 2h ago

You have to go pretty far back, it was in the Visual C++ 6.0 EULA, for instance (for lack of a better link):

https://proact.eu/wp-content/uploads/2020/07/Visual-Basic-En...

It wasn't a blanket prohibition, but a restriction on some parts of the documentation and redistributable components. Definitely was weird to see that in the EULA for a toolchain. This was removed later on, though I forget if it's because they changed their mind or removed the components.

DaSHacka · 5h ago

Hmm so "because you split spending between us and a competitor, we'll force you to give the competitor the whole share instead!"

Certainly a mindset befitting microsoft and Oracle, if I ever saw one.

david38 · 4h ago

I can understand the benchmark issue. It often happens when someone benchmarks something, it’s biased or wrong in some way.

I don’t believe it should be legal, but I see why they would be butt-hurt

valtism · 5h ago

Would something like that hold up in court?

compootr · 4h ago

they can choose who they do & don't want to do business with

manquer · 3h ago

Law does not work like that.

- Contracts can have unenforceable terms that can be declared null and void by a court, any decision not to renew the contract in future would have no bearing on the current one.

- there are plenty of restrictions on when/ whether you can turn down business for example FRAND contracts or patents don’t allow you choose to not work with a competitor and so on.

ronsor · 1h ago

People always say "this wouldn't hold up in court" and "the law doesn't work like that" when it comes to contract, but in reality, contracts can mostly contain whatever you want.

I see no reason why Anthropic can't arbitrarily ban OpenAI, regardless of my opinion on the decision. Anthropic hasn't "patented" access to the Claude API; there are no antitrust concerns that I can see; etc.

staticman2 · 10m ago

Nobody was asking if Anthropic can ban OpenAI. I believe they were asking if the contract that can ban using the output to train an AI would hold up in court.

And no, it isn't clear to me that this contract term would hold up in court as Anthopic doesn't have copyright ownership in the AI output. I don't believe you can enforce copyright related contracts without copyright ownership.

I could be wrong of course, but I find it odd this topic comes up from time to time but apparently nobody has a blog post by a lawyer or similar to share on this issue.

beefnugs · 2h ago

Dumbest thing they could do, why would you cut off insight into what your competitors are doing?

ramoz · 2h ago

Because they don't blatantly read people's prompts. They have a confidential inference architecture.

They don't target and analyze specific user or organizations - that would be fairly nefarious.

The only exception would be if there are flags for trust and safety. https://support.anthropic.com/en/articles/8325621-i-would-li...

ygjb · 5h ago

Good luck with that! Most of the relevant model providers include similar terms (Grok, OpenAI, Anthropic, Mistal, basically everyone with the exception of some open model providers).

chowells · 5h ago

You're like 50% of the way there...

palata · 4h ago

Can't we say it's "fair use"? They do whatever they want saying it's "fair use", I don't see why I couldn't.

modeless · 4h ago

OpenAI Services Agreement: "Customer will not [...] use Output to develop artificial intelligence models that compete with OpenAI’s products and services"

Live by the sword, die by the sword.

spwa4 · 3h ago

Didn't a whole bunch of AI companies make the news that they refuse to respect X law in AI training. So far, X has been:

* copyright law

* trademark law

* defamation law (ChatGTP often reports wrong facts about specific people, products, companies, ... most seriously claiming someone was guilty of murder. Getting ChatGPT to say obviously wrong things about products is trivial)

* contract law (bypassing scraping restrictions they had agreed as a compay beforehand)

* harassment (ChatGPT made pictures depicting specific individuals doing ... well you can guess where this is going. Everything you can imagine. Women, of course. Minors. Politics. Company politics ...)

So far, they seem to have gotten away with everything.

raincole · 2h ago

> defamation law

Not sure if you're serious... you think OpenAI should be held responsible for everything their LLM ever said? You can't make a token generator unless the tokens generated always happen to represent factual sentences?

spwa4 · 2h ago

Given that they publish everything their AI says? That that's in fact the business model? (in other words, they publish everything their AI says for money) Quite frankly, yes.

If I told people you are a murderer, for money, I'd expect to be sued and I'd expect to be convicted.

llbbdd · 1h ago

If you had a disclaimer (like OpenAI) does, probably different story

llbbdd · 1h ago

Nope - they do not charge you for truth

Taylor_OD · 5h ago

So it begins!

throwawayoldie · 5h ago

Let's hope.

lossolo · 5h ago

https://archive.is/m4uL7

Buttons840 · 4h ago

Who will pay me for my AI chat histories?

Seriously, make a browser extension that people can turn on and off (no need to be dishonest here), and pay people to upload their AI chats, and possibly all the other content they view.

If Reddit wont let you scrape, pay people to automatically upload the Reddit comments they view normally.

If Claude cuts you off, pay people to automatically upload their Claude conversations.

Am I crazy, am I hastening dystopia?

bit1993 · 4h ago

Than I would simply use AI to generate chat histories and get paid (:

manquer · 3h ago

That is not a problem if the price paid is lower than what generating synthetic data of similar size will cost .

bit1993 · 2h ago

Great point. Verifying the synthetic data also has a cost, I wonder if it is cheaper than generating it?

BoorishBears · 3h ago

I've done a lot of post training and data collection for post-training

I think if you're not OpenAI/Anthropic sized (in which case you can do better) you're not going to get much value out of it

It's hard to usefully post-train on wildly varied inputs, and post-training is all most people can afford.

There's too much noise to improve things unless you do a bunch of cleaning and filtering that's also somewhat expensive.

If you constrain the task (for example, use past generations from your own product) you get much further along though.

I've thought about building a Chrome plugin to do something useful for ChatGPT web users doing a task relevant to what my product does, then letting them opt into sharing their logs.

That's probably a bit more tenable for most users since they're getting value, and if your extension can do something like produce prompts for ChatGPT, you'll get data that actually overlaps with what you're doing.

bethekidyouwant · 5h ago

This article says absolutely nothing and appears to be an ad for anthropic

rs186 · 5h ago

Do you have adblocker on?

Show HN: Draw a fish and watch it swim with the others (drawafish.com)

Show HN: TraceRoot – Open-source agentic debugging for distributed services (github.com)

Show HN: An interactive dashboard to explore NYC rentals data (leaseswap.nyc)

Show HN: Pontoon – Open-source customer data syncs (github.com)

Show HN: Windows XP in the browser, with file system, Word, media, flash (github.com)

Show HN: Agentic AI Frameworks on AWS (LangGraph,Strands,CrewAI,Arize,Mem0) (github.com)

Show HN: I made a website that makes you cry (cryonceaweek.com)

Show HN: Rewindtty – Record and replay terminal sessions as structured JSON (github.com)

Show HN: Mcp-use – Connect any LLM to any MCP (github.com)

Show HN: AgentMail – Email infra for AI agents (chat.agentmail.to)

Show HN: KubeForge – A GUI for Kubernetes YAMLs (github.com)

Show HN: Sourcebot – Self-hosted Perplexity for your codebase (github.com)

Show HN: LinCal – A Calendar View for Linear Issues (lincal.app)

Show HN: Tambo – a tool for building generative UI React apps with tools/MCP (github.com)

Show HN: Tendly – Ephemeral, privacy-first sharing for files and notes (tendly.xyz)

Show HN: An AI agent that learns your product and guides your users (frigade.ai)

Show HN:typed - Markdown app for writers, students, professionals, and creators (play.google.com)

Show HN: Walk-through of rocket landing optimization paper [pdf] (scpowers.github.io)

Show HN: Arbok – Self-hosted HTTP tunnels to localhost using WireGuard (github.com)

Show HN: Open-source alternative to ChatGPT Agents for browsing (github.com)

Show HN: PLATO5 – AI-guided social engine to turn strangers into IRL friends (plato5.us)

Show HN: Astro dev blog template with interactive colorschemes (multiterm.stelclementine.com)

Show HN: IsAgent – Detect agents like ChatGPT Agent on your website (isagent.dev)

Show HN: Open-Source Alternative to Modal (github.com)

Show HN: Kanban-style Phase Board: plan → execute → verify → commit (traycer.ai)

Show HN: Dlg – Zero-cost printf-style debugging for Go (github.com)

Show HN: Companies use AI to take your calls. I built AI to make them for you (pipervoice.com)

Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs (whitelightning.ai)

Show HN: List of Clojure-like projects (github.com)

Show HN: Find paint colours in Ireland and generate your own palettes (swatcher.ie)

Show HN: The easiest accessibility (a11y) checker for VSCode (github.com)

Show HN: A high-altitude low-power flight computer for high-altitude balloons (github.com)

Show HN: Open-source physical rack-mounted GUI for home lab (getubo.com)

Show HN: The Aria Programming Language (github.com)

Show HN: Compress Image – Simple Lossless and Lossy Image Compression Tool (compressimagex.com)

Show HN: Square Images – Make Any Image a Perfect Square in One Click (squareimages.co)

Show HN: Online Ruler – Measuring in inches/centimeters (anruler.com)

Show HN: Convert from MIDI file to ASCII tablature (and more) (github.com)

Show HN: Zero Waste Cloud – Finds 20-40% savings in AWS/GCP bills and CO2 impact (zerowastecloud.io)

Show HN: AgentGuard – Auto-kill AI agents before they burn through your budget (github.com)

Show HN: MoebiusXBIN – ASCII and text-mode art editor with custom font support (blog.glyphdrawing.club)

Show HN: A GitHub Action that quizzes you on a pull request (github.com)

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL (github.com)

Show HN: Cant – Library written in Rust that provides PyTorch-like functionality (github.com)

Show HN: SafeRate – AI chat-native mortgage lender (saferate.com)

Show HN: Use Their ID – Use your local UK MP’s ID for the Online Safety Act (use-their-id.com)

Show HN: I built an AI that turns any book into a text adventure game (kathaaverse.com)

Show HN: Demitter – Distributed Node.js Event Emitter (Pub/Sub) (github.com)

Show HN: Monchromate – Smart greyscale browser extension (monochromate.lirena.in)

Show HN: Publican – an HTML-first static site generator for Node.js (publican.dev)

Anthropic revokes OpenAI's access to Claude

Comments (49)