Show HN: Open-source AI image/deepfake detection that actually works

1 lschneider 0 8/5/2025, 12:03:45 PM nonescape.com ↗

Hi HN,

YCW24 company here, we just open-sourced an AI image detection model that beats the SOTA commercial detectors. AI-generated images/videos have become incredibly good in the last few months and are flooding the internet; being able to detect them reliably gives some power back to consumers and companies that care about high-quality, genuine content.

Detecting AI-generated images is a very hard problem: there are many different techniques to generate images; there is image compression, noise, and other distortions that destroy generator artifacts; there's android phones applying auto-correction to images; etc. And none of the detectors we tried (sightengine.com, decopy.ai, etc.) works reliably even for basic examples (try it out with pirate Rick Astley made with Flux Kontext: https://imgur.com/a/iL3paE8).

We released two models, the full version (~600M params) and a smaller version (~20M params) that can even run in your browser on mobile (see demo)! We've also put up code for running things locally or via an API (free but rate-limited) using javascript/node and python code.

The full model was trained on 1M+ images that were scraped off the internet and the small model is a distillation. We're actively working on extending the dataset and further improving the models.

Classification accuracy: sightengine.com seems to be the best commercial solution out there, as confirmed by this (https://arxiv.org/pdf/2404.14581) paper, which they also cite on their website. Of course, they cherry-picked the results and claim 98.3% accuracy while only achieving (still impressive) ~82.8% over the full dataset. I've downloaded the dataset used in the paper and tested my models against it. The code for running the tests as well as a usable version of the dataset (the original was a big pain to download from OneDrive) are included in the repo code.

Here are our benchmark results for comparison: Total samples: 144,088 Real images: 17,044 Synthetic images: 127,044 Average precision: 0.991

================ threshold: 0.5 ================ Total accuracy: 0.864

PER-CATEGORY ACCURACY: Real 0.875 (17,044 samples) DALL-E_T2I 0.982 (16,110 samples) DreamStudio_T2I 0.968 (16,278 samples) Midjourney_T2I 0.961 (16,148 samples) StarryAI_T2I 0.847 (13,515 samples) DALL-E_IT2I 0.774 (16,665 samples) DreamStudio_IT2I 0.666 (16,139 samples) Midjourney_IT2I 0.897 (15,371 samples) StarryAI_IT2I 0.805 (16,818 samples) ================ threshold: 0.65 ================ Total accuracy: 0.827

PER-CATEGORY ACCURACY: Real 0.914 (17,044 samples) DALL-E_T2I 0.971 (16,110 samples) DreamStudio_T2I 0.949 (16,278 samples) Midjourney_T2I 0.940 (16,148 samples) StarryAI_T2I 0.803 (13,515 samples) DALL-E_IT2I 0.698 (16,665 samples) DreamStudio_IT2I 0.576 (16,139 samples) Midjourney_IT2I 0.845 (15,371 samples) StarryAI_IT2I 0.743 (16,818 samples)

uBlock Origin Lite now available for Safari (apps.apple.com)

Build Your Own Lisp (In C) (buildyourownlisp.com)

Monitor your security cameras with locally processed AI (frigate.video)

A Carnival Attraction That Saved Premature Babies (2016) (smithsonianmag.com)

PHP 8.5 adds pipe operator (thephp.foundation)

Show HN: I spent 6 years building a ridiculous wooden pixel display (benholmen.com)

Apache ECharts 6 New Features (echarts.apache.org)

Qwen-Image: Crafting with native text rendering (qwenlm.github.io)

How we made JSON.stringify more than twice as fast (v8.dev)

New world record Weather satellites detect 515-mile-long lightning flash (space.com)

Thingino: Open-Source Firmware for IP Cameras (thingino.com)

I tried to replace myself with ChatGPT in my English class (lithub.com)

3D Line Drawings (amritkwatra.com)

Show HN: I've been building an ERP for manufacturing for the last 3 years (github.com)

Clojure Civitas – Publish Clojure Ideas and Explorations (github.com)

Where to find ideas (howtogrow.substack.com)

The Disappearance of Saturday Morning (awn.com)

Ask HN: What trick of the trade took you too long to learn?

OpenIPC: Open IP Camera Firmware (openipc.org)

DrawAFish.com Postmortem (aldenhallak.com)

Indian Sign Painting: A typeface designer's take on the craft (bl.ag)

Using drone imagery and AI to rapidly assess damage after hurricanes and floods (stories.tamu.edu)

NASA's Curiosity picks up new skills (jpl.nasa.gov)

Content-Aware Spaced Repetition (giacomoran.com)

Welcome to the IPv4 Games (ipv4.games)

As a linguist, I want to find the words to measure chronic illness (thesicktimes.org)

Customizing tmux (evgeniipendragon.com)

Hiroshima (1946) (newyorker.com)

Perplexity is using stealth, undeclared crawlers to evade no-crawl directives (blog.cloudflare.com)

Introduction to Unikernel: Building, deploying lightweight, secure applications (tallysolutions.com)

Decades of Blunders Put a Lethal Wall at the End of a South Korean Runway (nytimes.com)

Is the interstellar object 3I/ATLAS alien technology? [pdf] (lweb.cfa.harvard.edu)

The history of the Schwartzian Transform (2016) (perl.com)

My Ideal Array Language (ashermancinelli.com)

Objects should shut up (dustri.org)

What Can a Cell Remember? (quantamagazine.org)

Job-seekers are dodging AI interviewers (fortune.com)

Century-old stone “tsunami stones” dot Japan's coastline (2015) (smithsonianmag.com)

Cellular Starlink expands to support IoT devices (me.pcmag.com)

EconTeen – Financial literacy lessons and tools for teens (econteen.com)

Once a death sentence, cardiac amyloidosis is finally treatable (nytimes.com)

Deterministic Simulation Testing in Rust: A Theater of State Machines (polarsignals.com)

Palantir is extending its reach even further into government (wired.com)

Kyoto University team develops pain reliever comparable to morphine (japantimes.co.jp)

Show HN: Sidequest.js – Background jobs for Node.js using your database (docs.sidequestjs.com)

Trust in AI coding tools is plummeting (leaddev.com)

Show HN: Tiny logic and number games I built for my kids (quizmathgenius.com)

Lamport's Byzantine Generals Algorithm in Python (bytepawn.com)

The creative tension between developer and language (krishna.github.io)

Rust, Python, and TypeScript: the new trifecta (smallcultfollowing.com)

Show HN: Open-source AI image/deepfake detection that actually works

Comments (0)