Tell HN: Help restore the tax deduction for software dev in the US (Section 174)

2434 points by dang 10d ago 905 comments

GCP Outage (status.cloud.google.com)

1460 points by thanhhaimai 7d ago 495 comments

Honda conducts successful launch and landing of experimental reusable rocket (global.honda)

1286 points by LorenDB 2d ago 414 comments

Frequent reauth doesn't make you more secure (tailscale.com)

1269 points by ingve 7d ago 518 comments

A receipt printer cured my procrastination (laurieherault.com)

1244 points by laurieherault 7d ago 599 comments

The Grug Brained Developer (2022) (grugbrain.dev)

1053 points by smartmic 2d ago 543 comments

Andrej Karpathy: Software in the era of AI [video] (youtube.com)

1047 points by sandslash 22h ago 580 comments

The last six months in LLMs, illustrated by pelicans on bicycles (simonwillison.net)

959 points by swyx 11d ago 233 comments

Magistral — the first reasoning model by Mistral AI (mistral.ai)

939 points by meetpateltech 9d ago 424 comments

If the moon were only 1 pixel: A tediously accurate solar system model (2014) (joshworth.com)

929 points by sdoering 6d ago 260 comments

Apple announces Foundation Models and Containerization frameworks, etc (apple.com)

859 points by thm 10d ago 489 comments

Working on databases from prison (turso.tech)

839 points by dvektor 3d ago 530 comments

Jemalloc Postmortem (jasone.github.io)

798 points by jasone 6d ago 235 comments

Show HN: Workout.cool – Open-source fitness coaching platform (github.com)

774 points by surgomat 1d ago 217 comments

Research suggests Big Bang may have taken place inside a black hole (port.ac.uk)

770 points by zaik 8d ago 604 comments

Containerization is a Swift package for running Linux containers on macOS (github.com)

768 points by gok 10d ago 409 comments

Apple introduces a universal design across platforms (apple.com)

738 points by meetpateltech 10d ago 1202 comments

US-backed Israeli company's spyware used to target European journalists (apnews.com)

731 points by 01-_- 7d ago 384 comments

Marines being mobilized in response to LA protests (cnn.com)

706 points by sapphicsnail 10d ago 1680 comments

WhatsApp introduces ads in its app (nytimes.com)

696 points by greenburger 3d ago 1006 comments

I convinced HP's board to buy Palm and watched them kill it (philmckinney.substack.com)

682 points by AndrewDucker 6d ago 495 comments

Chatterbox TTS (github.com)

666 points by pinter69 8d ago 188 comments

Resurrecting a dead torrent tracker and finding 3M peers (kianbradley.com)

633 points by k-ian 2d ago 199 comments

Congratulations on creating the one billionth repository on GitHub (github.com)

623 points by petercooper 8d ago 137 comments

Bruteforcing the phone number of any Google user (brutecat.com)

614 points by brutecat 10d ago 190 comments

How I program with agents (crawshaw.io)

612 points by bumbledraven 10d ago 294 comments

Show HN: Unregistry – “docker push” directly to servers without a registry (github.com)

606 points by psviderski 23h ago 134 comments

"Localhost tracking" explained. It could cost Meta €32B (zeropartydata.es)

593 points by donohoe 9d ago 272 comments

Launch HN: Vassar Robotics (YC X25) – $219 robot arm that learns new skills

580 points by charleszyong 9d ago 219 comments

Start your own Internet Resiliency Club (bowshock.nl)

576 points by todsacerdoti 3d ago 341 comments

Kagi Reaches 50k Users (kagi.com)

553 points by tigroferoce 10d ago 341 comments

Why SSL was renamed to TLS in late 90s (2014) (tim.dierks.org)

543 points by Bogdanp 4d ago 229 comments

Building Effective AI Agents (anthropic.com)

525 points by Anon84 2d ago 87 comments

OpenAI dropped the price of o3 by 80% (twitter.com)

517 points by mfiguiere 9d ago 494 comments

Waymo rides cost more than Uber or Lyft and people are paying anyway (techcrunch.com)

506 points by achristmascarl 7d ago 885 comments

Air India flight to London crashes in Ahmedabad with more than 240 onboard (theguardian.com)

498 points by Gud 7d ago 582 comments

Self-Host and Tech Independence: The Joy of Building Your Own (ssp.sh)

498 points by articsputnik 12d ago 241 comments

I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com)

479 points by yousef_g 5d ago 77 comments

Meta invests $14.3B in Scale AI to kick-start superintelligence lab (nytimes.com)

467 points by RyanShook 6d ago 477 comments

The Zed Debugger Is Here (zed.dev)

464 points by SupremumLimit 19h ago 181 comments

We’re secretly winning the war on cancer (vox.com)

459 points by lr0 11d ago 217 comments

Joining Apple Computer (2018) (folklore.org)

457 points by tosh 12d ago 123 comments

Building supercomputers for autocrats probably isn't good for democracy (helentoner.substack.com)

454 points by rbanffy 11d ago 259 comments

Danish Ministry Replaces Windows and Microsoft Office with Linux and LibreOffice (heise.de)

450 points by jlpcsl 7d ago 227 comments

New US visa rules will force foreign students to unlock social media profiles (theguardian.com)

446 points by sva_ 23h ago 585 comments

Scrappy – Make little apps for you and your friends (pontus.granstrom.me)

439 points by 8organicbits 1d ago 139 comments

Convert photos to Atkinson dithering (gazs.github.io)

437 points by nvahalik 12d ago 54 comments

Fossify – A suite of open-source, ad-free apps (github.com)

432 points by jalict 2d ago 129 comments

My iPhone 8 Refuses to Die: Now It's a Solar-Powered Vision OCR Server (terminalbytes.com)

421 points by hemant6488 1d ago 177 comments

Rendering Crispy Text on the GPU (osor.io)

420 points by ibobev 6d ago 131 comments

Show HN: Open Operator Evals – real-world benchmarks for LLM web agents

2 monoid73 0 6/19/2025, 1:03:56 PM github.com ↗

We’ve open-sourced a benchmark for LLM-driven web agent setups.

It evaluates real-world tasks, like logging in, scraping dashboards, and submitting forms, using structured criteria: success rate, latency, and task reliability.

Everything is fully reproducible, with all outputs, logs, and evaluation data available.

https://github.com/nottelabs/open-operator-evals

Feedback, critiques, or contributions welcome:)

Comments (0)

No comments yet