Show HN: We built a better ranker and open sourced it (huggingface.co)

1 points by ninalopatina 37s ago 0 comments

Ukraine sees 'priceless' digital battlefield data trove as key to West's support (reuters.com)

3 points by giuliomagnifico 2m ago 0 comments

Violation of local realism with unentangled photons (science.org)

2 points by TheOtherHobbes 2m ago 0 comments

Linux Foundation's updated inclusive language guide (aswf.io)

1 points by tommica 3m ago 0 comments

Cirrus Logic (abortretry.fail)

1 points by rbanffy 4m ago 0 comments

Roman Space Telescope Joins Earth's Asteroid Defence Team – Universe Today (universetoday.com)

1 points by rbanffy 4m ago 0 comments

The Source of Europe's Mild Climate (2006) (americanscientist.org)

1 points by pfdietz 5m ago 0 comments

Google's in-house AI agent discovers critical vulnerability in Chrome (pcworld.com)

1 points by mdhb 6m ago 0 comments

The History of The New Yorker’s Vaunted Fact-Checking Department (newyorker.com)

1 points by 2OEH8eoCRo0 6m ago 1 comments

Sk hynix plants flag in ultra-high-cap SSD area – Blocks and Files (blocksandfiles.com)

1 points by rbanffy 6m ago 0 comments

Incubation(psychology) (en.wikipedia.org)

2 points by CharlesW 8m ago 0 comments

Show HN: Testronaut – AI-powered mission-based browser testing (testronaut.app)

1 points by scfast 11m ago 0 comments

The Compass camera – the most complicated camera? (amateurphotographer.com)

1 points by Breadmaker 12m ago 0 comments

Jeffrey Epstein's and Peter Thiel (Ehud Barak leaked Emails) (reason.com)

3 points by net01 12m ago 0 comments

Putin on the code: DoD reportedly relies on utility written by Russian dev (theregister.com)

2 points by rntn 13m ago 0 comments

I built Nano Banana site ， its best (nanobananas.site)

1 points by bingbing123 14m ago 1 comments

Breaking the Creepy AI in Police Cameras [video] (youtube.com)

1 points by kyrofa 15m ago 0 comments

I Am An AI Hater (anthonymoser.github.io)

3 points by BallsInIt 18m ago 0 comments

Omarchy 2.0 [video] (youtube.com)

3 points by sergiotapia 18m ago 0 comments

Value My Mortgage (valuemymortgage.com)

2 points by couliss 19m ago 3 comments

Prisoner laments floppy disks for appeals documents, with file sizes < 1.44 MB (tomshardware.com)

2 points by Stratoscope 19m ago 1 comments

Agentic Metaflow (outerbounds.com)

9 points by oavdeev 22m ago 0 comments

Simpler Multiple Layouts with Phoenix 1.8 (elixircasts.io)

1 points by alekx 22m ago 0 comments

Show HN: I made an extension that blurs out low value comments on X (addons.mozilla.org)

1 points by pomdevv 22m ago 0 comments

FDA approves new Covid-19 vaccines in US but limits who can get them (theguardian.com)

4 points by xworld21 24m ago 2 comments

As the Colorado River dries up, states angle for influence over water rights (theconversation.com)

1 points by PaulHoule 25m ago 0 comments

Early gunpowder was made from the 'pisse' of church ladies (chicagoreader.com)

2 points by wingspan 26m ago 0 comments

I 3D Printed a Kayak in Less Than 24 hours [video] (youtube.com)

1 points by gnabgib 26m ago 0 comments

Zoom Docs: Going Beyond Meetings (blog.tryresearchly.com)

1 points by leo_researchly 27m ago 0 comments

Drowning prevention program comes to a halt at the CDC (npr.org)

6 points by geox 28m ago 1 comments

The One Where We Just Steal the Vulnerabilities (labs.watchtowr.com)

1 points by bdev12345 33m ago 0 comments

Firefox Has Moved to Firefox.com (firefox.com)

29 points by pentagrama 33m ago 10 comments

Heartbreak as a Service? (syntheticauth.ai)

1 points by zerolayers 33m ago 0 comments

Widespread Data Theft Targets Salesforce Instances via Salesloft Drift (cloud.google.com)

1 points by jamesmotherway 35m ago 0 comments

Natural Language CSV/TSV transformer – anyone interested?

1 points by sifterai 35m ago 0 comments

Why are residential property tax rates regressive (philadelphiafed.org)

4 points by topaz0 35m ago 6 comments

Streamline LLM Evaluation with Stax (developers.googleblog.com)

2 points by saikatsg 36m ago 0 comments

RFK Jr. Promises to Reveal the 'Cause' of Autism Next Month (gizmodo.com)

20 points by ulrischa 37m ago 4 comments

System Eval with Obsidian and Claude Code (interjectedfuture.com)

1 points by iamwil 43m ago 0 comments

Break my algorithm –> take the plaintext $20 Bitcoin you can control (Round 4) (app.redactsure.com)

2 points by redactsure 44m ago 1 comments

Imgburn.com spread malware (sombody bought mirror website) (imgburn.com)

1 points by 1432132143 44m ago 0 comments

Advanced Context Engineering for Agents [video] (youtube.com)

1 points by mavelikara 45m ago 0 comments

Online Safety Act – Blackmailed after submitting identity docs to scam site (old.reddit.com)

2 points by nvarsj 45m ago 1 comments

Decoding an SF Craigslist "furniture" listing that appears to be a coded drug ad (twitter.com)

3 points by npmipg 46m ago 1 comments

OAuth Device Flow Vulnerabilities: Analysis of 2024-2025 Attack Wave (guptadeepak.com)

1 points by guptadeepak 47m ago 1 comments

Enterprise Sales = the Opposite of "Learn by Doing" (predatorialism.com)

1 points by DragonHo 48m ago 1 comments

Wearable Lego Shirt by Neil Snowball [video] (youtube.com)

2 points by gnabgib 52m ago 0 comments

Simulating Product–Market Fit Using Python (obergxdata.substack.com)

2 points by hackboyfly 53m ago 0 comments

Big Tech Power Rankings (powerrankings.tech)

1 points by meshugaas 54m ago 0 comments

China sends an AI to its space station (theregister.com)

2 points by MattGrommes 56m ago 1 comments

Show HN: Testronaut – AI-powered mission-based browser testing

1 scfast 0 8/27/2025, 7:17:58 PM testronaut.app ↗

Hi HN,

I’ve been working on a project called *Testronaut*, an autonomous testing framework that combines AI reasoning with real browser automation. The idea is to let you define end-to-end tests as “missions” in plain English, then have an agent run them through a real browser using Playwright.

Why I built this: I’ve often found end-to-end tests to be fragile, time-consuming to maintain, and difficult to scale. Testronaut tries to reduce the maintenance burden by using AI to adapt tests to small UI changes, while still producing a deterministic report of what passed/failed.

How it works: - Missions can be written as strings or functions. - The agent uses GPT-4o with a set of tools (click, type, navigate, get_dom, etc.) to interact with the page. Support for other LLMs/Models in the works. - Browser control is handled by Playwright. - Reports are generated in both JSON and HTML, with step-by-step breakdowns (including screenshots). - It runs locally via a CLI (`npx testronaut`) and doesn’t require any hosted service. You will need to provide your own OpenAI API key, however.

Current state: - Early days: it works for simple flows and demo apps, but I’m still tuning the reliability and efficiency. - It installs with one command and comes with a sample mission. - Open source on npm/GitHub.

Links: - Docs & quickstart: https://docs.testronaut.app - GitHub: https://github.com/mission-testronaut/testronaut-cli - npm: https://www.npmjs.com/package/testronaut

I’d love feedback from the HN community on: - Where this could be most useful (CI/CD? flaky test replacement? exploratory testing?). - What concerns you’d have about using an AI-driven test runner. - Any “gotchas” I should watch out for in early adoption.

Thanks for taking a look!

Comments (0)

No comments yet