Show HN: Scenario: A Go library for using Agents to test your Agent

Comments (1)

0xdeafcafe · 5h ago

I’m sharing scenario-go, a Go library for automated end-to-end testing of conversational agents. You can define scenarios with success and failure criteria, and a testing agent simulates your users until goals are met or issues surface.

We include a connector for OpenAI, but it's trivial to support an LLM via the simple LLMCompletion interface.

Just to note: I work at LangWatch, but this library does not use our product, it just came out of one of our hack days, and the code is MIT licensed.

Preparing for When the Machine Stops (idiallo.com)

A new AI language model that mimics the organization of the brain (actu.epfl.ch)

Ask HN: Any AI Agents Recommendations

Bypassing Synology's dumb drive restrictions (xda-developers.com)

Preserving Columbia's Critical Research Capabilities (president.columbia.edu)

Beta Testers Wanted for SeedGenius – Realistic Seed Data CLI (seedgeni.us)

WebGPU Particle Life Simulation (lisyarus.github.io)

Lazarus 4.0 Released (forum.lazarus.freepascal.org)

Language Representations Can Be What Recommenders Need: Findings and Potentials (arxiv.org)

Crab Nebula (time-lapse movie 2008-2022) (app.astrobin.com)

Giant Inscrutable Matrices: Not Worse Than Anything Else

Using game design and mods to highlight long Covid impact (thesicktimes.org)

E-commerce sites hacked in supply-chain attack (arstechnica.com)

I built an AI code review agent in a few hours, here's what I learned (sourcebot.dev)

Framer – No code website builder loved by designers (framer.com)

Man pleads guilty to using malicious AI software to hack Disney employee (arstechnica.com)

Trump admin announces plans to shut down the Energy Star program (engadget.com)

Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels (cognition.ai)

When SVG almost got network support for raw sockets (leonidasv.com)

Energy efficiency of heat pumps in residential buildings using operation data (nature.com)

Show HN: The AI that helps with everything school related (feynman.so)

iOS 18.5 supports satellite service like T-Mobile Starlink on older iPhones (9to5mac.com)

Ask HN: Jaded with AI – Alternatives?

Modern Druids (youtube.com)

AI Agents: The Building Blocks of Tomorrow's Software Development Lifecycle (qckfx.com)

Nvidia to release RTX 5060 at $299 on May 19th (theverge.com)

JavaScript Obfuscation Through File Stream Side-Channel (blog.gavide.dev)

Mcp-scan: NPM-audit-style security scanner for MCPs (stytch.com)

India to conduct nationwide security drill today (timesofindia.indiatimes.com)

Deep-Dive of ZGC's Architecture (2022) (dev.java)

The Reverse Turing Test Game (reverse-turing.netlify.app)

An Entire Roman City Is Hidden Beneath London [video] (youtube.com)

UK Companies House Register (find-and-update.company-information.service.gov.uk)

Node v24.0.0 (nodejs.org)

Family uses AI to create video for deadly victim's own impact statement (abc15.com)

Will Supercapacitors Come to AI's Rescue? (spectrum.ieee.org)

The Agent Maturity Model – How far we are from human-like Agents and a roadmap (learn.tarka.ai)

Deepfakes Now Outsmarting Detection by Mimicking Heartbeats (studyfinds.org)

To Speed Up AI, Just Outsource Memory (spectrum.ieee.org)

Spain to reduce the standard 40-hour work week for 12.5 million employees (independent.co.uk)

Single, or not? Japanese dating app launches relationship verification system (japantimes.co.jp)

Show HN: An agent to run your newsletter (usetopical.com)

Ukraine or the Ukraine: Why do some country names have 'the'? (2012) (bbc.com)

Controllers Briefly Lost Contact with Planes at Newark Last Week (nytimes.com)

TypeTalk: Kerning Principles (2022) (creativepro.com)

Trump administration cuts off all future federal funding to Harvard (arstechnica.com)

AI-Enhanced Social Engineering Will Reshape the Cyber Threat Landscape (lawfaremedia.org)

Google's Waymo ramps up U.S. robotaxi production (qz.com)

How I cut GTA Online loading times by 70% (2021) (nee.lv)

Stad Ship Tunnel (en.wikipedia.org)

Show HN: Scenario: A Go library for using Agents to test your Agent

Comments (1)