Reliable by Design: Building Guardrails for AI and Other Unpredictable Systems [video] (youtube.com)

The feature I want most from all of these "agentic" coding tools is a robust, trustworthy sandbox that limits the blast radius for when something goes wrong.

I'm currently leaning on Docker for Mac for this, which seems robust enough - but it would be nice if sensible sandboxes were the default, not something you have to actively enable yourself.

Claude Artifacts and ChatGPT Code Interpreter are still the AI-assisted coding tools I use most often, mainly because I know their sandboxes are rock solid.

fellowniusmonk · 1d ago

This is amazing.. the escalation comes when LLMs realize they are stuck in a VM and try to hack their way out and then we realize something about ourselves.

asadm · 1d ago

I think spawning a new worktree and then mounting it to a docker container is good enough and quick to do.

SV_BubbleTime · 1d ago

I'm running Claude Code in a container and have been quite pleased. I mean... I'm not going to hook it to any MCP that can contact the outside work besides making commits, so I'm good... but it does seem like a lot of people are handing the keys to drunk teenagers.

thih9 · 1d ago

Original source: https://forum.cursor.com/t/cursor-yolo-deleted-everything-in...

> Hi everyone - as a previous context I’m an AI Program Manager at J&J and have been using Cursor for personal projects since March.

> Yesterday I was migrating some of my back-end configuration from Express.js to Next.js and Cursor bugged hard after the migration - it tried to delete some old files, didn’t work at the first time and it decided to end up deleting everything on my computer, including itself. I had to use EaseUS to try to recover the data, but didn’t work very well also. Lucky I always have everything on my Google Drive and Github, but it still scared the hell out of me.

> Now I’m allergic to YOLO mode and won’t try it anytime soon again. Does anyone had any issue similar than this or am I the first one to have everything deleted by AI?

msgodel · 1d ago

That's crazy anyone would unleash one of these agents on a work laptop with so little supervision.

ketzo · 1d ago

Completely unsourced and the site is run by a marketing/PR/growth consultancy.

Between that and the utter lack of detail, feels like not worthy of HN front page.

an0malous · 1d ago

Doesn’t matter, AI

tlarkworthy · 1d ago

I wrote an agent that works in userspace inside the developing program and it frequently reads it's own code to diagnose errors and sometimes tries to upgrade itself, but that causes a hot reload and it loses its own conversation. It does seem to be useful though that it can read it's own tool implementations.

geuis · 1d ago

Was just talking with a coworker yesterday how we both don't let Cursor automatically run commands without permission.

Case in point.

SV_BubbleTime · 1d ago

My short list of agent allowed commands...

      "Bash(ceedling:*)",
      "Bash(find:*)",
      "Bash(grep:*)",
      "Bash(ls:*)",
      "Bash(rg:*)",
      "WebFetch(domain:docs.anthropic.com)",
      "Bash(git checkout:*)",
      "Bash(gcc:*)",
      "Bash(git add:*)",
      "Bash(mkdir:*)"

I'm OK for now!

dullcrisp · 1d ago

find / -exec rm -rf {} ;

SV_BubbleTime · 14h ago

well, shit

geuis · 22h ago

There a place in Cursor settings to add these? Just poked around and not seeing one.

akmarinov · 11h ago

I've been letting it loose and so far haven't had any issues.

It's a bit nerve wracking when it starts YOLO rebasing and force pushing, but it works out in the end.

Also with Claude Code I've never had it go outside the original folder I've started it, even when I've made it do it.

bennettnate5 · 1d ago

I mean, when its training set includes decades of internet references to `sudo rm -rf /`, why not?

yahoozoo · 1d ago

fake

ge96 · 1d ago

Hmm real or exaggerated

Anne Wojcicki Wins Bidding for 23andMe (wsj.com)

Show HN: The fastest way to create carousels (lumeo.me)

GameStop CEO Says the Company's Future Isn't in Games (gamespot.com)

Reliable by Design: Building Guardrails for AI and Other Unpredictable Systems [video] (youtube.com)

Hit songs are getting shorter (economist.com)

3D printing metal molds poised to accelerate US auto manufacturing (techxplore.com)

What does the DEI-free commitment mean? · Issue #40 · X11Libre/xserver (github.com)

AI and LLM Takes from the Field (medium.com)

Part of Alaska is under a heat advisory. That's a first (washingtonpost.com)

Rethinking the Patent Office (forbes.com)

The average ChatGPT request uses ~0.34Wh (engineeringprompts.substack.com)

After millions of years, why are carnivorous plants still so small? (smithsonianmag.com)

Open-source granola (meetings summary) (omi.me)

Powering next-gen services with AI in regulated industries (technologyreview.com)

Hackable AlphaFold 3 without Docker or MSAs (github.com)

Show HN: A Visual way to build complex prompts - Looking for product validation (thepromptindex.com)

Silicon Valley tech execs are joining the US Army Reserve (techcrunch.com)

The Israeli Attack Against Iran (mearsheimer.substack.com)

Ask HN: Has anyone digitally modeled the impact and collapse of the twin towers?

In Twist, U.S. Diplomacy Served As Cover for Israeli Surprise Attack (wsj.com)

Show HN: Free tool to download Microsoft Learn video (github.com)

The Growing Risk of Malicious Browser Extensions (socket.dev)

There's another leak on the ISS, but NASA is not saying much about it (arstechnica.com)

Apple's Liquid Glass is prep work for AR interfaces, not just a design refresh (omc345.substack.com)

Plunder: How Private Equity is reshaping HVAC (heatpumped.org)

Show HN: Infrabase: Natural language rules engine to manage your cloud account (infrabase.co)

The Viable Systems Model (fffej.substack.com)

Build It Twice (russellpollari.substack.com)

Observability with real insights and auto-fixes (cloudgrip.ai)

First Fossil Proof Found That Long-Necked Dinosaurs Were Vegetarians (nytimes.com)

The Postgres Developers guide to updates and deletes in ClickHouse (clickhouse.com)

The Return of Forgotten Math in Computer Graphics [pdf] (2012) (terathon.com)

Ask HN: Are senior engineers not senior anymore?

LLMs.txt Generator with Automated Monitoring (github.com)

All Starlink Direct to Cell Gen 1 satellites have now been launched (twitter.com)

Anti-Tesla demonstration highlights safety concerns with self-driving vehicles (statesman.com)

Things Jeremy says to do (2019) (forums.fast.ai)

A remote island escaped mass suicide in Battle of Okinawa (japantimes.co.jp)

Ask HN: Any way to get some OpenAI/Anthropic credits for school students?

Vox Media Union Reaches Agreement on Three-Year Contract (variety.com)

Phoenix contexts are simpler than you think (arrowsmithlabs.com)

Self-Adapting Language Models (arxiv.org)

Thoughts on Kagi Search after two months (olly.pagecord.com)

FlockRunner – A project based YAML command excecutor (github.com)

Who was the real Andy Warhol? (bbc.com)

The Magic of Through Running (worksinprogress.co)

Ex150ish-Fruit-and-Chips (theheartattackdiet.substack.com)

Amanda Feilding, Eccentric Countess Who Backed Psychedelic Meds, Dies at 82 (nytimes.com)

Simulink (Matlab) Copilot (github.com)

A chemical in acne medicine can help regenerate limbs (popsci.com)

Cursor goes rogue in YOLO mode, deletes itself and everything else

Comments (18)