Do not download the app, use the website (idiallo.com)

I've made a few attempts at manually doing this w/ mcp and took a brief look at "claude swarm" https://github.com/parruda/claude-swarm - but in the short time I spent on it I wasn't having much success - admittedly I probably went a little too far into the "build an entire org chart of agents" territory

the main problem I have is that the agents just aren't used

For example, I set up a code reviewer agent today and then asked claude to review code, and it went off and did it by itself without using the agent

in one of anthropic's own examples they are specifically telling claude which agents to use which is exactly what I don't want to have to do:

> First use the code-analyzer sub agent to find performance issues, then use the optimizer sub agent to fix them

My working theory is that while Claude has been extensively trained on tool use and is often eager to use whatever tools are available, agents are just different enough that they don't quite fit - maybe asking another agent to do something "feels" very close to asking the user to do something, which is counter to their training

but maybe I just haven't spent enough time trying it out and tweaking the descriptions

conception · 4h ago

Roo code does this really well with their orchestration mode, there’s probably a way to have a claude.md to do this as well. The only issue with roo is it’s “single threaded” but you do get the specific loaded context and rules for a specific task which is really nice.

oc1 · 2h ago

the same problem with mcp. as well as claude md. most of the time they aren't used when it would be appropriate. what's the point of this agents and standards when you can't make them reliably being used by your model..

bomewish · 11h ago

Has CC become much stupider in recent weeks, or is it me? Any anecdata out there?

_--__--__ · 8h ago

People speculate somewhat seriously that Claude (especially given its French name) picked up at some point that you aren't supposed to work as hard in July and August.

sunaookami · 1h ago

That one guy on Twitter that posted this wrote it as a joke and everyone took it seriously. It's not true. It works the same for me.

oc1 · 1h ago

How do you know? It acts much lazier in the recent summer months for me..

stavros · 16m ago

How have you disproved the hypothesis that it recently got dumber and it just happens to be summer?

madrox · 7h ago

How long before we hire psychiatrists instead of engineers to debug AI

OrsonSmelles · 7h ago

Well, we could start with some ELIZA instances.

lubujackson · 5h ago

I see that you feel we could start with some ELIZA instances. Can you tell me more about that?

nialse · 2h ago

To be frank psychiatrists, being MDs, would likely prescribe medication and I’m not sure how that would help. As a licensed psychologist I have ideas on how to debug AI though.

nico · 10h ago

I don’t know about stupider, but definitely less reliable/available

A couple days ago I was getting so many api errors/timeouts I decided to upgrade from the $20 to the $100 plan (as I was also regularly hitting rate limits as well)

It seemed to fix the issue immediately. But today, the errors came back for about half an hour

SOLAR_FIELDS · 8h ago

It goes down usually around 1400-1500 UTC. Europeans are still awake and once the west coast joins in the fray Anthropic falls over.

Pretty rare to get a 529 outside of that time window in my personal experience, at least during the USA day.

data-ottawa · 4h ago

Their status page for the week is rough. They’re down to 98% uptime.

Hopefully they work out whatever issue is going on.

https://status.anthropic.com/

illusive4080 · 10h ago

Not for me. It gets worse when context is nearly full. I like to compact or clear context more often than it does automatically.

nico · 9h ago

Do you do this via settings or just keep track of it and manually ask it to do it more often?

laborcontract · 7h ago

Insert something to the tune of: “never read files in slices. Instead, whenever accessing a file, you must read a file in entirety[..]” at the beginning of every conversation or whenever you’re down to burn more credits/get better results.

A great deal of claude stupidity is due to context engineering, specifically due to the fact that it tries its hardest to pick out just the slice of code it needs to fulfill the task.

A lot of the annoying “you’re absolute right!” come from CC incrementally discovering that you have more than 10 lines of code in that file that pertains to your task.

I don’t believe conspiracies about dumbed down models. Its all context pruning.

oc1 · 1h ago

so claude code does the same shit like cursor?

slantaclaus · 7h ago

I feel like it’s gotten better recently

Garlef · 3h ago

One nice realization I had when using a similar feature in roo:

You don't need a full agent library to write LLM workflows.

Rather: A general purpose agent with a custom addition to the system prompt can be instructed to call other such agents.

(Of course explicitly mamaging everything is the better choice depending on your business case. But i think it would be always cheaper to at least build a prototype using this method.)

Dlanv · 6h ago

I wonder if this is also a good way to create experts for specific tasks/features of a codebase.

For example, a sub-agent for adding a new stat to an RPG. It could know how to integrate with various systems like items, character stats component, metrics, and so on without having to do as much research into the codebase patterns.

T0Bi · 12h ago

So everything claude-flow¹ already does but worse (I guess?).

¹ https://github.com/ruvnet/claude-flow

jampa · 7h ago

> IMPORTANT: Claude Code must be installed first:

> [...]

> # 2. Activate Claude Code with permissions

> claude --dangerously-skip-permissions

Bypassing all permissions and connecting with MCPs, can't wait for "Claude flow deleted all my files and leaked my CI credentials" blog post

T0Bi · 1h ago

There are already several of such blog posts.

I use the .devcontainer¹ from the claude-code repository. It works great with VSC and let's you work in your docker container without any issues. And as long as you use some sort of version control (git) you cannot really lose anything.

¹ https://github.com/anthropics/claude-code/tree/main/.devcont...

data-ottawa · 4h ago

I would like a simple tool to run Claude in a container with only read/write access to provided folders.

I’ve set it up bespoke but the auth flow gets broken.

T0Bi · 1h ago

¹ https://github.com/anthropics/claude-code/tree/main/.devcont...

oarsinsync · 3h ago

Have you considered asking Claude code to write this for you?

SOLAR_FIELDS · 8h ago

That guy doesn't even understand how his own software works. Is anyone actually using this thing and putting their code into production?

lubujackson · 5h ago

It's extreme dogfooding where he is making a mashed potato volcano where Claude agents are the potatoes and your sanity is the gravy.

dazzaji · 4h ago

Ruv (of Claude Flow) seems to like the new Claude Agents a lot, and already is leveraging them in Claude Flow. He waxes positively on the topic here: https://www.linkedin.com/posts/reuvencohen_spent-the-afterno...

dchuk · 10h ago

I’ll admit this looks comprehensive, but man oh man does this seem complicated and over doing it

nazgul17 · 10h ago

Except it's not in alpha phase

himeexcelanta · 9h ago

This looks like a yarn ball (in not a good way)

Do not download the app, use the website (idiallo.com)

Open Sauce is a confoundingly brilliant Bay Area event (jeffgeerling.com)

It's time for modern CSS to kill the SPA (jonoalderson.com)

CCTV Footage Captures the First-Ever Video of an Earthquake Fault in Motion (smithsonianmag.com)

Turn any diagram image into an editable Draw.io file. No more redrawing (imagetodrawio.com)

Rust on Every GPU (rust-gpu.github.io)

The Rise and Fall of the Hanseatic League (worksinprogress.co)

Simon Tatham's Portable Puzzle Collection (chiark.greenend.org.uk)

It's a DE9, not a DB9 (but we know what you mean) (news.sparkfun.com)

Why I Do Programming (esafev.com)

Never write your own date parsing library (zachleat.com)

Keep Pydantic out of your Domain Layer (coderik.nl)

Why MIT switched from Scheme to Python (2009) (wisdomandwonder.com)

Efficient Computer's Electron E1 CPU – 100x more efficient than Arm? (morethanmoore.substack.com)

Vanilla JavaScript support for Tailwind Plus (tailwindcss.com)

Animated Cursors (tattoy.sh)

Experimental surgery performed by AI-driven surgical robot (arstechnica.com)

The future is not self-hosted (drewlyton.com)

Users claim Discord's age verification can be tricked with video game characters (thepinknews.com)

Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope (wired.com)

Show HN: Auto Favicon MCP Server (github.com)

Developing our position on AI (recurse.com)

CO2 Battery (energydome.com)

Women dating safety app 'Tea' breached, users' IDs posted to 4chan (404media.co)

A Union Pacific-Norfolk Southern combination would redraw the railroad map (trains.com)

Programming vehicles in games (wassimulator.com)

What is X-Forwarded-For and when can you trust it? (2024) (httptoolkit.com)

Ambigrammia: Between Creation and Discovery (Hofstadter, 2025) (yalebooks.yale.edu)

Researchers value null results, but struggle to publish them (nature.com)

Show HN: Apple Health MCP Server (github.com)

Generic Containers in C: Vec (uecker.codeberg.page)

Steve Jobs' cabinet (perfectdays23.substack.com)

Internet Archive is now a federal depository library (kqed.org)

Who has the fastest F1 website (2021) (jakearchibald.com)

Windsurf employee #2: I was given a payout of only 1% what my shares where worth (twitter.com)

Show HN: Price Per Token – LLM API Pricing Data (pricepertoken.com)

Claude Code introduces specialized sub-agents (docs.anthropic.com)

Quantitative AI progress needs accurate and transparent evaluation (mathstodon.xyz)

Show HN: I built a biological network visualization tool (nodes.bio)

SRAM Has No Chill: Exploiting Power Domain Separation to Steal On-Chip Secrets (cacm.acm.org)

Running PostmarketOS on Android Termux proot without a custom ROM (2024) (ivonblog.com)

Asciinema: Record and share your terminal sessions (asciinema.org)

Dwl: Dwm for Wayland (codeberg.org)

Show HN: Open IT Maintenance Planner (maintenance-planner.vangemert.dev)

Google spoofed via DKIM replay attack: A technical breakdown (easydmarc.com)

Why is there a date of 1968 in the Intel Chipset Device Software Utility? (intel.com)

Games Look Bad: HDR and Tone Mapping (2017) (ventspace.wordpress.com)

Brazil central bank to launch Pix installment feature in September (reuters.com)

How to configure X11 in a simple way (eugene-andrienko.com)

Stackless Traversal (2018) (dyalog.com)

Claude Code introduces specialized sub-agents

Comments (34)