New Prompt Engineering Metaheuristic – (NoA) Network of Agents

Comments (1)

scraper01 · 8h ago

I've been looking into the idea of "deep thinking" in AI, but it seems reserved for big models with huge compute budgets. I wanted to see if a different approach was possible: trading instantaneous computation for a slower burn. To explore this, I've been building an open-source research project called Network of Agents (NoA). The goal is to turn a modest laptop (I'm developing on a 32GB RAM machine) into a "solution mining" rig. You can set up a hard problem, and using a local LLM (via Ollama and a quantized Qwen model like Qwen 30b a3b), let a society of agents work on it for hours or days, iteratively refining their collective answer. The Core Idea: Backpropagation with Natural Language The system is built with LangGraph and is inspired by neural networks. It runs in epochs, with each epoch consisting of a "Forward Pass" and a "Reflection Pass". 1. The Forward Pass (Inference): • Instead of numerical weights, the network's "weights" are the natural language system prompts of its agents. • The process starts by procedurally generating a multi-layered network of agents. The first layer gets cognitive diversity from MBTI archetypes and "seed verbs" related to the user's problem. • Subsequent "hidden" layers are built by having an agent-analyst chain create a "hard request" designed to challenge the previous layer, then spawning a new agent specialized for that challenge. • Information flows through the network layer by layer, with the combined JSON outputs of one layer being broadcast as input to all agents in the next. • 2. The Reflection Pass (Learning): This is where I've tried to simulate backpropagation. • Critique as the "Loss Function": After the final layer's outputs are synthesized into a single solution, a critique_agent assesses it against the original problem and generates a constructive critique. • Propagating the "Gradient": This critique is the error signal. It's propagated backward through the network. An agent in layer N-1 receives a targeted critique based on its contribution to the final answer generated by layer N. • The "Optimizer" Meta-Prompt: At each step of the backward pass, an update_agent_prompts_node uses the incoming critique as the main input to a meta-prompt. This meta-prompt's job is to completely rewrite and evolve the receiving agent's system prompt—its skills, attributes, and even its career—to better address the critique.

The entire network learns and adapts its own instructions, not through a central controller, but through a distributed process of peer-to-peer challenge. The Long-Term Vision: A New Kind of Training Data This is the part that I find most exciting. Every run of this system produces a complete, structured trace of a multi-agent collaborative process: the initial agent personas, the layer-by-layer reasoning (CoT traces), the critiques, and the evolution of each agent's prompts across epochs. This is a new kind of dataset that captures the dynamics of reasoning, not just static information. My long-term, ambitious goal is to use this data to train a "World Language Model" – a model trained not just on text, but on the fundamental patterns of collaboration, error correction, and social intelligence. This is an early-stage research project. The code is available for anyone to run, and the immediate roadmap includes dynamic memory for small models, P2P networking for distributed mining, and better visualization. I'd love to get this community's feedback. What do you think of this approach? Is the analogy to backpropagation sound? How would you improve the meta-prompts that drive the evolution? Thanks for reading.

PuTTY has a new website (putty.software)

The future of large files in Git is Git (tylercipriani.com)

AI is different (antirez.com)

I accidentally became PureGym’s unofficial Apple Wallet developer (drobinin.com)

Show HN: Edka – Kubernetes clusters on your own Hetzner account (edka.io)

Occult books digitized and put online by Amsterdam’s Ritman Library (openculture.com)

Best Practices for Building Agentic AI Systems (userjot.com)

Do Things That Don't Scale (2013) (paulgraham.com)

Deep-Sea Desalination Pulls Fresh Water from the Depths (scientificamerican.com)

OpenBSD is so fast, I had to modify the program slightly to measure itself (flak.tedunangst.com)

ADHD drug treatment and risk of negative events and outcomes (bmj.com)

Launch HN: Embedder (YC S25) – Claude code for embedded software

The electric fence stopped working years ago (soonly.com)

Porting Gigabyte MZ33-AR1 Server Board with AMD Turin CPU to Coreboot (blog.3mdeb.com)

Prompting by Activation Maximization (joecooper.me)

TextKit 2 – The Promised Land (blog.krzyzanowskim.com)

Show HN: Prime Number Grid Visualizer (enda.sh)

A privacy VPN you can verify (vp.net)

Model intelligence is no longer the constraint for automation (latentintent.substack.com)

Recto – A Truly 2D Language (masatohagiwara.net)

ARM adds neural accelerators to GPUs (newsroom.arm.com)

A mind–reading brain implant that comes with password protection (nature.com)

America's stock-market dominance is an emergency for Europe (wsj.com)

Bullfrog in the Dungeon (filfre.net)

Compiler Bug Causes Compiler Bug: How a 12-Year-Old G++ Bug Took Down Solidity (osec.io)

Claude Opus 4 and 4.1 can now end a rare subset of conversations (anthropic.com)

Vaultwarden commit introduces SSO using OpenID Connect (github.com)

Open hardware desktop 3D printing is dead? (josefprusa.com)

Is air travel getting worse? (maximum-progress.com)

Thai Air Force seals deal for Swedish Gripen jets (scmp.com)

EasyPost (YC S13) Is Hiring (easypost.com)

When the CIA got away with building a heart attack gun (wisewolfmedia.substack.com)

I let LLMs write an Elixir NIF in C; it mostly worked (overbring.com)

Secret Messengers: Disseminating Sigint in the Second World War [pdf] (media.defense.gov)

Bird signs and cycles, February, 2024 (subject.space)

An interactive guide to sensor fusion with quaternions (quaternion.cafe)

Non-invasive vagus nerve stimulation and exercise capacity in healthy volunteers (academic.oup.com)

Imagen 4 is now generally available (developers.googleblog.com)

Simulating and Visualising the Central Limit Theorem (blog.foletta.net)

Progress towards universal Copy/Paste shortcuts on Linux (mark.stosberg.com)

'Constantine Cavafy' Review: A Poet's Odyssey Within (wsj.com)

The Timmy Trap (jenson.org)

It seems like the AI crawlers learned how to solve the Anubis challenges (social.anoxinon.de)

California unemployment rises to 5.5%, worst in the U.S. as tech falters (sfchronicle.com)

The beauty of a text only webpage (albanbrooke.com)

The Role of Feature Normalization in Ijepa (github.com)

Show HN: JMAP MCP – Email for your agents (github.com)

Gemma 3 270M: Compact model for hyper-efficient AI (developers.googleblog.com)

The 10 Percent Is in a Fit of Rage over Airport Lounges (newrepublic.com)

The new science of “emergent misalignment” (quantamagazine.org)

New Prompt Engineering Metaheuristic – (NoA) Network of Agents

Comments (1)