Stop killing games: Demands for game ownership must also include workers' rights (theconversation.com)

With the latest GPT-5 I think it has done a great job at solving the needle in a haystack problem and finding the relevant files to change to build out my feature/solve my bug. Although, I still feel that it lacks some basic context around the codebase that really improves the quality of the response. Currently, the way agentic development works is that we do a semantic search using RAG (dense search) over our codebase and find the most relevant code or grep (sparse search) to solve our given problem/feature request.

I think that's great. But I also think that it gives room for improvement on how we think of context. Most time documentation is hidden in some architectural design review in a tool like notion, confluence, etc. Those are great for human retrieval but even then it is often time forgotten when we implement the code functionality. Another key issue is that as the code evolves, our documentation becomes stale.

We need a tool that follows the agentic approach we are starting to see where we have ever-evolving documentation, or memories, that our agents could utilize without another needle in a haystack problem.

For the past few weeks I have been building an open source MCP server that allows for the creation of "notes" that are specifically anchored to files that AI agents could retrieve, create, summarize, search, and ultimately clean up.

This has solved a lot of issues for me.

You get the correct context of why AI Agents did certain things, and gotchas that might have occurred not usually documented or commented on a regular basis.

It just works out-of-the-box without a crazy amount of lift initially.

It improves as your code evolves.

It is completely local as part of your github repository. No complicated vector databases. Just file anchors on files.

I would love to hear your thoughts if I am approaching the problem completely wrong, or have advice on how to improve the system.

You can find the project at https://github.com/a24z-ai/a24z-Memory.

Comments (4)

gitgallery · 1h ago

Is there any projects that are using it that you can show?

brandonin · 38m ago

I have only been building it for the past week so I am trying to get some feedback on what is versus is not working. I am building some evaluation tools to see what the baseline is with and without a24z. I'll have those for you and write another post as I evolve the project.

gitgallery · 1h ago

How hard is this?

brandonin · 36m ago

Setup is pretty easy. If you use cursor or vs code you just click the button on the readme. if you are using anything else you just copy and paste the MCP server.

The second step is to add a rule that tells it to use the a24z memory mcp server. After that it should automatically do everything for you. I am having some trouble where it doesn't always call the tool, but I get around that just by adding "use a24z memory MCP" as part of my prompt.

Beyond Benchmark Maxxing: Measuring Open Source Models as Real-World Agents (ultravox.ai)

Retrieval Embedding Benchmark (RTEB) (huggingface.co)

Armtrak: Multiplayer space shooting game with images and music (github.com)

Trump hits India with punishing 50% tariffs for buying Russian oil (nbcnews.com)

Pet Rats Using Paws to Create Masterpieces That Have Sold for over $2,600 Total (unmc.edu)

Just Code: Power, Inequality and the Political Economy of IT (cse.umn.edu)

Another cruise: Re-reading "Moby-Dick" at Ahab's age (calebcrain.substack.com)

Learning PostgreSQL Internals (blog.cleverelephant.ca)

In-browser NL2SQL2NL playground: SQLite WASM meets Gemini (github.com)

Show HN: TypeKro – TypeScript infra-as-code for Kubernetes built on KRO (typekro.run)

A Choice of Giants (kriskowal.com)

Science > Panpsychism [video] (youtube.com)

AgentQL, a toolkit for extracting data and automating workflows on live websites (github.com)

AI2 releases Asta: open‑source ecosystem for trustworthy scientific AI agents (hpcwire.com)

What's the Safest Seat in a Car? (popsci.com)

Maunder Minimum (en.wikipedia.org)

Claude's GitHub: two issues closed by AI as "duplicates" – of each other..! (github.com)

Show HN: Seven Dollar Chat- Claude, Llama, DeepSeek and More (7chat.sbs)

Goiânia Accident (en.wikipedia.org)

FlightConnections (flightconnections.com)

Optimizing is not only for better performance

Tony Blair Attends White House Meeting with Trump on Postwar Gaza (theguardian.com)

Engineer as Value Appraiser (substack.com)

Stop killing games: Demands for game ownership must also include workers' rights (theconversation.com)

Dead arms test importance of clenched fists (2015) (bbc.co.uk)

The Rise of Front-Loaded Vesting (levels.fyi)

The Cybersecurity Psychology Framework: A Pre-Cognitive Vulnerability Assessment (cpf3.org)

DraftKings said it acted properly in voiding Iowa man's $14.2M payout (desmoinesregister.com)

Ask HN: What's your current "ecosystem"/dev stack/tools that you use to build?

What's new in Excel August 2025 (techcommunity.microsoft.com)

Anthropic – Detecting and countering misuse of AI: August 2025 (anthropic.com)

We're doing context engineering wrong

A retrospective about blogging for a decade (midzer.de)

You're doing context engineering wrong (github.com)

Asahi Linux Lead Developer Steps Down (linuxiac.com)

Trying Out Claude for Chrome Research Preview (youtube.com)

Flying Cameras Are Snapping Photos of Houses for Insurers (wsj.com)

Lord of the Io_uring (unixism.net)

Io_uring by Example (unixism.net)

Layoff Tracker (trueup.io)

Io_uring basics: Writing a file to disk (notes.eatonphil.com)

Observing the Earnings Gap Through Marital Status, Race and Gender (2019) (stlouisfed.org)

Atlassian's Trello redesign may be 'worst in tech history' say frustrated users (theregister.com)

4chan and Kiwifarms Sue Ofcom over Attempt to Enforce Online Safety Act in US (twitter.com)

Vibe Coding from My Smartphone (marcolabarile.me)

Show HN: DeepShot – an open-source NBA predictor with ML, EWMA, and live UI (github.com)

The Electric Slide (notboring.co)

Measuring founder's presence's contribution to stock price (mirror.xyz)

Steiger: OCI-native builds for Docker, Bazel, and Nix with direct registry push (github.com)

SOPS is an editor of encrypted files (github.com)

We're doing context engineering wrong

Comments (4)