Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context

8 vasa_ 2 6/3/2025, 1:05:15 PM github.com ↗

Hey HN! We're Vasilije, Laszlo and Lazar, the authors of a new paper and part of https://www.cognee.ai. cognee let’s you build memory layers for AI applications and agents, allowing them to personalize results, connect various data sources, and add custom rules. This enables AI apps to deliver increasingly accurate responses, we reached almost 90% on standard industry benchmarks as you can see here https://github.com/topoteretes/cognee/tree/main/evals and our paper can be accessed at: https://arxiv.org/abs/2505.24478 and collab is in the repository

LLMs don't remember context well—they can’t ingest your data and keep it in memory. This limitation leads to lacking interactions, a lack of accuracy, and the inability to connect your data sources cheaply because developers must include long, unmanaged context in every prompt.

When we were building RAGs we saw that we can’t find the data we need and that there are too many knobs to turn in RAG frameworks. We had to tweak many parameters and also could not specify the rules we wanted the data to follow. No ontologies, no rulesets, no state or good data engineering practices and a lot of manual work. That’s why cognee

cognee builds memory that combines graph, vector, and relational stores. Here is how it works:

Adding data: When you use cognee with your AI App, it can take in any message, string, S3 bucket or even a relational database and automatically ingest it

Managing information: cognee sorts this information into semantic graph: - It extracts entities and connections between things. - It embeds the data in the vector store, It enriches data with custom ontologies you provide, that help ground the graphs and make them more reliable. - The overall information is stored in many layers of a graph and vector store that allows for finding similar information later using a variety of types of searches.

Retrieving data: When given an input query, cognee searches for and retrieves related stored information by leveraging a combination of graph traversal techniques, vector similarity, and COT techniques. It can use the internal benchmarking system to make sure that your pipelines are returning only accurate data when you need it.

---

cognee introduces a self-improving group of memory layers that cover various topics, data sources and can be customized. This reduces the need to build everything from scratch, and you can use our primitives to get started and move more quickly. On the other hand, you can do everything yourself, from start to end. We’ve designed the system to be modular and extensible.

We’ve open-sourced cognee —specifically the framework and various vector and graph database adapters, as well as our default data pipeline, cognify—under the Apache 2.0 license. This includes the ability to add, cognify, and retrieve data within your AI applications and also extend it with custom components that are just pure Python.

However, many keep features that are optimized for production use, as a part of our paid platform. We release these functionalities too, including in the last few weeks permission management + distributed pipelines(dev). These are a part of our open-source package and are available to those who in production environments need things like rate limiting and credential management. We will release a paid offering for developers to get easy API access to our platform.

Our automation tooling allows us to optimize the pipelines and find the best combination of parameters that answer questions our stakeholders have.

We’d love to hear what you think! Please feel free to try our demo, check out the code, read the research paper, and share thoughts or suggestions with us. Your feedback will help shape where we take cognee from here!

Comments (2)

dovlex · 1d ago

Hey, this looks super interesting - nice work!

Couple questions:

How does cognee handle temporal evolution of memory (e.g. when facts change over time)? Curious what’s still an open problem here because AFAIK this is really hard to solve.

Is cognee meant to power agent memory that builds up evolving contextual data about individual users and agents acting on their behalf? Say you track evolving user preferences in some domain space e.g. travel. If so, how do you handle user-specific memory separation—especially in multi-user or multi-agent setups?

vasa_ · 1d ago

Hi, founder of cognee here

We have temporal resolution mechanisms we are building, and framework is generalizable enough to build any custom logic. We have a few ideas and some things will be posted there soon.

cognee has notion of nodesets which work similar to tags: https://docs.cognee.ai/core-concepts/node-sets

And also we have graphs per user now available. So, user permisions + graph filtering

Kagi Is Down (kagi.com)

Tesla use of Detroit shopping center lot lands property owner in hot water (autonews.com)

'Bohemian Rhapsody': The Story Behind Queen's Rule-Breaking Classic Song (udiscovermusic.com)

Jen Pahlka – What DOGE Didn't Do (eatingpolicy.com)

NFT phenom CryptoPunks was just sold to a nonprofit (techcrunch.com)

The Re-Imagined Newsletter: An AI/Human Launchpad (substack.com)

Trump wants to put humans on Mars – here's what scientists think (nature.com)

I Bought a Tesla and the Previous Owner Has Been Remotely Controlling My Car (torquenews.com)

FinOps Foundation's Focus 1.2 Expands to SaaS, PaaS (thenewstack.io)

Show HN: Floco – Enterprise AI Infrastructure with 600% Performance Boost

Cursor 1.0 (youtube.com)

Peers vote to defy government over copyright threat from AI (theguardian.com)

Quantifying Volatility of Chess Games (lichess.org)

(Canadian) Brigadier-General Cook: We need to be prepared for war by 2028-2030 (spaceq.ca)

Plotting Points in Seconds, in R (ggirelli.info)

HTML Might Be All You Need [video] (youtube.com)

Would you use an LLM that follows instructions reliably?

Lab Snacks (neil.computer)

This Year's Hot New Tool for Chefs? ChatGPT (nytimes.com)

Cheap yet ultrapure titanium might enable widespread use in industry (2024) (phys.org)

Just Breath (ggirelli.info)

Unsold Cybertrucks Are Piling Up at a Decaying US Shopping Mall (vice.com)

A job wherever you are wherever you want to go (indiegogo.com)

OpenAI slams court order to save all ChatGPT logs, including deleted chats (arstechnica.com)

Model golf – code golf, but with LLMs (model.golf)

China Launches Mission to Capture Pieces of an Unusual Asteroid (nytimes.com)

Mastodon 4.4 will begin displaying quote posts from other servers (mastodon.social)

The prostate cancer 'super test (thenakedscientists.com)

Reddit sues Anthropic, alleging its bots accessed Reddit more than 100k times (theverge.com)

Ask HN: Are LLMs Overhated?

After court order, OpenAI is now preserving all ChatGPT user logs (mastodon.laurenweinstein.org)

The effort to tally AI's energy appetite (technologyreview.com)

FHIR Dosage Support (jy95.github.io)

The Natural Rate of Interest is Zero (2005) [pdf] (moslereconomics.com)

Employee Stress Is a Business Risk–Not an HR Problem (hbr.org)

Flight Simulator Gave Birth to 3D Video-Game Graphics (spectrum.ieee.org)

Arthur C. Clarke Predicted the Rise of AI (1978) (openculture.com)

Two Programmers (michaelrbernste.in)

One thing Tesla and Comma.ai overlooked in self-driving (deepdream.substack.com)

Comparing Claude System Prompts Reveal Anthropic's Priorities (dbreunig.com)

Microsoft's LinkedIn Chief Roslansky to Run Office Group in AI Shakeup (bloomberg.com)

Figma Dev Mode MCP server public beta (figma.com)

Show HN: Glue is thread-first team chat with MCP support (glue.ai)

Anatomy of a Recruitment Phish (blog.amen6.com)

Some Dead Sea Scrolls are older than researchers thought, AI analysis suggests (science.org)

An Arrow Flight SQL Server with DuckDB or SQLite back-end execution engines (github.com)

What was the role of MS-DOS in Windows 95? (devblogs.microsoft.com)

American Utopian Communities of the 20th Century (defunct and active) (en.wikipedia.org)

British canoeist 'forced to choose between Olympics and OnlyFans' (bbc.com)

Agency is giving away $15k Branding package in a contest (alisteragency.ca)

Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context

Comments (2)