Why do RAG systems feel like they hit a ceiling?
I've been diving into
@helloiamleonie's latest article on agent memory, and it provided so much clarity into the current evolution of RAG systems.
The progression from RAG โ Agentic RAG โ Agent Memory isn't about adding features. It's about changing ๐ต๐ผ๐ ๐ถ๐ป๐ณ๐ผ๐ฟ๐บ๐ฎ๐๐ถ๐ผ๐ป ๐ณ๐น๐ผ๐๐.
๐ฅ๐๐: ๐ฅ๐ฒ๐ฎ๐ฑ-๐ข๐ป๐น๐, ๐ข๐ป๐ฒ-๐ฆ๐ต๐ผ๐
Traditional RAG is like a library where you can only check out books. You retrieve context, generate a response, done. The knowledge base is static - updated offline, queried online. Simple, but inflexible.
๐๐ด๐ฒ๐ป๐๐ถ๐ฐ ๐ฅ๐๐: ๐ฆ๐บ๐ฎ๐ฟ๐ ๐ฅ๐ฒ๐ฎ๐ฑ-๐ข๐ป๐น๐
Agentic RAG adds intelligence to retrieval. The agent decides:
โข Do I even need external information?
โข Which knowledge source should I query?
โข Is this retrieved context actually relevant?
Still read-only, but way more sophisticated about ๐ธ๐ฉ๐ข๐ต it reads.
๐๐ด๐ฒ๐ป๐ ๐ ๐ฒ๐บ๐ผ๐ฟ๐: ๐ฅ๐ฒ๐ฎ๐ฑ-๐ช๐ฟ๐ถ๐๐ฒ ๐ข๐ฝ๐ฒ๐ฟ๐ฎ๐๐ถ๐ผ๐ป๐
Agent memory introduces write operations during inference. The agent can now:
โข Store new information from conversations
โข Update existing knowledge
โข Create memories from important events
โข Build personalized context over time
Your AI assistant doesn't just retrieve your preferences - it ๐น๐ฒ๐ฎ๐ฟ๐ป๐ them through interaction. It's not just searching a static knowledge base - it's actively building one.
Leonie breaks this down with code examples showing how WriteTool extends the SearchTool paradigm. The agent gets tools for storing, updating, even consolidating memories.
But (and this is important) - she's also super clear this is a simplified mental model. Real agent memory systems need sophisticated memory management: deciding what to remember, what to forget, how to handle memory corruption. It's messier than it looks ๐
The article also touches on different memory types - procedural ("use emojis"), episodic ("user mentioned trip on Oct 30"), semantic ("Eiffel Tower is 330m tall") - potentially stored in separate collections.
I love this framing because it shows how each evolution solved specific limitations. RAG was too rigid. Agentic RAG made retrieval smarter. Agent memory made the whole system adaptive.
Full article here:
leoniemonigatti.com/blog/froโฆ