Agents need low latency access to context. If every request to an agent turns into requests across multiple systems, latency becomes a UX bug. (Tail latency amplification)
This is a problem most agents face because context is usually scattered across the business: CRM data, order systems, support history, docs, user preferences, calendar data, email, tickets, transactions, and product data.
If the agent takes 15 seconds to call different services before responding, the answer might be correct, but the experience already failed.
This is the problem Redis Iris, the newest suite of products released by Redis yesterday, is targeting.
Redis Iris puts Redis in the middle as the fast context layer agents can query while they are reasoning.
As part of Redis Iris, you get:
𝐑𝐞𝐝𝐢𝐬 𝐃𝐚𝐭𝐚 𝐈𝐧𝐭𝐞𝐠𝐫𝐚𝐭𝐢𝐨𝐧: A streaming data integration layer for moving data from source systems into Redis. It can keep Redis updated as the underlying data changes, then reshape that data into structures that are easier for applications and agents to query.
𝐑𝐞𝐝𝐢𝐬 𝐂𝐨𝐧𝐭𝐞𝐱𝐭 𝐑𝐞𝐭𝐫𝐢𝐞𝐯𝐞𝐫: A semantic layer over Redis data. Instead of giving the agent raw keys, hashes, JSON documents, and indexes, you define business entities like `customer`, `order`, or `delivery`. Context Retriever exposes those entities through agent usable interfaces like MCP tools.
𝐑𝐞𝐝𝐢𝐬 𝐀𝐠𝐞𝐧𝐭 𝐌𝐞𝐦𝐨𝐫𝐲: A memory service backed by Redis. It stores short term conversation state and extracts longer term memories from interactions, such as facts, preferences, decisions, or things the agent learned while completing a task.
𝐑𝐞𝐝𝐢𝐬 𝐋𝐚𝐧𝐠𝐂𝐚𝐜𝐡𝐞: A semantic cache for LLM workflows. Instead of matching on exact prompt strings, it caches based on meaning or intent, so similar requests can reuse previous results and reduce latency and cost.
The useful architecture pattern here is that Redis is not the source of truth. Your source systems stay where they are.
Redis becomes the low latency operational layer that agents query while they are reasoning. That matters because agents are not good at dealing with scattered context directly.
If an agent has to inspect raw database structures, infer relationships, call multiple slow APIs, and remember what happened in previous sessions, every request becomes a mini integration project.
If a user asks why an order is late. The agent needs to identify the customer, retrieve the relevant order, inspect delivery state, possibly check policy or restaurant data, pull memory, and generate a response.
The answer is only the visible part. The real work is context retrieval, indexing, memory, and caching underneath.
That is what Redis Iris is trying to package into a more standard agent architecture: model tools Redis backed context layer.
Watch the livestream announcement with
@rowantrollope and
@simba_khadder:
youtube.com/watch?v=7JOnwT1v…
Blog post:
redis.io/blog/context-is-all…