DEV Community

DEV Community

6 Photos and videos

Tweets

Juan David Gómez retweeted

DEV Community

@ThePracticalDev

May 5

React Native on iOS doesn't support ReadableStream. When users switch apps mid-response, the AI keeps generating to nothing. This dev solved it by making the server keep going after disconnect and persist the full response to the DB. { author: @juandastic } dev.to/juandastic/i-built-a-…

I Built a Mobile App in 3 Days. The Hard Part Was Keeping It Connected.

I have been building web apps for 12 years. In that time I never wrote a single line of mobile code....

dev.to

1,635

Juan David Gómez

Juan David Gómez

@juandastic

May 4

12 years building web apps. Zero mobile experience. Last month I needed an iOS app for my side project so I just built one. Took 3 days. The real unlock was not React Native or Expo. It was having AI in the workflow. It did not write the app for me but it removed the friction of learning a new platform. I moved at the same speed I move on the web. Then iOS broke everything. Turns out when your user switches to WhatsApp mid-response, iOS kills your network request. The AI keeps generating server-side, burning tokens, but the response never arrives. Had to rethink the whole streaming architecture to handle disconnects gracefully. Full writeup: dev.to/juandastic/i-built-a-…

I Built a Mobile App in 3 Days. The Hard Part Was Keeping It Connected.

I have been building web apps for 12 years. In that time I never wrote a single line of mobile code....

dev.to

Juan David Gómez

Juan David Gómez

@juandastic

Apr 19

Building AI side projects is fun until you have to pay for it I built Synapse, an AI companion for my wife with a memory system that makes Gemini know her life, her patterns, her emotional triggers. She uses it daily. Two weeks ago, I connected PostHog to track costs. $24. One session alone: 28 messages, $2.42. Every message sends ~30K tokens of context. 80-90% of those tokens are the exact same compiled knowledge, repeated every turn. Most AI providers offer automatic caching. Send the same prefix enough times, and they might cache it for you. But you have no control. One small change in the prompt (like a datetime updating every turn) breaks prefix matching. You never know if it is working. 𝗚𝗲𝗺𝗶𝗻𝗶'𝘀 𝗲𝘅𝗽𝗹𝗶𝗰𝗶𝘁 𝗰𝗮𝗰𝗵𝗶𝗻𝗴 𝗔𝗣𝗜 𝗶𝘀 𝗱𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝘁. You create a cache resource, get a name back, and reference it on every request. Guaranteed hit. 75% cheaper on cached tokens. You decide what gets cached and you can verify it is working. I separated the stable knowledge compilation (~25K tokens) from the volatile parts of the prompt, cached the compilation after hydration, and referenced it by name on every message. The client sends both the cache_name and the full compilation on every request. If the cache is hot: 75% savings. If it expired, the server inlines the compilation at full price. The user never notices. 𝗧𝗵𝗲 𝗻𝘂𝗺𝗯𝗲𝗿𝘀 Before: $0.017-0.039 per generation After: $0.0088 per generation That $2.42 session? Would cost ~$0.25 now. Same knowledge graph. Same memory quality. Just a lot cheaper to remember. Full breakdown: dev.to/juandastic/my-ai-send…

My AI Sends 30k Tokens Per Message. 80% of Them Were Wasted.

Building AI side projects is fun until you have to pay for them. I built Synapse, an AI companion...

dev.to

Juan David Gómez

Juan David Gómez

@juandastic

Apr 3

My wife is a psychologist. She started using an AI between therapy sessions but got frustrated that it forgot everything. So I built her one that doesn't. 297 messages in 15 days. Here's what I learned 🧵

Juan David Gómez

Juan David Gómez

@juandastic

Apr 3

Synapse uses a knowledge graph instead of a flat memory list. With the right therapeutic frameworks (ACT, DBT, Polyvagal Theory) and all her context, she found her ideal AI personal companion synapse-chat.juandago.dev

Juan David Gómez

Juan David Gómez

@juandastic

Apr 3

In the last 15 days of real production data: → 297 messages → 15.2M tokens processed → 100% daily active usage → Peak: 69 messages in one day It's fully open source. The full product story: dev.to/juandastic/my-wife-se…

Juan David Gómez

Juan David Gómez

@juandastic

Mar 22

I benchmarked @zep_ai vs @mem0ai to find out if I was over-engineering my AI companion's memory. Both solve fact extraction, deduplication, and contradiction handling. But they store data in fundamentally different ways.

more replies

Juan David Gómez

Juan David Gómez

@juandastic

Mar 22

You can't ask "find the most connected entity and give me all its facts" if the facts live in a different store than the structure. I'm calling this "context blindness" and for a personal companion it matters more than retrieval scores.

Juan David Gómez

Juan David Gómez

@juandastic

Mar 22

Full benchmark with architecture diagrams, code, and results (open source): dev.to/juandastic/i-benchmar…

I Benchmarked Graphiti vs Mem0: The Hidden Cost of Context Blindness in AI Memory

A few days ago, Taranjeet, the CEO of Mem0, reacted to one of my articles about building AI memory...

dev.to

Juan David Gómez

Juan David Gómez

@juandastic

Mar 17

I built a bidirectional sync between a Neo4j Knowledge Graph and Notion for the @ThePracticalDev Challenge with @NotionHQ AI memory is usually a black box. Now, it's a fully editable, dynamically generated Notion workspace. Here is the architecture dev.to/juandastic/full-circl…

Full Circle: Giving My AI's Knowledge Graph a Notion Interface using MCP

This is a submission for the Notion MCP Challenge When I started building AI tools for my wife, it...

dev.to

DEV Community

Juan David Gómez retweeted

DEV Community

@ThePracticalDev

Mar 2

Context windows are expensive resources. This dev demonstrates how to tame a 120k token prompt by implementing a deterministic GraphRAG approach for more efficient memory scaling. { author: @juandastic } dev.to/juandastic/scaling-ai…

Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

In a past article, I wrote about Synapse, an AI companion I built for my wife. To solve the problem...

dev.to

2,126

Juan David Gómez

Juan David Gómez

@juandastic

Mar 1

1 Million token context windows are a luxury, not an architecture. My AI side-project hit a wall when the system prompt bloated to 120,000 tokens per message. Here is how I fixed it using a Knowledge Graph, a "Waterfill" budget, and Agentless GraphRAG. 🧵👇

more replies

Juan David Gómez

Juan David Gómez

@juandastic

Mar 1

To get those details back, I built Deterministic GraphRAG. I hate Agent tool-calling (adds 3-5s latency). Instead, a straight-line Hybrid Search runs in <1s. It checks what's already in the prompt and only injects missing facts. Zero redundancy.

Juan David Gómez

Juan David Gómez

@juandastic

Mar 1

The Result: Token usage collapsed back to my 40k limit. Latency stayed under 1s. The AI still feels like it knows everything. I wrote a full deep dive on @ThePracticalDev with the architecture dev.to/juandastic/scaling-ai…

Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

In a past article, I wrote about Synapse, an AI companion I built for my wife. To solve the problem...

dev.to

DEV Community

Juan David Gómez retweeted

DEV Community

@ThePracticalDev

Feb 18

Congrats to our top 7 authors this week! 🏆 @dannwaneri, @madsstoumann, Julien Avezou, Peter Mulligan, Prithwish Nath, Vivek V, and @juandastic. This week's lineup features a CSS recreation of a Pantone color deck, a self-hosted Google Trends alternative using DuckDB, and much more. dev.to/devteam/top-7-feature…

Beyond RAG: Building an AI Companion with "Deep Memory" using Knowledge Graphs

I build AI tools to solve my own problems. A while back, I built NutriAgent to track my calories...

dev.to

1,497

I Built a Mobile App in 3 Days. The Hard Part Was Keeping It Connected.

I Built a Mobile App in 3 Days. The Hard Part Was Keeping It Connected.

My AI Sends 30k Tokens Per Message. 80% of Them Were Wasted.

I Benchmarked Graphiti vs Mem0: The Hidden Cost of Context Blindness in AI Memory

Full Circle: Giving My AI's Knowledge Graph a Notion Interface using MCP

Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

Top 7 Featured DEV Posts of the Week

Beyond RAG: Building an AI Companion with "Deep Memory" using Knowledge Graphs