Joined August 2009
129 Photos and videos
1.4M Jupyter notebooks pulled from GitHub. 4% reproduced. Six years of human code-checking: ~116 papers certified. Reproducibility isn't a tooling gap, it's a labor shortage. Three agent patterns for the last mile, receipts included: medium.com/p/reproducibility… #ScientificPython
8
"Keep Opus out of the loop; it's 19x Haiku." That gap is 5x now. Anthropic cut Opus 3x. Tiered agents still cut bills 40-60% but not because of price. Token volume = 80% of agent performance. Model choice = ~5%. Why this works 👇 medium.com/p/opus-sonnet-hai… #AIAgents #LLMOps #CostOptimization
13
Most AI agents are amnesiacs. They learned not to call your deprecated endpoint on Tuesday, but they'll cheerfully do it again on Wednesday. That architecture just got obsoleted. Production memory has stopped looking like a vector database and started looking like a filesystem. If you're building long-running agents, here is the architectural playbook for the two new primitives: Memory and "Dreaming" (Sleep-time Compute). #AI #Agent #AgentMemory #AIAgent Full deep dive below 👇 medium.com/p/goldfish-at-sca…

1
7
Most multi-vertical AI agents fail because specialization is treated as a prompt problem. What worked for us instead: - template-scoped personas - composable prompt fragments - scaffold-aware reasoning - persona-specific tool gating One agent implementation. Multiple specialized behaviors. No prompt spaghetti. “Agent Specialization Through Templatized Prompt Architectures” #AI #LLM #AIAgents medium.com/p/agent-specializ…
33
The MCP ecosystem just hit lightspeed. 🚀 Between Sept '25 and April '26, the game changed: 🔹 Progressive Tool Discovery (85% token reduction) 🔹 Structured Output splits 🔹 Code Execution as orchestration 🔹 The rise of Skills #MCP #AI #AgenticAI #Anthropic Part 2 of my Production MCP series is live: medium.com/p/production-mcp-…
25
The most important reader of your documentation isn't human anymore it's an LLM. It doesn't scroll. It doesn't care about your JavaScript. And if you're serving it heavy HTML, you're bleeding context tokens. Anthropic already shipped the fix: the llms.txt file. Here is how to rebuild your retrieval architecture for the AI age 👇 #SEO #LLM #WebDev #TechDocs #ArtificialIntelligence #Anthropic #DeveloperExperience medium.com/p/write-for-the-f…
13
Flipping stream: true on Open WebUI works for a local demo, but crumbles in production. Scaling requires deliberate architectural choices from fixing nginx buffering to isolating task models and securing plugins. Here are the 7 decisions you need to make: medium.com/p/streaming-llm-a… #OpenWebUI #LLMs #SystemArchitecture #DevOps
19
We built an AI security review system with zero application code. Just Markdown files. No frameworks, no pipelines, no containers. For @GoDaddy's "Compress the Cycle" hackathon, our team created a multi-agent system that catches security issues before code is even committed not after it hits CI. Specialized AI agents run in parallel across SAST, logic review, IaaC, and policy compliance, each challenged by a dedicated Validator whose job is to prove the finding wrong. The result: fewer false positives, higher developer trust, and a system where anyone who can write clearly can improve the security review "no coding required". Proud that our team was selected. Full technical deep dive on the GoDaddy blog: godaddy.com/resources/news/t… #AgenticAI #SecurityEngineering #ShiftLeft #GoDaddy
21
Building AI apps? You've probably hit the LLM streaming wall: you need streaming so users don't stare at a spinner, but partial JSON is syntactically invalid. 🧱 BAML from @boundaryML solves this beautifully by parsing broken LLM streams into typed, partial objects in real-time. Stop waiting for the final closing brace to render your UI! 🛠️✨ #SoftwareDevelopment #AI #LLMs #WebDev #Engineering #Tech medium.com/p/streaming-struc…
23
Stop finding security vulnerabilities after you’ve already pushed code. 🛑 Most "shift-left" security tools actually shift-middle—they trigger in CI, forcing you to context-switch back to old code. The fix? A prompt-native multi-agent team running pre-commit. The wild part? This architecture has almost zero application code. It’s built entirely in Markdown. 🧵
1
7
Check out how we moved security feedback from "post-push" to "pre-commit" using a hub-and-spoke agent architecture.
1
4