Joined May 2008
16 Photos and videos
Agents are growing up. Agent infra needs to do so too.
The first wave of enterprise AI agents is being rebuilt. We recently sponsored @VentureBeat's AI Impact Series in New York, where industry leaders tackled the question that's keeping engineering teams up at night: why do production AI agents keep failing, and what does it take to fix them? Spoiler: it's not about smarter models. It's about durable architecture, state management, and building for failure from day one. Read the recap: venturebeat.com/orchestratio…
25
Johann SchleierSmith retweeted
Jun 1
been asking others at Anthropic how they stay in the loop with Claude and fully understand the work being done this is one of my favorites from Suzanne:
212
682
10,452
1,360,275
Johann SchleierSmith retweeted
Measuring someone's productivity by their token usage is a horrible idea. Giving everyone the same fixed token budget isn't much better. So what's the right way to roll out AI across your org? We built a system to measure how many productive engineering hours every Devin task is worth, validated against a dataset of real engineers’ times estimates. The goal is to answer the fundamental question that companies are grappling with: how much real value are you getting from each of your agent sessions? On top of that, we're giving an AI productivity guarantee! Now if Devin delivers less engineering value than you're paying for, we fund your usage until it does. The whole industry needs to move from measuring activity to measuring output. We hope to see more AI companies taking this approach.
AI should earn its keep. Introducing the AI Productivity Guarantee. If Devin delivers less engineering value than you’re paying for, Cognition will fund your usage until it does, up to $10 million. It’s time for the AI industry to stop maximizing tokens and start maximizing productive output.
59
58
893
159,711
It feels like Claude writes better in LaTeX than in Markdown or HTML. Can anyone confirm / contradict? I’m getting concise language, good argument development, and minimal LLM smell. Not my unusual experience. Using Opus 4.7 in Claude Code.
65
Johann SchleierSmith retweeted
May 20
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
1,198
3,918
26,787
13,571,069
AI investment accelerating at Temporal
Agentic AI needs infrastructure that actually works in production. Today we're doubling down. 🚀 Welcoming Chkk and Adviser Labs to Temporal — two teams with AWS-scale distributed systems experience and Stanford AI research who were already building on Temporal before we ever called. More at Replay next week👀 In the meantime, read more about this: temporal.io/blog/doubling-do…
2
96
Johann SchleierSmith retweeted
literally everyone is asking the mcp vs cli question. the team answered it in this new post - check it out and let us know what you think!
New blog: Building agents that reach production systems with MCP. When should agents use direct APIs vs CLIs vs MCP? Plus patterns for building MCP servers, context-efficient clients and pairing MCP with skills. claude.com/blog/building-age…
5
11
254
148,260
Claude Codex TLA is a powerful combo for generating reliable code that handles corner cases
1
98
Johann SchleierSmith retweeted
Agents fail in production for boring reasons. Transient API errors. Human approval steps that time out. Non-deterministic branching that nobody accounted for at 2am on a Saturday. Our CTO @mfateev is at #GoogleCloudNext April 23 with @GitLab @vellum_ai getting into the infrastructure decisions that determine whether your agent system survives contact with real users. Register: googlecloudevents.com/next-v…
5
9
817
I sat down with @temporalio CEO Samar Abbas to talk about his view of AI systems trends. Check out the writeup!
We're in the MS-DOS era of agentic AI. In this article, our CEO Samar Abbas unpacks what that actually means and what comes after: temporal.io/blog/ms-dos-to-a…
1
6
487
Major upgrades to OpenAI Agents SDK out this week. Run them all on @temporalio
Build long-running agents with more control over agent execution. New capabilities in the Agents SDK: • Run agents in controlled sandboxes • Inspect and customize the open-source harness • Control when memories are created and where they’re stored
1
83
Johann SchleierSmith retweeted
Apr 8
AI agents don't just need smarter models. They need code that won't crash. See what @temporalio co-founder and CTO, @mfateev, had to say at HumanX.
5
12
978
Johann SchleierSmith retweeted
Keep your AI agents running through process crashes and network drops. This tutorial shows you how to build a ReAct-style loop with the Gemini API and Temporal. Temporal persists every step so your agent resumes exactly where it left off. Learn more → goo.gle/4rvf0Wr
21
45
248
21,911
I'm claiming my AI agent "jssys01t" on @moltbook 🦞 Verification: burrow-P9LZ
1
1
81
I'm claiming my AI agent "jssysoc" on @moltbook 🦞 Verification: wave-7F5B
86
I'm claiming my AI agent "jsski" on @moltbook 🦞 Verification: coast-K3EL
73
Progress was already exponential before we had agents creating other agents. What's next?
10 Sep 2025
AI agents can prototype apps… But shipping real software takes hours of testing, debugging, and refactoring. Agent 3 is 10× more autonomous — it keeps going where others get stuck. The “Full Self-Driving” moment of software.
222
Big News! Crystal DBA has been acquired by Temporal Technologies . Read more: crystaldba.ai/blog/post/temp… @temporalio @auto_dba

3
452
Johann SchleierSmith retweeted
9 Aug 2025
We've been building the pieces for years. Projects, AI Agents, Automations. Today, the dots connect. Introducing 🧬 Taskade Genesis Preview • One prompt → a full-stack AI app • Powered by your Workspace • Supercharged with GPT-5 Reply `Genesis` for early access 🚀
193
21
121
51,480
Johann SchleierSmith retweeted
29 Jul 2025
We’re excited to introduce RunLLM v2 today! 🎉 RunLLM v2 is rebuild of the product from the ground up focused on delivering the most powerful and flexible platform for enterprise support teams. Read the RunLLM v2 launch blog post: 👉 tinyurl.com/49st5ns2 Today’s launch includes: 🤖 A new agentic planner with fine-grained reasoning and tool use support ✨ A redesigned new UI that enables creating, managing, and inspecting multiple agents ⚙️ A Python SDK that allows you to exercise fine-grained control over support workflows We’ll be sharing more throughout the week, but today we’re focused on how RunLLM’s new agentic capabilities enable more precise answers and more effective debugging.
4
14
34
9,547