Staff Engineer at @trydaily. Loves coffee, music, family and frontend. Previously @stickermule Namics @lyskaio Tweets are mine.

Joined October 2009
64 Photos and videos
Christian Stuff retweeted
17 Mar 2025
Introducing Pipecat Cloud, infrastructure for open source voice AI agents. If you're building voice AI agents with @pipecat_ai, you have lots of options for hosting your agents: anywhere you can run a Python process and terminate WebSocket or WebRTC connections. But managing agents in production, on rock solid infrastructure, with observability, autoscaling, blue-green deployments, and everything else needed for real usage at scale is not trivial. Devops is a big category of questions and conversation threads in the Pipecat Discord. So we built a platform specifically for voice AI. I've been describing Pipecat Cloud as ➡️ a "Kubernetes wrapper," or ➡️ "Heroku for voice agents, or ➡️ "you just push us a Docker container and we do the rest." If you're building voice agents with Pipecat, take a look at Pipecat Cloud and tell us what you think.
20
40
294
37,890
Christian Stuff retweeted
Correctly identifying guitar sounds with Google Gemini 2 voice-to-voice
Great to have this repo! thanks @kwindla! Gemini 2 Voice-to-Voice = the fastest voice interface? Pipecat means fast transport. It's clearly hearing non-verbal audio but struggles to describe it, and can't sing or laugh. Great for practical apps. This repo rocks!
4
6
54
4,943
Christian Stuff retweeted
Build real-time voice agents with Gemini, Pipecat, and Daily! Read more about our partnership with Google to build Multimodal Live API support into @pipecat_ai as part of the Gemini 2.0 launch
11 Dec 2024
Gemini 2.0 launched today. Amazing multimodal capabilities, long context windows, fast response times, built-in tools, and top-of-the-leaderboards reasoning capabilities. Plus a new API — the Multimodal Live API — for conversational AI applications, like voice agents and multimodal copilots. @Google and Daily have partnered to build Multimodal Live API support into the @pipecat_ai Open Source SDKs for Web, Android, iOS and C . The Pipecat SDKs come with echo cancellation and noise reduction, device management, event abstractions, React hooks, and more. They support both direct connections to the Gemini WebSocket API, and WebRTC routing on Daily's global ultra-low latency network. Build realtime voice agents with Gemini, Pipecat, and Daily. Links to docs and starter kits in the thread below (1/4)...
2
5
231
Christian Stuff retweeted
11 Dec 2024
Big day today for conversational AI! A new Gemini 2.0 model and a new voice-to-voice (plus video input) API from @Google. 🔊📹🤖⚡️🔊😀 See the thread below for links to: ➡️ Open Source @pipecat_ai clients for Web, React, Android, iOS, and C . Echo cancellation and noise reduction, hooks for function calling and tool use, support for both WebSocket and WebRTC network transport. ➡️ A Pipecat service that brings the Multimodal Live API features into the Pipecat Open Source ecosystem. Use this model in combination with your existing voice agent workflows, for example. ➡️ Bite-sized sample code demos. ➡️ A full-blown multimodal chat app starter kit project.
11 Dec 2024
Gemini 2.0 launched today. Amazing multimodal capabilities, long context windows, fast response times, built-in tools, and top-of-the-leaderboards reasoning capabilities. Plus a new API — the Multimodal Live API — for conversational AI applications, like voice agents and multimodal copilots. @Google and Daily have partnered to build Multimodal Live API support into the @pipecat_ai Open Source SDKs for Web, Android, iOS and C . The Pipecat SDKs come with echo cancellation and noise reduction, device management, event abstractions, React hooks, and more. They support both direct connections to the Gemini WebSocket API, and WebRTC routing on Daily's global ultra-low latency network. Build realtime voice agents with Gemini, Pipecat, and Daily. Links to docs and starter kits in the thread below (1/4)...
1
20
85
15,929
Christian Stuff retweeted
11 Dec 2024
Gemini 2.0 launched today. Amazing multimodal capabilities, long context windows, fast response times, built-in tools, and top-of-the-leaderboards reasoning capabilities. Plus a new API — the Multimodal Live API — for conversational AI applications, like voice agents and multimodal copilots. @Google and Daily have partnered to build Multimodal Live API support into the @pipecat_ai Open Source SDKs for Web, Android, iOS and C . The Pipecat SDKs come with echo cancellation and noise reduction, device management, event abstractions, React hooks, and more. They support both direct connections to the Gemini WebSocket API, and WebRTC routing on Daily's global ultra-low latency network. Build realtime voice agents with Gemini, Pipecat, and Daily. Links to docs and starter kits in the thread below (1/4)...
4
11
38
18,991
Christian Stuff retweeted
Developers now can build voice-to-voice AI with Llama models! We’re excited to partner with @trydaily to bring the power of Llama 3.1 models to ultra low latency voice and video agents.
20 Aug 2024
Today we’re launching Daily Bots, the ultra low latency Open Source cloud for voice, vision, and video AI. Build voice-to-voice with any LLM, at conversational latencies as low as 500ms. With Daily Bots, developers can: *️⃣  build with Open Source SDKs *️⃣  mix and match the best Generative AI models for specific use cases *️⃣  run at scale on Daily’s real-time global infrastructure We've partnered with @AnthropicAI, @cartesia, @DeepgramAI, and @togethercompute for this launch. Our goal is to combine the best tools, best developer ergonomics, and best infrastructure for real-time AI into a single platform. Daily Bots is the culmination of the last 18 months of work we've done with customers and partners, pioneering real-time and voice-to-voice AI in production. The two fastest growing Open Source real-time AI projects came out of this work: @pipecat_ai and RTVI. *️⃣  Daily Bots apps are built using the RTVI Open Source SDKs for the Web, iOS, and Android. Your Daily Bots code will run anywhere that supports the RTVI standard. (Or you can run your own infrastructure.) *️⃣  Your Daily Bots can also answer the phone. (You can buy a phone number from us with a single curl command.) *️⃣  Bring your own API Keys and use any inference provider that supports OpenAI-compatible APIs. Or run your own models and connect your bots to your infrastructure. For a highly non-serious take on the kinds of things we’ve been creating for ourselves, as we’ve worked on Daily Bots, check out the video. We'd love to hear what multi-modal, real-time AI directions are most interesting to you. We can't wait to see what you build.
1
9
43
4,863
Christian Stuff retweeted
20 Aug 2024
🤖 Daily Bots launch day at @trydaily! 🤖 Voice-to-voice with any LLM. (And vision and video, of course.) We've had a ton of fun building this hosted service for real-time AI on top of the Open Source projects that we've been contributing to for the last year. I'm all in on the potential of multi-modal AI. PS: if you've seen the 500ms voice-to-voice demos I've been posting, you can build that kind of thing yourself now in ~18 lines of code. 🏎️ Depending on the LLM you choose and how you prompt it, you can optimize for latency, or for features like function calling and vision input, or for experimenting with new capabilities like @AnthropicAI's incredibly useful new prompt caching beta.
20 Aug 2024
Today we’re launching Daily Bots, the ultra low latency Open Source cloud for voice, vision, and video AI. Build voice-to-voice with any LLM, at conversational latencies as low as 500ms. With Daily Bots, developers can: *️⃣  build with Open Source SDKs *️⃣  mix and match the best Generative AI models for specific use cases *️⃣  run at scale on Daily’s real-time global infrastructure We've partnered with @AnthropicAI, @cartesia, @DeepgramAI, and @togethercompute for this launch. Our goal is to combine the best tools, best developer ergonomics, and best infrastructure for real-time AI into a single platform. Daily Bots is the culmination of the last 18 months of work we've done with customers and partners, pioneering real-time and voice-to-voice AI in production. The two fastest growing Open Source real-time AI projects came out of this work: @pipecat_ai and RTVI. *️⃣  Daily Bots apps are built using the RTVI Open Source SDKs for the Web, iOS, and Android. Your Daily Bots code will run anywhere that supports the RTVI standard. (Or you can run your own infrastructure.) *️⃣  Your Daily Bots can also answer the phone. (You can buy a phone number from us with a single curl command.) *️⃣  Bring your own API Keys and use any inference provider that supports OpenAI-compatible APIs. Or run your own models and connect your bots to your infrastructure. For a highly non-serious take on the kinds of things we’ve been creating for ourselves, as we’ve worked on Daily Bots, check out the video. We'd love to hear what multi-modal, real-time AI directions are most interesting to you. We can't wait to see what you build.
3
12
75
6,206
Christian Stuff retweeted
20 Aug 2024
Today we’re launching Daily Bots, the ultra low latency Open Source cloud for voice, vision, and video AI. Build voice-to-voice with any LLM, at conversational latencies as low as 500ms. With Daily Bots, developers can: *️⃣  build with Open Source SDKs *️⃣  mix and match the best Generative AI models for specific use cases *️⃣  run at scale on Daily’s real-time global infrastructure We've partnered with @AnthropicAI, @cartesia, @DeepgramAI, and @togethercompute for this launch. Our goal is to combine the best tools, best developer ergonomics, and best infrastructure for real-time AI into a single platform. Daily Bots is the culmination of the last 18 months of work we've done with customers and partners, pioneering real-time and voice-to-voice AI in production. The two fastest growing Open Source real-time AI projects came out of this work: @pipecat_ai and RTVI. *️⃣  Daily Bots apps are built using the RTVI Open Source SDKs for the Web, iOS, and Android. Your Daily Bots code will run anywhere that supports the RTVI standard. (Or you can run your own infrastructure.) *️⃣  Your Daily Bots can also answer the phone. (You can buy a phone number from us with a single curl command.) *️⃣  Bring your own API Keys and use any inference provider that supports OpenAI-compatible APIs. Or run your own models and connect your bots to your infrastructure. For a highly non-serious take on the kinds of things we’ve been creating for ourselves, as we’ve worked on Daily Bots, check out the video. We'd love to hear what multi-modal, real-time AI directions are most interesting to you. We can't wait to see what you build.
11
32
139
31,488
Christian Stuff retweeted
23 Jul 2024
Very, very fast voice bots. Llama 3.1 running on @GroqInc. 🚀 500ms voice-to-voice response times
81
442
3,898
386,492
Christian Stuff retweeted
12 Mar 2024
Function calling, interruptibility, fast responses. This is a nice example of where real-world voice interfaces are headed.
3
3
65
9,593
People who know me know I am a big fan and defendant of open web standards. The list outlined in @kwindla‘s blog post is probably one of the most important lists to look at as the year comes to an end (I might be exaggerating if I said „more important than Santa‘s list“).
19 Dec 2023
Motivated by Twilio's announcement that Twilio Video is going away, I've been spending some time digging into what the latest version of Zoom's Web SDK can (and can't) do. It's definitely getting better, but it's still much less performant than native WebRTC. And it's still missing a lot of things that web video apps need. daily.co/blog/zoom-web-sdk-t…
1
2
91
It‘s not just about the video platform itself, but about the documentation, support and the SDKs and API design that set the ease of a migration path. Personally I care a lot about providing a very good entry point into Daily‘s video platform for React developers.
1
37
I am a big fan and defendant of open web standards.
26
Christian Stuff retweeted
12 Dec 2023
Putting this out there for search's sake (lol, twitter search): If you've built something on @twilio's video product and you're trying to figure out what to do about the EOL, you should check out Daily. We've even got a post showing you how to migrate: daily.co/blog/migrating-a-tw…
1
4
237
It was an absolute pleasure to be part of this event! So many amazing people, great talks and projects and a fantastic organization team! 🤩
Thank you, everyone, for the amazing conference🌟 Speakers, sponsors, volunteers, attendees, partners, artists, MCs, program committee ♥️ You are awesome!
1
6
335
You’ve heard of elf on a shelf, but here’s
1
77
1
32
25