kwindla

kwindla

64 Photos and videos

Tweets

Christian Stuff retweeted

kwindla

@kwindla

17 Mar 2025

Introducing Pipecat Cloud, infrastructure for open source voice AI agents. If you're building voice AI agents with @pipecat_ai, you have lots of options for hosting your agents: anywhere you can run a Python process and terminate WebSocket or WebRTC connections. But managing agents in production, on rock solid infrastructure, with observability, autoscaling, blue-green deployments, and everything else needed for real usage at scale is not trivial. Devops is a big category of questions and conversation threads in the Pipecat Discord. So we built a platform specifically for voice AI. I've been describing Pipecat Cloud as ➡️ a "Kubernetes wrapper," or ➡️ "Heroku for voice agents, or ➡️ "you just push us a Docker container and we do the rest." If you're building voice agents with Pipecat, take a look at Pipecat Cloud and tell us what you think.

294

37,890

Joe Heitzeberg

Christian Stuff retweeted

Joe Heitzeberg

@jheitzeb

11 Dec 2024

Correctly identifying guitar sounds with Google Gemini 2 voice-to-voice

1:09

Joe Heitzeberg

@jheitzeb

11 Dec 2024

Great to have this repo! thanks @kwindla! Gemini 2 Voice-to-Voice = the fastest voice interface? Pipecat means fast transport. It's clearly hearing non-verbal audio but struggles to describe it, and can't sing or laugh. Great for practical apps. This repo rocks!

4,943

Ramya Venkat

Christian Stuff retweeted

Ramya Venkat @ramyavmani

11 Dec 2024

Build real-time voice agents with Gemini, Pipecat, and Daily! Read more about our partnership with Google to build Multimodal Live API support into @pipecat_ai as part of the Gemini 2.0 launch

Daily

@trydaily

11 Dec 2024

Gemini 2.0 launched today. Amazing multimodal capabilities, long context windows, fast response times, built-in tools, and top-of-the-leaderboards reasoning capabilities. Plus a new API — the Multimodal Live API — for conversational AI applications, like voice agents and multimodal copilots. @Google and Daily have partnered to build Multimodal Live API support into the @pipecat_ai Open Source SDKs for Web, Android, iOS and C . The Pipecat SDKs come with echo cancellation and noise reduction, device management, event abstractions, React hooks, and more. They support both direct connections to the Gemini WebSocket API, and WebRTC routing on Daily's global ultra-low latency network. Build realtime voice agents with Gemini, Pipecat, and Daily. Links to docs and starter kits in the thread below (1/4)...

231

kwindla

Christian Stuff retweeted

kwindla

@kwindla

11 Dec 2024

Big day today for conversational AI! A new Gemini 2.0 model and a new voice-to-voice (plus video input) API from @Google. 🔊📹🤖⚡️🔊😀 See the thread below for links to: ➡️ Open Source @pipecat_ai clients for Web, React, Android, iOS, and C . Echo cancellation and noise reduction, hooks for function calling and tool use, support for both WebSocket and WebRTC network transport. ➡️ A Pipecat service that brings the Multimodal Live API features into the Pipecat Open Source ecosystem. Use this model in combination with your existing voice agent workflows, for example. ➡️ Bite-sized sample code demos. ➡️ A full-blown multimodal chat app starter kit project.

Daily

@trydaily

11 Dec 2024

15,929

Daily

Christian Stuff retweeted

Daily

@trydaily

11 Dec 2024

18,991

Together AI

Christian Stuff retweeted

Together AI

@togethercompute

20 Aug 2024

Developers now can build voice-to-voice AI with Llama models! We’re excited to partner with @trydaily to bring the power of Llama 3.1 models to ultra low latency voice and video agents.

Daily

@trydaily

20 Aug 2024

Today we’re launching Daily Bots, the ultra low latency Open Source cloud for voice, vision, and video AI. Build voice-to-voice with any LLM, at conversational latencies as low as 500ms. With Daily Bots, developers can: *️⃣ build with Open Source SDKs *️⃣ mix and match the best Generative AI models for specific use cases *️⃣ run at scale on Daily’s real-time global infrastructure We've partnered with @AnthropicAI, @cartesia, @DeepgramAI, and @togethercompute for this launch. Our goal is to combine the best tools, best developer ergonomics, and best infrastructure for real-time AI into a single platform. Daily Bots is the culmination of the last 18 months of work we've done with customers and partners, pioneering real-time and voice-to-voice AI in production. The two fastest growing Open Source real-time AI projects came out of this work: @pipecat_ai and RTVI. *️⃣ Daily Bots apps are built using the RTVI Open Source SDKs for the Web, iOS, and Android. Your Daily Bots code will run anywhere that supports the RTVI standard. (Or you can run your own infrastructure.) *️⃣ Your Daily Bots can also answer the phone. (You can buy a phone number from us with a single curl command.) *️⃣ Bring your own API Keys and use any inference provider that supports OpenAI-compatible APIs. Or run your own models and connect your bots to your infrastructure. For a highly non-serious take on the kinds of things we’ve been creating for ourselves, as we’ve worked on Daily Bots, check out the video. We'd love to hear what multi-modal, real-time AI directions are most interesting to you. We can't wait to see what you build.

3:58

4,863

kwindla

Christian Stuff retweeted

kwindla

@kwindla

20 Aug 2024

🤖 Daily Bots launch day at @trydaily! 🤖 Voice-to-voice with any LLM. (And vision and video, of course.) We've had a ton of fun building this hosted service for real-time AI on top of the Open Source projects that we've been contributing to for the last year. I'm all in on the potential of multi-modal AI. PS: if you've seen the 500ms voice-to-voice demos I've been posting, you can build that kind of thing yourself now in ~18 lines of code. 🏎️ Depending on the LLM you choose and how you prompt it, you can optimize for latency, or for features like function calling and vision input, or for experimenting with new capabilities like @AnthropicAI's incredibly useful new prompt caching beta.

Daily

@trydaily

20 Aug 2024

3:58

6,206

Daily

Christian Stuff retweeted

Daily

@trydaily

20 Aug 2024

3:58

139

31,488

kwindla

Christian Stuff retweeted

kwindla

@kwindla

23 Jul 2024

Very, very fast voice bots. Llama 3.1 running on @GroqInc. 🚀 500ms voice-to-voice response times

1:52

442

3,898

386,492

kwindla

Christian Stuff retweeted

kwindla

@kwindla

12 Mar 2024

Function calling, interruptibility, fast responses. This is a nice example of where real-world voice interfaces are headed.

3:15

9,593

Christian Stuff

Christian Stuff @Regaddi

19 Dec 2023

People who know me know I am a big fan and defendant of open web standards. The list outlined in @kwindla‘s blog post is probably one of the most important lists to look at as the year comes to an end (I might be exaggerating if I said „more important than Santa‘s list“).

kwindla

@kwindla

19 Dec 2023

Motivated by Twilio's announcement that Twilio Video is going away, I've been spending some time digging into what the latest version of Zoom's Web SDK can (and can't) do. It's definitely getting better, but it's still much less performant than native WebRTC. And it's still missing a lot of things that web video apps need. daily.co/blog/zoom-web-sdk-t…

more replies

Christian Stuff

Christian Stuff @Regaddi

19 Dec 2023

It‘s not just about the video platform itself, but about the documentation, support and the SDKs and API design that set the ease of a migration path. Personally I care a lot about providing a very good entry point into Daily‘s video platform for React developers.

Christian Stuff

Christian Stuff @Regaddi

19 Dec 2023

I am a big fan and defendant of open web standards.

Chad

Christian Stuff retweeted

Chad

@chadbailey59

12 Dec 2023

Putting this out there for search's sake (lol, twitter search): If you've built something on @twilio's video product and you're trying to figure out what to do about the EOL, you should check out Daily. We've even got a post showing you how to migrate: daily.co/blog/migrating-a-tw…

Migrating a Twilio video demo to Daily

A practical migration of a Twilio application to Daily’s video API

daily.co

237

Christian Stuff

Christian Stuff @Regaddi

10 Dec 2023

It was an absolute pleasure to be part of this event! So many amazing people, great talks and projects and a fantastic organization team! 🤩

React Day Berlin

@reactdayberlin

8 Dec 2023

Thank you, everyone, for the amazing conference🌟 Speakers, sponsors, volunteers, attendees, partners, artists, MCs, program committee ♥️ You are awesome!

335

Christian Stuff

Christian Stuff @Regaddi

7 Dec 2023

You’ve heard of elf on a shelf, but here’s

more replies

Christian Stuff

Christian Stuff @Regaddi

7 Dec 2023

Christian Stuff

Christian Stuff @Regaddi

7 Dec 2023