Real-time AI, voice and video infrastructure for developers #videosdk #videoapi #webrtc

Joined March 2021
152 Photos and videos
Pinned Tweet
You don’t like the voice of your realtime model. But you can’t change it. Can’t fix pronunciation. Can’t clean transcripts before LLM. That’s where voice agents fail. So we rebuilt the pipeline. Introducing VideoSDK Agents v1 - Prism Explore Agent v1: dub.sh/U887mrY
1
1
3
343
VideoSDK retweeted
Anam interactive avatars are now officially on @Video_SDK. Our integration brings real-time avatars into VideoSDK agent pipelines. Video > voice > text: 70% user preference over voice-only across customer deployments.
1
1
2
166
Docs shouldn’t be searched. They should answer. So we built an MCP Server for VideoSDK Agents Doc🚀 Ask anything. Get exactly what you need. Docs that respond > docs you search. 🔗Explore - dub.sh/FeVyHkW
1
3
166
VideoSDK retweeted
You don’t like the voice of your realtime model. But you can’t change it. Can’t fix pronunciation. Can’t clean transcripts before LLM. That’s where voice agents fail. So we rebuilt the pipeline.
1
1
5
226
You don’t like the voice of your realtime model. But you can’t change it. Can’t fix pronunciation. Can’t clean transcripts before LLM. That’s where voice agents fail. So we rebuilt the pipeline. Introducing VideoSDK Agents v1 - Prism Explore Agent v1: dub.sh/U887mrY
1
1
3
343
• Unified Pipeline • pipeline hooks - control any stage (audio, text, turns) • Hybrid Pipeline - Mix any stack - STT, LLM, TTS, knowledge base • Compose anything - STT , LLM-only, full voice, realtime • Built-in observability - metrics, logs, traces • Avatar support plug
1
75
Which means the same system can be used to build: • transcription pipelines • voice copilots • chat voice agents • fully autonomous voice agents • realtime agents Read more about Agent v1 here : dub.sh/U887mrY
1
60
VideoSDK retweeted
🚀 𝗚𝗲𝗺𝗶𝗻𝗶 𝟯.𝟭 𝗙𝗹𝗮𝘀𝗵 𝗟𝗶𝘃𝗲 is now supported on VideoSDK AI Voice Agents! @Google just launched their most capable real-time voice model yet and you can start building with it on VideoSDK today. Check out the Docs now: docs.videosdk.live/ai_agents…
1
2
2
159
VideoSDK retweeted
We’re excited to introduce @Anam__ai integration with VideoSDK AI Voice Agents 🚀 You can now add real-time, expressive AI avatars to your voice agents  with natural lip sync and sub-second responses. 👉 𝗘𝘅𝗽𝗹𝗼𝗿𝗲 𝘁𝗵𝗶𝘀 𝗶𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻 : dub.sh/3WVOSxj
1
2
1
146
We’re excited to introduce @Anam__ai integration with VideoSDK AI Voice Agents 🚀 You can now add real-time, expressive AI avatars to your voice agents  with natural lip sync and sub-second responses. 👉 𝗘𝘅𝗽𝗹𝗼𝗿𝗲 𝘁𝗵𝗶𝘀 𝗶𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻 : dub.sh/3WVOSxj
1
2
1
146
𝗪𝗵𝘆 𝘁𝗵𝗶𝘀 𝗺𝗮𝘁𝘁𝗲𝗿𝘀: - Higher engagement with video-first experiences - Natural, real-time conversations with low latency - Best-in-class realism powered by Anam’s CARA model - bring your own LLM, customize personas, clone voices and support 50 languages
1
48
🚀 𝗚𝗲𝗺𝗶𝗻𝗶 𝟯.𝟭 𝗙𝗹𝗮𝘀𝗵 𝗟𝗶𝘃𝗲 is now supported on VideoSDK AI Voice Agents! @Google just launched their most capable real-time voice model yet and you can start building with it on VideoSDK today. Check out the Docs now: docs.videosdk.live/ai_agents…
1
2
2
159
𝗚𝗲𝗺𝗶𝗻𝗶 𝟯.𝟭 𝗙𝗹𝗮𝘀𝗵 𝗟𝗶𝘃𝗲 𝗯𝗿𝗶𝗻𝗴𝘀: - Ultra-low latency audio-to-audio responses - Acoustic nuance detection (pitch, pace, tone) - 70 languages supported in real time - Improved tool calling - 2x longer conversation memory
1
1
42
Whether you're building customer support bots, AI meeting assistants, or multilingual voice apps - @video_sdk AI Voice Agents gives you the fastest path from idea to a live voice experience. And now it runs on the most powerful real-time model @Google has ever shipped.
28
VideoSDK retweeted
Anam interactive avatars are now natively supported on @Video_SDK. Add real-time video avatars to VideoSDK agent pipelines. Under 10 lines of Python. Sub-second response times. Works with your existing STT, LLM, and TTS. 70% user preference over voice-only. CARA model leads all tested providers on visual quality, lip sync, and overall experience (avatarbenchmark.com). anam.ai/interactive-avatars/… @SagarKava_ @Arjun_Kava
4
10
320
VideoSDK retweeted
1/ Introducing AI Voice Agent Cloud Deployments via CLI Dashboard ⚡️ Deploying voice agents shouldn’t feel complicated.
4
1
6
406
Announcing VideoSDK Inference: One Magic API for Every Voice AI Model 🎉 Maintaining multiple accounts for speech recognition, language models, and speech synthesis, each with its own keys, quotas, billing, and APIs 👉 Explore VideoSDK Inference : dub.sh/uIsIJu9
1
1
7
543
- No multiple provider accounts to manage - Works across telephony, web, mobile, and IoT - Built for true real-time, low-latency conversations - Supported models - gemini, sarvam AI, Deepgram, Cartesia
145
🚀 Introducing 𝗩𝗶𝗱𝗲𝗼𝗦𝗗𝗞 𝗣𝗵𝗼𝗻𝗲 𝗡𝘂𝗺𝗯𝗲𝗿𝘀 We’re excited to announce VideoSDK Phone Numbers, first-party telephony service that lets you connect voice agents directly to the phone network 👉 Read the full announcement here : dub.sh/6dXwMOR
1
2
195
𝗪𝗵𝗮𝘁’𝘀 𝗶𝗻𝗰𝗹𝘂𝗱𝗲𝗱 - 𝗙𝗶𝗿𝘀𝘁-𝗽𝗮𝗿𝘁𝘆 𝘁𝗲𝗹𝗲𝗽𝗵𝗼𝗻𝘆 : Purchase US local or toll-free numbers directly from the VideoSDK dashboard - 𝗟𝗼𝘄𝗲𝗿 𝗹𝗮𝘁𝗲𝗻𝗰𝘆, 𝗯𝗲𝘁𝘁𝗲𝗿 𝗾𝘂𝗮𝗹𝗶𝘁𝘆 : Fewer network hops mean more reliable calls, crisper audio.
1
2
118
- 𝗦𝗶𝗺𝗽𝗹𝗲 𝗱𝗶𝘀𝗽𝗮𝘁𝗰𝗵 𝗿𝘂𝗹𝗲𝘀 : Attach a number to a voice agent and go live instantly. Manage everything in one place.
2
97