This week in Voice AI may have been quieter, but it still brought some noteworthy updates 🤗 !
@AgoraIO has released its new ConvoAI Studio providing a no-code deployment tool to configure, test, & deploy Voice AI agents, along with targeted solutions for Customer Support and Sales & Marketing. Additionally, a new open-source solution called Dograh has emerged, which mimics VAPI or Retell AI solutions.
@togethercompute has also enhanced its offerings by adding hosted for speech-to-text (STT) and text-to-speech (TTS) models to deploy very low latency agents in a similar way to Cloudflare approach.
Two opensource TTS models coming from
@hume_ai and
@FishAudio promising zero content hallucinations with a novel text-audio sync approach in the first case and promising high level of control and emotions in the second case.
As usual, upgrades for all the major orchestration frameworks
@livekit,
@pipecat_ai and
@TenFramework and found specially interesting the approach to split instructions by modality added to LiveKit that we have been using at
@livetok_ai for a while too, and the interesting blog post from Daily showing the performance of the new
@nvidia Nemotron model for Voice AI use cases.
Full update with more news, details and link below 👇
Have a nice week!