engineering @ Elyos. ex @GoCardless, @bcgdv. pilot. redhead.

Joined March 2010
402 Photos and videos
This is the classic min/max tweet that adds nothing to the conversation about voice agents. The real headline: NVIDIA released an iteration on speech to speech agents. We're still a long way from these being useful in production. The voices suck, tool calling with be bad, etc
The part most people will skip: NVIDIA just made every voice AI API a commodity. OpenAI charges $0.06/min input and $0.24/min output for Realtime API. Gemini Live bills 25 tokens/second of audio. Every startup building voice agents is hemorrhaging cash on per-minute API fees to run what is fundamentally a pipeline problem: ASR → LLM → TTS, three models stitched together with latency at every seam. PersonaPlex replaces that entire pipeline with one 7B model. Runs on a single A100. Open weights, MIT license, commercial use permitted. Response latency: 0.170 seconds for turn-taking, 0.240 seconds for interruptions. It scores higher on dialog naturalness than Gemini (2.95 vs 2.80 MOS) and handles interruptions better than every commercial system they benchmarked. This tells you everything about NVIDIA’s playbook. They don’t need to charge for the model. They need you to buy the GPU. Every company that self-hosts PersonaPlex instead of paying OpenAI per-minute is another A100/H100 sale. Every voice agent startup that drops their API dependency is another enterprise GPU contract. NVIDIA open-sourced the fishing rod because they sell the lake. Built on the Moshi architecture from Kyutai, fine-tuned with under 5,000 hours of data. The voice AI margin is migrating from the application layer to the hardware layer. And NVIDIA is the only company that profits no matter which model wins. 330,000 downloads in the first month. That’s infrastructure capture disguised as generosity.
48
I wish claude code had a way to have tools emit progress bars..
1
49
Real world benchmarking … @DeepgramAI @covaldev @cekuraAi @livekit
At @Elyos_AI We benchmarked 13 STT providers on 100 real customer calls from the trades businesses. Not synthetic lab data. Real calls with: - Background noise & multiple speakers - UK postcodes & addresses - Regional accents (England, Scotland, Ireland) - Short confirmations to long explanations Top performers: 🥇 @DeepgramAI Flux - 15.9% WER 🥈 @soniox_ai - 16.9% WER 🥉 @Speechmatics - 17.7% WER @OpenAI Whisper? 39.8% WER - wouldn't recommend for production voice AI. What's your experience with STT models? Are we there yet?
71
This is exactly what we’re doing @Elyos_AI
i know a small team in Texas making more than most “AI startups” just by fixing one boring problem for local contractors they noticed something simple: contractors can handle the work but the admin part drains their entire week quoting takes too long scheduling gets messy invoices pile up follow-ups never happen so they built a system that cleans up the admin headache: • pulls job requests from text, email, and WhatsApp • turns them into structured job details • drafts the quote • books the slot on the calendar • sends reminders • generates the invoice • collects payment • pushes everything to the accounting tool all with off-the-shelf tools stitched together cleanly they charge depending on the size of the business people try it for a month, realise they’re saving hours every week, and then they stick around now they manage the backend for 40 trades businesses solving a painful, recurring, unsexy operational problem that owners will happily pay to make disappear everyone wants to build AI copilots for the Fortune 500 meanwhile, the people printing money are the ones automating inbox chaos for local businesses
73
If the outcome of this is more people get comfortable in a terminal environment… that can only be a good thing.
Every single dev and product team I speak to in the last 30 days has moved from Cursor to Claude Code. 1. Is this permanent? 2. If so, what happens to Cursor?
99
28 Nov 2025
Yo @sama how do we get OpenAI support to respond to our support requests :( we’re being left on read.
1
32
matt brown retweeted
10 Feb 2025
Getting real IBM Watson vibes from all these Salesforce AgentForce ads
15
11
384
28,439
3 Feb 2025
Why is iOS speech to text so much better at numbers, addresses and postcodes than @DeepgramAI, @OpenAI whisper, etc? It nails them - the rest are mediocre at best.
126
matt brown retweeted
What's your favourite compact/pancake lens for a Fujifilm X-T4? I want to go out more with my camera without having to fight with a bulky lens.
1
1
1
149
25 Oct 2024
Warehouse energy usage as 8 bit art.. yellow=solar exports green=solar consumed black=grid imports
2
109
30 Aug 2024
Software engineering is making assumptions, and then figuring out N weeks later that assumption was completely wrong.
1
112
matt brown retweeted
this from @forrestbrazeal is just wonderful. i properly laughed out loud
4
95
329
49,212
matt brown retweeted
Rails is dead, also we sell half a million dollars worth of conference tickets in 20 minutes.
6
23
339
22,649
25 Apr 2024
GCP DataStream....... GCP's easy to use data replication product that just requires you to follow 15 complex steps, and do a small dance, to set up.
1
1
211
25 Mar 2024
Data viz on the web always turns into a battle against leaky abstractions…
112
22 Mar 2024
Anybody have solar that's exporting with a FIT, or via the SEG? Did you get given a second meter when you did the install?
1
213
8 Mar 2024
I remember watching a video once about structuring job queues using SLOs (you have a 5 minute queue, a 30 minute queue, etc). If jobs on the 5min queue we're delayed by more than 5mins you'd scale the workers. Anybody recollect this? GCers? @ghaidar0 @lawrjones?
3
607