Varun Singh

Varun Singh

344 Photos and videos

Tweets

Varun Singh

@vr000m

Jun 12

A day with Fable 5 running a task and I started to hear my computer speak and play tones. I had asked it to debug an audio capture issue. In the past, Opus would either ask me to run the Smoke test wire up the devices since it required a human to test. And if this was repetitive, I would ask it to make a script or similar to do the test. But this I saw it take the initiative to do the analysis by itself. This was on a new project and I heard it say "Samantha" then a few tones, a period of silence and this continued for an hour. I spun up another claude to ask it to analyse the JSONL and understand why it was playing tones. It said: "to prove the Core Audio tap was actually capturing system audio, the test played a known audible signal and checked the captured PCM peak amplitude." This was a pipecat example with a menu bar item and nemotron-3.5-asr capturing mic and system audio. Fable wrote the menubar, coreaudio bindings, pipecat reads from these sockets and sends it to the ASR and SmartTurnModel to write full semantic sentences into a VTT-style file. One more step in loop engineering unlocked, on to the next.

106

Varun Singh

Varun Singh

@vr000m

Jun 12

The code is here github.com/vr000m/onoats-bot github.com/vr000m/pipecat-lo… @pipecat_ai

GitHub - vr000m/onoats-bot

Contribute to vr000m/onoats-bot development by creating an account on GitHub.

github.com

Varun Singh

Varun Singh

@vr000m

Jun 12

We all want to run a personal assistant somewhere. @signalgaining invited me to do a quick demo of @pipecat_ai on a jetson nano and running local model from voxtral and kokoro. Cant wait to see pipecat and voice ai in the robots!

Wendy

@wendylabsinc

Jun 10

Want to build your own Alexa, Siri, or Google with an NVIDIA Jetson? Get started today with `wendy init --template` wendy.dev/blog/voice-ai-agen…

7:09

167

Peter Steinberger 🦞

Varun Singh retweeted

Peter Steinberger 🦞

@steipete

Jun 11

Here's a simple loop: Tell codex to maintain your repos, wake up every 5 minutes and direct work to threads. That makes it easy to parallelize steer work as needed. I use a orchestrator skill combined with my triage autoreview computer use skills, so some work can land autonomously. github.com/steipete/agent-sc… github.com/steipete/agent-sc…

200

428

5,094

508,882

Varun Singh

Varun Singh

@vr000m

Jun 10

Ah back channeling is a such a good trait. And one I’d the reasons, developers even consider speech models. This adds to the naturalness of speech. Love this update, can’t wait to try it. Perhaps a small model like Smart Tutn could consider back channeling hints for the cascade pipeline to consider.

kyutai @kyutai_labs

Jun 10

New paper: Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models We use RL to post-train speech models (Moshi and PersonaPlex) to talk more like a human: to know when to respond, when to wait, and when to nod along with “yeah”s and “okay”s when listening.

0:40

Tim Sneath

Varun Singh retweeted

Tim Sneath @timsneath

Jun 9

One of my personal favorite features announced at WWDC will I suspect be a sleeper hit: container machines, allowing your Mac to run a lightweight, persistent Linux environment with your home directory and repos automatically mounted: github.com/apple/container/b…

container/docs/container-machine.md at main · apple/container

A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon. - apple/container

github.com

227

815

9,698

729,544

Claude

Varun Singh retweeted

Claude

@claudeai

Jun 9

Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.

Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

ALT Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

512

1,791

15,546

5,465,061

Varun Singh

Varun Singh

@vr000m

Jun 9

Lots of interesting projects built at the multi-agent hackathon. Spam call detection and advisory, MIDI generation, auto bug or vulnerability detection with proper disclosure for bug bounties, an auto training based on loss spike detection and many more projects built in 24h! Thanks @altryne for the invitation and opportunity to judge the work!

👩‍💻 Paige Bailey

Varun Singh retweeted

👩‍💻 Paige Bailey

@DynamicWebPaige

Jun 8

Replying to @altryne @googlemaps @priceline

thank you for having us, such a good crew!! ❤️

274

swyx

Varun Singh retweeted

swyx

@swyx

Jun 5

3 weeks left til @aidotengineer world's fair! if you want to get on this year's map of top ai engineering companies, theres a few spots left we are sold out of: - presenting sponsors - model lab sponsors - platinum sponsors - gold sponsors the big spots left are for the official afterparties - welcome reception, networking night, and world cup quarterfinal bundles if interested - drop a note to sponsorships@ai.engineer detailing size/scale of interest (below is 2025, we are recruiting 2026 now) if you are attending - BOOK YOUR HOTELS BY TMR AS THE ROOM BLOCK DISCOUNT EXPIRES TMR

22,283

Varun Singh

Varun Singh

@vr000m

Jun 5

Come hang out and build voice agents, I will be there both days in pipecat swag

Alex Volkov

@altryne

Jun 1

This weekend, join us in SF for our 4th WeaveHacks hackathon! Sponsored by @OpenAIDevs for the first time ( @dkundel judging!), @cursor_ai ,@Redisinc and @CopilotKit , Hackers will get over $150 in credits to build multi-agent orchestration systems Over $15K in prizes!

ALT lu.ma/weavehacks

3,663

Prince Canuma

Varun Singh retweeted

Prince Canuma

@Prince_Canuma

Jun 3

🚀 Gemma 4 12B is here! We partnered with @GoogleDeepMind to bring and optimize their new dense and unifed multimodal model for Apple Silicon. ◈ 12B dense · 256K context ◈ Thinking mode (built-in reasoning) ◈ Vision: dynamic res, OCR, UI charts ◈ Native audio: ASR speech translation ◈ Function calling for agents ◈ Text image audio, interleaved Runs local. Get started now ⚡ > uv pip install -U mlx-vlm github.com/Blaizzy/mlx-vlm

Google Gemma

@googlegemma

Jun 3

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

143

1,409

178,144

Harsha from Slashy

Varun Singh retweeted

Harsha from Slashy

@GaddipatiHarsha

Jun 2

Excited to announce Slashy The first email client that works for you. The real cost of email isn't the time. It's the mental load of constantly checking it, just in case something needs you. Slashy kills that. You never need to open your inbox unless Slashy tells you. Try it out at slashy.com

0:25

264

57,530

Basil Chatha

Varun Singh retweeted

Basil Chatha

@realbasilchatha

May 29

We're hosting a fireside chat on Voice AI with Basia Sudol (Head of Enterprise Solutions at @DecagonAI), Sudarshan Kamath (Founder at @smallest_AI ), Varun Singh (CPTO at @trydaily), Steven Diaz (FDE Manager at Vapi), and Tyler D'Silva (Founding FDE at @retellai) next Thursday, June 4th! Don't miss it: luma.com/fmpyy1b6

1,418

swyx

Varun Singh retweeted

swyx

@swyx

May 28

"Developers can update Claude’s instructions mid-task without breaking the prompt cache or routing the update through a user turn" wtf? how??

Claude

@claudeai

May 28

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

ALT Benchmark table showing how Claude Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks.

846

138,821

Varun Singh

Varun Singh

@vr000m

May 18

Question for Hermes builders: if a workflow needs multiple subagents and by extension different models (e.g. Gemini vs Claude vs Codex, or different ‘thinking levels’ like Sonnet vs Opus, x-high vs low), is the trivial approach to spin up separate Hermes agents per model? I’ve embedded model hints in skills, but to swap providers inside Hermes I still need separate Hermes instances, right? Or asked in another way -- if I want to mix providers within Hermes, does that still imply multiple Hermes instances, or is there a better abstraction?

111

antirez

Varun Singh retweeted

antirez @antirez

May 18

Imagine a local agent where cache misses don't exist, tools don't need translations, you see progress for prefill, tokens are emitted ASAP.

0:43

436

41,450

xAI

Varun Singh retweeted

xAI

@xai

May 15

You can now use your @grok subscription inside @NousResearch Hermes Agent. x.ai/news/grok-hermes

559

595

5,539

3,966,882

Aditya Mishra

Varun Singh retweeted

Aditya Mishra

@adi_myth

May 17

Marking this as a moment convincing @swyx to bring @aiDotEngineer to India next year with @sanjeed_i @udayan_w Exciting times!! 🥳

40,483