Rowan Zellers

Rowan Zellers

40 Photos and videos

Tweets

Pinned Tweet

Rowan Zellers

@rown

May 11

We are so back!

Thinking Machines

@thinkymachines

May 11

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/int…

2:15

542

53,238

Rowan Zellers

Rowan Zellers

@rown

May 19

Grants for research on interactivity, realtime video/audio full duplex evals, and safety

Thinking Machines

@thinkymachines

May 19

We are offering grants of $100,000 Tinker credits to researchers advancing the field of human-AI interactivity. Submit your proposals by June 19th! thinkingmachines.ai/news/int…

111

25,742

MichiganAI

Rowan Zellers retweeted

MichiganAI @michigan_AI

May 16

Big congratulations to Dr. @ziqiao_ma, well deserved! 🎉👏 Excited for your new chapter at @thinkymachines!

Martin Ziqiao Ma

@ziqiao_ma

May 15

PhDone :)

5,139

Martin Ziqiao Ma

Rowan Zellers retweeted

Martin Ziqiao Ma

@ziqiao_ma

May 11

P.S. The demo is basically my life at thinky: I start to cut coffee, @liliyu_lili is visually prompt-injecting my human intelligence with sweet snack every day, and I've gained weight since joining TML.

Thinking Machines

@thinkymachines

May 11

Replying to @thinkymachines

Lili and Martin get some help controlling themselves.

1:00

136

19,811

Zixian Ma@CVPR

Rowan Zellers retweeted

Zixian Ma@CVPR

@zixianma02

May 12

Congrats Rowan and Thinky team on the cool release! I remember you mentioned having a v different vision of multimodal interactions a few weeks ago @rown so this is what that looks like! 🆒 It’s exciting to see this release going beyond just a single model, showcasing truly different native multimodal interactions too. A couple things from the nicely written blog really resonate with me: 1. people are most effective when they can collaborate with AI the same way they do with other people 2. existing interfaces limit human inputs (esp multimodal ones) to the model, and this input limit needs to be lifted to unlock much better interactivity The blog also reminds me of the fun and challenging discussions with @shannonzshen and others on what “scaling collaboration” can look like. we made an initial attempt describing our vision: arxiv.org/pdf/2510.25744 It’d be great to see more human centric evaluations of the model/system/interface too — looking forward to it🥂

Rowan Zellers

@rown

May 11

We are so back!

7,176

Mira Murati

Rowan Zellers retweeted

Mira Murati

@miramurati

May 11

We started Thinking Machines to advance human-AI collaboration, and this is our first bet on what that looks like. Most labs treat autonomy as the goal and interactivity as scaffolding around a turn-based core. We think the way we work with AI matters as much as how smart it is. Interactivity has to be in the model, and it has to scale with intelligence rather than trail behind it. thinkingmachines.ai/blog/int…

Interaction Models: A Scalable Approach to Human-AI Collaboration

Interaction models move beyond turn-based AI interfaces by handling multimodal, real-time collaboration natively across audio, video, and text.

thinkingmachines.ai

811

58,672

Lilian Weng

Rowan Zellers retweeted

Lilian Weng

@lilianweng

May 11

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions ( many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

Thinking Machines

@thinkymachines

May 11

2:15

947

179,887

Aurick Qiao

Rowan Zellers retweeted

Aurick Qiao

@aurickq

May 11

Very excited to share a preview of what we’ve been working on!

Thinking Machines

@thinkymachines

May 11

2:15

1,672

Long Lian

Rowan Zellers retweeted

Long Lian

@LongTonyLian

May 12

Thinky’s new interaction models perform search in the background when listening and responding so you don’t notice! Also per request: Spoiler Alert 🚨

Thinking Machines

@thinkymachines

May 11

Replying to @thinkymachines

The model can multi-task! Long thinks the model knows everything, but the model actually searched while listening and responding to him so he didn't notice.

0:58

2,884

Mu Cai

Rowan Zellers retweeted

Mu Cai

@MuCai7

May 11

My first share since joining @thinkymachines. Fun working with this team on real-time multimodal interaction. Vision in turn-based models felt like flipping through photos — continuous video is a different problem. Visual proactivity is essential — grateful to have worked on this alongside @liliyu_lili, @rown , and the rest of the team!

Thinking Machines

@thinkymachines

May 11

2:15

157

10,568

Brandon Trabucco

Rowan Zellers retweeted

Brandon Trabucco @brandontrabucco

May 11

I'm excited to share some of our work at @thinkymachines. As models get more intelligent, the bottleneck is increasingly how quickly and seamlessly we can access their intelligence, and today we are sharing a preview of how we think about human-AI collaboration.

Thinking Machines

@thinkymachines

May 11

2:15

5,119

Rowan Zellers

Rowan Zellers

@rown

May 11

Our interaction model is the first general video speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

Lili Yu

@liliyu_lili

May 11

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

135

11,259

Rowan Zellers

Rowan Zellers

@rown

May 11

If you're interested in working on realtime video speech specifically, or human AI collaboration more generally, please reach out!

1,103

Lili Yu

Rowan Zellers retweeted

Lili Yu

@liliyu_lili

May 11

Thinking Machines

@thinkymachines

May 11

Replying to @thinkymachines

Tessa's quality of life has improved a lot with some nagging.

0:47

16,405

Thinking Machines

Rowan Zellers retweeted

Thinking Machines

@thinkymachines

May 11

Lili and Martin get some help controlling themselves.

1:00

613

168,736

Thinking Machines

Rowan Zellers retweeted

Thinking Machines

@thinkymachines

May 11

2:15

464

1,959

15,789

7,750,163

Saining Xie

Rowan Zellers retweeted

Saining Xie

@sainingxie

Apr 23

vision🍌 is here vision-banana.github.io/ if you got into computer vision the way I did, starting with pixel-level labeling tasks like segmentation, edges, depth, or surface normals, you’ll probably feel the same seeing these results -- something big has quietly shifted, and it’s going to change how we approach these problems for good 🧵

0:12

112

789

65,978

Jacob van Gogh

Rowan Zellers retweeted

Jacob van Gogh @JayArrVeeGee

Apr 22

me: Make me the most AI slop image that ever AI slopped. The pinnacle of slop. A seminal work on AI slop. ChatGPT Images 2.0:

209

199

2,591

931,292

Rowan Zellers

Rowan Zellers

@rown

Apr 22

Feels like the SOTA way to download files on google drive in 2026 is to sic claude code/codex at the directory and have it figure it out 🥲

4,391

Rowan Zellers

Rowan Zellers

@rown

Apr 21

welcome Weiyao!!

weiyaow

@weiyaow1

Apr 21

After 8 years at Meta (FAIR/MSL) working on multi-modal perception and generations — Gradient-Blending, UVO, SAM3D — I've joined @thinkymachines this week to keep working on multi-modal. Excited for what's ahead.

6,217