Ben Duffy

Ben Duffy

170 Photos and videos

Tweets

Pinned Tweet

Ben Duffy @benduffyMMM

13 Apr 2025

Releasing: Johnny the humanoid vs. the socks youtube.com/watch?v=Di5hI802…

Johnny the humanoid vs. the socks

Johnny's battle against the socks and an unplanned challenge at the...

youtube.com

459

Robotic Systems Lab

Ben Duffy retweeted

Robotic Systems Lab @leggedrobotics

Jun 10

PPO has long dominated robot locomotion training in simulation. SAC, despite its sample efficiency, couldn't keep up. We analyze why: 🔗sabagian.github.io/sac_relea… 🔥Integrated into RSL-RL, our approach requires only minimal changes, making SAC a drop-in alternative out of the box.

0:20

336

41,365

Ben Duffy

Ben Duffy @benduffyMMM

Apr 24

Claude Opus 4.7 likes: Phases, tasks, gates, stages, rungs

Ben Duffy

Ben Duffy @benduffyMMM

Apr 6

A year ago, I asked all LLMs back then (claude 3.7 grok 3 deepseek v3 gemini 2.5 deep research) to predict the next 5 years of progress and partially to assess the plausability of the AI 2027 report. One thing the report got is that all the labs are focusing on recursive improvement e.g. with Codex 5.3 helping created Chat Gpt 5.4 and so on i.e. "closing the loop". Anyway, this year, new prompt and new models. Getting more quantitative and then will ask chatgpt 5.4 to summarise all answers and compare. Getting a bit meta to ask multiple AI agents to predict future progress of AI and compare previous forecasts. Grok is supposed to be optimised on forecasting accuracy! Summary from ChatGPT of below answers from 5.4, gemini, grok and claude 4.6 sonnet:

Ben Duffy @benduffyMMM

6 Apr 2025

Right, starting now, for shits and giggles, I will ask the top ~5 models every year on April 6th to: "predict the next 5 years of AI and AGI progress" Then we can compare over the years: 1. How right/wrong this forecast report got it /th

more replies

Ben Duffy

Ben Duffy @benduffyMMM

Apr 6

grok.com/share/c2hhcmQtMg_e5…

Ben Duffy

Ben Duffy @benduffyMMM

Apr 6

claude.ai/share/f7231e0e-29a…

Ben Duffy

Ben Duffy @benduffyMMM

Apr 3

I love talking to mini AGIs about what a true AGI will be like

Ben Duffy

Ben Duffy @benduffyMMM

Feb 13

I love robots But sometimes they don't love me... 🥲

Ben Duffy

Ben Duffy @benduffyMMM

Feb 13

Humanoids. Built in our own image... THE HUBRIS!!! I love it!

Ben Duffy

Ben Duffy @benduffyMMM

Jan 14

Software development has changed so rapidly, even over last month... Gonna listen to their book. youtube.com/watch?v=zuJyJP51…

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What...

Note: Steve and Gene’s talk on Vibe Coding and the post IDE world w...

youtube.com

Ben Duffy

Ben Duffy @benduffyMMM

Jan 10

omg, we live in the future, claude is taking control of my browser to add my dishwasher and 15 other items as ads in ebay (kleinanzeigen) and facebook marketplace.

200

Ben Duffy

Ben Duffy @benduffyMMM

Jan 10

Missed this one. 2026/2027 is gonna be the year of AI and science combined.

Ben Duffy

Ben Duffy @benduffyMMM

Jan 3

Cursor's composer-1 frontier LLM is super fast and accurate highly underated!

Andrej Karpathy

Ben Duffy retweeted

Andrej Karpathy

@karpathy

31 Dec 2025

The first 100% autonomous coast-to-coast drive on Tesla FSD V14.2! 2 days 20 hours, 2732 miles, zero interventions. This one is special because the coast-to-coast drive was a major goal for the autopilot team from the start. A lot of hours were spent in marathon clip review sessions late into the night looking over interventions as we attempted legs of the drive over time - triaging, categorizing, planning out all the projects to close the gap and bring the number of interventions to zero. Amazing to see the system actually get there and huge congrats to the team!

David Moss

@DavidMoss

31 Dec 2025

I am proud to announce that I have successfully completed the world’s first USA coast to coast fully autonomous drive! I left the Tesla Diner in Los Angeles 2 days & 20 hours ago, and now have ended in Myrtle Beach, SC (2,732.4 miles) This was accomplished with Tesla FSD V14.2 with absolutely 0 disengagements of any kind even for all parking including at Tesla Superchargers.

310

972

14,071

1,075,334

Chris Offner

Ben Duffy retweeted

Chris Offner

@chrisoffner3d

20 Nov 2025

Paper naming conventions are reaching a climax.

106

14,663

Ben Duffy

Ben Duffy @benduffyMMM

12 Nov 2025

Somehow missed this. Always love Minecraft/open-ended papers! Voyager paper blew my mind, but used code-gen on the Mineflayer API! 2022 pixel-to-action papers below are similar but used fine tuning. But this is with only offline data! I think very relevant for robotics.

Ben Duffy

Ben Duffy @benduffyMMM

12 Nov 2025

x.com/ThomasMiconi/status/15…

Thomas Miconi @ThomasMiconi

27 Jun 2022

I'd like to look back at the two mega-papers on Minecraft RL that just came out, from @OpenAI and @nvidia. They both rely on diabolically clever ideas... but in completely different directions.

Generalist

Ben Duffy retweeted

Generalist

@GeneralistAI

4 Nov 2025

Introducing GEN-0, our latest 10B foundation model for robots ⏱️ built on Harmonic Reasoning, new architecture that can think & act seamlessly 📈 strong scaling laws: more pretraining & model size = better 🌍 unprecedented corpus of 270,000 hrs of dexterous data Read more 👇

0:48

281

1,487

483,300

Ben Duffy

Ben Duffy @benduffyMMM

30 Sep 2025

Whole #corl2025 conference is just a bunch of nerds who built cool intelligent things with advanced lego (hardware and/or software) and then say "check what I built". I love it.

139