Joined April 2016
170 Photos and videos
Ben Duffy retweeted
PPO has long dominated robot locomotion training in simulation. SAC, despite its sample efficiency, couldn't keep up. We analyze why: 🔗sabagian.github.io/sac_relea… 🔥Integrated into RSL-RL, our approach requires only minimal changes, making SAC a drop-in alternative out of the box.
5
41
336
41,365
Claude Opus 4.7 likes: Phases, tasks, gates, stages, rungs
1
36
A year ago, I asked all LLMs back then (claude 3.7 grok 3 deepseek v3 gemini 2.5 deep research) to predict the next 5 years of progress and partially to assess the plausability of the AI 2027 report. One thing the report got is that all the labs are focusing on recursive improvement e.g. with Codex 5.3 helping created Chat Gpt 5.4 and so on i.e. "closing the loop". Anyway, this year, new prompt and new models. Getting more quantitative and then will ask chatgpt 5.4 to summarise all answers and compare. Getting a bit meta to ask multiple AI agents to predict future progress of AI and compare previous forecasts. Grok is supposed to be optimised on forecasting accuracy! Summary from ChatGPT of below answers from 5.4, gemini, grok and claude 4.6 sonnet:
Right, starting now, for shits and giggles, I will ask the top ~5 models every year on April 6th to: "predict the next 5 years of AI and AGI progress" Then we can compare over the years: 1. How right/wrong this forecast report got it /th
1
1
75
I love talking to mini AGIs about what a true AGI will be like
24
I love robots But sometimes they don't love me... 🥲
54
Humanoids. Built in our own image... THE HUBRIS!!! I love it!
1
44
omg, we live in the future, claude is taking control of my browser to add my dishwasher and 15 other items as ads in ebay (kleinanzeigen) and facebook marketplace.
200
Missed this one. 2026/2027 is gonna be the year of AI and science combined.
34
Cursor's composer-1 frontier LLM is super fast and accurate highly underated!
29
Ben Duffy retweeted
The first 100% autonomous coast-to-coast drive on Tesla FSD V14.2! 2 days 20 hours, 2732 miles, zero interventions. This one is special because the coast-to-coast drive was a major goal for the autopilot team from the start. A lot of hours were spent in marathon clip review sessions late into the night looking over interventions as we attempted legs of the drive over time - triaging, categorizing, planning out all the projects to close the gap and bring the number of interventions to zero. Amazing to see the system actually get there and huge congrats to the team!
31 Dec 2025
I am proud to announce that I have successfully completed the world’s first USA coast to coast fully autonomous drive! I left the Tesla Diner in Los Angeles 2 days & 20 hours ago, and now have ended in Myrtle Beach, SC (2,732.4 miles) This was accomplished with Tesla FSD V14.2 with absolutely 0 disengagements of any kind even for all parking including at Tesla Superchargers.
310
972
14,071
1,075,334
Ben Duffy retweeted
Paper naming conventions are reaching a climax.
4
14
106
14,663
12 Nov 2025
Somehow missed this. Always love Minecraft/open-ended papers! Voyager paper blew my mind, but used code-gen on the Mineflayer API! 2022 pixel-to-action papers below are similar but used fine tuning. But this is with only offline data! I think very relevant for robotics.
1
60
12 Nov 2025

I'd like to look back at the two mega-papers on Minecraft RL that just came out, from @OpenAI and @nvidia. They both rely on diabolically clever ideas... but in completely different directions.
38
Ben Duffy retweeted
Introducing GEN-0, our latest 10B foundation model for robots ⏱️ built on Harmonic Reasoning, new architecture that can think & act seamlessly 📈 strong scaling laws: more pretraining & model size = better 🌍 unprecedented corpus of 270,000 hrs of dexterous data Read more 👇
47
281
1,487
483,300
30 Sep 2025
Whole #corl2025 conference is just a bunch of nerds who built cool intelligent things with advanced lego (hardware and/or software) and then say "check what I built". I love it.
2
139