Anian Ruoss

Anian Ruoss

26 Photos and videos

Tweets

Fabio Pardo retweeted

Anian Ruoss @anianruoss

15 Jul 2025

Our LMAct benchmark evaluates LMs in dynamic environments (Atari, chess, DM Control) in the zero-to-many-shot regime (up to 1M tokens) With @PardoFab, @SirrahChan, @bonniesjli, Volodymyr Mnih, and Tim Genewein 📅 Tue 15 July ⏰ 11:00 – 13:30 📍East Exhibition Hall A-B #E-1804

873

Fabio Pardo

Fabio Pardo @PardoFab

10 Dec 2024

Thrilled to be at #NeurIPS this week! Excited to reconnect with friends and colleagues I haven’t seen in a while. If you’d like to meet up, don’t hesitate to reach out, I’ll be stopping by the @GoogleDeepMind booth occasionally.

1,104

Fabio Pardo

Fabio Pardo @PardoFab

5 Dec 2024

Genie 2 🧞‍♂️ is a very impressive model and the perspective of generating endless, unique worlds is incredibly exciting! Collaborating with the team has been a truly fun and inspiring experience 😁

Jack Parker-Holder

@jparkerholder

4 Dec 2024

Introducing 🧞Genie 2 🧞 - our most capable large-scale foundation world model, which can generate a diverse array of consistent worlds, playable for up to a minute. We believe Genie 2 could unlock the next wave of capabilities for embodied agents 🧠.

0:35

3,420

Fabio Pardo

Fabio Pardo @PardoFab

3 Dec 2024

We explored how LLMs tackle low-level control tasks across diverse domains. While there's room to grow, in-context learning and existing image/text tokens can handle diverse observation and action spaces. No special tokens or adapters are needed to play games or control robots!

Anian Ruoss @anianruoss

3 Dec 2024

Ever wonder how well frontier models (Claude 3.5 Sonnet, Gemini 1.5 Flash & Pro, GPT-4o, o1-mini & o1-preview) play Atari, chess, or tic-tac-toe? We present LMAct, an in-context imitation learning benchmark with long multimodal demonstrations (arxiv.org/abs/2412.01441). 🧵 1/N

1,608

Fabio Pardo

Fabio Pardo @PardoFab

20 Jul 2024

The General Agents team at @GoogleDeepMind Toronto is looking for a new Research Scientist. Apply if you have the right skills and want to work with us on building generalist agents that can interact with simulated and real world environments! boards.greenhouse.io/deepmin…

DeepMind

job-boards.greenhouse.io

9,040

AK

Fabio Pardo retweeted

@_akhaliq

15 Dec 2023

Google DeepMind announces Vision-Language Models as a Source of Rewards paper page: huggingface.co/papers/2312.0… Building generalist agents that can accomplish many goals in rich open-ended environments is one of the research frontiers for reinforcement learning. A key limiting factor for building generalist agents with RL has been the need for a large number of reward functions for achieving different goals. We investigate the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of rewards for reinforcement learning agents. We show how rewards for visual achievement of a variety of language goals can be derived from the CLIP family of models, and used to train RL agents that can achieve a variety of language goals. We showcase this approach in two distinct visual domains and present a scaling trend showing how larger VLMs lead to more accurate rewards for visual goal achievement, which in turn produces more capable RL agents.

126

555

64,928

Fabio Pardo

Fabio Pardo @PardoFab

11 Dec 2023

After getting a glimpse of the incredible New Orleans, I'm excited to be at @NeurIPSConf all week! Let me know if you'd like to chat, or come and say hi at the @GoogleDeepMind booth on Tuesday or Wednesday afternoon. I'll be with the #GeminiAI team.

1,115

Fabio Pardo

Fabio Pardo @PardoFab

7 Dec 2023

Gemini is a suite of incredibly powerful and general models that push the limits of artificial intelligence. Participating in this massive effort made me feel like I was part of history in the making and I can't wait to see what lies ahead! #GeminiAI blog.google/technology/ai/go…

Introducing Gemini: our largest and most capable AI model

Gemini is our most capable and general model, built to be multimodal and optimized for three different sizes: Ultra, Pro and Nano.

blog.google

2,982

Demis Hassabis

Fabio Pardo retweeted

Demis Hassabis

@demishassabis

6 Dec 2023

The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses blog.google/technology/ai/go…

381

1,843

10,752

3,151,242

Jeff Dean

Fabio Pardo retweeted

Jeff Dean

@JeffDean

6 Dec 2023

I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks, including 10 of 12 popular text and reasoning benchmarks, 9 of 9 image understanding benchmarks, 6 of 6 video understanding benchmarks, and 5 of 5 speech recognition and speech translation benchmarks. Gemini Ultra is the first model to achieve human-expert performance on MMLU across 57 subjects with a score above 90%. It also achieves a new state-of-the-art score of 62.4% on the new MMMU multimodal reasoning benchmark, outperforming the previous best model by more than 5 percentage points. Gemini was built by an awesome team of people from @GoogleDeepMind, @GoogleResearch, and elsewhere at @Google, and is one of the largest science and engineering efforts we’ve ever undertaken. As one of the two overall technical leads of the Gemini effort, along with my colleague @OriolVinyalsML, I am incredibly proud of the whole team, and we’re so excited to be sharing our work with you today! There’s quite a lot of different material about Gemini available, starting with: Main blog post: blog.google/technology/ai/go… 60-page technical report authored by th Gemini Team: deepmind.google/gemini/gemin… In this thread, I’ll walk you through some of the highlights.

240

2,360

12,573

3,903,043

Fabio Pardo

Fabio Pardo @PardoFab

10 Oct 2022

I am incredibly happy to announce that I am joining DeepMind Toronto as a Research Scientist 🎉🇨🇦 Working at @DeepMind on the team led by @VladMnih is a great honor for me. I can't wait to get started!

417

Fabio Pardo

Fabio Pardo @PardoFab

9 Aug 2022

I am very happy to share that I successfully passed my PhD viva today! 🥳 Many thanks to my thesis supervisor Petar Kormushev @Petar_Kormushev and to my two amazing reviewers Deepak Pathak @pathak2206 and Antoine Cully @CULLYAntoine. This is the end of an incredible journey!

Fabio Pardo

Fabio Pardo @PardoFab

15 Dec 2021

We are releasing OstrichRL 🎉 The repository contains a musculoskeletal model of an ostrich in MuJoCo, a set of dm_control tasks for reinforcement learning, and motion capture data. GitHub: github.com/vittorione94/ostr… Paper: arxiv.org/abs/2112.06061

1:07

229

more replies

Fabio Pardo

Fabio Pardo @PardoFab

15 Dec 2021

We used the Tonic RL library in every stage of the project: to design the tasks, to train, and to evaluate agents. GitHub: github.com/fabiopardo/tonic

Fabio Pardo

Fabio Pardo @PardoFab

15 Dec 2021

This work is the result of an amazing collaboration with Vittorio La Barbera @VittorioLaBarb2 (equal contribution), Yuval Tassa @yuvaltassa, Monica Daley @birdBiomech, Chris Richards @PropPhysiology, Petar Kormushev @Petar_Kormushev, and John Hutchinson @JohnRHutchinson 😁

Fabio Pardo

Fabio Pardo @PardoFab

8 Jun 2021

I am so happy to start my second research internship at @DeepMind with @andre_s_barreto in the reinforcement learning team. This is going to be fun! 😁🤖

154

Unify

Fabio Pardo retweeted

Unify

@letsunifyai

6 Apr 2021

Introduing Ivy Modules! Define your models and optimizers in Ivy, and then train using any of: PyTorch, TensorFlow, Jax or MXNet! Ivy does not wrap classes from these frameworks. Ivy classes build directly on Ivy's simple functional API. Find more at: ivy-dl.org/