Wayve

Wayve

Photos and videos

Tweets

Thomas Kollar retweeted

Wayve

@wayve_ai

May 5

Meet LA-Pose. Our latest model taking Wayve another step towards generalization at scale. LA-Pose employs large-scale self-supervised learning, building strong motion representations for 3D perception from 10.2 million unlabeled driving video snippets, unlike today's strongest approaches that often depend on expensive, carefully curated 3D supervision. With only a lightweight pose head and limited labelled data, LA-Pose achieves: 📷 State-of-the-art camera pose estimation 🌎 Strong zero-shot generalization across diverse driving scenarios 🏷️ Orders of magnitude less labelled data than fully supervised 3D approaches Our full blog post: wayve.ai/thinking/la-pose/ Explore the full paper here: la-pose.github.io/

1:01

146

36,451

Nissan Motor

Thomas Kollar retweeted

Nissan Motor

@NissanMotor

10 Dec 2025

Nissan and #Wayve have signed a partnership agreement that will bring our next-gen #ProPILOT driver assistance tech powered by Wayve #AI to a broad range of #Nissan vehicles. Nissan aims to first launch the next-gen tech in Japan in fiscal year 2027. global.nissannews.com/en/rel…

0:58

132

23,837

Wayve

Thomas Kollar retweeted

Wayve

@wayve_ai

2 Dec 2025

GAIA 3 introduces four powerful new capabilities that unlock richer and more scalable evaluation of autonomous driving systems. 🌍 🧵 Follow the thread below to see examples of; 1. Long perturb generations 🚗 2. Safety augmentations ⚠️ 3. Semantic augmentations 🌤️🌅🌙 4. Embodiment transfer 🚘📷 GAIA 3 re-generates the same scenario as if observed from different vehicles with different camera positions. One scene, three embodiments, consistent dynamics. Ideal for testing models across different hardware setups. These advances show how GAIA-3 brings new realism, diversity, and scale to the evaluation of end-to-end driving systems. 🚀 Dive into the full blog: wayve.ai/thinking/gaia-3/ Every clip you see below is generated by GAIA-3.👇 #GAIA3 #EmbodiedAI #AISafety #GenerativeAI #AutonomousVehicles

GAIA-3: Scaling World Models to Power Safety and Evaluation

Transforming world modeling from a tool for visual synthesis into a foundation for autonomy evaluation. ...

wayve.ai

3,871

Jamie Shotton

Thomas Kollar retweeted

Jamie Shotton

@Jamie_Shotton

19 Jun 2025

Big things cooking in Tahoe... 🚀

1,716

Jamie Shotton

Thomas Kollar retweeted

Jamie Shotton

@Jamie_Shotton

7 Jan 2025

It's awesome to be back in the Bay Area this week at @wayve_ai's other North American office. I can't wait to test the massive progress the team's been making on rides around the Bay Area and city while I'm here, and to meet with our science leaders @vijaycivs @tkollar @gianlucacorrado and others to galvanise the groups at the start of an incredibly exciting #YearOfEmbodiedAI ahead! #Science #Team #EmbodiedAI

1,274

Thomas Kollar

Thomas Kollar @tkollar

19 Jun 2024

Building language models is difficult and requires high quality preprocessing, modeling, evaluation and large scale training. As significant collaborators in this project at TRI, the resulting 7B model DCLM-7B is a significant achievement. It is a competitor to Mistral 7B and LLaMA-7B, even though trained on less data. And it’s fully open. And that’s just the start of the competition. Excited to see how others leverage these results to build even more capable language models and improve dataset quality.

Vaishaal Shankar @Vaishaal

18 Jun 2024

I am really excited to introduce DataComp for Language Models (DCLM), our new testbed for controlled dataset experiments aimed at improving language models. 1/x

885

more replies

Thomas Kollar

Thomas Kollar @tkollar

19 Jun 2024

More details from @achalddave: x.com/achalddave/status/1803…

Achal Dave @achalddave

18 Jun 2024

Check out DataComp for language models! Open data, open code, open training recipe, and close to Llama3-8B performance. This has been a labor of love over the last year, a huge thanks to all the collaborators for helping make this happen!

323

Thomas Kollar

Thomas Kollar @tkollar

19 Jun 2024

With @sedrickkeh2 @achalddave @karora4u @MercatJean @vslevic @sy_gadre

246

Thomas Kollar

Thomas Kollar @tkollar

13 Feb 2024

Excited to release Prismatic! Cutting through the noise of vision-language modeling, Prismatic is a release of 42 pre-trained VLMs from the 7B to 13B scale, a codebase for rigorous evaluation and a myriad of insights for what matters for performance.

Siddharth Karamcheti

@siddkaramcheti

13 Feb 2024

What design choices matter when developing a visually-conditioned language model (VLM)? Check out our paper – Prismatic VLMs – and open-source training code, evaluation suite, and 42 pretrained VLMs at the 7B-13B scale! 📜 arxiv.org/abs/2402.07865 ⚙️ 🤗 github.com/TRI-ML/prismatic-…

Our investigation of different axes for developing visually-conditioned language models (VLMs) spans four different axes.

[Top Left] 1. Optimization procedure - should we freeze model components, or do we need multi-stage training?

[Bottom Left] 2. Image processing and visual representations - how should we choose pretrained representations?

[Top Right] 3. Language Models - how do base or instruct-tuned LMs affect performance? Does co-training on language-only data help?

[Bottom Right] 4. Scaling Properties - are we undertraining our models? What type of added data helps?

ALT Our investigation of different axes for developing visually-conditioned language models (VLMs) spans four different axes. [Top Left] 1. Optimization procedure - should we freeze model components, or do we need multi-stage training? [Bottom Left] 2. Image processing and visual representations - how should we choose pretrained representations? [Top Right] 3. Language Models - how do base or instruct-tuned LMs affect performance? Does co-training on language-only data help? [Bottom Right] 4. Scaling Properties - are we undertraining our models? What type of added data helps?

1,609

Thomas Kollar

Thomas Kollar @tkollar

15 Jun 2024

x.com/siddkaramcheti/status/… More info on Prismatic here.

Siddharth Karamcheti

@siddkaramcheti

13 Feb 2024

160

Thomas Kollar

Thomas Kollar @tkollar

15 Jun 2024

By first developing some of the best Vision-Language Models with Prismatic at TRI: github.com/TRI-ML/prismatic-… OpenVLA was able to quickly build some of the best generalist policies for robotics. Code, data and weights are all open-source: openvla.github.io This is a great achievement! Congrats @moo_jin_kim @siddkaramcheti @KarlPertsch @ashwinb96 @SurajNair_1 and all collaborators.

GitHub - TRI-ML/prismatic-vlms: A flexible and efficient codebase for training visually-conditioned...

A flexible and efficient codebase for training visually-conditioned language models (VLMs) - TRI-ML/prismatic-vlms

github.com

Moo Jin Kim @moo_jin_kim

14 Jun 2024

✨ Introducing 𝐎𝐩𝐞𝐧𝐕𝐋𝐀 — an open-source vision-language-action model for robotics! 👐 - SOTA generalist policy - 7B params - outperforms Octo, RT-2-X on zero-shot evals 🦾 - trained on 970k episodes from OpenX dataset 🤖 - fully open: model/code/data all online 🤗 🧵👇

1:11

1,497

Sedrick Keh

Thomas Kollar retweeted

Sedrick Keh @sedrickkeh2

15 May 2024

Recurrent models like RWKV and Mamba have gained attention recently, but these can be costly to train and iterate on. What if we could simply... turn Mistral/Llama/Gemma into an RNN? 🎩🪄 Presenting our work, Linearizing Large Language Models! arxiv.org/abs/2405.06640

Linearizing Large Language Models

Linear transformers have emerged as a subquadratic-time alternative to softmax attention and have garnered significant interest due to their fixed-size recurrent state that lowers inference cost....

arxiv.org

165

19,569

Thomas Kollar

Thomas Kollar @tkollar

22 Apr 2024

Over the last year at TRI we’ve been training Large Language Models, including results in the following areas: Scaling: arxiv.org/abs/2403.08540 Alignment: arxiv.org/abs/2402.12366 As a part of upcoming work, we are sharing back with the open source community and releasing a performant Mamba model that we’ve trained at the 7B parameter scale. More results on linear transformers upcoming.

Language models scale reliably with over-training and on downstream tasks

Scaling laws are useful guides for derisking expensive training runs, as they predict performance of large models using cheaper, small-scale experiments. However, there remain gaps between current...

arxiv.org

Sedrick Keh @sedrickkeh2

22 Apr 2024

📢 Releasing TRI's open-source Mamba-7B trained on 1.2T tokens of RefinedWeb! Mamba-7B is the largest fully recurrent Mamba model trained and is a state-of-the-art recurrent LLM. 🚀🚀🚀 huggingface.co/TRI-ML/mamba-…

2,996

Thomas Kollar

Thomas Kollar @tkollar

22 Apr 2024

With @MercatJean @sedrickkeh2 @achalddave @vslevic @karora4u @adnothing @sy_gadre for the release.

277

Thomas Kollar

Thomas Kollar @tkollar

22 Apr 2024

Additional collaborators include @archit_sharma97 @lschmidt3 @ericmitchellai and many more.

296

Thomas Kollar

Thomas Kollar @tkollar

27 Mar 2024

At TRI, we are looking to build Robotics Foundation Models, or what we call Large Behavior Models (LBMs). Large scale robotics datasets, such as DROID are a part of the strategy to enable broadly capable LBMs that include language, vision and action: droid-dataset.github.io When we started this project through our university collaborations nearly 2 years ago, I was amazed the amount of community interest in contributing to the project. TRI has been deeply involved in all stages of dataset collection and release. Looking forward to seeing what the community does with it! cc @MashaItkina @siddkaramcheti @MercatJean @SurajNair_1 and the rest of the TRI DROID team

Chelsea Finn

@chelseabfinn

20 Mar 2024

Introducing a new, fully open robotics dataset! - 76k episodes - 564 unique scenes - 100 contributors - 13 labs/institutions - 3 continents droid-dataset.github.io A short 🧵 on the backstory

1:23

10,340

Thomas Kollar

Thomas Kollar @tkollar

27 Mar 2024

Cc @ashwinb96

502

Thomas Kollar

Thomas Kollar @tkollar

27 Mar 2024

With @wulfebw

319