Blake Richards

Blake Richards

542 Photos and videos

Tweets

Pinned Tweet

Blake Richards @tyrell_turing

26 Oct 2023

Check out this new paper: Led by @mehdiazabou and @evadyer, we show that it is possible to get SOTA brain decoding with transfer across individuals and tasks! The key is a clever way to tokenize spiking data for transformers. #brain #neurotech #NeurIPS2023

Mehdi Azabou @ NeurIPS @mehdiazabou

25 Oct 2023

Is a universal brain decoder possible? Can we train a decoding system that easily transfers to new individuals/tasks? Check out our #NeurIPS2023 paper where we show that it’s possible to transfer from a large pretrained model to achieve SOTA 🧠! Link: poyo-brain.github.io/ 🧵

0:25

146

31,707

Raymond Chua

Blake Richards retweeted

Raymond Chua @RaymondRChua

Jun 8

Excited to share our new paper accepted at ICML 2026 with @tyrell_turing and Doina Precup! 🇰🇷 See you in Seoul. A major challenge in continual reinforcement learning is balancing: • plasticity (learning new things) • stability (not forgetting old ones) 🧵 1/15

3,636

Sonia Joseph

Blake Richards retweeted

Sonia Joseph

@soniajoseph_

Apr 26

Interpretability is built on a few core assumptions. Two of our ICLR 2026 @iclr_conf papers suggest some of those assumptions are wrong (or at least highly incomplete). 1. Sparse CLIP: Co-Optimizing Interpretability and Performance in Contrastive Learning arxiv.org/abs/2601.20075 much of the field has internalized an interpretability–accuracy trade-off: if you want cleaner, more human-understandable features, you sacrifice performance. however, we find that this trade-off is not fundamental. instead of relying on post-hoc methods (e.g. sparse autoencoders trained on frozen representations), we incorporate sparsity directly into CLIP training. surprisingly, this produces features that are significantly more interpretable while preserving downstream performance. this result made me more optimistic about intrinsically interpretable models, a direction that was imo written off too early. - 2. Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry arxiv.org/abs/2510.08638 a lot of interpretability work implicitly assumes that vision representations behave like language: sparse, linear, and decomposable into independent features. we find that this assumption is often misleading. instead, vision representations appear partially dense and geometrically structured. we propose the Minkowski Representation Hypothesis: tokens live in sums of convex regions formed from a small set of “archetypes,” rather than isolated features along linear directions. this reframes how different tasks (classification, segmentation, depth) recruit and organize concepts. it also suggests that many current interpretability tools are mismatched to the actual structure of vision data. -- tldr; interpretability can be built into training with surprisingly simple tweaks, and that different modalities have different sparsities/geometries. Tailoring the interp method to the modality is super impt!

481

34,698

Roy Eyono

Blake Richards retweeted

Roy Eyono @RoyEyono

Mar 19

How do neural circuits in the brain implement normalization? 🧠 In our new paper, we show that just normalizing sensory input isn't enough. Crucially, we must also normalize the error signals! 🧵👇 Paper: arxiv.org/abs/2603.17676

133

8,924

Sonia Joseph

Blake Richards retweeted

Sonia Joseph

@soniajoseph_

Feb 25

Today we release a new paper from Meta @AIatMeta: "Interpreting Physics in Video World Models," one of the first interpretability studies of video encoders. V-JEPA 2 shows rich, counterintuitive behaviors, including brain-like population codes and high-dimensional steering.

ALT The Physics Emergence Zone emerges one-third through the net on the intuitive physics task.

629

80,924

Blake Richards

Blake Richards @tyrell_turing

Feb 13

A big thank you to the @foresightinst for supporting our research on neuro-foundation models!

Foresight Institute

@foresightinst

Feb 13

We’re excited to support @evadyer and @tyrell_turing as they combine different ways of measuring neural activity to better model how the brain works. They will explore the development of a general-purpose, multiscale, multimodal model of human brain activity that learns shared representations across invasive (e.g. intracranial EEG) and non-invasive (e.g. scalp EEG) recordings. The goal is to build a foundation for simulating, decoding, and interacting with brain dynamics in ways that advance both neuroscience and the development of more interpretable, brain-aligned AI systems.

2,970

Mila - Institut québécois d'IA

Blake Richards retweeted

Mila - Institut québécois d'IA

@Mila_Quebec

Feb 10

Quels domaines sont les plus prometteurs pour l'avenir de la recherche en IA ? Cette question a donné le ton de la première conférence annuelle de Mila, au cours de laquelle la communauté a exploré les mystères qui définiront la recherche de demain. Mention spéciale à nos chercheur·euse·s @hugo_larochelle, @tyrell_turing, @AaronCourville, et @tegan_maharaj pour avoir relevé le défi "Hot Ones" ! mila.quebec/fr/nouvelle/myst…

980

Mila - Institut québécois d'IA

Blake Richards retweeted

Mila - Institut québécois d'IA

@Mila_Quebec

Feb 10

Which fields hold the most promise for the future of AI research? This question set the tone for Mila's first annual conference, where the community explored the mysteries that will define tomorrow's research. Special mention to our researchers @hugo_larochelle, @tyrell_turing, @AaronCourville, and @tegan_maharaj for braving the 'Hot Ones' challenge! mila.quebec/en/news/mysterie…

1,334

Seijin Kobayashi

Blake Richards retweeted

Seijin Kobayashi @SeijinKobayashi

Jan 6

Standard reinforcement learning in raw tokens is a disaster for sparse rewards! Here, we propose 𝗜𝗻𝘁𝗲𝗿𝗻𝗮𝗹 𝗥𝗟: acting on abstract actions emerging in the residual stream representation. A paradigm shift in using pretrained models to solve hard, long-horizon tasks! 🧵

122

936

254,363

Blake Richards

Blake Richards @tyrell_turing

Jan 6

Another bery cool RL result from our Paradigms of Intelligence team! tl;dr: You can get effective hierarchical RL by learning a policy on the latent representations in an autoregressive sequence model.

Seijin Kobayashi @SeijinKobayashi

Jan 6

1,734

Kording Lab 🦖

Blake Richards retweeted

Kording Lab 🦖@KordingLab

2 Dec 2025

Awesome encoding of neural activities.

Vinam Arora @vinam_arora

2 Dec 2025

Excited to share our #NeurIPS2025 work: NuCLR, a framework for learning neuron-level representations 🧠 These embeddings capture the biological identity of neurons and work out-of-the-box on new animals; no finetuning needed 💃 This offers some of the first evidence that large-scale neuroscience models can truly generalize across animals. Paper: arxiv.org/abs/2512.01199 Code: github.com/nerdslab/nuclr If you are at NeurIPS in San Diego, come find us at Poster Session 5 (11am-3pm PT, Exhibit Hall C,D,E, # 2107) 🎉 1/x 🧵

0:14

12,306

Mehdi Azabou @ NeurIPS

Blake Richards retweeted

Mehdi Azabou @ NeurIPS @mehdiazabou

5 Dec 2025

Come by our poster this morning to learn more about NuCLR! This is the beginning of what I believe is needed to unlock zero-shot BCI 🧠🤖 The key insights? 1. Observe neurons for longer (not just sub-second context windows) and 2. Observe how they activate relative to the rest of the population. Poster No. 2107 #NeurIPS2025

Vinam Arora @vinam_arora

2 Dec 2025

0:14

1,977

Arna Ghosh

Blake Richards retweeted

Arna Ghosh @arna_ghosh

5 Dec 2025

In San Diego attending #NeurIPS2025? Come to our poster to talk more about representation geometry in LLMs. 😃 🗓️ Friday 4:30-7:30 pm session 📍 Exhibit Hall C, D, E 🏁 Poster # 2502

@kumarkagrawal

30 Oct 2025

Autoregressive language models learn to compress data by mapping sequences to high-dimensional representations and decoding one token at a time. The quality of compression, as defined by the ability to predict the next token given a prompt, progressively improves (as measured by negative log-likelihood) during training. We find that complexity of the representation manifold however, evolves non-mononitically in distinct phases across pretraining and post-training. Excited to share our #NeurIPS2025 📄 led by our amazing undergrad @melody_zixuan where we study the complexity dynamics of LLMs, and how distinct phases relate to specific behaviors. 🧵👇

5,761

Mehdi Azabou @ NeurIPS

Blake Richards retweeted

Mehdi Azabou @ NeurIPS @mehdiazabou

1 Dec 2025

The Foundation Models for the Brain and Body workshop is happening this week at #NeurIPS2025 🏝️🧠 We have an amazing lineup of keynote speakers, spotlight talks, posters and demos. We can’t wait to welcome everyone on Saturday!

4,935

Blake Richards

Blake Richards @tyrell_turing

3 Dec 2025

2/ Most algorithms rely on decoupled agency—treating agents as separate from the environment. But in multi-agent settings, you are part of the world that others are modeling! We show how this insight, coupled with predictive models, can resolve social dilemmas in RL.

1,081

more replies

Blake Richards

Blake Richards @tyrell_turing

3 Dec 2025

19/ This work was spearheaded by Alexander Meulemans, Rajai Nasser, Rif A. Saurous and Joao Sacramento, with help from other members (e.g. @g_lajoie_ ) of the Google Paradigms of Intelligence team, led by @blaiseaguera and James Manyika.

707

Blake Richards

Blake Richards @tyrell_turing

3 Dec 2025

20/ I consider myself very lucky to be working with this team, and it's great to see this paper out!!! 🎉🎉🎉

621