Joined November 2016
48 Photos and videos
Pinned Tweet
We've got really good at utilizing data. But methods for acquiring that data are often still rudimentary. Our new review paper shows how Bayesian experimental design has recently transformed to now provide a powerful mechanism to acquire data intelligently arxiv.org/abs/2302.14545
2
30
107
15,190
Tom Rainforth retweeted
Active testing enables label-efficient model evals but can be computationally expensive. We show how to reduce costs and scale up to LLMs. arxiv.org/abs/2508.09093 Work led by Gabrielle Berrada. Find her at EurIPS, or @janundnik and me at NeurIPS in San Diego.
3
7
14
7,298
Tom Rainforth retweeted
Apple and Oxford just made AI 6.5x better at problem-solving. The secret: it teaches AI agents to ask perfect questions. This rockets success rates from 14% to 91%. No need for fine-tuning or retraining. It runs on current models. Here's how it works: It's a strategic loop designed for multi-turn conversations. At every step, the agent works to find the shortest path to the right answer. Hypothesize: The agent creates an internal list of all possible solutions to the problem. Score Questions: It simulates asking various questions and scores each one on "Expected Information Gain" (EIG). This number represents how much a question is mathematically likely to shrink the list of possibilities. Ask the Best Question: It asks the user only the single, highest-scoring question. Update & Repeat: Based on the answer, it filters its list of hypotheses, getting smarter with each interaction, and then begins the loop again for the next turn. Why this matters for your AI strategy: This marks a shift from building passive "oracles" to proactive, question-asking agents Business Leaders: A 6.5x multiplier on task success is a lever for efficiency. This translates to fewer failed customer interactions, faster diagnostics, and more accurate personalization, a clear ROI on smarter AI. Practitioners: This is a deployment-time framework, not a new model. You can build this agent on top of existing LLMs today. It provides a principled way to overcome common multi-turn issues like inconsistency and context loss without fine-tuning or retraining. Researchers: This paper is a victory for information theory. It proves that a full EIG calculation is superior to heuristics like predictive entropy. It sets a new standard for how to build intelligent information-seeking agents.
23
136
830
89,397
I have an opening for a 2-year postdoc in probabilistic machine learning and/or experimental design. The application deadline is the 3rd of September. See here for details and how to apply: tinyurl.com/rainmlpostdoc202…

11
39
3,229
I have an opening for a 2.5-year postdoc position in the RainML lab as part of my ERC grant on probabilistic machine learning and intelligent data acquisition. Application deadline 10th July 2024. See here for details and to apply: tinyurl.com/rainmlpostdoc

1
21
53
11,491
I'm delighted to announce that from September I will officially be an Associate Professor (remaining at the Oxford stats department)
17
2
171
16,235
Our new #ICLR2024 paper shows how LLMs can successfully check their own change of thought reasoning without any fine-tuning or even examples, using an approach we call SelfCheck. Join me at poster 125 this afternoon to learn more Paper: openreview.net/forum?id=pTHf…
1
4
26
3,710
All credit goes to my fantastic coauthors @nmstatistics and @yeewhye
1
588
In-context learning can learn novel input-output relationships beyond what can be picked up from input context alone, but doesn't behave like conventional learning algorithm. Find out more at our ICLR poster #129 this afternoon. Paper: openreview.net/forum?id=YPIA…, led by @janundnik
1
12
1,441
Tom Rainforth retweeted
Are you at ICLR? Have you heard that In-Context Learning in LLMs does not learn label relationships? Well that's not true. Visit our poster TODAY to find out how LLMs incorporate label information. Spoiler: it's not Bayesian inference. Poster #129, May 7, 4.30 pm
1
9
35
5,609
Tom Rainforth retweeted
I will be presenting our work on "Beyond Bayesian Model Averaging over Paths in Probabilistic Programs with Stochastic Support" at AISTATS in Valencia tomorrow (details in thread below). If you are interested in probabilistic programming, come and say hi at poster session 1!
Our new paper (arxiv.org/abs/2310.14888) shows that probabilistic programs with stochastic support are implicitly Bayesian model averages (BMA) which leads to issues if we assume our model is misspecified! w/ @TimReichelt3 and Luke Ong A thread (1/5)
1
20
1,404
Tom Rainforth retweeted
It is @NeurIPS time again! I am excited to present our trans-dimensional jump diffusion work with @AndrewC_ML @willarvey @ValentinDeBort1 @tom_rainforth and @ArnaudDoucet1 ! Come over on Thursday 2nd poster session, neurips.cc/virtual/2023/post…. arxiv.org/abs/2305.16261 #NeurIPS2023

1
5
16
2,230
Tom Rainforth retweeted
13 Dec 2023
Interested in large language models? Worried about impacts of climate change? Come join us @oxcsml @NatureRecovery @UniofOxford in pushing the frontiers in #LLMs and at the same time help #NatureRecovery and address the impacts of #ClimateChange! bit.ly/4750IBO

1
23
54
25,400
Tom Rainforth retweeted
12 Dec 2023
We construct neural processes by iteratively transforming a simple stochastic process into an expressive one, similar to flow/diffusion-based models, but in function space! Join us at our #NeurIPS2023 poster session: neurips.cc/virtual/2023/post… on Wednesday morning!
2
8
44
5,174
Incredibly well deserved. Congratulations Adam!
Wow. Just heard I've been awarded the Corcoran Memorial Prize for my DPhil thesis by @OxfordStats ! It's wonderful and unexpected news :) A double thank you to my supervisors @tom_rainforth @yeewhye for four years terrific guidance and also for putting me forward for the award
7
1,595
Our new paper (arxiv.org/abs/2310.14888) shows that probabilistic programs with stochastic support are implicitly Bayesian model averages (BMA) which leads to issues if we assume our model is misspecified! w/ @TimReichelt3 and Luke Ong A thread (1/5)

1
10
50
7,166
For probabilistic programs, one damning failure case is that the SLP weights tend to be overconfident, i.e. overly collapsing onto a single SLP. This means inferences might change drastically based on nominal changes in the data and will lead to sub-optimal predictions. (4/5)
1
2
409
We propose alternative schemes that set the weights by optimizing a predictive objective, leveraging ideas from stacking and PAC-Bayes. They can be implemented as cheap post-processing steps on top of existing inferences and lead to more robust weights & better predictions. (5/5)
1
359