PhD Student @EdinburghUni; studying temporally extended behaviours in both single and multi-agent RL

Joined August 2019
2 Photos and videos
Pinned Tweet
Really excited to present our recent work at #ICLR2026 this week! We discover highly coordinated joint behaviours and integrate them into the skill sets of MARL agents, accelerating the search for effective joint strategies in downstream tasks.🧵 Paper: raulsteleac.github.io/iaro
1
4
16
1,529
Really excited to present our recent work at #ICLR2026 this week! We discover highly coordinated joint behaviours and integrate them into the skill sets of MARL agents, accelerating the search for effective joint strategies in downstream tasks.🧵 Paper: raulsteleac.github.io/iaro
1
4
16
1,529
Finally, we use this multi-dimensional n-distance as a state representation for eigenoption discovery, leading to coordinated alignment patterns that are effective in aiding teams of agents in multiple downstream tasks. Also works with heterogeneous agent state spaces.
1
1
2
70
Work done under the supervision of Mohan Sridharan and @dabelcs (many thanks)! If you’re at #ICLR2026 in Rio and want to chat, come find me during Poster Session 2! 🔥
1
2
99
Raul Steleac retweeted
Nice! (could not resist)
🚨 Are neural implicit representations applicable for larger-scale SLAM? Check out our NICE-SLAM👍! #CVPR2022 NICE website: pengsongyou.github.io/nice-s… NICE code: github.com/cvg/nice-slam NICE collaborations w/ Zihan Zhu (undergrad) @visionviktor @Martin_R_Oswald @mapo1 et al. 1/6
6
49
Raul Steleac retweeted
20 Apr 2022
A lot of people complain that RL doesn't work and RL researchers are still playing games. While this criticism is true to some extent, there's been a new trend of applying RL for real-life problems. This is a thread of notable papers split by the topic. 1/n

ALT The Office Thank You GIF

24
140
746
Raul Steleac retweeted
14 Jun 2021
23 and 24 year olds able to book their NHS vaccine appointments from tomorrow.
12
54
240
Raul Steleac retweeted
Discover how WaveNet has evolved from research concept to advanced real-world system that creates more natural-sounding speech and helps @Google unblock communication barriers for millions of people around the world: dpmd.ai/wavenet
4
62
208
Raul Steleac retweeted
14 May 2021
Chongus
17
24
2,856
Raul Steleac retweeted
Diffusion Models Beat GANs on Image Synthesis Achieves 3.85 FID on ImageNet 512×512 and matches BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. arxiv.org/abs/2105.05233
8
106
581
Raul Steleac retweeted
Today @iclr_conf - Women in Machine Learning (@WIML) at 2PM - Philosophy and AGI at 5PM with @dabelcs, @clarelyle and @jakeABeck (@UniOfOxford) There are also various poster sessions happening today from 5PM - see the full schedule here: dpmd.ai/ICLR21 #ICLR2021
20
120
Raul Steleac retweeted
In addition to MT-Opt, we are releasing Actionable Models, which addresses the problem of defining tasks (which becomes quite cumbersome at scale). This work uses the dataset collected by MT-Opt but uses goal-conditioned offline Q-learning to learn a general goal-reaching policy.
Excited to present our new work on Actionable Models, an approach for learning functional understanding of the world via goal-conditioned Q-functions in a fully-offline setting! paper: arxiv.org/abs/2104.07749 website: actionable-models.github.io youtube.com/watch?v=S3SCR7iY…
1
14
30
Raul Steleac retweeted
Most RL agents assume that rewards are caused by recent actions, and learn slowly when this isn't true. This new method speeds up learning in tasks with delayed reward by learning to link related events - regardless of how much time separates them. dpmd.ai/12425
10
142
677
Raul Steleac retweeted
Thrilled to announce our first major breakthrough in applying AI to a grand challenge in science. #AlphaFold has been validated as a solution to the ‘protein folding problem’ & we hope it will have a big impact on disease understanding and drug discovery: deepmind.com/alphafold-blog

149
1,813
7,618
Raul Steleac retweeted
The break conundrum
28 Nov 2020
1
9
Raul Steleac retweeted
14 Aug 2020
Everyone has heard about fast.ai or CS231n (for a good reason), but did you know you can access Stanford’s CS224w ML with Graphs or download the book Elements of Causal Inference for free? Thread on underappreciated ML resources 📚🎥 that deserve more love 👇 /1
29
903
3,561