Joined April 2008
1 Photos and videos
Guy Van den Broeck retweeted
Congratulations @itisalex3 on acceptance to #ACL2026 🎉 If you are interested in KV cache compression, I highly recommend reading this paper.
16 Oct 2025
What happens when we compress the KV cache of prompts with multiple instructions? 🤔 Existing compression methods can lead to some instructions being ignored. 🙀 We propose simple changes to KV cache eviction that fix this problem alongside other pitfalls to be aware of. 💯
3
27
2,323
Guy Van den Broeck retweeted
Very excited to be leading this research direction, employing probabilistic programming to improve LLM inference for code generation.
When you prompt an LLM for code, you get one deterministic program. However, the LLM actually defines a distribution over many programs, and existing methods discard it‼️ PPoT uses this distribution to extract free performance and efficiency gains. 🧵👇
1
4
44
7,212
Guy Van den Broeck retweeted
When you prompt an LLM for code, you get one deterministic program. However, the LLM actually defines a distribution over many programs, and existing methods discard it‼️ PPoT uses this distribution to extract free performance and efficiency gains. 🧵👇
1
17
66
11,424
Guy Van den Broeck retweeted
We just implemented trai (try AI) (github.com/binpash/trai), a Claude plugin that can help you isolate 🫷 changes done in your file system by tool calls (like pip install), and only commit them if they are intended ☺️. Try it out (pun intended) and share your feedback!
2
2
8
710
We put probabilistic circuits into diffusion language models and got a big boost in reasoning performance!
One of the biggest promises of Diffusion LLMs is parallel generation: predicting multiple tokens at once to bypass the sequential bottleneck of autoregressive models. However, parallel generation comes with a price. For example: Should the sentence “He is from [MASK] [MASK]” be filled with [New] [York] or [San] [Diego]? If a diffusion model predicts both at the exact same time, it assumes independence and may produce... [San] [York]. 🤦‍♂️ We argue this arises from a structural misspecification: models are restricted to fully factorized outputs because parameterizing the full joint distribution would require a prohibitively massive output head. This is the Factorization Barrier crippling parallel generation. Here is how we broke it with CoDD.
12
1,147
Guy Van den Broeck retweeted
Check out our most recent work on dLLM!
One of the biggest promises of Diffusion LLMs is parallel generation: predicting multiple tokens at once to bypass the sequential bottleneck of autoregressive models. However, parallel generation comes with a price. For example: Should the sentence “He is from [MASK] [MASK]” be filled with [New] [York] or [San] [Diego]? If a diffusion model predicts both at the exact same time, it assumes independence and may produce... [San] [York]. 🤦‍♂️ We argue this arises from a structural misspecification: models are restricted to fully factorized outputs because parameterizing the full joint distribution would require a prohibitively massive output head. This is the Factorization Barrier crippling parallel generation. Here is how we broke it with CoDD.
3
10
1,306
Guy Van den Broeck retweeted
Check out our recent work offering a principled way to perform parallel prediction (few-step generation) in Diffusion LLMs with minimal performance degradation!
One of the biggest promises of Diffusion LLMs is parallel generation: predicting multiple tokens at once to bypass the sequential bottleneck of autoregressive models. However, parallel generation comes with a price. For example: Should the sentence “He is from [MASK] [MASK]” be filled with [New] [York] or [San] [Diego]? If a diffusion model predicts both at the exact same time, it assumes independence and may produce... [San] [York]. 🤦‍♂️ We argue this arises from a structural misspecification: models are restricted to fully factorized outputs because parameterizing the full joint distribution would require a prohibitively massive output head. This is the Factorization Barrier crippling parallel generation. Here is how we broke it with CoDD.
1
10
1,051
Guy Van den Broeck retweeted
One of the biggest promises of Diffusion LLMs is parallel generation: predicting multiple tokens at once to bypass the sequential bottleneck of autoregressive models. However, parallel generation comes with a price. For example: Should the sentence “He is from [MASK] [MASK]” be filled with [New] [York] or [San] [Diego]? If a diffusion model predicts both at the exact same time, it assumes independence and may produce... [San] [York]. 🤦‍♂️ We argue this arises from a structural misspecification: models are restricted to fully factorized outputs because parameterizing the full joint distribution would require a prohibitively massive output head. This is the Factorization Barrier crippling parallel generation. Here is how we broke it with CoDD.
8
30
311
22,806
Guy Van den Broeck retweeted
Join our reading group next Monday! Paper: Planned Diffusion Presenters: Daniel Israel (@danielmisrael), Tian Jin (@jintian)
📢Feb 2 (Mon): Planned Diffusion 🙅Diffusion language models are capable of parallelizing text generation but can struggle with coherence in low time-step regimes. 💡Planned Diffusion unlocks a new axis of parallelism: Token-level parallelism ➡️ semantic parallelism ✍️Planned diffusion first generates a structured plan, then diffuses semantically independent spans of text in parallel according to the plan. This Monday, Daniel Israel (UCLA) (@danielmisrael) and Tian Jin (MIT) (@jintian) will discuss their exciting Planned Diffusion paper as joint first authors. Collaborators: Ellie Cheng (elliecheng.com/), Guy Van den Broeck (@guyvdb), Aditya Grover (@adityagrover_), Suvinay Subramanian (@suvinay), Michael Carbin (@mcarbin) Paper link: arxiv.org/abs/2510.18087
4
17
2,587
Guy Van den Broeck retweeted
📢Feb 2 (Mon): Planned Diffusion 🙅Diffusion language models are capable of parallelizing text generation but can struggle with coherence in low time-step regimes. 💡Planned Diffusion unlocks a new axis of parallelism: Token-level parallelism ➡️ semantic parallelism ✍️Planned diffusion first generates a structured plan, then diffuses semantically independent spans of text in parallel according to the plan. This Monday, Daniel Israel (UCLA) (@danielmisrael) and Tian Jin (MIT) (@jintian) will discuss their exciting Planned Diffusion paper as joint first authors. Collaborators: Ellie Cheng (elliecheng.com/), Guy Van den Broeck (@guyvdb), Aditya Grover (@adityagrover_), Suvinay Subramanian (@suvinay), Michael Carbin (@mcarbin) Paper link: arxiv.org/abs/2510.18087
1
9
47
10,365
Guy Van den Broeck retweeted
19 Dec 2025
@RealAAAI⁩ AAAI2021Conference - Neuro-Symbolic AI Panel, during the COVID-19 crisis. With ⁦@kerstingAIML⁩ ⁦@guyvdb⁩ ⁦@mattbotvinick⁩ Marta Kwiatkowska, Leslie Pack Kaelbling. Just a picture 🙂
2
8
593
Guy Van den Broeck retweeted
3 Nov 2025
Recordings of the NeSy 2025 keynotes are now available! 🎥 Check out insightful talks from @guyvdb , @tkipf and @dlmcguinness on our new Youtube channel. Topics include using symbolic reasoning for LLM, and object-centric representations youtube.com/@NeSyconference
5
14
2,994
I gave a keynote at @nesyconf on "Symbolic Reasoning in the Age of Large Language Models" Check out the recording if you are curious about neurosymbolic generative AI: youtube.com/watch?v=OtmiJRTl…
7
42
4,188
Guy Van den Broeck retweeted
16 Oct 2025
What happens when we compress the KV cache of prompts with multiple instructions? 🤔 Existing compression methods can lead to some instructions being ignored. 🙀 We propose simple changes to KV cache eviction that fix this problem alongside other pitfalls to be aware of. 💯
2
3
25
5,078
Guy Van den Broeck retweeted
22 Oct 2025
Plan autoregressively, denoise in parallel!
"An hour of planning can save you 10 hours of doing." ✨📝 Planned Diffusion 📝 ✨ makes a plan before parallel dLLM generation. Planned Diffusion runs 1.2-1.8× faster than autoregressive and an order of magnitude faster than diffusion, while staying within 0.9–5% AR quality.
2
6
1,404
Guy Van den Broeck retweeted
22 Oct 2025
Diffusion 🤝 Autoregressive Fast high-quality generation
"An hour of planning can save you 10 hours of doing." ✨📝 Planned Diffusion 📝 ✨ makes a plan before parallel dLLM generation. Planned Diffusion runs 1.2-1.8× faster than autoregressive and an order of magnitude faster than diffusion, while staying within 0.9–5% AR quality.
2
2
648
Guy Van den Broeck retweeted
"An hour of planning can save you 10 hours of doing." ✨📝 Planned Diffusion 📝 ✨ makes a plan before parallel dLLM generation. Planned Diffusion runs 1.2-1.8× faster than autoregressive and an order of magnitude faster than diffusion, while staying within 0.9–5% AR quality.
7
47
317
38,722
Guy Van den Broeck retweeted
🔦Adaptive Parallel Decoding (APD) has been accepted as a spotlight paper at @NeurIPSConf ! I thank my collaborators, reviewers, and program organizers for this honor. A thread for those interested 🧵 (1/n)
12
24
172
17,045
Guy Van den Broeck retweeted
8 Sep 2025
Replying to @e_giunchiglia
How can reverend Bayes help us to incorporate constraints? With NeSy of course 👀 With applications in non-toxic LLM generation and safe AI driving! @guyvdb
1
1
6
492