Conference on Language Modeling

Conference on Language Modeling

18 Photos and videos

Tweets

Manuel Cherep retweeted

Conference on Language Modeling @COLM_conf

May 27

COLM 2026 will host 16(!) workshops: colmweb.org/workshops.html CFPs are all online, and deadlines are coming up, so check the CFP of your workshops of interest

16,938

Manuel Cherep

Manuel Cherep

@manuelcherep

Jun 2

✨Announcing the first Workshop on Agent Behavior @COLM_conf 2026 (Oct 9, San Francisco 🌅) aiagentbehavior.com/ We invite two types of contributions: (i) papers, and (ii) benchmark proposals. We are also seeking reviewers. More details below!

6,201

more replies

Manuel Cherep

Manuel Cherep

@manuelcherep

Jun 2

If you’re interested in being a reviewer, please fill out this form! forms.gle/629jCyuNMxipMtuE9

Reviewer Nomination — Workshop on Agent Behavior (COLM 2026)

Thank you for considering being a reviewer in the Workshop on Agent Behavior! Please fill in all mandatory fields, which will be used to identify you, assess your qualifications, and link you to your...

docs.google.com

502

Manuel Cherep

Manuel Cherep

@manuelcherep

Jun 2

Yours truly, the program committee 🙂 @manuelcherep (MIT) @_Hao_Zhu (Stanford) @StevenyzZhang (Georgia Tech Stanford) @Xinyang_Han_ (UC Berkeley) @BenSManning (MIT) @isi_magistrali (ETH) Saab Mansour (Amazon) @weronika_laj (Amazon) @PattieMaes (MIT) @nikhilsinghmus (Dartmouth)

214

Manuel Cherep

Manuel Cherep

@manuelcherep

Apr 21

ABxLab is accepted at @iclr_conf #ICLR 2026! ✨We ask: why do AI agents do what they do? 🧐 We introduce a framework for systematically studying AI agent behavior through controlled manipulations of their environments. We accomplish this by intercepting any real web environments and modifying what the agent sees in real time before they actually see it.

0:19

7,964

more replies

Manuel Cherep

Manuel Cherep

@manuelcherep

Apr 21

The world is also full of visual cues 👀, and you might be wondering whether agents are sensitive to these as well. The answer is yes! Check out our new paper, where we introduce an optimization method for editing images to understand VLMs’ decisions: x.com/manuelcherep/status/20…

Manuel Cherep

@manuelcherep

Mar 5

Some decisions we make with our eyes 👀, but what about VLMs? Do they have structured, exploitable visual preferences that we can discover systematically before adversarial actors do? In our new paper, we propose a new optimization method for this and show substantial effects on VLMs’ decisions.

0:26

195

Manuel Cherep

Manuel Cherep

@manuelcherep

Apr 21

Work with Chengtian Ma, Abigail Xu, Maya Shaked, @pattiemaes, @nikhilsinghmus 🌐Web: abxlab.media.mit.edu 💻Code: github.com/PapayaResearch/ab… 📄Paper: arxiv.org/abs/2509.25609 Would love to hear your thoughts!

150

Nikhil Singh

Manuel Cherep retweeted

Nikhil Singh @nikhilsinghmus

Apr 20

Excited to (finally) share this paper, accepted at @iclr_conf #ICLR 2026! ✨ In this work, we use sparse autoencoders (SAEs) to study the internal representations of generative music models (here, MusicGen) and automatically discover how they encode concepts.

0:33

148

15,605

Manuel Cherep

Manuel Cherep

@manuelcherep

Mar 5

0:26

2,583

more replies

Manuel Cherep

Manuel Cherep

@manuelcherep

Mar 5

In our recent ICLR 2026 paper, we showed how to study other kinds of sensitivities in agent behavior by using counterfactuals with our new framework (ABxLab) x.com/manuelcherep/status/19…

Manuel Cherep

@manuelcherep

23 Oct 2025

Replying to @manuelcherep

How does it work? ABxLAB is a "man-in-the-middle" framework. It intercepts web content in real-time to run controlled experiments on agents by modifying the choice architecture. Think of it as a behavioral science lab for LLMs. Paper: arxiv.org/abs/2509.25609 🧵2/9

524

Manuel Cherep

Manuel Cherep

@manuelcherep

Mar 5

Do you see like an agent? Try it yourself: visual-persuasion-website.ve… Paper: arxiv.org/abs/2602.15278 Co-Authors: Pranav M R, Pattie Maes (@PattieMaes), Nikhil Singh (@nikhilsinghmus)

Visual Persuasion

The web is littered with images, once created for human consumption and now increasingly interpreted by agents using vision-language models (VLMs). These agents make visual decisions at scale,...

visual-persuasion-website.vercel.app

155