Ismail Elezi

Ismail Elezi

47 Photos and videos

Tweets

Ismail Elezi @Ismail_Elezi

Apr 17

Claude is very dishonest, something I have been suspecting for a while. Played versions of this game multiple times, it always cheats: claude.ai/share/8f18bc7d-332… Sleeper agents do not even need to be trained, and alignment is nowher ready to be solved @AnthropicAI

128

Ismail Elezi

Ismail Elezi @Ismail_Elezi

Feb 27

Massive respect to @AnthropicAI for standing by their principles. The entire fate of humanity might be dependent on not doing stupid things required by people who do not know anything about AI.

141

Ismail Elezi

Ismail Elezi @Ismail_Elezi

11 Nov 2025

Did @iclr_conf just break OpenReview?

5,538

Ismail Elezi

Ismail Elezi @Ismail_Elezi

1 Jul 2025

Stats for a first time AC in @NeurIPSConf, while in the last 48h of the reviewing deadline. 27/56 (48%) of reviews submitted 1/14 papers has 4 reviews 3/14 papers have 3 reviews 4/14 papers have 2 reviews 6/14 papers have 1 review

511

Ismail Elezi

Ismail Elezi @Ismail_Elezi

2 Jul 2025

Update: 24 hours to go: 36/56 (64%) of reviews submitted. 2/14 papers have 4 reviews. 5/14 papers have 3 reviews. 6/14 papers have 2 reviews. 1/14 papers have 1 review.

249

Ismail Elezi

Ismail Elezi @Ismail_Elezi

3 Jul 2025

Deadline time: 49/56 (87.5%) of reviews submitted. 7/14 papers have 4 reviews. 7/14 papers have 3 reviews. All reviews are atleast ok level. One of the best reviews is from one of the biggest names on the field, which means that yes, you have enough time to write a good review.

186

Xin Wen

Ismail Elezi retweeted

Xin Wen @_xwen_

11 Jun 2025

Can we decouple semantics from spectrum for image tokenizers? Our answer: the 1D Semanticist tokenizer. We push the burden of photo-realistic image generation to diffusion decoders, and let the tokens focus on the semantic structure. The PCA-like structure is induced by nested CFG (dropout), allowing plausible reconstruction & generation with very few tokens, and coarse-to-fine hierarchy. Thanks to semantic-spectrum decoupling, our tokenizer also achieves strong performance on ImageNet linear probing, indicating potential for unified understanding and generation. To check for more details, see you 1-2 pm TODAY at ExHall D, GMCV Workshop! "Principal Components" Enable A New Language of Images Project Page: visual-gen.github.io/semanti… Code: github.com/visual-gen/semant…

18,394

Ismail Elezi

Ismail Elezi @Ismail_Elezi

17 Mar 2025

Really cool project with a simple general solution, while reaching SOTA results, lead by @t_gaintseva.

Tatiana Gaintseva

@t_gaintseva

17 Mar 2025

A new paper is out! CASteer: Steering Diffusion Models for Controllable Generation Arxiv link: arxiv.org/abs/2503.09630 Code: github.com/Atmyre/CASteer Diffusion models are powerful, but their generation process can be difficult to control, which poses safety risks (e.g., generating images with nudity/violence). There are many ideas on how to address this, but most of the existing approaches are limited in what issues they can handle and often require additional training. CASteer, on the other hand, is capable of handling broad range of tasks, while being completely training-free! CASteer works by constructing a special steering vector for each cross-attention layer in a diffusion model using prompt pairs that capture dspecific concepts. By adding or subtracting these vectors from the outputs of cross-attention layers during inference, we gain fine-grained control over the entire generation process. We can build steering vectors for any kind of concept, and this allows for broad range of manipulations over images being generated. We can add/remove objects (e.g., apples), alter abstract attributes (e.g., nudity), do style transfer, identity manipulation (switching Leonrado DiCaprio to Keanu Reevs), concept interpolation (going from cat to giragge), and more (see picture). Simplicity of CASteer allows for easy incorporation of it into most of the modern DMs. I would like to thank ChengCheng Ma and @Ismail_Elezi , who provided invaluable assistance in this project, as well as my university supervisors: Ziquan Liu, Martin Benning, Gregory Slabaugh and Jiankang Deng. Hope to have further great collaborations!

239

Ismail Elezi

Ismail Elezi @Ismail_Elezi

22 Jan 2025

First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, ...

1,154

more replies

Ismail Elezi

Ismail Elezi @Ismail_Elezi

22 Jan 2025

We will update the paper with the latest results, but the findings are identical to the current ArXiV version: arxiv.org/abs/2410.17174 On a personal note, always wanted to visit Singapore and this seems the perfect way to do so.

From Attention to Activation: Unravelling the Enigmas of Large...

We study two strange phenomena in auto-regressive Transformers: (1) the dominance of the first token in attention heads; (2) the occurrence of large outlier activations in the hidden states. We...

arxiv.org

182

Ismail Elezi

Ismail Elezi @Ismail_Elezi

22 Jan 2025

And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR).

726

Ismail Elezi

Ismail Elezi @Ismail_Elezi

10 Jan 2025

#CVPR has done a very bad job in paper assignment this time for me. Out of 4 papers I am reviewing, only one has something to do with me, and even that is related to 2-4 years old papers I have. Not sure if I was unlucky, or the matching system has gone downhill.

243

Ismail Elezi

Ismail Elezi @Ismail_Elezi

15 Dec 2024

Max Tegmark talk at #NeurIPS2024 was funny, obvious, entertaining and concerning at the same time. The best AI talk I have heard in a very long time.

252

Ismail Elezi

Ismail Elezi @Ismail_Elezi

9 Dec 2024

I am very happy to attend NeurIPS in Vancouver when together with @Miles12Roy we will be presenting our VeLora paper on Thu 12 Dec 4:30 p.m. PST — 7:30 p.m. PST.

210

more replies

Ismail Elezi

Ismail Elezi @Ismail_Elezi

9 Dec 2024

have topic match (VLLMs, LLMs, multimodality learning, or diffusion) and are interested in doing an internship at Huawei Research Center in London, please write to me and let’s have a chat in the conference.

129

Ismail Elezi

Ismail Elezi @Ismail_Elezi

9 Dec 2024

We offer long internships (6 months), competitive salaries, an office in the center of London, and a very diverse group (very gender-balanced, researchers from 8 countries working on a wide range of topics).