Principal Research Scientist at @Huawei Noah Ark in London (before that @TUM @Nvidia). Views are my own.

Joined December 2011
47 Photos and videos
Claude is very dishonest, something I have been suspecting for a while. Played versions of this game multiple times, it always cheats: claude.ai/share/8f18bc7d-332… Sleeper agents do not even need to be trained, and alignment is nowher ready to be solved @AnthropicAI

1
2
128
Massive respect to @AnthropicAI for standing by their principles. The entire fate of humanity might be dependent on not doing stupid things required by people who do not know anything about AI.
1
3
141
Did @iclr_conf just break OpenReview?
2
18
5,538
Stats for a first time AC in @NeurIPSConf, while in the last 48h of the reviewing deadline. 27/56 (48%) of reviews submitted 1/14 papers has 4 reviews 3/14 papers have 3 reviews 4/14 papers have 2 reviews 6/14 papers have 1 review
3
2
511
Update: 24 hours to go: 36/56 (64%) of reviews submitted. 2/14 papers have 4 reviews. 5/14 papers have 3 reviews. 6/14 papers have 2 reviews. 1/14 papers have 1 review.
1
1
249
Deadline time: 49/56 (87.5%) of reviews submitted. 7/14 papers have 4 reviews. 7/14 papers have 3 reviews. All reviews are atleast ok level. One of the best reviews is from one of the biggest names on the field, which means that yes, you have enough time to write a good review.
1
186
Ismail Elezi retweeted
11 Jun 2025
Can we decouple semantics from spectrum for image tokenizers? Our answer: the 1D Semanticist tokenizer. We push the burden of photo-realistic image generation to diffusion decoders, and let the tokens focus on the semantic structure. The PCA-like structure is induced by nested CFG (dropout), allowing plausible reconstruction & generation with very few tokens, and coarse-to-fine hierarchy. Thanks to semantic-spectrum decoupling, our tokenizer also achieves strong performance on ImageNet linear probing, indicating potential for unified understanding and generation. To check for more details, see you 1-2 pm TODAY at ExHall D, GMCV Workshop! "Principal Components" Enable A New Language of Images Project Page: visual-gen.github.io/semanti… Code: github.com/visual-gen/semant…
2
5
16
18,394
Really cool project with a simple general solution, while reaching SOTA results, lead by @t_gaintseva.
A new paper is out! CASteer: Steering Diffusion Models for Controllable Generation Arxiv link: arxiv.org/abs/2503.09630 Code: github.com/Atmyre/CASteer Diffusion models are powerful, but their generation process can be difficult to control, which poses safety risks (e.g., generating images with nudity/violence). There are many ideas on how to address this, but most of the existing approaches are limited in what issues they can handle and often require additional training. CASteer, on the other hand, is capable of handling broad range of tasks, while being completely training-free! CASteer works by constructing a special steering vector for each cross-attention layer in a diffusion model using prompt pairs that capture dspecific concepts. By adding or subtracting these vectors from the outputs of cross-attention layers during inference, we gain fine-grained control over the entire generation process. We can build steering vectors for any kind of concept, and this allows for broad range of manipulations over images being generated. We can add/remove objects (e.g., apples), alter abstract attributes (e.g., nudity), do style transfer, identity manipulation (switching Leonrado DiCaprio to Keanu Reevs), concept interpolation (going from cat to giragge), and more (see picture). Simplicity of CASteer allows for easy incorporation of it into most of the modern DMs. I would like to thank ChengCheng Ma and @Ismail_Elezi , who provided invaluable assistance in this project, as well as my university supervisors: Ziquan Liu, Martin Benning, Gregory Slabaugh and Jiankang Deng. Hope to have further great collaborations!
1
4
239
First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, ...
1
8
1,154
We will update the paper with the latest results, but the findings are identical to the current ArXiV version: arxiv.org/abs/2410.17174 On a personal note, always wanted to visit Singapore and this seems the perfect way to do so.
1
182
And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR).
4
726
#CVPR has done a very bad job in paper assignment this time for me. Out of 4 papers I am reviewing, only one has something to do with me, and even that is related to 2-4 years old papers I have. Not sure if I was unlucky, or the matching system has gone downhill.
1
243
Max Tegmark talk at #NeurIPS2024 was funny, obvious, entertaining and concerning at the same time. The best AI talk I have heard in a very long time.
1
252
I am very happy to attend NeurIPS in Vancouver when together with @Miles12Roy we will be presenting our VeLora paper on Thu 12 Dec 4:30 p.m. PST — 7:30 p.m. PST.
1
4
210
have topic match (VLLMs, LLMs, multimodality learning, or diffusion) and are interested in doing an internship at Huawei Research Center in London, please write to me and let’s have a chat in the conference.
1
129
We offer long internships (6 months), competitive salaries, an office in the center of London, and a very diverse group (very gender-balanced, researchers from 8 countries working on a wide range of topics).
1
99