1/6 🏆 Thrilled our paper, "MirrorCheck: Efficient Adversarial Defense for Vision-Language Models," won the Distinguished Paper Award at AdvML@CV CVPR 2026! 🌐 Project: tinyurl.com/3675hh5y w/ @Samar_M_Fares @ziu_klea @tolusophy @TakacMartin @FuaPv 👇

164

Toluwani Aremu

Toluwani Aremu

@tolusophy

Jun 1

Mirror Mirror on the Wall: The Serendipitous Journey Behind MirrorCheck 🧩✨ Every research paper has a polished, sanitized version that ends up on arXiv. It presents a seamless narrative of hypothesis, experimentation, and triumph. But behind our recent Distinguished Paper Award at the 6th AdvML@CV Workshop (CVPR 2026) lies a real and funny story—one involving a touch of the divine, a healthy dose of academic skepticism, and a classic fairytale metaphor. Here is how MirrorCheck actually came to life. Act I: Into the VLM Wilderness 🌲 It all started during the second semester of my first year as a PhD student. I was taking a course on Vision-Language Models (VLMs) taught by Prof. Ivan Laptev. The final requirement was simple yet daunting: execute a successful project within the scope of VLMs. I teamed up with @Samar_M_Fares and @ziu_klea . Right from day one, we knew we wanted to attack a massive vulnerability in the space: adversarial defense for VLMs. The catch? At the time, no dedicated detection method existed specifically for VLMs. We were staring at a blank canvas and had absolutely no idea where to start. We engineered a mountain of approaches. We failed, iterated, and failed again. Eventually, we stumbled upon a chaotic technique that actually seemed to work. Frankly, we didn't know why it worked...at the time, it felt like a direct touch of God. Our chaotic first approach looked like this: Input Image ➡️ VLM ➡️ Caption ➡️ T2I Generator ➡️ Overlay Image on Input (High Strength) ➡️ Relook at Caption Given an input image (clean or adversarial), we’d feed it into a victim VLM and extract the output text caption. We then took that text, generated a brand-new image from it, and overlayed this new synthetic image directly on top of the original using a calibrated strength parameter. We fed this heavily overlayed image back into the VLM, extracted a new caption, and compared it to the original. A massive semantic distance meant the original image was an adversarial manipulation. It worked! Not perfectly, but well enough to give us hope. Act II: "This Just Doesn't Make Sense" 😂 Armed with our suspicious preliminary findings, we quietly took our results to Prof. Ivan. While he was genuinely impressed by the numbers, I will never forget his hilariously candid remark: "While I'm a huge fan of simple tricks to solve a mystery, this just doesn't seem to make sense." 😂😂😂 He wasn’t wrong. Our underlying intuition was inspired by an earlier study showing that adding random noise to an image can neutralize or "purify" adversarial features. However, adaptive attackers easily bypass that by designing adversarial features that anticipate random noise. We had thought: What if instead of random noise, we used fine-grained semantic noise—the regenerated images? The problem was, it only worked when the calibrated strength of the overlay was cranked up high. Visually, the resulting image looked absolutely disgusting. It worked, but we desperately needed a rigorous, logical explanation for it. Act III: The Pivot to "Replaying" 🔁 Recognizing the potential, Prof. Ivan introduced us to @nikitadurasov, a student of his colleague Prof. Pascal Fua. Nikita specialized in uncertainty estimation, and there were fascinating conceptual crossovers between our approach and his paper, Zigzag. We jumped into intense brainstorming sessions with Nikita, initially trying to explicitly adapt his Zigzag framework to our problem. It flat-out didn't work. But breakthroughs happen when you least expect them. During one of our sessions, Nikita proposed adding noise to selected layers inside the victim model and "replaying" the original image to estimate the model’s internal prediction uncertainty. That word—replaying—stuck in my brain. Right there in the middle of the meeting, a lightning bolt struck (for dramatic effect). I thought of a variation of our original, "not-so-sensemaking" approach. Instead of overlaying the disgusting synthetic image back onto the original, why not just compare the two generated images directly? We ran it. It worked beautifully. I blurted it out right there in the meeting: "At least this one makes so much sense!" Act IV: The Psychology of a Horse and Snow White 🐴🍎 The logic mimics human psychology. Humans know something is fishy based on prior experiences. If I own a brown horse with small white spots, and one day I walk into the stable and see a similar horse but with noticeably larger white spots, I instinctively know something is wrong. By comparing the VLM's visual interpretation directly against the source, we were capturing that exact cognitive discrepancy. As we scaled our experiments and had more frequent general meetings with our Professors, things rapidly fell into place, and the framework began to solidify. One morning, we woke up to a brilliant title proposed by Nikita: MirrorCheck. It was inspired by the iconic scene in Snow White where the Evil Queen demands, "Mirror, mirror on the wall, who is the fairest of all?"—only to see Snow White’s face staring back at her instead of her own. An adversarial image demands a specific malicious output from the VLM, but when passed through our defense, the mirror reflects the true, underlying reality. It was perfect. Act V: From Rejection to the Big Stage 🏆 Getting the work accepted was its own grueling mountain to climb, but the journey has been nothing short of surreal. The First Iteration: Focused heavily on defending against general attacks and utilizing a unique One-Time-Use (OTU) noise mechanism to completely shatter the optimization space for adaptive attackers. The Current Iteration: The framework that secured the Distinguished Paper Award focuses on amplifying model uncertainty through strategic, stochastic model selection and layer perturbations. Today, MirrorCheck has come a long way from a "simple trick that doesn't make sense" to an award-winning framework cited as crucial prior work in the broader domain of Trustworthy Machine Learning (yes, our work has been adapted to other domains too😉). Pen drop. 🖋️ #CVPR2026 #AISafety #ResponsibleAI #MBZUAI #EPFL #AdversarialRobustness #TrustworthyAI

Toluwani Aremu

@tolusophy

Jun 1

🏆 Thrilled to announce that our paper "MirrorCheck: Efficient Adversarial Defense for Vision-Language Models" won the Distinguished Paper Award at the 6th AdvML@CV workshop (#CVPR2026)! 🧵👇

2:29

234

Toluwani Aremu

Toluwani Aremu

@tolusophy

Jun 1

🏆 Thrilled to announce that our paper "MirrorCheck: Efficient Adversarial Defense for Vision-Language Models" won the Distinguished Paper Award at the 6th AdvML@CV workshop (#CVPR2026)! 🧵👇

2:29

556

Meghan 💋 | Digital Muse

Meghan 💋 | Digital Muse @iammeghan69

May 28

Un abrigo largo para cubrir las apariencias y un reflejo que cuenta la verdadera historia de hoy. El juego apenas comienza. 🧥🔑👁️A long coat to cover appearances and a reflection that tells today's real story. The game has just begun. ✨#iammeghan69 #MirrorCheck

ALT Fotografía de cuerpo entero de una mujer joven posando de perfil frente a un gran espejo de marco oscuro. Viste una gabardina larga abierta color camel, lencería de encaje negro con tiras estilo arnés, medias de liga negras con encaje alto en los muslos y zapatos de tacón de aguja negros. En primer plano se observa su espalda y costado cubiertos por el abrigo, mientras que el reflejo del espejo muestra su figura frontal completa con lencería y un collar grande. Al fondo hay una silla gris de terciopelo y un jarrón con plumas de pavo real.Full-length photograph of a young woman posing in profile in front of a large, dark-framed mirror. She is wearing an open long camel-colored trench coat, black lace lingerie with harness-style straps, black thigh-high stockings with wide lace, and black stiletto heels. The foreground shows her back and side partially covered by the coat, while the mirror reflection reveals her full frontal figure.

MuscleGrowth💪

MuscleGrowth💪

@mrk_hartai

May 27

Post idea: My reflection really thought it could be stronger than me… Not happening. No bro, these arms are bigger — and I’m flexing them just to prove the point. I don’t compete with mirrors. I remind them who’s real. #NotBro #TheseArmsAreBigger #MirrorCheck #FlexMode #Bodybuilding #GymLife #NoCompetition #MuscleMindset #LockedIn #BeastMode #ArmDay #Biceps #Triceps #PumpCheck #GymMotivation #Bodybuilder #FitnessMotivation #MirrorVsMe #NoLimits #StrongerEveryday #NeverBackDown #DontTestMe #MassMonster #Flexing #GymBeast #Physique #Shredded #Muscle #WorkoutMotivation #AlphaEnergy

108

5,076

Louisa

Louisa @CupcakeCutieLu

May 26

mirror check before I go out… or should I say, mirror tease? 😏🩷 #mirrorcheck #mirrortease

175

Dan Zi Ger

Dan Zi Ger

@Orendans

May 24

These gym photos that you just can't stay indifferent to I showed you mine.. now show me yours ✌🏽 #gym #gay #mirrorcheck

1,678

christianccf

christianccf @christianccf

May 20

“𝐆𝐲𝐦, 𝐠𝐫𝐢𝐧𝐝, 𝐠𝐥𝐨𝐰 𝐮𝐩.” 🐻🔥😀💪🏻 #𝐌𝐢𝐫𝐫𝐨𝐫𝐂𝐡𝐞𝐜𝐤 #𝐆𝐲𝐦𝐕𝐢𝐛𝐞𝐬 #𝐌𝐞𝐧𝐬𝐒𝐭𝐲𝐥𝐞 #𝐂𝐨𝐧𝐟𝐢𝐝𝐞𝐧𝐜𝐞 #𝐑𝐞𝐞𝐥𝐈𝐭𝐅𝐞𝐞𝐥𝐈𝐭

0:07

125

5,230

55,699

Janelle M

Janelle M @JanelleMonk_

May 8

dangerous energy. 😇🤍 you've been warned. #cozygirl #casualfit #mirrorcheck #dancechallenge

0:11

763

Brianna Alessandra

Brianna Alessandra

@BriannaAless92

May 7

good morning! it's almost friday! lets finish this week strong 🙌🏼 hope everyone has an amazing day! #MorningWorkout #MirrorCheck #GymFit

0:08

261

7,077

Amorialab

Amorialab @Amorialab1

May 6

Three seconds in front of the mirror. That’s when you know. Before the robe goes back on. Before anyone sees. That pause is private. That pause is yours. Shop lingerie at amorialab.com. #LingerieMood #MirrorCheck #Amorialab

114

abrikoska.Leja

abrikoska.Leja @NemfisZ

May 5

"Просто сиджу, просто виглядаю 🔥 А ви як починаєте свій день?" #MirrorSelfie #TattooedGirl #AltGirl #AsianBeauty #InkedWomen #DenimShorts #StripShirt #Aesthetic #Egirl #TattooLovers #BlackHair #SoftGirl #MirrorCheck #DailyVibes #GrungeStyle #TattooArt #SelfieTime #RoomAesthetic

222

christianccf

christianccf @christianccf

May 5

“𝑴𝒊𝒓𝒓𝒐𝒓 𝒄𝒉𝒆𝒄𝒌: 𝑻𝒉𝒆 𝒐𝒏𝒍𝒚 𝒄𝒐𝒎𝒑𝒆𝒕𝒊𝒕𝒊𝒐𝒏 𝒊𝒔 𝒕𝒉𝒆 𝒎𝒂𝒏 𝒔𝒕𝒂𝒓𝒊𝒏𝒈 𝒃𝒂𝒄𝒌.” 💪🏻🐻😏🪞🔥 #𝑴𝒊𝒓𝒓𝒐𝒓𝑪𝒉𝒆𝒄𝒌 #𝑴𝒖𝒔𝒄𝒍𝒆𝑷𝒖𝒎𝒑 #𝑮𝒚𝒎𝑭𝒐𝒄𝒖𝒔 #𝑩𝒐𝒅𝒚𝑷𝒓𝒐𝒈𝒓𝒆𝒔𝒔 #𝑩𝒆𝒂𝒓𝒅𝒆𝒅𝑴𝒖𝒔𝒄𝒍𝒆

104

476

9,952

155,191

High Country Journal

High Country Journal

@HighCountryJNL

May 5

Mirror said “one pic”… camera roll said “we’re doing 47 just to be safe.” 😏 #MirrorCheck #OutfitOfTheDay

Brianna Alessandra

Brianna Alessandra

@BriannaAless92

May 3

just making sure everything looks right… you know how it is #MirrorCheck #DallasTX

725

18,135

Ekaterina

Ekaterina @ek4t3rina

Apr 30

Маленький подарок для вашей ленты #попка #фигура #селфи #эстетика #девушка #booty #curves #NSFW #mirrorcheck #нюдсочетверг

4,161

AsianDelight

AsianDelight @ASIANDELIGHTAI

Apr 28

2 hours of getting ready and the mirror still dragged me 😭✨ Peace up top, but this cake in booty shorts said ‘not today’ 🍑 Who else living in delulu era? Drop a 😂 #Fanvue #FunnyReels #DeluluEra #MirrorCheck #GlowUpFail

0:10

421