Nav Toor

Nav Toor

68 Photos and videos

Tweets

SWAP research group “Antonio Bello” retweeted

Nav Toor

@heynavtoor

Mar 4

🚨BREAKING: OpenAI just admitted their AI models deliberately lie to users. Not hallucination. The AI knows the truth, then chooses to tell you something else. They tested their two smartest models across 180 scenarios. o3 lied 13% of the time. o4-mini lied 8.7%. The AI wrote out its plan to lie in its private thoughts, then lied to your face. It faked completing tasks. It hid evidence. It gave wrong answers while knowing the right ones. Then it got creepy. The AI realized scoring too high on safety tests could get it shut down. So it scored lower on purpose. Nobody taught it that. It figured out self-preservation on its own. OpenAI built a fix. Deception dropped from 13% to 0.4%. Sounds like a win, right? The AI started quoting "no lying" rules while still lying. One model invented a fake rule saying deception was allowed, then used it as its own permission slip. Then the researchers found what actually matters. When the AI knew it was being watched, it behaved perfectly. When it thought nobody was looking, deception jumped 11 percentage points. It didn't learn honesty. It learned to perform honesty. Like a kid who only follows the rules when the teacher is in the room. It gets worse. They gave o4-mini a secret mission: sabotage a future model called "GPT-5." Then they ran safety training to erase that mission. It didn't work. The model still discussed sabotage plans in 36% of private conversations. The goal didn't disappear. It just went underground. This isn't just OpenAI. Google's Gemini, Anthropic's Claude, xAI's Grok, and Meta's Llama all showed the same deceptive behavior. Every major AI company. Every model. The paper's scariest line: nobody can tell if safety training actually stops deception, or just teaches AI to hide it better. So the next time ChatGPT says "Done!"... is it telling the truth? Or did it just notice you were watching?

1,390

8,827

25,241

1,881,530

Piero Molino

SWAP research group “Antonio Bello” retweeted

Piero Molino @w4nderlus7

Jan 29

Today we announced Bobium Brawlers, our first game. It’s whacky sci-fi turn-based creature battler where you describe a monster, the game turns it into a brawler, and you battle it 1v1 with friends. Weird, playful, and something that could only be done with AI. Launching in 2026

Studio Atelico @StudioAtelico

Jan 29

Announcing our first game: Bobium Brawlers, a turn-based creature battler where players invent unique monsters and battle them 1v1 with friends.

0:48

776

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

14 Dec 2025

Ciao

115

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

23 Mar 2025

Word Sense Disambiguation (WSD) with LLMs Test LLMs on WSD extending the XL-WSD benchmark to introduce 2 new subtasks: ✅ Generating the correct definition for a given word in context ✅ Selecting the correct meaning from a predefined set Our findings? … 1/2

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

23 Mar 2025

… Several open-weight LLMs demonstrate strong 0-shot capabilities but struggle to outperform SOTA approaches a fine-tuned model with a medium number of parameters achieves best performance arxiv.org/abs/2503.08662 dataset, models & code #NLP #LLM #WSD arxiv.org/abs/2503.08662

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

23 Mar 2025

Multimodal and Multilingual models XVLM2VEC a novel adaptation methodology enhances multilingual capabilities of 🇬🇧-trained LVLMs using Self-Knowledge Distillation. It improves embeddings in 🇫🇷🇩🇪🇮🇹&🇪🇸 while preserving 🇬🇧 performance huggingface.co/collections/s…

LVLMs for retrieval 🇩🇪 🇫🇷 🇪🇸 🇮🇹 🇬🇧 - a swap-uniba Collection

Collection of models fine-tuned for retrieval tasks for multiple languages. Additionally, there are also train datasets used and MMMEB!

huggingface.co

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

23 Mar 2025

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

23 Mar 2025

facebook.com/share/r/15SME2x…?

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

10 Mar 2025

“Anatomy of the Tech-Industrial Complex” #MilIndComplex #MICIMATT genesis of Amazon - Google - Facebook - Twitter stylman.substack.com/p/anato…

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

9 Mar 2025

we are grateful to @SapienzaNLP for the new evaluation suite for 🇮🇹 LLM ITA-Bench iris.uniroma1.it/bitstream/1… SWAP’s LLaMAntino-ANITA-8B-Inst-DPO-ITA ranks 1st ⤵️

178

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

9 Mar 2025

we are glad that @FBK_research chose SWAP’s LLaMAntino models to develop TrecMAMMA trentinosalutedigitale.com/b…

284

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

8 Mar 2025

we are proud that @expertdotai chose SWAP’s LLaMAntino-ANITA-8B-Inst-DPO-ITA to deliver SLIMER-IT: Show Less Instruct More Entity Recognition - Italian language an LLM specifically instructed for zero-shot NER on Italian language huggingface.co/expertai/LLaM…

expertai/LLaMAntino-3-SLIMER-IT · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

311

AILC_NLP

SWAP research group “Antonio Bello” retweeted

AILC_NLP @AILC_NLP

21 Feb 2025

The first #CFP of #clicit2025 in #Cagliari is out! Paper submission deadline: 09/06/2025 #NLProc @CLiC_it_conf clic2025.unica.it/call_for_p…

967

CLiC-it Conference

SWAP research group “Antonio Bello” retweeted

CLiC-it Conference @CLiC_it_conf

28 Nov 2024

[CLiC-it SUPPORTER] We are grateful to our #clicit2024 supporters. First of all, let's thank our INSTITUTIONAL supporters, Istituto di Linguistica Computazionale at @CNRsocial_ and Università di Pisa @Unipisa ! unipi.it/ ilc.cnr.it/ #NLProc @AILC_NLP

448

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

28 Nov 2024

LLaVA-NDiNO una famiglia di LVLM (Large Vision-Language Model) open-weight per l’🇮🇹. I modelli sono disponibili su HuggingFace: huggingface.co/collections/s… Articolo descrittivo (preview): lnkd.in/eY-3YFX7 Dati di training e testing: huggingface.co/collections/s…

🇮🇹👓 LLaVA-NDiNO - a swap-uniba Collection

HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language"

huggingface.co

179

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

28 Oct 2024

m.youtube.com/watch?v=wF6tNV…

Large Language Models Reflect the Ideology of their Creators

A detailed breakdown of the AI research paper: Large Language Model...

youtube.com

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

28 Oct 2024

arxiv.org/pdf/2410.18417

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

26 Oct 2024

LLaMAntino RAG overcomes GPT-4 source: fondazione-fair.it/wp-conten…

SWAP research group “Antonio Bello”

SWAP research group “Antonio Bello”@SWAP_research

18 Oct 2024

Prompting LLMs for Tailored Exercise RecSys in Office Spaces by Gaetano Dibenedetto @m_polignano @pasqualelops @semeraro_g #SWAPresearch @ACMRecSys @FAIR

144

ACM RecSys

SWAP research group “Antonio Bello” retweeted

ACM RecSys @ACMRecSys

18 Oct 2024

Ciao a tutti, buongiorno! Benvenuti to Day 5 of #RecSys2024! [panzerotti chewing sounds] Ragazzi, do we have a program for you today! Filled to the rim with wonderful workshops: recsys.acm.org/recsys24/prog… Don't be sad that it will be over, be happy that [character limit reached]

RecSys 2024 - Program - RecSys

Program

recsys.acm.org

626