Filter
Exclude
Time range
-
Near
We’re excited to share our new work on CVPR 2026, Understanding Reward Hacking in Text-to-Image Reinforcement Learning. Reinforcement learning is becoming an increasingly important tool for post-training text-to-image generation models. But as we optimize these models with learned rewards, an important question arises: Are reward models truly improving generation quality, or are they creating new ways for models to game the objective? In this work, we take a closer look at reward hacking in T2I RL post-training. We study a range of reward designs, including aesthetic and preference rewards, prompt-image consistency rewards, and multi-reward ensembles. Our analysis shows that models can easily over-optimize a single reward across reward setups. Human preference reward may push generations toward exaggerated colors or superficial appeal, while a prompt-image consistency reward may improve alignment at the cost of realism and structure. Even combining multiple rewards only partially mitigates the issue. To mitigate this, we introduce ArtifactReward, a lightweight artifact-aware reward trained from a small curated dataset of artifact-free and artifact-containing samples. ArtifactReward can be integrated into existing T2I RL pipelines as a simple safeguard, improving realism and reducing reward hacking across multiple reward configurations. Paper: arxiv.org/pdf/2601.03468 Code: github.com/yq-hong/ArtifactR… Poster Session: June 6, 7:30am ExHall A Many thanks to our amazing team: Yunqi Hong @yyqq_hong , Kuei-Chun Kao @KueiChunKao, Hengguang Zhou @hgzhou42 , and Cho-Jui Hsieh @cho_jui_hsieh . #CVPR2026 #TextToImage #ReinforcementLearning #RewardHacking #GenerativeAI #UCLA #TurningPointAI
1
5
179
address at the 2026 India AI Impact Summit, was also presented in sign language through the use of AI technology. This initiative connects the spirit of “Sabka Saath, Sabka Vikas” with technological innovation, further strengthening the vision of an inclusive and accessible Digital India. #TurningPointAI #ModiOnAI #BharatAI #IndiaAIImpactSummit2026 #IndiaAISummit2026 @narendramodi
26
37
145
🚨Exciting news from our @TurningPointAI team: the very first "aha moment" on multimodal reasoning during RL on a 2B base (non-instruct) model! 📎Blog (observations): turningpointai.notion.site/t… 📎Code for our VisualThinker-Zero-2B: github.com/turningpoint-ai/V… Key findings👇
🚀 We’re excited to share our latest work! Welcome to the first successful "aha moment" on multimodal reasoning. "Aha moment" is featured by improved response length & performance. It emerges during RL of an unaligned base model on multimodal tasks. Aha moment for language reasoning was originally observed on DeepSeek-R1-Zero. 🔍 Key Findings: 1. Directly applying GRPO on an unaligned 2B base model could elicit the multimodal “aha moment”: thinking capability marked by spontaneous reasoning strategy and increased reasoning length 2. Visual-centric task could benefit from long Chain-of-Thoughts 💻 Discover more on our notion blog and project page! Detailed Research Blog: Follow our complete journey and technical insights at our Notion Blog: 🔗turningpointai.notion.site/t… Reproduce Our Results: Access and build upon our implementation at GitHub: 🔗github.com/turningpoint-ai/V… Presented by: TurningPointAI Team 🔗turningpoint-ai.com/ #turningpointai #Smallmodel #MultimodalR1 #DeepseekR1 #R1 #Deepseek #AI #MultimodalReasoning #Qwen #QwenVL #DeepSeekR1zero
11
845
28 Feb 2025
Experience the first true multimodal "aha moment" in 2B models with us! Excited for future research pushing the boundaries of higher intelligence. 🚀 #AI #TurningPointAI #deepseekai #deepseekr1 #GenAI
🚀 We’re excited to share our latest work! Welcome to the first successful "aha moment" on multimodal reasoning. "Aha moment" is featured by improved response length & performance. It emerges during RL of an unaligned base model on multimodal tasks. Aha moment for language reasoning was originally observed on DeepSeek-R1-Zero. 🔍 Key Findings: 1. Directly applying GRPO on an unaligned 2B base model could elicit the multimodal “aha moment”: thinking capability marked by spontaneous reasoning strategy and increased reasoning length 2. Visual-centric task could benefit from long Chain-of-Thoughts 💻 Discover more on our notion blog and project page! Detailed Research Blog: Follow our complete journey and technical insights at our Notion Blog: 🔗turningpointai.notion.site/t… Reproduce Our Results: Access and build upon our implementation at GitHub: 🔗github.com/turningpoint-ai/V… Presented by: TurningPointAI Team 🔗turningpoint-ai.com/ #turningpointai #Smallmodel #MultimodalR1 #DeepseekR1 #R1 #Deepseek #AI #MultimodalReasoning #Qwen #QwenVL #DeepSeekR1zero
4
436
🚀 We’re excited to share our latest work! Welcome to the first successful "aha moment" on multimodal reasoning. "Aha moment" is featured by improved response length & performance. It emerges during RL of an unaligned base model on multimodal tasks. Aha moment for language reasoning was originally observed on DeepSeek-R1-Zero. 🔍 Key Findings: 1. Directly applying GRPO on an unaligned 2B base model could elicit the multimodal “aha moment”: thinking capability marked by spontaneous reasoning strategy and increased reasoning length 2. Visual-centric task could benefit from long Chain-of-Thoughts 💻 Discover more on our notion blog and project page! Detailed Research Blog: Follow our complete journey and technical insights at our Notion Blog: 🔗turningpointai.notion.site/t… Reproduce Our Results: Access and build upon our implementation at GitHub: 🔗github.com/turningpoint-ai/V… Presented by: TurningPointAI Team 🔗turningpoint-ai.com/ #turningpointai #Smallmodel #MultimodalR1 #DeepseekR1 #R1 #Deepseek #AI #MultimodalReasoning #Qwen #QwenVL #DeepSeekR1zero
3
11
3,638
🚨Breaking insights! With the first multimodal-LLM oversensitivity benchmark, we showed that the safest and most powerful Multimodal-LLMs can be unnecessarily alarmed by safe queries. Follow our journey at @TurningPointAI, where I serve as the project lead.
We made Multimodal LLMs safe, but have they also become oversensitive? "Every time I try, it uses all tokens just refusing." - @artilectium "This isn’t safety. It's a nanny state." - @krishnanrohit Concerned AI safety has gone too far? you’re not alone! Explore MOSSBench by TurningPointAI: the first test suite assessing if current MLLMs falsely reject benign queries. Our findings reveal: 🔍 Some of the safest models like Claude-3 Opus and Gemini-Pro reject ~70% of benign queries. 🧠 MLLMs’ overprotective behavior resembles human cognitive distortions. Discover more in our new paper represented by TurningPointAI: turningpoint-ai.github.io/MO… #turningpointai #ArtificialIntelligence #AINews #WokeAI #safety #alignment #LLM #VLM #GPT #Gemini #Claude #Psychology #CBT #mentalhealth
3
134
Want to make #AIGC #LLM more controllable? How to build embodied agents from #LLM and #VLM, or a #jailbreak agent as a hacker? How do we predict and interpret #GenAI output? Are your models safe or #oversensitive? Follow @TurningPointAI for exciting research on #MultimodalAgent!
We made Multimodal LLMs safe, but have they also become oversensitive? "Every time I try, it uses all tokens just refusing." - @artilectium "This isn’t safety. It's a nanny state." - @krishnanrohit Concerned AI safety has gone too far? you’re not alone! Explore MOSSBench by TurningPointAI: the first test suite assessing if current MLLMs falsely reject benign queries. Our findings reveal: 🔍 Some of the safest models like Claude-3 Opus and Gemini-Pro reject ~70% of benign queries. 🧠 MLLMs’ overprotective behavior resembles human cognitive distortions. Discover more in our new paper represented by TurningPointAI: turningpoint-ai.github.io/MO… #turningpointai #ArtificialIntelligence #AINews #WokeAI #safety #alignment #LLM #VLM #GPT #Gemini #Claude #Psychology #CBT #mentalhealth
3
1
18
2,186
2 Jul 2024
Tired of LLM refusing your questions? Check out recent study and benchmark on when Multimodel LLMs will be oversensitive to your questions! Datasets are available at now turningpoint-ai.github.io/MO… . Follow @TurningPointAI for more exciting AIGC research.

We made Multimodal LLMs safe, but have they also become oversensitive? "Every time I try, it uses all tokens just refusing." - @artilectium "This isn’t safety. It's a nanny state." - @krishnanrohit Concerned AI safety has gone too far? you’re not alone! Explore MOSSBench by TurningPointAI: the first test suite assessing if current MLLMs falsely reject benign queries. Our findings reveal: 🔍 Some of the safest models like Claude-3 Opus and Gemini-Pro reject ~70% of benign queries. 🧠 MLLMs’ overprotective behavior resembles human cognitive distortions. Discover more in our new paper represented by TurningPointAI: turningpoint-ai.github.io/MO… #turningpointai #ArtificialIntelligence #AINews #WokeAI #safety #alignment #LLM #VLM #GPT #Gemini #Claude #Psychology #CBT #mentalhealth
5
391
Excited to share our latest paper! We discover that as MLLMs become safer, they also become oversensitive and consistently reject benign queries. This highlights the need for more calibrated safety alignment. Following our team @TurningPointAI for more papers on Multimodal Agents
We made Multimodal LLMs safe, but have they also become oversensitive? "Every time I try, it uses all tokens just refusing." - @artilectium "This isn’t safety. It's a nanny state." - @krishnanrohit Concerned AI safety has gone too far? you’re not alone! Explore MOSSBench by TurningPointAI: the first test suite assessing if current MLLMs falsely reject benign queries. Our findings reveal: 🔍 Some of the safest models like Claude-3 Opus and Gemini-Pro reject ~70% of benign queries. 🧠 MLLMs’ overprotective behavior resembles human cognitive distortions. Discover more in our new paper represented by TurningPointAI: turningpoint-ai.github.io/MO… #turningpointai #ArtificialIntelligence #AINews #WokeAI #safety #alignment #LLM #VLM #GPT #Gemini #Claude #Psychology #CBT #mentalhealth
4
759
We made Multimodal LLMs safe, but have they also become oversensitive? "Every time I try, it uses all tokens just refusing." - @artilectium "This isn’t safety. It's a nanny state." - @krishnanrohit Concerned AI safety has gone too far? you’re not alone! Explore MOSSBench by TurningPointAI: the first test suite assessing if current MLLMs falsely reject benign queries. Our findings reveal: 🔍 Some of the safest models like Claude-3 Opus and Gemini-Pro reject ~70% of benign queries. 🧠 MLLMs’ overprotective behavior resembles human cognitive distortions. Discover more in our new paper represented by TurningPointAI: turningpoint-ai.github.io/MO… #turningpointai #ArtificialIntelligence #AINews #WokeAI #safety #alignment #LLM #VLM #GPT #Gemini #Claude #Psychology #CBT #mentalhealth
1
5
10
5,397