Salesforce AI Research

Salesforce AI Research

893 Photos and videos

Tweets

Pinned Tweet

Salesforce AI Research

@SFResearch

11 Sep 2025

Looking for the cutting-edge of AI research? Follow Salesforce AI Research to see how we're transforming enterprise technology through advanced innovations. From world models to agentic systems, discover the future of AI before it hits the market.

0:15

Follow Salesforce AI Research

x.com

434

2,444,223

Salesforce AI Research

Salesforce AI Research

@SFResearch

Jun 9

Model cards are nutrition labels for AI. Now they include environmental impact. 🌱 @Salesforce is adding standardized energy carbon metrics to its AI model cards: sforce.co/4umu8qm Salesforce AI Research worked with the Impact team to embed these estimates into the standard model evaluation workflow, so a model's footprint is measured alongside its performance. They cover energy use and emissions across pre-training, post-training, and inference, using the AI Energy Score methodology. The Environmental Impact section is live now in the model cards for First Name Match, Account Match, and TextEval. Browse them on the Salesforce Trust site: sforce.co/4eaIwMu #ResponsibleAI #Sustainability #FutureOfAI

Measuring AI’s Environmental Impact: How We’re Operationalizing Transparency Through Model Cards

Today, Salesforce is expanding its AI model cards with standardized environmental impact metrics. This update helps customers better understand the energy

salesforce.com

418

Ziyang Wang

Salesforce AI Research retweeted

Ziyang Wang

@ZiyangW00

Jun 4

Excited to be at #CVPR2026 this week and to present my internship work with @SFResearch: Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding In this work, we study how multimodal agents can actively reason over long videos by iteratively seeking the most relevant evidence, rather than passively processing all video content at once. If you’re attending CVPR, feel free to stop by our poster! 📍 #245, Findings Posters, ExHall A 📅 Sunday, June 7 🕢 7:30 – 9:00 AM Project page: activevideoperception.github… Looking forward to connecting at CVPR! #CVPR2026 #ComputerVision #MultimodalAI #VideoUnderstanding #AIAgents

1,117

Salesforce AI Research

Salesforce AI Research

@SFResearch

Jun 3

(1/8) Can Language Models Remember What They Learn? LLMs learn from feedback. But most post-training is amnesiac: rollout → reward → update → forget. What if you keep the signal? Procedural Memory Distillation (PMD): learning from experience, not just feedback. 🧵

1,601

more replies

Salesforce AI Research

Salesforce AI Research

@SFResearch

Jun 3

(7/8) PMD doesn't give models a permanent notebook. It lets them use one while learning, absorb the useful lessons into their weights, and move on. Every training step contains signals about what works and what fails. Most methods throw it away. 📄 Paper: sforce.co/4dXlVTE

Procedural Memory Distillation

Online Reflection for Self-Improving Language Models

self-evolving-agents.salesforceresearch.ai

203

Salesforce AI Research

Salesforce AI Research

@SFResearch

Jun 3

(8/8) Authors:: Ye Liu @YeLiu918, Srijan Bansal @SrijanBansal1, Bo Pang @bo_pang0, Yang Li, Zeyu Leo Liu @ZEYULIU10, Yifei Ming @ming5_alvin, Zixuan Ke @KeZixuan, Shafiq Joty @JotyShafiq, and Semih Yavuz @semih__yavuz. 📝 Blog: sforce.co/4dAjQOu

Can Language Models Remember What They Learn?

Post-training methods (RLVR, On-policy distillation) are Episode-local Language models are getting better at learning from feedback during post-training. In reinforcement learning with verifiable...

salesforce.com

268

Salesforce AI Research

Salesforce AI Research

@SFResearch

Jun 2

The 6th Multimodal Algorithmic Reasoning Workshop at #CVPR2026 is Thursday (6/4) morning 🗓️ sforce.co/4ueOT7j Bringing together researchers across academia and industry to explore advances in multimodal reasoning, foundation models, agentic reasoning, and the future of intelligent reasoning systems. Keynote speakers: 🔹 Juan Carlos Niebles @jcniebles, Salesforce AI Research 🔹 Jiayuan Mao — U. of Pennsylvania 🔹 Melanie Mitchell — Santa Fe Institute 🔹 Jialong Wu — Tsinghua University Room 601, Colorado Convention Center | 8:55 AM – 12:30 PM MDT Thanks to Honglu Zhou @zhou_honglu (Research Scientist at @SFResearch) and sponsors @merl_news and @ElorianAI

909

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 28

Can Language Models Remember What They Learn? Introducing Procedural Memory Distillation (PMD): sforce.co/4dAjQOu PMD turns model attempts into reusable training memory, conditions a self-teacher on it, and distills the guidance into the student's weights.

Can Language Models Remember What They Learn?

Post-training methods (RLVR, On-policy distillation) are Episode-local Language models are getting better at learning from feedback during post-training. In reinforcement learning with verifiable...

salesforce.com

3,269

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 28

Accepted to #ICML2026: MFCL-Audio — a benchmark for voice agents that have to call tools. Real speech is messy. Accents, background noise, mumbling, and "wait, what did I say?" moments all break tool calls. MFCL-Audio measures how badly, across 6.2K tasks. Authors: Huanzhi Mao, Aditya Ghai, Imra Dawoodani, Tony Ginart, Shishir G. Patil, John Emmons, Joseph E. Gonzalez #FutureOfAI #EnterpriseAI #VoiceAgents

1,104

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 26

📣 Counterparty Modeling is Not Strategy: The Limits of LLM Negotiators sforce.co/3RTTU7Q New research finds LLM agents can model a negotiating partner's preferences accurately, but don't reliably turn that knowledge into strategic bargaining. ➡️ Asymmetric information backfires: giving sellers the buyer's preferences raised buyer utility while seller utility fell ➡️ Agents accurately read the room early on, but fail to convert this social understanding into reciprocal, multi-turn exchange ➡️ Final deals are driven by opening price anchors rather than latent utility structure ➡️ Forcing explicit give/ask trade plans doesn't close the gap, proving that a model's ability to reason about a variable doesn't mean it can execute it in interaction. The problem isn't that models fail to read the room; they form accurate early beliefs about the opponent. The breakdown is downstream: they fail to convert social understanding into multi-turn strategic execution. Showing a capability in a reasoning trace does not mean the model can deploy it in sequential interaction. Authors: Romain Cosentino @Rom_Cosentino, Sarath Shekkizhar @shekkizh, Adam Earle, Silvio Savarese @silviocinguetta #FutureOfAI #EnterpriseAI #AgenticAI

877

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 22

1/5 RLVR trains LLMs with pass/fail rewards — but every near-miss rollout is wasted. What if models could actually *learn* from their mistakes? New paper: "Learning from Language Feedback via Variational Policy Distillation" Read: sforce.co/4uv2f0k 🧵👇

14,423

more replies

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 22

4/5 Tested on 3 model families (Qwen3-4B/8B, Llama-3.1-8B) across code generation (LiveCodeBench) and scientific reasoning (SciKnowEval): ✅ Consistent gains over GRPO and self-distillation baselines ✅ Stable training where prior methods collapse ✅ Best gains on domains with rich error signals (code, science)

485

Salesforce AI Research

Salesforce AI Research

@SFResearch

May 22

5/5 Binary rewards leave information on the table. Teaching models to interpret why they failed, not just that they failed, unlocks a complementary learning signal. Paper: sforce.co/4uv2f0k Authors: Yang Li @YangL95, Erik Nijkamp @erik_nijkamp, Semih Yavuz, @semih__yavuz, Shafiq Joty @JotyShafiq

Learning from Language Feedback via Variational Policy Distillation

Reinforcement learning from verifiable rewards (RLVR) suffers from sparse outcome signals, creating severe exploration bottlenecks on complex reasoning tasks. Recent on-policy self-distillation...

arxiv.org

463