Unlock work experiences of the future. Join @ServiceNowRSRCH as we advance the state-of-the-art in Enterprise AI. #ServiceNowResearch #LifeAtNow #Hiring

Joined July 2016
460 Photos and videos
๐Ÿš€ Grok Voice Think Fast 1.0 (@xAI) lands on the Pareto frontier on EVA-Bench โ€” no system in the eval beats it on accuracy without sacrificing experience, or vice versa. ๐Ÿ“Š Leaderboard: servicenow.github.io/eva/#reโ€ฆ @elonmusk #VoiceAgents #ServiceNowResearch #EVABenchย #GrokVoice #xAI

4
25
96
114,260
โญ ๐—˜๐—ฉ๐—”-๐—•๐—ฒ๐—ป๐—ฐ๐—ต ๐——๐—ฎ๐˜๐—ฎ ๐Ÿฎ.๐Ÿฌ: ๐Ÿฏ ๐——๐—ผ๐—บ๐—ฎ๐—ถ๐—ป๐˜€, ๐Ÿญ๐Ÿฎ๐Ÿญ ๐—ง๐—ผ๐—ผ๐—น๐˜€, ๐Ÿฎ๐Ÿญ๐Ÿฏ ๐—ฆ๐—ฐ๐—ฒ๐—ป๐—ฎ๐—ฟ๐—ถ๐—ผ๐˜€ We just published an article detailing the major expansion we have done to the data behind EVA-Bench. ๐Ÿ—‚๏ธ Data: huggingface.co/datasets/Servโ€ฆ ๐Ÿ“„ Article: huggingface.co/blog/ServiceNโ€ฆ #VoiceAgents #OpenSource #Data #AIResearch #ServiceNowResearch
8
13
654
Thrilled to see this benchmark, EnterpriseOps-Gym, featured in Jensen Huang's Computex keynote โ€” a testament to the Core AI team's work building rigorous agentic evaluation for the enterprise. ๐Ÿš€
Big moment for the whole ๐—˜๐—ป๐˜๐—ฒ๐—ฟ๐—ฝ๐—ฟ๐—ถ๐˜€๐—ฒ๐—ข๐—ฝ๐˜€-๐—š๐˜†๐—บ team. ๐Ÿš€ ๐—๐—ฒ๐—ป๐˜€๐—ฒ๐—ป announced Apriel on the K25 stage. This year, EnterpriseOps-Gym made it into his ๐—–๐—ผ๐—บ๐—ฝ๐˜‚๐˜๐—ฒ๐˜… ๐—ธ๐—ฒ๐˜†๐—ป๐—ผ๐˜๐—ฒ. ๐Ÿ™‚ NVIDIA announced ๐—ก๐—ฒ๐—บ๐—ผ๐˜๐—ฟ๐—ผ๐—ป ๐Ÿฏ ๐—จ๐—น๐˜๐—ฟ๐—ฎ, the largest Nemotron 3 model to date, with 550B parameters / 55B active ..and it was incredibly exciting to see results on our benchmark featured in the keynote. Enterprise agents need benchmarks that go beyond static QA, ones that test: - long-horizon planning - reliable tool use - stateful workflows - enterprise realism and ability to act without breaking things downstream That is exactly why we built EnterpriseOps-Gym. Huge congrats to the Nemotron team for building agentic capability at the frontier level of intelligence and cost. More to come. ๐Ÿ”ฅ @NVIDIAAI @nvidia @ServiceNowRSRCH @ServiceNowNews @sagardavasam @shiva_malay @PShravannayak enterpriseops-gym.github.io/
7
853
ServiceNow AI Research retweeted
MosaicLeaks is now on arXiv. The Mosaic Effect captures a simple idea: small fragments can look harmless alone, but become revealing in aggregate. Deep research agents can leak enterprise information in exactly this way. 1/9
1
2
6
1,200
๐Ÿ† ACM CAIS 2026 Best Paper Award! "Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain" was honored at the first conference dedicated to agentic systems ๐Ÿ™Œ Read the story ๐Ÿ‘‡ servicenow.com/in/workflow/aโ€ฆ
1
1
4
545
๐Ÿ“„ Full paper: arxiv.org/pdf/2510.05159 Huge congrats to the team: @LeoBoisvert, @AbhayPuri98, Chandra Kiran Reddy Evuru, Nazanin Sepahvan, @alexandredrouin, @alex_lacoste_, @NicolasChapados, Quentin Cappart, @jstanl, @DjDvij ๐Ÿ‘

2
1
367
Proud to share the latest from our Core AI colleagues! โ€” rethinking what world models should (and shouldn't) be for enterprise agents. ๐Ÿ‘‡
๐ŸšจThe World Model Myth in Enterprise Systems๐Ÿšจ Everyoneโ€™s obsessed with world models for agents. But in enterprise systems? The entire "world" (rules, workflows, transitions) is readable at runtime. No hidden dynamics. We asked the dangerous question: Do enterprise systems need learned world models?
1
1
7
1,423
ServiceNow AI Research retweeted
Our first vLLM V0โ†’V1 run on PipelineRL looked broken. @ehsk0 and I almost reached for an objective-side correction. That would have been the wrong fix. The real problem: four mismatches in the rollout backend. ๐Ÿงต
1
4
7
2,489
ServiceNow AI Research retweeted
Boris Cherny, Head of Claude Code, on the importance of ServiceNow $NOW

95
44
550
88,379
ServiceNow AI Research is heading soon to Seoul with 4 papers accepted at #ICML2026! ๐ŸŽ‰ Our work spans retrieval, multimodal forecasting, knowledge distillation, and diffusion language models. Congrats to all the researchers involved! ๐Ÿ‘
1
1
14
2,396
Here are the 4 papers accepted at #ICML2026! ๐ŸŽ‰๐Ÿ‡ฐ๐Ÿ‡ท ๐Ÿ“„ Hierarchical Retrieval at Scale ๐Ÿ“„ Overcoming the Modality Gap in Context-Aided Forecasting ๐Ÿ“„ Privileged Information Distillation for Language Models ๐Ÿ“„ DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone See you in Seoul! icml.cc/

1
3
1,518
(1/8) ๐Ÿš€ Introducing Super Apriel: One Checkpoint, Many Speeds Train once โ†’ serve at any speed-quality tradeoff We release: โœ“ 15B supernets with 4 mixers/layer โœ“ Training code (Fast-LLM) โœ“ vLLM serving extension ๐Ÿงต How it works โ†“
1
10
32
10,553
(7/8) ๐ŸŽ Bonus: speculative decoding for free Use all-FA as verifier, fast preset as draft Same checkpoint โ†’ up to 2.75ร— speedup, no quality loss ๐Ÿ“„ Paper: arxiv.org/pdf/2604.19877 ๐Ÿค— Models: huggingface.co/ServiceNow-AIโ€ฆ ๐Ÿ’ป Code: github.com/ServiceNow/Fast-Lโ€ฆ

1
1
4
1,560
(8/8) ๐Ÿ™ Amazing team that made this possible: @ostap__alex @Ray97369304 @tscholak @alirezamh_ Aman Tomar Shruthan Radhakrishna Denis Kochetkov Joel Lamy-Poirier @nandahkrishna @muchomuchacho @SathwikTejaswi Srinivas Sunkara @vbecaert
1
7
1,458
ICLR 2026 is underway in Rio de Janeiro ๐ŸŒŽ The ServiceNow AI Research team has 7 papers accepted across the main conference, workshops, and blogpost track โ€” covering AI safety, enterprise benchmarking, computer-use agents, and more. Attending ICLR? Go catch their sessions! ๐Ÿ‘‡ #ICLR2026 #AIResearch #MachineLearning
1
2
1,592
๐Ÿ“„ No, Of Course I Can! โ€“ Deeper Fine-Tuning Attacks That Bypass Token-Level Safety Mechanisms ๐Ÿ“„ DRBench: A Realistic Benchmark for Enterprise Deep Research ๐Ÿ“„ Grounding Computer Use Agents on Human Demonstrations ๐Ÿ“„ Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency โ€” Workshop ๐Ÿ“„ Overcoming the Modality Gap in Context-Aided Forecasting โ€” Workshop ๐Ÿ“„ CUA-Suite: Expert Trajectories and Pixel-Precise Grounding for Computer-use Agents โ€” Workshop ๐Ÿ“„ Destruction is a General Strategy to Learn Generation โ€” Blogpost track
1,222