Scale AI

Scale AI

589 Photos and videos

Tweets

Scale AI

@scale_AI

Jun 11

We spoke with over 50 healthcare professionals to understand what they need from AI. The conversations came down to one thing: trust. Our latest healthcare findings: labs.scale.com/blog/healthca…

57 Healthcare Professionals Told Us What They Need from AI

We surveyed 57 healthcare professionals about what they actually want from AI. Their answers point to three capability gaps that current evaluations miss.

labs.scale.com

1,517

Scale Labs

Scale AI retweeted

Scale Labs

@ScaleAILabs

Jun 1

Today we're releasing HiL-Dynamics, the first open-source tool that measures how production agents actually collaborate with humans under uncertainty. Not just whether they got the answer. Now you can measure exactly when your agent asks for help, when it makes assumptions, and when it'll confidently ship the wrong answer. Our findings 🧵

3,827

Scale AI

Scale AI

@scale_AI

May 26

To understand our story, you have to go back to the beginning. It started with self-driving cars. Ten years later, it's the architecture underneath AI that actually works, across frontier labs, enterprises, governments, and mission-critical systems around the world.

1:23

5,464

Philip de Guzman

Scale AI retweeted

Philip de Guzman

@PhilipofGuzman

May 20

The humans stay. That’s the idea behind @scale_ai's new brand campaign. 10 years of building AI has taught us something: the most important decisions belong to humans. The AI that works in decisions of consequence keeps humans at the center. Going live in SF and NYC. Where to next? 👀

8,689

Scale AI

Scale AI

@scale_AI

May 18

The future runs on proof. 😤

0:53

7,593

Scale AI

Scale AI

@scale_AI

May 14

🚨 JUST IN: Scale AI milestone incoming. Stay tuned.

7,823

Scale AI

Scale AI

@scale_AI

May 14

It's our birthday. 🎂 scale.com/blog/ten-years-of-…

Ten Years of Scale | Building Reliable AI Systems Since 2016

In Scale AI’s 10th anniversary blog, CEO Jason Droege reflects on the company’s journey from pioneering AI data infrastructure to powering reliable AI systems for enterprises, governments, and...

scale.com

3,433

Scale AI

Scale AI

@scale_AI

May 14

This month we turn 10. The hard work started in 2016, and it hasn’t stopped. Shortcuts are for losers. Winners welcome. scale.com/careers

0:41

118

56,290

Scale Labs

Scale AI retweeted

Scale Labs

@ScaleAILabs

May 7

Today we’re releasing Refactoring, the final leaderboard of our SWE Atlas suite. This new leaderboard is the ultimate test of an agent's ability to restructure code without breaking the system. Claude Opus 4.7 with Claude Code takes the top spot🥇

679

106,007

Scale AI

Scale AI

@scale_AI

May 6

Proud to share @CDAODoW has expanded its enterprise agreement with Scale AI raising the ceiling from $100M to $500M. This expansion reflects our continued commitment to accelerating the adoption of AI capabilities across the Pentagon to help America stay prepared, resilient, and strong. scale.com/blog/Scale-ai-pent…

Scale AI Expands Pentagon AI Partnership to $500 Million

The Pentagon's CDAO has expanded its enterprise agreement with Scale from $100M to $500M, giving any DoW component streamlined access to Scale's full AI platform.

scale.com

4,060

Jason Droege

Scale AI retweeted

Jason Droege

@jdroege

May 6

AI pretenders vs. AI contenders. It's those who still haven’t realized reliability is the product vs. those who can deliver reliability and outcomes. That's what the enterprise AI race comes down to. Here's a note I sent the Scale team this week.

Jason Droege

@jdroege

May 6

x.com/i/article/205204886920…

19,248

Scale Labs

Scale AI retweeted

Scale Labs

@ScaleAILabs

May 4

We recently built HiL-Bench, the first benchmark to test a critical question: do AI agents know what they’re missing and when to ask? Frontier models perform well with perfect specs. But remove a few key details, and they confidently guess and ship plausible wrong answers. We just added GPT-5.5, Opus 4.7, and Kimi K2.6 to the leaderboard. Here’s what we’re seeing ⬇️🧵

654

79,633

Scale AI

Scale AI

@scale_AI

Apr 21

Scale AI has acquired ICG Solutions, a defense technology firm specializing in real-time streaming data analytics. This is another step forward in how we support the U.S. defense and intelligence community with AI systems built to serve America’s most important national security missions. scale.com/blog/scale-acquire…

Scale AI Acquires ICG Solutions | National Security AI

Scale AI acquires ICG Solutions to provide the U.S. Department of War and the broader Intelligence Community the most advanced, reliable AI infrastructure available.

scale.com

7,922

Scale AI

Scale AI

@scale_AI

Apr 20

New @ScaleAILabs Research: Your AI agent just gave you an answer but did it actually solve the problem, get lucky, or just sound right? Today’s benchmarks can’t tell. We built HiL-Bench (Human-in-Loop Benchmark) to test a critical skill: does your agent know what it’s missing and when to ask for clarification? 🧵

9,010

more replies

Scale AI

Scale AI

@scale_AI

Apr 20

Key takeaway for model builders: capability and judgment are orthogonal axes. Scaling SWE-Bench alone won't close this. Current post-training doesn’t penalize an agent for confidently solving the wrong problem. Ask-F1 is the first verifiable signal that does, and it transfers across domains. The goal isn't full autonomy. It's selective escalation: agents that know what they don't know.

2,166

Scale AI

Scale AI

@scale_AI

Apr 20

Paper: static.scale.com/uploads/67a… Data: huggingface.co/datasets/Scal… Leaderboard: labs.scale.com/leaderboard/h… Code & Harness: github.com/hilbenchauthors/h…

1,914