Judgment Labs

Judgment Labs

9 Photos and videos

Tweets

Pinned Tweet

Judgment Labs

@JudgmentLabs

May 12

Today is a special day.

Alex Shan

@alexshander03

May 12

We’re launching @JudgmentLabs today and announcing $32M in funding. As AI agents take on more of the work that creates economic value, they generate massive amounts of production data: the clearest record of how they behave with users, software, and the real world. Judgment builds infrastructure for improving AI agents from production data.

2:00

299

2,319,775

Judgment Labs

Judgment Labs

@JudgmentLabs

Jun 5

Bay to Breakers is the city's biggest footrace... And Judgment took it over. 🟠 30 of us. 7.46 miles. This is the Summer of Judgment. Text "JL" to (628) 888-7594 to get on the list for our next event.

1:51

78,713

Enyu

Judgment Labs retweeted

Enyu

@0xhappier

Jun 5

We ran Bay to Breakers. 30 of us. 7.46 miles. One Sunday morning in San Francisco.

ALT Judgment Labs

29,672

Shun

Judgment Labs retweeted

Shun

@22uenos

May 28

Had a lot of fun making these visuals :) Made in @paper with Claude

Judgment Labs

@JudgmentLabs

May 28

We built Agent Judge to evaluate long-horizon agents. As agents take on longer tasks, the evidence needed to evaluate them gets buried across tool calls, retries, logs, database updates, and final outputs. Evaluating these agents requires investigating the trajectory, not just judging the final answer.

127

66,536

Judgment Labs

Judgment Labs

@JudgmentLabs

May 28

74,629

more replies

Judgment Labs

Judgment Labs

@JudgmentLabs

May 28

Agent behavior changes as models, tools, products, and user workflows change. That means the rubric used by the judge has to improve from production data, so it keeps evaluating the behaviors that matter. Rubric Builder turns feedback into concrete rubric updates.

721

Judgment Labs

Judgment Labs

@JudgmentLabs

May 28

The core idea: long-horizon agent evals should be done by agents, not simple LLM judges. Agent Judge searches trajectories, verifies stateful actions, and adapts rubrics from production feedback. judgmentlabs.ai/blogs/agent-…

Agent Judge: Solving Long-Context Evals for Production Agents

Why production agent evals need agentic judges that can search, verify, and adapt.

judgmentlabs.ai

4,833

Judgment Labs

Judgment Labs

@JudgmentLabs

May 27

Thrilled to be recognized by @Redpoint as one of the most promising private AI Infrastructure companies. More exciting news to come!

Redpoint @Redpoint

May 27

The Redpoint InfraRed 100 is now live. These are the companies building the infrastructure that powers everything happening in AI right now, from world models and agent runtimes to the sandboxes, databases, and security tools agents depend on. Congratulations to this year's honorees! Read the full 2026 InfraRed Report: our state of the union on AI and cloud infrastructure 👉 redpoint.com/reports/the-inf…

2,868

Judgment Labs

Judgment Labs retweeted

Judgment Labs

@JudgmentLabs

May 18

Replying to @tbpn

@tbpn thanks for having our team on! P.S. it's Judgment 🧡

0:30

256,231

Judgment Labs

Judgment Labs

@JudgmentLabs

May 16

Scoops will start flying @ 1pm (500 Marina Blvd) Tomorrow @Baytobreakers Finish Line The Flavors: - Fudgement @JudgmentLabs - Berry Brex-fast @brexHQ - Modal Green Tea @modal - Claude au Lait @claudeai - Vercel Road @vercel - Ando Apple Pie @andocorporation

Judgment Labs

@JudgmentLabs

May 16

Summer of Judgment! 2 days left for free ice cream...

0:26

2,706

Judgment Labs

Judgment Labs

@JudgmentLabs

May 16

Summer of Judgment! 2 days left for free ice cream...

0:26

315,311

Philip Kiely

Judgment Labs retweeted

Philip Kiely

@philipkiely

May 15

Great inference requires a great model Great models require great data Great data requires capturing what actually happens in production Enjoyed chatting with the @JudgmentLabs team about everything from agents to GTM strategies (ice cream is surprisingly high ROI)

4,551

Emily Lonetto

Judgment Labs retweeted

Emily Lonetto

@EmilyLonetto

May 15

Naturally Casper needed to check out @JudgmentLabs in South Park

742

Brex

Judgment Labs retweeted

Brex

@brexHQ

May 15

Replying to @alexshander03 @lightspeedvp

Brex-fast is the most important meal of the day

2,128

Judgment Labs

Judgment Labs

@JudgmentLabs

May 15

what handsome guys

Aaron Makelky

@theaaron

May 15

only in SF: a @JudgmentLabs wrapped van handing out free ice cream & the flavors are named after the tech companies they parked it in front of @0xhappier

0:11

1,146

Descript

Judgment Labs retweeted

Descript

@descript

May 14

let us get some of that Descript Red Velvet flavor next time @JudgmentLabs 🍦

Enyu

@0xhappier

May 14

The scoops are flying! @AbridgeHQ @modal @descript pulled up 🍦

0:05

2,352

Judgment Labs

Judgment Labs

@JudgmentLabs

May 14

sweet tooth!

ChinesePowered.com @ChinesePowered

May 14

Third day in a row for ice cream @JudgmentLabs

983

ᴡɪꜱᴅᴏᴍɪᴇʟ De video editor

Judgment Labs retweeted

ᴡɪꜱᴅᴏᴍɪᴇʟ De video editor @Wisdomiel

May 14

This is the kind of IRL marketing the space needs 😂🍦 “Fudgement” is a top-tier flavor name ngl. SF really does feel alive again 🚀

Alex Shan

@alexshander03

May 14

Summer of Judgment continues. We're giving away free icecream in SF for the rest of the week! Check out our schedule at judgmentlabs.ai/icecream Today, we'll be in the Mission from 11am-3pm, right outside of the Modal headquarters in SF (375 Alabama St) Today's Flavors: - Modal Green Tea (@modal) - Together Berry Breakfast (@togethercompute) - Roxy Road (@rox_ai) - Abridge Apple Pie (@AbridgeHQ) - Fudgement (@JudgmentLabs) SF IS SO BACK 🚀

1,269

Ipshita Agarwal

Judgment Labs retweeted

Ipshita Agarwal

@agarwalipshita

May 14

Huge congrats to the @JudgmentLabs team on the launch! killer team working on an important problem

Alex Shan

@alexshander03

May 12

2:00

4,018