We’re extending our Legal Agent Benchmark (LAB) to in-house contracting.
This adds 500 new tasks focused on the contract work in-house legal teams do every day, spanning tasks like drafting, redlining, and escalation, and 90 contract types across 10 areas such as financing, IP, M&A, and employment.
1) Why contracting matters
Contracts are the operating layer of the enterprise. They define how companies sell, buy, hire, partner, finance, license IP, share data, and manage risk.
In-house legal teams handle the high-stakes work behind those contracts. This includes drafting agreements, negotiating redlines, managing fallback positions, and escalating risks.
All of these activities, taken together, are contracting.
2) How LAB models contracting
Each task has four parts:
Environment: all context available at a point in the negotiation, including a data room, playbooks, guidance, emails, and prior negotiation turns.
Instruction: a short next-step prompt, like draft the agreement, respond to a redline, or prepare an issues list.
Output: the artifact that moves the negotiation forward.
Verification: expert rubrics grade the agent's output against the appropriate next step, including what the agent edited, accepted, rejected, negotiated, or escalated.
See
@gabepereyra’s article for a walkthrough of an example LAB task: responding to a counterparty redline of an MSA that has already been heavily negotiated.
3) What comes next
Today, LAB measures how agents handle each stage of a contract negotiation as a discrete step.
Next, we’ll expand across more contract types, more negotiation states, and interactive benchmarks where agents must decide whether to negotiate, escalate, or walk away.
Read more from our cofounder
@gabepereyra below.