Harvey publishes the first open-source benchmark for legal AI agents — 1,200 tasks across 24 practice areas, backed by OpenAI, Anthropic, Mistral and DeepMind. Enter, a São Paulo startup, triples its valuation to $1.2 billion in eight months on a $100M raise from Founders Fund, Sequoia and Ribbit — becoming Latin America's first AI unicorn. Slaughter and May completes the Magic Circle sweep by going firmwide with Harvey. Linklaters launches Applied Intelligence, a team of lawyers and data scientists selling bespoke AI solutions directly to clients. And the US federal government intervenes in the lawsuit challenging Colorado's AI Act, taking sides in the AI regulation fight for the first time.
The release of the Legal Agent Benchmark is, on one reading, a product announcement. On another reading, it is a bid to set the standard before anyone else does. That distinction matters enormously for how this market develops.
Every mature software category eventually gets a shared benchmark that buyers and vendors both reference. Security has CVE and CVSS. Machine learning has MMLU, HELM, and a dozen others. Legal AI has had nothing that measures what actually matters: whether an AI agent can take a real legal matter and produce a work product a lawyer would sign off on.
Harvey's LAB is an attempt to fill that gap. The benchmark is open-source on GitHub, built around 1,200 agent tasks across 24 practice areas and evaluated against 75,000 expert-written rubric criteria. The structure is explicit: an instruction, a client matter containing relevant materials, and a work product. Not a question-answer pair. Not a research retrieval test. A task that looks like the actual work.
The backing is significant. Nvidia, OpenAI, Anthropic, Mistral and DeepMind are all named launch partners — which means the model providers are aligning to a legal-specific capability measure at the same time as the legal-AI vendors. Harvey has declined to publish a leaderboard on day one, saying it expects the dataset to evolve. That is a deliberate signal: the benchmark is designed to be the long-term frame, not a one-week news story.
For the first time, an enterprise legal team evaluating a legal AI agent has a shared vocabulary to work from. The question is no longer "does this tool make lawyers faster?" — which is unmeasurable and unanswerable in a trial. The question is "where on the LAB task set does this agent pass, and where does it fail?" That is a procurement-grade question. It will change the conversation in every RFP that a legal AI vendor answers from this week forward.
The legal-AI market has so far competed on a mix of credentials (Magic Circle clients), fundraise headline ($11B valuation), and narrative ("fiduciary-grade", "agent-native"). The LAB changes the competition surface in Harvey's favour. By publishing the benchmark — and by being the company whose products will inevitably score well against it because Harvey built it — Harvey has done something sophisticated: it has created a public measure of capability that its own training data is optimised for, while inviting every competitor and model provider to be evaluated on it.
The right comparison is not to a software benchmark. It is to what AWS did by publishing the Well-Architected Framework, or what Salesforce did by publishing the enterprise SaaS evaluation criteria it knew its own products would pass. The entity that sets the standard usually wins the market that forms around it.
Enter's $1.2B valuation is not an outlier. It is the clearest evidence yet that the structural problem legal AI is solving is not an Anglo-American problem. It is a feature of every legal system built on high-volume, rule-based work done by expensive human labour.
Brazil's court system receives more new lawsuits per year than any other country in the world. The practical consequence is that large Brazilian companies — banks, fintechs, platforms — each face hundreds of thousands of active cases at any given moment. The traditional response was to maintain enormous internal legal teams or outsource to law firms at per-case rates that added up to material operating costs. Enter builds AI agents that read each incoming lawsuit, pull the case-specific facts, and draft a defence brief. The company processes more than one million cases a year.
The founder's background is worth noting. Mateus Costa-Ribeiro became Brazil's youngest practising lawyer at 18, finished Harvard Law, passed the New York Bar at 20, and left a fully funded Stanford MBA seat to start the company. The $100M Series B was led by Founders Fund and joined by Sequoia Capital and Ribbit Capital — three of the most structurally rigorous investors in technology. All three diligenced the mass-litigation use case and concluded it was venture-scale.
Enter is doing in São Paulo what Flank does in London and New York: taking the category of legal work defined by high volume, rule-based repetition, and a fixed relationship between input and outcome, and moving it from expensive human time to supervised agents. The Brazilian court system is an extreme version of the problem — more volume, less predictability, harder to automate — but it makes the proof stronger, not weaker. If the thesis works at one million Brazilian lawsuits per year, it works at ten thousand UK NDAs per year. The cost structure is the same. The inexpensive work is being done by expensive people. That is the structural inefficiency agents remove.
Slaughter and May's firmwide Harvey rollout is a milestone, but the more interesting story of the week is what Linklaters did next. One firm completed the tool deployment. Another firm started selling the deployment as a product.
Every Magic Circle firm has now publicly committed to a named AI platform. The spread is instructive:
The variation in choice is itself a signal. The market has not converged on a single platform. Harvey has the most Magic Circle logos, but Legora, Microsoft and Anthropic are each present. The competition is real, and the firms with the most sophisticated procurement processes are still reaching different conclusions.
The more structurally interesting event of the week is Linklaters' Applied Intelligence launch. The unit is not an internal AI programme. It is a client-facing team of lawyers and data scientists that co-designs and delivers bespoke AI workflows for client use — analysing large data sets, building custom tooling, and selecting from frontier models on a project-by-project basis.
The framing in Linklaters' own announcement is careful: it is described as an AI-enabled legal service, not a product, keeping it within the firm's existing regulatory structure. But the economic model is different from any legal service that came before it. Clients are paying Linklaters to build AI infrastructure for them. The deliverable is software. The price is not a per-hour rate; it is a project fee. The work is not being done by a partner billing at £1,200 per hour. It is being done by a team designing something that, once built, runs at near-zero marginal cost.
What does a law firm do when its most profitable work — the high-complexity, high-judgment advisory — is still protected by expertise and relationships, but its volume work can be delivered by agents at a fraction of the cost? Applied Intelligence is Linklaters' answer: sell the agent layer as a premium service, not just the advice it produces. Whether that model survives long-term depends on whether clients conclude that a law firm is the right vendor for AI infrastructure. Many will not. But it is a rational first move.
The DOJ's intervention in the Colorado AI Act lawsuit is the most significant US AI regulatory event in months — not because of Colorado specifically, but because of what it signals about how the federal preemption fight will actually be conducted.
The Colorado AI Act (SB 24-205) was the first comprehensive US state AI law — passed in 2024, originally effective June 30, 2026. It imposes duties on developers and deployers of high-risk AI systems to avoid algorithmic discrimination in consequential decisions affecting Coloradans: education, employment, financial services, healthcare, housing, insurance, and — directly relevant here — legal services.
xAI filed a lawsuit seeking to invalidate the Act in April 2026. The Department of Justice intervened on April 28, supporting the challenge. The White House's December 2025 executive order had already established an "AI Litigation Task Force" specifically to challenge state AI laws inconsistent with federal AI policy. This is that task force acting.
At the same time, inside Colorado's own legislature, SB 189 moved through committee proposing to drop the requirement that companies explain how their AI works and push the effective date to January 2027. As of this week, the Colorado AI Act is simultaneously under attack in a federal courthouse and being rewritten by the legislators who passed it.
| Jurisdiction | Measure | Status as of 8 May 2026 |
|---|---|---|
| Colorado | SB 24-205 (Colorado AI Act) | Original June 30, 2026 effective date. SB 189 proposes January 2027. DOJ litigation in parallel. |
| European Union | AI Act Annex III high-risk systems | August 2, 2026 deadline remains in force. Follow-up omnibus trilogue pending (~13 May). |
| Connecticut | Comprehensive AI regulation | Failed. Governor veto threat ended the session without passage. |
| New York | RAISE Act (as amended March 27) | In force. Transparency-led, not risk-classification. Reporting framework only. |
| Federal (US) | AI Litigation Task Force | Active. DOJ intervention in Colorado Act challenge is first direct action. |
The Troutman state AI law tracker for May 4 catalogued another cluster of new proposals across multiple states. The net result of this week is that the US AI regulatory landscape is simultaneously more active and less clear than it was a month ago. The federal preemption play — if successful in Colorado — would simplify the landscape significantly, but it will take years to resolve in the courts. Enterprise legal teams deploying AI are planning against a target that is moving in three directions at once: federal preemption, state legislation, and international obligations.
The compliance burden is, in itself, a kind of market signal. Every hour a legal team spends mapping AI obligations across 50 state regimes and the EU AI Act is an hour not spent on work that creates value. That cost — borne by expensive in-house lawyers doing work that is, structurally, research and mapping rather than legal judgment — is exactly the kind of work agents should be doing.
This week's stories describe a market that has moved past the question of whether AI agents can do legal work and is now engaged in three consequential follow-on questions: how do we measure it, who owns the infrastructure, and what happens when governments start fighting each other about the rules?
Two of this week's stories — Harvey's LAB and Enter's unicorn round — define the same moment from different angles. The LAB defines what "good" looks like for a legal AI agent on a discrete, structured task. Enter shows what the commercial model looks like when you build a business around agents that can do those tasks at scale, on a defined category of work, for clients who need a million of them done per year.
The common thread is the work itself. Mass litigation in Brazil. Contract review in London. Procurement processing in New York. These are all the same structural problem: a large volume of inexpensive work being done by expensive resources inside a billing model that prices the resource, not the outcome. The agent fixes that by decoupling the output from the input cost. Enter proved it on lawsuits. Flank does it on contracts. The market is now validating both at the same time, with the same investors, for the same structural reason.
The benchmark matters for Flank's customers specifically because it makes the conversation easier. The question is no longer whether agents can do legal work — Enter's million cases a year, Harvey's 500+ live agents, and the LAB's 1,200-task evaluation set all answer that in the affirmative. The question is whether your legal team is outsourcing to agents yet, or still paying a law firm to do the same work at ten times the cost. That is the question we are having with every GC we talk to. This week made it easier to ask.