The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

CYPHER Learning celebrates Customer of the Year winners for 2025

CYPHER Learning celebrates Customer of the Year winners for 2025

CYPHER Learning announces this year’s award-winning customers transforming learning and producing measurable business

March 9, 2026

Craters & Freighters of Portland Announces New Ownership

Craters & Freighters of Portland Announces New Ownership

New ownership reinforces custom crating, engineered packaging, and specialty shipping services for high-value and

March 9, 2026

Craters & Freighters Announces New Ownership in Jacksonville, Florida

Craters & Freighters Announces New Ownership in Jacksonville, Florida

New leadership strengthens custom crating, engineered packaging, and specialty shipping services for high-value assets

March 9, 2026

AI & Big Data Expo North America 2026 Comes to San Jose as Demand for AI and Data-Driven Solutions Accelerates

AI & Big Data Expo North America 2026 Comes to San Jose as Demand for AI and Data-Driven Solutions Accelerates

The event will bring together enterprise AI leaders, data scientists, technology innovators, and solution providers.

March 9, 2026

Intelligent Automation Conference North America 2026 Comes to San Jose as Demand for Automation & Workflows Accelerates

Intelligent Automation Conference North America 2026 Comes to San Jose as Demand for Automation & Workflows Accelerates

The event will bring together enterprise leaders, automation engineers, business strategists, and technology providers.

March 9, 2026

SSOJet Debuts AI-Native Enterprise SSO Platform Designed to Work Alongside Existing Auth Systems

SSOJet Debuts AI-Native Enterprise SSO Platform Designed to Work Alongside Existing Auth Systems

AI-native platform adds enterprise SSO capabilities across 25+ identity providers to existing authentication

March 9, 2026

2026 Builders Awards to Honor Westchester’s Leading Builders, Developers, and Design Visionaries

2026 Builders Awards to Honor Westchester’s Leading Builders, Developers, and Design Visionaries

Westchester Home and 914INC. recognize the innovators transforming the region’s residential and commercial landscape.

March 9, 2026

TFSF Ventures Publishes Analysis Comparing AI Agents Against Outsourcing for Consulting Firms

TFSF Ventures Publishes Analysis Comparing AI Agents Against Outsourcing for Consulting Firms

Report examines three-year costs, confidentiality, and operational differences between outsourced teams and AI

March 9, 2026

TFSF Ventures Details How Automation Reduces Client Onboarding from 14 Days to 48 Hours

TFSF Ventures Details How Automation Reduces Client Onboarding from 14 Days to 48 Hours

Guide identifies which onboarding steps can be automated while preserving human-led client relationship elements DUBAI,

March 9, 2026

International Association of Top Professionals (IAOTP) Continues to honor the World’s Most Prestigious Professionals

International Association of Top Professionals (IAOTP) Continues to honor the World’s Most Prestigious Professionals

International Association of Top Professionals (IAOTP) Continues to Gain Global Recognition as a Premier Professional

March 9, 2026

Egypt eVisa Online Simplifies Travel Authorization for International Visitors

Egypt eVisa Online Simplifies Travel Authorization for International Visitors

Egypt has introduced a modern digital visa system that allows travelers to obtain their travel authorization online

March 9, 2026

Space 11 Appoints Former NASA Chief Scientist James L. Green to Its Global Space Advisory Board

Space 11 Appoints Former NASA Chief Scientist James L. Green to Its Global Space Advisory Board

NEW YORK, NY / ACCESS Newswire / March 9, 2026 / Space 11 announces the appointment of James L. Green as Strategic

March 9, 2026

SMX and LIQOS, by algo21, Partner to Build the World’s First Tokenized Market Infrastructure for Verified Industrial Materials

SMX and LIQOS, by algo21, Partner to Build the World’s First Tokenized Market Infrastructure for Verified Industrial Materials

The partnership seeks to combine SMX's physical verification layer with LIQOS, by algo21's autonomous liquidity

March 9, 2026

The New Digital Gold Rush: .AI Domains Triple in Value as Artificial Intelligence Rewrites the Rules of Online Real Estate

The New Digital Gold Rush: .AI Domains Triple in Value as Artificial Intelligence Rewrites the Rules of Online Real Estate

Historic milestone – .AI now worth more than all other alternative extensions combined. SAN FRANCISCO, CA / ACCESS

March 9, 2026

Luminar Media Group Files to Change Corporate Name to Fortun Corp. and Trading Symbol to FRTU

Luminar Media Group Files to Change Corporate Name to Fortun Corp. and Trading Symbol to FRTU

Corporate Rebranding Aligns Public Company Identity With the Fortun Brand MIAMI, FLORIDA / ACCESS Newswire / March 9,

March 9, 2026

Jaguar Health Strengthens Company’s Balance Sheet by Restructuring and Reducing Royalty and Debt Obligations and Extinguishing Warrants

Jaguar Health Strengthens Company’s Balance Sheet by Restructuring and Reducing Royalty and Debt Obligations and Extinguishing Warrants

Strengthening balance sheet and capitalization is a key Jaguar priorityCompany continues its sharp, strategic focus on

March 9, 2026

ZetrOZ Systems is a Premier Sponsor of Arthritis Foundation’s Pathways Conference, Highlighting Drug Free Treatment Options for Knee Osteoarthritis

ZetrOZ Systems is a Premier Sponsor of Arthritis Foundation’s Pathways Conference, Highlighting Drug Free Treatment Options for Knee Osteoarthritis

The developers of sustained acoustic medicine technology and the sam® wearable ultrasound device continue mission of

March 9, 2026

Weight Loss Buddy Encourages Sustainable Nutrition Habits this March, Emphasizing App’s Behavior-Driven Approach

Weight Loss Buddy Encourages Sustainable Nutrition Habits this March, Emphasizing App’s Behavior-Driven Approach

~ AI-powered community platform highlights the role of daily nutrition habits and peer accountability in long-term

March 9, 2026

Priddy Spaces Signs Long-Term Lease to Bring Venture X Coworking to The Forum Peachtree Corners

Priddy Spaces Signs Long-Term Lease to Bring Venture X Coworking to The Forum Peachtree Corners

23,000+ SF Premium Workspace to Open at One of Metro Atlanta’s Top Walkable Lifestyle Destinations Today’s

March 9, 2026

Infinnium First to Achieve ISO 42001:2023 Certification for Responsible AI Governance in Enterprise Data Intelligence

Infinnium First to Achieve ISO 42001:2023 Certification for Responsible AI Governance in Enterprise Data Intelligence

Certification reinforces Infinnium’s leadership in secure, ethical, and compliant AI-native data governance at source.

March 9, 2026

Beyond Celiac Appoints Turner Jenkins as Managing Director of Beyond Celiac Investments

Beyond Celiac Appoints Turner Jenkins as Managing Director of Beyond Celiac Investments

Jenkins to lead venture philanthropy strategy, accelerating celiac disease research and treatment Celiac disease has

March 9, 2026

Skyward Introduces Alira, an AI FOIA Software Platform Designed to Accelerate Transparency Across Government

Skyward Introduces Alira, an AI FOIA Software Platform Designed to Accelerate Transparency Across Government

Speed is nice. Defensibility is required. Day One usability is a dream. Alira delivers all three, and that’s why I put

March 9, 2026

Genesis Systems’ WaterCube® Becomes First Atmospheric Water Technology to Pass U.S. Military Gold Standard Water Testing

Genesis Systems’ WaterCube® Becomes First Atmospheric Water Technology to Pass U.S. Military Gold Standard Water Testing

Rigorous field testing confirms WaterCube exceeds production expectations and meets the U.S. Army’s strict TB MED-577

March 9, 2026

Grit Races Announces Rebirth of Huntington Sprint Triathlon to Cleveland Metroparks

Grit Races Announces Rebirth of Huntington Sprint Triathlon to Cleveland Metroparks

We’ve lost quite a few local triathlons, so I am delighted that Grit Races is resurrecting the Huntington Sprint

March 9, 2026

DrFirst Releases Next-Generation RxInform, Driving 30% More Patient Interactions for Medication Adherence

DrFirst Releases Next-Generation RxInform, Driving 30% More Patient Interactions for Medication Adherence

Redesigned experience guides patients through personalized savings, education, and reminders, achieving 98%

March 9, 2026

Bishop D. A. Davis Releases Powerful New Book Addressing the Crisis of Modern Relationships

Bishop D. A. Davis Releases Powerful New Book Addressing the Crisis of Modern Relationships

Effective Habits for Affective Relationships My book teaches you how to move from emotional survival to spiritual

March 9, 2026

Revefi Launches AI and Agentic Observability for Enterprise LLM and Agent Workflows

Revefi Launches AI and Agentic Observability for Enterprise LLM and Agent Workflows

New capabilities give data, AI, and engineering teams cost attribution, benchmarking, traceability, and integration

March 9, 2026

Leverage Launches AI Workforce Productivity Platform to Help Employees Find Information and Work Faster

Leverage Launches AI Workforce Productivity Platform to Help Employees Find Information and Work Faster

Employees can now leverage their existing tools, apps, and data to get more work done in less time. The future of work

March 9, 2026

Rocketgraph Expands Higher Education Access Through Unimarket Marketplace

Rocketgraph Expands Higher Education Access Through Unimarket Marketplace

High-Performance Graph Analytics Platform Now Available to Universities via Trusted eProcurement Channel NEW YORK, NY,

March 9, 2026

Policy Pathways and VCU’s Wilder School Announce 2026 Summer Academy for Policy Leadership and Public Service

Policy Pathways and VCU’s Wilder School Announce 2026 Summer Academy for Policy Leadership and Public Service

Register Now for Two-Week Summer Leadership Academy for High School and College Students! The Policy Pathways Summer

March 9, 2026

Pathfinder Wealth Consulting Welcomes Wealth Advisor Brice Gibson to Wilmington Office

Pathfinder Wealth Consulting Welcomes Wealth Advisor Brice Gibson to Wilmington Office

Pathfinder Wealth Consulting is pleased to announce Brice Gibson as the newest addition to its growing team of

March 9, 2026

Lentech, Inc. Announces Rose Allen as CEO to Drive Growth and Innovation

Lentech, Inc. Announces Rose Allen as CEO to Drive Growth and Innovation

ELKRIDGE, MD, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Lentech, Inc., an Employee-Owned organization and a

March 9, 2026

Injury Law Partners Expands Philadelphia Presence With Comprehensive Legal Services

Injury Law Partners Expands Philadelphia Presence With Comprehensive Legal Services

Delivering results-driven, detail-focused, and client-centered legal advocacy to protect and empower Philadelphia

March 9, 2026

Chinese Top 3 Leading Stainless Steel Color Pipe Manufacturers – Shaping the Future of Steel Products

Chinese Top 3 Leading Stainless Steel Color Pipe Manufacturers – Shaping the Future of Steel Products

Chinese Top 3 Leading Stainless Steel Color Pipe Manufacturers – Shaping the Future of Steel Products CALIFORNIA, CA,

March 9, 2026

TalkCounsel Acquires LegalSafe, Boosting Legal Services with AI Risk & Compliance Tools

TalkCounsel Acquires LegalSafe, Boosting Legal Services with AI Risk & Compliance Tools

TalkCounsel acquires LegalSafe, adding automated risk assessment and AI compliance readiness tools to its on-demand

March 9, 2026

USNS Selected as Approved Early Numeracy Screener Under Indiana’s Updated Numeracy Act

USNS Selected as Approved Early Numeracy Screener Under Indiana’s Updated Numeracy Act

Indiana schools may adopt Forefront Education’s Universal Screeners for Number Sense (USNS) to meet early numeracy

March 9, 2026

Nakiea Cook, MBA, CFEI®, Chosen to Serve on New York Financial Educators Council’s Professional Advisory Board

Nakiea Cook, MBA, CFEI®, Chosen to Serve on New York Financial Educators Council’s Professional Advisory Board

Nakiea Cook’s blend of executive-level financial strategy and passionate community advocacy is a powerful asset for the

March 9, 2026

MELTRIC Exhibits Inherently Electrically Safe Industrial Plugs & Receptacles at IEEE Electrical Safety Workshop 2026

MELTRIC Exhibits Inherently Electrically Safe Industrial Plugs & Receptacles at IEEE Electrical Safety Workshop 2026

MELTRIC Corporation to showcase switch-rated plugs and receptacles at the IEEE IAS Electrical Safety Workshop (ESW)

March 9, 2026

TechEx North America returns to California on May 18-19, 2026, in San Jose

TechEx North America returns to California on May 18-19, 2026, in San Jose

TechEx North America 2026 comes to San Jose for your annual enterprise technology intelligence briefing. I am

March 9, 2026

Raya Therapeutic Announces Selection of RT1999 (Smilagenin) onto the EXPERTS-ALS Clinical Trial Platform

Raya Therapeutic Announces Selection of RT1999 (Smilagenin) onto the EXPERTS-ALS Clinical Trial Platform

Positive data from EXPERTS-ALS could accelerate RT1999 towards a registration trial; RT1999 to be presented as a poster

March 9, 2026