This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Lauren Papa Launches Art Deco Freak: Where Forgotten Properties Find Their Second Act

Lauren Papa Launches Art Deco Freak: Where Forgotten Properties Find Their Second Act

PITTSBURGH, PA, UNITED STATES, March 27, 2026 /EINPresswire.com/ — Creative entrepreneur, artist, and humanitarian

March 27, 2026

MONDAINE Highlights the Design Legacy of the Iconic Swiss Railways Station Clock

MONDAINE Highlights the Design Legacy of the Iconic Swiss Railways Station Clock

A functional design created for Swiss railways in 1944 continues to influence modern watchmaking and industrial design.

March 27, 2026

The Flow State Helps Women Get Unstuck During Midlife Changes

The Flow State Helps Women Get Unstuck During Midlife Changes

Flow Speaker and Author Laurie Smith Featured on Real Insights by Empart Media, Highlighting How Flow Boosts Confidence

March 27, 2026

Call for applications: BMW Group and ESMT Berlin launch next round of the Change Maker Fellowship

Call for applications: BMW Group and ESMT Berlin launch next round of the Change Maker Fellowship

The BMW Group will once again support future leaders. The fellowships are available for ESMT’s Full-time MBA and MSc

March 27, 2026

Kuvings USA Announces Spring Sale with Limited-Time 20% Discount

Kuvings USA Announces Spring Sale with Limited-Time 20% Discount

IL, UNITED STATES, March 27, 2026 /EINPresswire.com/ — Kuvings USA announces its 20% Spring Sale, inviting customers

March 27, 2026

High Quality Rechargeable Heated Clothing Supplier Prioritizes BSCI and ISO9001 Standards Amid Growing Global Demand

High Quality Rechargeable Heated Clothing Supplier Prioritizes BSCI and ISO9001 Standards Amid Growing Global Demand

QUANZHOU, FUJIAN, CHINA, March 27, 2026 /EINPresswire.com/ — The global apparel industry is currently undergoing a

March 27, 2026

AI Robotic Coffee Machines Deliver Strong Profitability in 2026 as Demand for 24/7 Unmanned Retail Grows

AI Robotic Coffee Machines Deliver Strong Profitability in 2026 as Demand for 24/7 Unmanned Retail Grows

SHENZHEN, GUANGDONG, CHINA, March 27, 2026 /EINPresswire.com/ — In 2026, entrepreneurs and business owners are still

March 27, 2026

ROKiT Flix, one of the world’s largest streaming platforms with 30,000 hours of free content, Opens to Creators

ROKiT Flix, one of the world’s largest streaming platforms with 30,000 hours of free content, Opens to Creators

The fast-growing streaming platform today announced it is opening its User-Generated Content (UGC) Platform ecosystem

March 27, 2026

Thailand Achieves Record Results at FILMART 2026, Generating Nearly THB 1.4 Billion in Business Deals

Thailand Achieves Record Results at FILMART 2026, Generating Nearly THB 1.4 Billion in Business Deals

This achievement reflects the growing international recognition of Thailand's film and content industry”— Sunanta

March 27, 2026

Chinese Medical Association Publishing House Co., Ltd Sheds Light on New Noninvasive Clues to Detect Portal Hypertension

Chinese Medical Association Publishing House Co., Ltd Sheds Light on New Noninvasive Clues to Detect Portal Hypertension

A new review highlights how combining multiple ultrasound techniques may help detect and assess portal hypertension in

March 27, 2026

China Sheet Metal Fabrication Supplier to Showcase Precision at EUROBLECH

China Sheet Metal Fabrication Supplier to Showcase Precision at EUROBLECH

SHANGHAI, SHANGHAI, CHINA, March 27, 2026 /EINPresswire.com/ — Yixing Technology is a China sheet metal fabrication

March 27, 2026

Industry Insight: The Strategic Evolution of the China Leading Rechargeable Heated Clothing OEM Service Sector in 2026

Industry Insight: The Strategic Evolution of the China Leading Rechargeable Heated Clothing OEM Service Sector in 2026

QUANZHOU, FUJIAN, CHINA, March 27, 2026 /EINPresswire.com/ — As the global functional apparel market undergoes a rapid

March 27, 2026

How Goldmoor Inn Creates Private, Restful Stays for Couples In Galena, Illinois

How Goldmoor Inn Creates Private, Restful Stays for Couples In Galena, Illinois

Top Goldmoor Inn Accommodations for Those Who Prefer Peace and Privacy Galena, United States – March 27, 2026 /

March 27, 2026

Savannah HVAC Tune-Ups Explained Before Summer Heat Hits

Savannah HVAC Tune-Ups Explained Before Summer Heat Hits

Is an HVAC Tune-Up Worth It, and What Does It Actually Do? Savannah, United States – March 27, 2026 / AAction Air

March 27, 2026

Lada Niva 1.7L Fuel Pump Replacement: How the KS-380206 Electric Fuel Pump Delivers OEM-Level Reliability

Lada Niva 1.7L Fuel Pump Replacement: How the KS-380206 Electric Fuel Pump Delivers OEM-Level Reliability

WENZHOU, ZHEJIANG, CHINA, March 27, 2026 /EINPresswire.com/ — The Lada Niva has earned its reputation as one of the

March 27, 2026

Grand Building Construction Expands Home Remodeling Services Across Greater Seattle Serving 80+ Cities

Grand Building Construction Expands Home Remodeling Services Across Greater Seattle Serving 80+ Cities

Renton's licensed & insured remodeling contractor offers bathroom remodeling, kitchen renovations, ADUs & more

March 27, 2026

AS DISABILITY RIGHTS FACE UNPRECEDENTED PRESSURE, FILM FESTIVAL RETURNS WITH ITS MOST URGENT LINEUP YET

AS DISABILITY RIGHTS FACE UNPRECEDENTED PRESSURE, FILM FESTIVAL RETURNS WITH ITS MOST URGENT LINEUP YET

Featuring No One Cares About Crazy People, narrated by Bob Odenkirk; investigative Holocaust documentary Disposable Humanity; and 30+ films across 20 venues We program films…

March 27, 2026

Wholesale Rechargeable Heated Clothing Manufacturer Unveils Sustainable GRS-Certified Fabrics at Canton Fair

Wholesale Rechargeable Heated Clothing Manufacturer Unveils Sustainable GRS-Certified Fabrics at Canton Fair

QUANZHOU, FUJIAN, CHINA, March 27, 2026 /EINPresswire.com/ — As wearable technology and environmental responsibility

March 27, 2026

Bright Pattern and Plexxum GmbH Partner to Advance AI-Powered Customer Experience in the DACH Region

Bright Pattern and Plexxum GmbH Partner to Advance AI-Powered Customer Experience in the DACH Region

Bright Pattern is an ideal addition to our existing partner ecosystem. The innovative AI-powered platform enables us to

March 27, 2026

Influential Women Profiles Michelle R. Dunham: Leading Innovation in Law, Regulation & Girls in Sports

Influential Women Profiles Michelle R. Dunham: Leading Innovation in Law, Regulation & Girls in Sports

NEW BUFFALO, MI, UNITED STATES, March 26, 2026 /EINPresswire.com/ — VP and General Counsel Champions Ethical Leadership, Mentorship, and Opportunities for Women in Law Michelle…

March 27, 2026

New Novel, ‘Echoes of the Revolution’ by T.D. MacLean Imagines Modern Espionage During the American Revolution

New Novel, ‘Echoes of the Revolution’ by T.D. MacLean Imagines Modern Espionage During the American Revolution

NY, UNITED STATES, March 26, 2026 /EINPresswire.com/ — U.S. Army veteran and historian T.D. MacLean invites readers on an unprecedented literary journey in his new…

March 27, 2026

SQ Tech Delivers Prefab Steel Buildings to the U.S. Southwest — Design, Engineering, and Fabrication from Sonora, Mexico

SQ Tech Delivers Prefab Steel Buildings to the U.S. Southwest — Design, Engineering, and Fabrication from Sonora, Mexico

SQ Tech provides architectural design, IBC-compliant structural engineering, fabrication, and delivery of prefabricated steel buildings to U.S. buyers. We deliver architectural design, IBC-compliant engineering drawings,…

March 27, 2026

Resemble AI pairs new threat report with free detection tools to help millions verify digital media in real time

Resemble AI pairs new threat report with free detection tools to help millions verify digital media in real time

Deepfakes Were an Enterprise Tech Problem; Now They Are Everyone’s Problem As synthetic content and deepfakes explode online, multimodal AI security isn’t optional; it’s essential…

March 27, 2026

Hal Foster’s Tarzan: The Complete Sunday Comics Coming Soon from TASCHEN

Hal Foster’s Tarzan: The Complete Sunday Comics Coming Soon from TASCHEN

Hal Foster made a man in a leopard loincloth a Sunday morning icon TARZANA, CA, UNITED STATES, March 26, 2026 /EINPresswire.com/ — In 1929 Hal…

March 27, 2026

The Ultimate Guide to Customized Adjustable Temperature Heated Vest OEM Service for Brand Startups

The Ultimate Guide to Customized Adjustable Temperature Heated Vest OEM Service for Brand Startups

QUANZHOU, FUJIAN, CHINA, March 27, 2026 /EINPresswire.com/ — The global apparel industry is undergoing a

March 27, 2026

Minetek doubles down on mine water leadership with global Water in Mining partnership

Minetek doubles down on mine water leadership with global Water in Mining partnership

Water in Mining Global Summit 2026 partnership underscores Minetek’s long-term commitment to water stewardship, ESG

March 27, 2026

How Top Baseball Uniform Manufacturers Are Shaping the Future of Team Sports Apparel

How Top Baseball Uniform Manufacturers Are Shaping the Future of Team Sports Apparel

SHENZHEN CITY, GUANGDONG PROVINCE, CHINA, March 27, 2026 /EINPresswire.com/ — The global team sports apparel market

March 27, 2026

Top Slitting Lines Manufacturers Driving Innovation and Efficiency in Metal Processing

Top Slitting Lines Manufacturers Driving Innovation and Efficiency in Metal Processing

SHANGHAI CITY, CHINA, March 27, 2026 /EINPresswire.com/ — The global metal processing industry has seen steady growth

March 27, 2026

CAROLINE’S CART & CAROLINE’S CAUSE: EXHIBIT AT INAUGURAL ABILITIES INTERNATIONAL ACCESSIBILITY CONFERENCE IN LONG BEACH

CAROLINE’S CART & CAROLINE’S CAUSE: EXHIBIT AT INAUGURAL ABILITIES INTERNATIONAL ACCESSIBILITY CONFERENCE IN LONG BEACH

Caroline’s Cart & Caroline’s Cause will exhibit at the Accessibility Conference (Mar 27–29, Long Beach). Meet

March 27, 2026

Pre-Orders Open March 26 for Maximus Power Armor Statue from ‘Fallout’

Pre-Orders Open March 26 for Maximus Power Armor Statue from ‘Fallout’

Prime 1 Studio announced Maximus Power Armor Statue from “Fallout.” Pre-orders began March 26, 2026 (JST), with release set for November 2027. ASAKUSA, TAITO-KU, TOKYO,…

March 27, 2026

Modern Med Aesthetics Launches Personalized Hormone Therapy Program in Crestwood

Modern Med Aesthetics Launches Personalized Hormone Therapy Program in Crestwood

New Hormone Replacement Therapy Program Introduced for Men and Women in Crestwood, MO Saint Louis, United States – March 25, 2026 / Modern Med Aesthetics…

March 27, 2026

Storaen Soft Seal Gate Valves: Zero-Leakage Precision and Corrosion Resistance for Global Water Systems

Storaen Soft Seal Gate Valves: Zero-Leakage Precision and Corrosion Resistance for Global Water Systems

CANGZHOU , HEBEI , CHINA, March 27, 2026 /EINPresswire.com/ — In the complex infrastructure of modern fluid control,

March 27, 2026

Frontier Psychiatry Hires Growth Leader to Scale Rural Telepsychiatry

Frontier Psychiatry Hires Growth Leader to Scale Rural Telepsychiatry

Veteran behavioral health executive to lead partnerships and market expansion for data-driven telepsychiatry platform

March 27, 2026

Dongjie 358 Anti-Climb Mesh: The Ultimate High-Security Perimeter Solution

Dongjie 358 Anti-Climb Mesh: The Ultimate High-Security Perimeter Solution

HENGSHUI , HEBEI , CHINA, March 27, 2026 /EINPresswire.com/ — The global demand for advanced physical barriers has

March 27, 2026

Premium Smart LED Mirrors: Elevating Modern Bathrooms with High-Precision Glass and Intelligent Illumination

Premium Smart LED Mirrors: Elevating Modern Bathrooms with High-Precision Glass and Intelligent Illumination

HEBEI , SHAHE, CHINA, March 27, 2026 /EINPresswire.com/ — The evolution of interior design has increasingly centered

March 27, 2026

The Future of Outdoor Gear: Analysis from a Wholesale Rechargeable Heated Clothing Manufacturer Perspective

The Future of Outdoor Gear: Analysis from a Wholesale Rechargeable Heated Clothing Manufacturer Perspective

QUANZHOU, FUJIAN, CHINA, March 27, 2026 /EINPresswire.com/ — As the outdoor industry navigates an era of rapid

March 27, 2026

Enhancing Perimeter Protection with Yisizhe Wave-Style Euro Fence

Enhancing Perimeter Protection with Yisizhe Wave-Style Euro Fence

SHIJIAZHUANG , HEBEI, CHINA, March 27, 2026 /EINPresswire.com/ — The security of modern infrastructure, ranging from

March 27, 2026

High-Performance Stainless Steel Seamless Pipes by BENKOO METAL

High-Performance Stainless Steel Seamless Pipes by BENKOO METAL

HANDAN, HUBEI, CHINA, March 27, 2026 /EINPresswire.com/ — The modernization of global industrial frameworks relies

March 27, 2026

TECHOM Systems Launches Phone System for Australian Businesses

TECHOM Systems Launches Phone System for Australian Businesses

TECHOM Systems launches phone system to help Australian businesses improve communication, reduce costs, and support

March 27, 2026

AI adoption in South Africa will depend on skills, governance and execution and not just technology

AI adoption in South Africa will depend on skills, governance and execution and not just technology

AI success in South Africa depends on skills, governance and execution, not just technology, with strong data,

March 27, 2026