This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Why Some Cats Lose Their Appetite and Why ZenaPet Is Part of the Broader Pet Wellness Conversation

Why Some Cats Lose Their Appetite and Why ZenaPet Is Part of the Broader Pet Wellness Conversation

Costa Mesa, California – March 09, 2026 – PRESSADVANTAGE – A cat that suddenly shows little interest in food can

March 9, 2026

GovCore Opens Oklahoma City Office to Strengthen Local Client Support

GovCore Opens Oklahoma City Office to Strengthen Local Client Support

Regulatory technology company opens Oklahoma City office at Arvest Tower, deepening its commitment to delivering

March 8, 2026

Artivist Jacqueline Rudolph to Exhibit Powerful Portraits and Sculptural Works at ArtExpo New York 2026

Artivist Jacqueline Rudolph to Exhibit Powerful Portraits and Sculptural Works at ArtExpo New York 2026

Celebrated Santa Fe-based artist brings socially conscious portraiture and activist-driven artwork to Manhattan's

March 8, 2026

Why Orthodontists Still Pursue Board Certification in an Era of Modern Orthodontics

Why Orthodontists Still Pursue Board Certification in an Era of Modern Orthodontics

Board certification through the American Board of Orthodontics remains a voluntary but respected credential signaling

March 8, 2026

John W. Crane Qualifies for MDRT’s Top of the Table

John W. Crane Qualifies for MDRT’s Top of the Table

Financial advisor John W. Crane earns MDRT Top of the Table while helping high-income families simplify saving, protect

March 8, 2026

Entrepreneur Raymond Palmer Reflects on Ikigai Journey Behind One Dog One Bone

Entrepreneur Raymond Palmer Reflects on Ikigai Journey Behind One Dog One Bone

Inventor of the Bone Pool shares how creativity, craftsmanship, and purpose shaped a life’s work with dogs Being able

March 8, 2026

CTO Spotlights Women’s Impact in Tourism on International Women’s Day

CTO Spotlights Women’s Impact in Tourism on International Women’s Day

BRIDGETOWN, BARBADOS, March 8, 2026 /EINPresswire.com/ — The Caribbean Tourism Organization (CTO) joined the global

March 8, 2026

Author Jane-Marie Auret Explores Immigration, Identity, and Digital Age Struggles in Screens and the Ego

Author Jane-Marie Auret Explores Immigration, Identity, and Digital Age Struggles in Screens and the Ego

Literary work explores immigration, university life, emotional struggle, and spiritual questions, shaping a digitally

March 8, 2026

Webtage LLC Sets a New Industry Standard with AI, GEO, and AEO-Integrated SEO Solutions for Local and Global Businesses

Webtage LLC Sets a New Industry Standard with AI, GEO, and AEO-Integrated SEO Solutions for Local and Global Businesses

Webtage LLC launches a new AI, GEO, and AEO powered SEO framework to help businesses adapt to AI search, conversational

March 8, 2026

New Molecular Switch that Boosts Tooth Regeneration Discovered

New Molecular Switch that Boosts Tooth Regeneration Discovered

Researchers uncover how SMAD7 directly activates Wnt signaling to promote dental pulp stem cell regeneration CHINA,

March 8, 2026

Rogue Collective Names Clara Woods as Its First Artist in Residence on International Women’s Day

Rogue Collective Names Clara Woods as Its First Artist in Residence on International Women’s Day

Rogue United Expands Commitment to Art as Innovation, Resilience and Cultural Impact; Woods Named Official Artist of

March 8, 2026

LongevityNext.com Relaunches as a Longevity Science, Business & Policy Publication

LongevityNext.com Relaunches as a Longevity Science, Business & Policy Publication

The relaunched site will cover longevity research, therapeutics, biomarkers, clinics, regulation, capital and data

March 8, 2026

Eleven-Year-Old Fashion Designer, Actor Charlie LeRoy Wins First ‘Be A Star With A Star’ Contest Through eZWay Network

Eleven-Year-Old Fashion Designer, Actor Charlie LeRoy Wins First ‘Be A Star With A Star’ Contest Through eZWay Network

A Bright Future Ahead HOLLYWOOD, CA, UNITED STATES, March 8, 2026 /EINPresswire.com/ — Eleven-year-old fashion

March 8, 2026

BLZ Fire Skids Launches UTV and Truck Fire Suppression Systems Featuring PolyPro™ Construction and Redline Pumps

BLZ Fire Skids Launches UTV and Truck Fire Suppression Systems Featuring PolyPro™ Construction and Redline Pumps

Mobile fire suppression systems for UTVs and trucks featuring PolyPro™ tanks and BLZ Redline pumps for rapid response.

March 8, 2026

Alternative to Meds Center Highlights Long-Term Fanapt Risks and Individualized Antipsychotic Tapering Support

Alternative to Meds Center Highlights Long-Term Fanapt Risks and Individualized Antipsychotic Tapering Support

Sedona inpatient program educates on iloperidone side effects, anticholinergic burden, and holistic alternatives for

March 8, 2026

Lan Tianyang Introduces Chinese Opera Vocal Techniques Into Global Pop Singing Training

Lan Tianyang Introduces Chinese Opera Vocal Techniques Into Global Pop Singing Training

Chinese opera vocal techniques enter modern pop singing training worldwide. Chinese opera contains centuries of vocal

March 8, 2026

KLOTA Expands E-Commerce Toolkit with Expert-Led SEO and Google Ads Audit Services

KLOTA Expands E-Commerce Toolkit with Expert-Led SEO and Google Ads Audit Services

Fixed-price, manual audits with prioritized action plans join KLOTA’s growing suite of diagnostic tools for online

March 8, 2026

When a ‘Salt Room’ Has No Salt on the Walls: Experts Warn Consumers About a Growing Halotherapy Problem

When a ‘Salt Room’ Has No Salt on the Walls: Experts Warn Consumers About a Growing Halotherapy Problem

Dr. Margaret Smiechowski explains why real salt rooms must include salt walls, proper climate control, and correct

March 8, 2026

Southern Live Oak Wellness Expands Partial Hospitalization Program in Atlanta and South Georgia

Southern Live Oak Wellness Expands Partial Hospitalization Program in Atlanta and South Georgia

Southern Live Oak Wellness provides a structured Partial Hospitalization Program in Atlanta alongside residential and

March 8, 2026

Instacoins Concierge Launches ‘Finesse’ Initiative on International Women’s Day

Instacoins Concierge Launches ‘Finesse’ Initiative on International Women’s Day

Celebrating women who shape decisions, travel, and lifestyle, Finesse by Instacoins provides dedicated concierge

March 8, 2026

MonsGeek Introduces TMR MagMech Magnetic Keyboards: Hybrid Mechanical and Magnetic Switches in One Keyboard

MonsGeek Introduces TMR MagMech Magnetic Keyboards: Hybrid Mechanical and Magnetic Switches in One Keyboard

MonsGeek unveils TMR MagMech keyboards, combining mechanical and magnetic switches in a hybrid design for precision,

March 8, 2026

Advanced eClinical Training Expands Nationwide Clinical Partner Network, Strengthening Medical Assistant Pipeline

Advanced eClinical Training Expands Nationwide Clinical Partner Network, Strengthening Medical Assistant Pipeline

ACT expands its network of 1,000+ healthcare partners nationwide – extern-to-hire workforce pathways for medical

March 8, 2026

Monkey Dooz Inks First Franchisee, Bringing Award-Winning Children’s Salon Concept to Missouri

Monkey Dooz Inks First Franchisee, Bringing Award-Winning Children’s Salon Concept to Missouri

Family-focused brand known for whimsical haircut experiences, philanthropic impact & national recognition signs

March 8, 2026

Figment Design Promotes Robert Santiago to Marketing Channels Manager

Figment Design Promotes Robert Santiago to Marketing Channels Manager

MIRAMAR, FL, UNITED STATES, March 8, 2026 /EINPresswire.com/ — Figment Design, a South Florida agency specializing in

March 8, 2026

Lauren Tobey’s Spiraling into Control Takes Over Times Square, Reframing Trauma, Burnout, and the Myth of ‘Being Fine’

Lauren Tobey’s Spiraling into Control Takes Over Times Square, Reframing Trauma, Burnout, and the Myth of ‘Being Fine’

Lauren Tobey's Spiraling into Control, featured in Times Square, merges memoir and neuroscience to redefine trauma,

March 8, 2026

An Artistic Expedition Across the Storm: ‘Golden Bell Laureates’ Shine at Carnegie Hall

An Artistic Expedition Across the Storm: ‘Golden Bell Laureates’ Shine at Carnegie Hall

Defying the Storm: The Story of an Artistic Expedition from Beijing to New York. NEW YORK, NY, UNITED STATES, March 8,

March 8, 2026

Dr. Juan P. Chisholm, Author of Mission Possible is an Audience Choice Award Winner for Mission Possible Book Award Film

Dr. Juan P. Chisholm, Author of Mission Possible is an Audience Choice Award Winner for Mission Possible Book Award Film

It is an incredible honor to have our movie premiered at the Black Art & Film Festival and be recognized as the

March 8, 2026

Hosted.com Expands SSL Certificate Options to Strengthen Website Security

Hosted.com Expands SSL Certificate Options to Strengthen Website Security

Hosted.com expands its SSL certificate offerings, providing simplified encryption solutions to help improve website

March 8, 2026

Global Finance Enters ‘End of Predictability’ as Demand for Roger Spitz’s Uncertainty Keynotes Surges

Global Finance Enters ‘End of Predictability’ as Demand for Roger Spitz’s Uncertainty Keynotes Surges

Former M&A Banker and Top-Ranked Futurist Roger Spitz Decodes Era of “Metaruptions” as Boards across Global Finance

March 8, 2026

Southern Live Oak Wellness Expands Residential Mental Health Programs Across Atlanta and South Georgia

Southern Live Oak Wellness Expands Residential Mental Health Programs Across Atlanta and South Georgia

Southern Live Oak Wellness provides evidence-based residential mental health programs, therapy, and outpatient services

March 8, 2026

Southern Live Oak Wellness Expands PTSD Treatment for Teens in Atlanta and South Georgia

Southern Live Oak Wellness Expands PTSD Treatment for Teens in Atlanta and South Georgia

Southern Live Oak Wellness provides specialized PTSD treatment for teens through residential care, therapy programs,

March 8, 2026

Southern Live Oak Wellness Expands Personality Disorders Treatment in Atlanta and South Georgia

Southern Live Oak Wellness Expands Personality Disorders Treatment in Atlanta and South Georgia

Southern Live Oak Wellness offers specialized personality disorder treatment, residential care, and therapy programs

March 8, 2026

Martinique Highlights Airlift Momentum and Market Growth at South Florida Travel & Adventure Show

Martinique Highlights Airlift Momentum and Market Growth at South Florida Travel & Adventure Show

FORT LAUDERDALE, FL, UNITED STATES, March 8, 2026 /EINPresswire.com/ — The Martinique Tourism Authority highlighted

March 8, 2026

As Real Estate Discovery Shifts Toward Video, Some Brokerages Are Exploring a New Digital Front Door

As Real Estate Discovery Shifts Toward Video, Some Brokerages Are Exploring a New Digital Front Door

The ReelMap connects agent video content to geographic maps, creating a discovery layer that allows buyers to explore

March 8, 2026

SunTrust Remodeling Expands Professional Exterior Remodeling Services Across California

SunTrust Remodeling Expands Professional Exterior Remodeling Services Across California

SunTrust Remodeling strengthens exterior renovation services including roofing, siding, windows, and exterior painting

March 8, 2026

Women-Owned GMR Transcription Releases Women’s Day Report on the Legacy of Women in Documentation

Women-Owned GMR Transcription Releases Women’s Day Report on the Legacy of Women in Documentation

GMR Transcription marks Women’s Day with a report highlighting women’s historic role in documentation and examining the

March 8, 2026

Miramar Pet Boarding Facility Hits 165 Five-Star Reviews — Pet Parents Say It’s Nothing Like a Kennel

Miramar Pet Boarding Facility Hits 165 Five-Star Reviews — Pet Parents Say It’s Nothing Like a Kennel

Four Paws Inn's cage-free home environment is winning over South Florida families who refused to settle for traditional

March 8, 2026

Fort Worth Roofer Helps Homeowners Fight Back Against Lowball Storm Damage Insurance Claims

Fort Worth Roofer Helps Homeowners Fight Back Against Lowball Storm Damage Insurance Claims

Veteran Brothers Roofing & Restoration is guiding North Texas homeowners through complex insurance claim processes

March 8, 2026

Career Coach Barry Simpson Launches ‘It’s Not About You,’ a New Guide to Getting Hired

Career Coach Barry Simpson Launches ‘It’s Not About You,’ a New Guide to Getting Hired

New book and companion web app challenge job seekers to stop focusing on what they want and start thinking about what

March 8, 2026

Auntie Atom: The Final Harvest Announces 2026 Launch Date for New Roblox Horror Experience

Auntie Atom: The Final Harvest Announces 2026 Launch Date for New Roblox Horror Experience

Chicago, Illinois – Auntie Atom: The Final Harvest announces the official 2026 release of its upcoming horror game on

March 8, 2026