This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

First Benchmark for Legacy Code Comprehension Shows Specialized AI Approach Outperforms General-PurposeModels

LegacyCodeBench tests whether AI can understand COBOL well enough to document itaccurately not just generate plausible text

NEW YORK, NY, UNITED STATES, January 13, 2026 /EINPresswire.com/ — A new benchmark designed to measure whether AI systems can actuallyunderstand legacy enterprise code shows that specialized approaches significantlyoutperform general-purpose models. LegacyCodeBench, developed by Kalmantic (anapplied AI research lab) in collaboration with Hexaview Technologies, evaluates AIcomprehension of COBOL the language still processing 95% of ATM transactions and $3trillion in daily global transactions.
The benchmark finds that domain-specialized systems like Hexaview’s Legacy Insightsachieve 92% accuracy, compared to 86-90% for general-purpose models like GPT-4o andClaude Sonnet 4.

-Why This Matters
Over 220 billion lines of COBOL remain in production worldwide, but the engineers whowrote it are retiring. Modernization projects fail at rates exceeding 60%, and the pattern isusually the same: organizations try to replace systems they never fully understood.

“The risk everyone focuses on is the legacy technology itself, but that’s not actually whereprojects fall apart,” said Ankit Agarwal, Founder and CTO of Hexaview. “What kills these programs is undocumented business logic. We needed an objective way to measurewhether AI can actually understand these systems well enough to trust the output.”


-How It Works
Most AI benchmarks use another LLM to judge output quality, which creates reproducibilityproblems. LegacyCodeBench takes a different approach: it verifies claims against theoriginal program’s behavior.The process extracts specific behavioral claims from AI-generated documentation -statements like “PREMIUM is calculated by multiplying BASE-RATE by RISK-FACTOR” – andthen verifies them by executing the original COBOL program with test inputs. If the claimdoesn’t match what the code actually does, it fails.”We’re not testing whether documentation reads well,” said Nikita, co-author of the paper.”We wanted to know if you could actually trust it. There’s a difference.”The benchmark also penalizes gaming. Documentation that avoids making testable claimsscores zero on the behavioral track, which carries 50% of the total weight. And if the AIhallucinates variables that don’t exist in the source code, the entire task fails

-Results


| System | LCB Score | Structural | Doc Quality | Behavioral | T1 Basic | T4 Enterprise |
| ————————— | ——— | ———- | ———– | ———- | ——– | ————- |
| Legacy Insights (Hexaview) | 92% | 94% | 96% | 90% | 96% | 90% |
| Claude Sonnet 4 (Anthropic) | 90% | 96% | 78% | 91% | 92% | 92% |
| AWS Transform Mainframe | 88% | 98% | 68% | 91% | 88% | 87% |
| IBM Granite 13B | 87% | 93% | 72% | 90% | 89% | 84% |
| GPT-4o (OpenAI) | 86% | 92% | 71% | 89% | 91% | 82% |


Specialized systems (Legacy Insights, AWS Transform) outperform general-purposemodels, particularly on documentation quality. All models maintain reasonably strongperformance from basic programs (T1) to enterprise-scale COBOL (T4), though GPT-4oshows the largest drop (9 points).

“General-purpose models have gotten quite good at parsing legacy code, which is realprogress,” Agarwal said. “But there’s still a gap between understanding the syntax andunderstanding what the code is actually doing in a business context. That’s wherespecialization matters.”

-Open Source
LegacyCodeBench is fully open source with deterministic evaluation. The publicleaderboard is at legacycodebench.com, and the team welcomes submissions via GitHub

-Resources
• Website: legacycodebench.com
• Paper: Available at legacycodebench.com
• GitHub: github.com/kalmantic/legacycodebench
• Legacy Insights: legacyip.hexaview.ai


-About Hexaview
Hexaview is a strategic implementation partner for regulated enterprises, specializing inlegacy system preservation and modernization. Learn more: hexaviewtech.com

-About Kalmantic Labs Kalmantic is an applied AI research lab studying the challenges that emerge when AI meetsproduction systems. They publish research openly and build tools based on their findings.Learn more: kalmantic.com

LegacyCodeBench is open source under MIT license.

Ankit Agarwal
Hexaview Technologies
+1 845-653-3855
email us here

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Hip Hop Artist Deveye Signs with Syndicate Entertainment, Launches New Era w/ Anthemic Single & Physical Transformation

Hip Hop Artist Deveye Signs with Syndicate Entertainment, Launches New Era w/ Anthemic Single & Physical Transformation

Everything is new right now to me in my mindset. It’s a rebuild. It’s a brand new day. Whatever I was doing before, I’m doing…

January 26, 2026

BIG GAME FUELS CHILD TRAFFICKING PREVENTION

BIG GAME FUELS CHILD TRAFFICKING PREVENTION

In Our Backyard hosts a Missing Children Public Outreach on January 31, 2026, at 10 AM at Santa Clara University to prevent trafficking ahead of…

January 26, 2026

Aspire Biopharma’s Unveils Bold New Era for BUZZ BOMB(TM) Caffeine with Dynamic Website and Packaging Redesign

Aspire Biopharma’s Unveils Bold New Era for BUZZ BOMB(TM) Caffeine with Dynamic Website and Packaging Redesign

New streamlined, mobile-optimized shopping experience, allows consumers to purchase BUZZ BOMB™ directly, access exclusive promotions and engage with the brand through educational and lifestyle content…

January 26, 2026

Healthspan Collective & Regen Therapy Announce Partnership to Advance Next-Gen Regenerative Medicine Education & Access

Healthspan Collective & Regen Therapy Announce Partnership to Advance Next-Gen Regenerative Medicine Education & Access

The partnership aims to provide curated access to credible science, responsible innovation, and practical frameworks

January 25, 2026

The Club at Mediterra earns Elite status from Distinguished Clubs

The Club at Mediterra earns Elite status from Distinguished Clubs

National recognition places the club among just 132 private clubs nationwide for exceptional service, amenities and

January 25, 2026

RISE Healthy Communities Summit Returns with Expanded Whole Person Health Mission

RISE Healthy Communities Summit Returns with Expanded Whole Person Health Mission

ORLANDO, FL, UNITED STATES, January 14, 2026 /EINPresswire.com/ — The RISE Healthy Communities Summit, formerly the

January 25, 2026

East West Partners And Sonnenalp Hotel Create Exclusive Partnership To Offer Prima Residences At The Sonnenalp

East West Partners And Sonnenalp Hotel Create Exclusive Partnership To Offer Prima Residences At The Sonnenalp

Four Luxury Homes in the Heart of Vail Village Combine Mountain Craftsmanship with the Legendary Service and Amenities

January 25, 2026

IdeaLift Accepted Into Microsoft Partner Network

IdeaLift Accepted Into Microsoft Partner Network

Partnership enables deeper Microsoft Teams integration and Azure Marketplace availability for product teams worldwide

January 25, 2026

Grant Brothers Tree Service Helps Form New National Tree Care Association

Grant Brothers Tree Service Helps Form New National Tree Care Association

Virginia-based tree care company plays a leadership role in forming a new national association focused on safety,

January 25, 2026

RETSY Ranks Among Arizona’s Top 10 Residential Real Estate Brokerages; 10 Agents and Teams in Phoenix’s Most Productive

RETSY Ranks Among Arizona’s Top 10 Residential Real Estate Brokerages; 10 Agents and Teams in Phoenix’s Most Productive

Phoenix Business Journal Rankings Underscore RETSY's Leadership in Sales Volume, Agent Productivity, and Luxury Market

January 25, 2026

Gross-Wen Technologies Named on the 2026 Global Cleantech 100

Gross-Wen Technologies Named on the 2026 Global Cleantech 100

A Year Defined by Intensifying Competition, Resource Security, and the Rise of Economic Durability as Cleantech’s New

January 25, 2026

Crow’s Nest Campground Opens 2026 Season Reservations with Enhanced Family Amenities

Crow’s Nest Campground Opens 2026 Season Reservations with Enhanced Family Amenities

Newport, NH destination campground announces early booking for Mount Sunapee region getaways featuring upgraded

January 25, 2026

Law Office of Justin C. Frankel, P.C. Successfully Reinstates Disability Benefits for Senior Executive

Law Office of Justin C. Frankel, P.C. Successfully Reinstates Disability Benefits for Senior Executive

GARDEN CITY, NY, UNITED STATES, January 14, 2026 /EINPresswire.com/ — Disability Benefits Reinstated for Senior

January 25, 2026

Insurance Expert Bill Pancake of Kissimmee, FL Discusses Auto Insurance Coverage in HelloNation

Insurance Expert Bill Pancake of Kissimmee, FL Discusses Auto Insurance Coverage in HelloNation

How much auto insurance is enough in Central Florida? KISSIMMEE, FL, UNITED STATES, January 14, 2026 /EINPresswire.com/

January 25, 2026

AstroDoc Announces ASTRID – Healthcare AI That Solves the ‘Last Mile’

AstroDoc Announces ASTRID – Healthcare AI That Solves the ‘Last Mile’

Healthtech company with integrated U.S. medical practice offers free global AI access and seamless care delivery –

January 25, 2026

Golden Waves Grain Announces Strategic Investment from Foote Family, Advancing $200m Goodland KS Milling/Bakery Project

Golden Waves Grain Announces Strategic Investment from Foote Family, Advancing $200m Goodland KS Milling/Bakery Project

Golden Waves Grain announced the major investment. The $200M project breaks ground Spring '26. Projects like this help

January 25, 2026

Rowan Foundation Launches National Writing Scholarships For Undergraduate Women

Rowan Foundation Launches National Writing Scholarships For Undergraduate Women

Program Includes a First-of-Its-Kind National Writing Scholarship for Women Affected by Blood Clots and Clotting

January 25, 2026

Texas Hospital Insurance Exchange Deploys PCMS Atlas to Modernize Workers’ Compensation Operations

Texas Hospital Insurance Exchange Deploys PCMS Atlas to Modernize Workers’ Compensation Operations

THIE selects Atlas for its flexibility, scalability, and cloud-based architecture to enhance operational efficiency

January 25, 2026

Advanced Axis Delivers Industry-Leading Results in New AT&T Partnership

Advanced Axis Delivers Industry-Leading Results in New AT&T Partnership

Advanced Axis delivers measurable AT&T growth, generating 5,200 new customers and $18.2M in revenue through

January 25, 2026

LinkedIn Automation Update Helps Sales Teams Scale Outreach Without Losing Message Quality

LinkedIn Automation Update Helps Sales Teams Scale Outreach Without Losing Message Quality

NEW YORK, NY, UNITED STATES, January 14, 2026 /EINPresswire.com/ — As buyer expectations continue to rise, many sales

January 25, 2026

Creative Fabrica Enters the Third Dimension: New AI Tools Turn Text into 3D Print Models Instantly

Creative Fabrica Enters the Third Dimension: New AI Tools Turn Text into 3D Print Models Instantly

Create 3D models from text in seconds. Creative Fabrica’s new AI tools let you generate, validate, and export

January 25, 2026

Houzeo Strengthens Its Buyer Platform with the Launch of Cost of Living Calculator in Florida

Houzeo Strengthens Its Buyer Platform with the Launch of Cost of Living Calculator in Florida

The new tool helps homebuyers compare affordability, lifestyle costs, and housing options across Florida. MIAMI, FL,

January 25, 2026

​​Let Grow Announces Strategic Expansion: Nonprofit Nearly Triples Staff to Advance Childhood Independence Movement

​​Let Grow Announces Strategic Expansion: Nonprofit Nearly Triples Staff to Advance Childhood Independence Movement

Responding to rising demand from schools and families, Let Grow nearly triples its team to expand evidence-based

January 25, 2026

TX Supreme Court Reasserts Authority Over Law-School Approval for Bar Admission, Ending Automatic Reliance on ABA

TX Supreme Court Reasserts Authority Over Law-School Approval for Bar Admission, Ending Automatic Reliance on ABA

Landmark rule change follows years of public debate—and highlights the real-world impact of attorney Nelson A. Locke’s

January 25, 2026

Ideal Physical Therapy Helps Golfers Address Common Injuries and Improve Performance

Ideal Physical Therapy Helps Golfers Address Common Injuries and Improve Performance

One-on-one physical therapy led by Dr. James Harris, PT, DPT, helping golfers reduce pain, improve movement, and stay

January 25, 2026

70-Year-Old Historian Releases First Video Game After 36 Years in Educational Software Development

70-Year-Old Historian Releases First Video Game After 36 Years in Educational Software Development

History Run brings American history to life through fast-paced gameplay As player attempt to restore artifacts to the

January 25, 2026

New York Comedy Film Festival Announces Full Schedule for Inaugural Weeklong Celebration of Comedy Film February 15 – 22

New York Comedy Film Festival Announces Full Schedule for Inaugural Weeklong Celebration of Comedy Film February 15 – 22

NYC’s first festival dedicated exclusively to comedy presents 75+ features, shorts, episodics, and docs, plus filmmaker

January 25, 2026

Creatio Maps the Next Phase of Enterprise Automation for 2026 with Trends Report

Creatio Maps the Next Phase of Enterprise Automation for 2026 with Trends Report

A practical roadmap to help leaders prioritize investments and navigate the next phase of enterprise automation in 2026

January 25, 2026

Partnerize Brings VantagePoint™ to Publishers to Quantify and Monetize Influence Beyond the Click

Partnerize Brings VantagePoint™ to Publishers to Quantify and Monetize Influence Beyond the Click

New charter program quantifies publisher authority across AI-mediated discovery and zero-click conversions to establish

January 25, 2026

NRH Search President & CEO Ron Stockman Celebrates 28-Year Milestone in Executive Recruiting

NRH Search President & CEO Ron Stockman Celebrates 28-Year Milestone in Executive Recruiting

NRH Search is proud to announce that President and CEO Ron Stockman is celebrating his 28th anniversary with the firm

January 25, 2026

Aiarty Reaffirms Secure, Offline, Privacy-Focused Desktop AI Tools for Image and Video Enhancement

Aiarty Reaffirms Secure, Offline, Privacy-Focused Desktop AI Tools for Image and Video Enhancement

Aiarty reinforces responsible innovation with offline desktop AI tools where image and video processing occurs locally.

January 25, 2026

Wave Shine Tech Advances Wireless Infrastructure with Tile-Based Reconfigurable Intelligent Surfaces (RIS)

Wave Shine Tech Advances Wireless Infrastructure with Tile-Based Reconfigurable Intelligent Surfaces (RIS)

Power-independent modular RIS for eliminating wireless blind spots in 6G and beyond. SAN FRANCISCO, CA, UNITED STATES,

January 25, 2026

R2 Recycling – Worcester Announces New Dedicated E-Waste & Battery Recycling Pickup Service for Local Businesses

R2 Recycling – Worcester Announces New Dedicated E-Waste & Battery Recycling Pickup Service for Local Businesses

Scheduled commercial pickups now available across the Worcester area to help organizations clear out electronics and

January 25, 2026

From Behavior-Driven Platforms to Rule-Based Financial Infrastructure: Finger Trader’s Emerging Role

From Behavior-Driven Platforms to Rule-Based Financial Infrastructure: Finger Trader’s Emerging Role

From Behavior-Driven Platforms to Rule-Based Financial Infrastructure: Finger Trader’s Emerging Role LA, LA, UNITED

January 25, 2026

CodaPet expands compassionate in-home pet euthanasia services in Atlanta, GA

CodaPet expands compassionate in-home pet euthanasia services in Atlanta, GA

The veterinarian-owned startup empowers a network of veterinarians who provide in-home euthanasia to ease the passing

January 25, 2026

VideoProc Introduces AI-Powered Enhancement Workflow for High-Quality Video Editing

VideoProc Introduces AI-Powered Enhancement Workflow for High-Quality Video Editing

VideoProc unveils AI-powered workflow for editors, enhancing video & audio before or after editing to deliver

January 25, 2026

Storecove Receives Plateforme Agrèèe Accreditation For France’s E-invoicing Mandate

Storecove Receives Plateforme Agrèèe Accreditation For France’s E-invoicing Mandate

The e-invoicing provider advances towards full registration as France prepares for mandatory compliance requirements

January 25, 2026

AI Energy Conference 3 Issues Call for Speakers: Data Center Giants to Address Community Impact

AI Energy Conference 3 Issues Call for Speakers: Data Center Giants to Address Community Impact

The rapid expansion of AI data centers must coexist with the well-being of the communities that host them in the

January 25, 2026

From Rising Stars to Global Icons: MSM Precollege Gala Highlights Artistic Legacy

From Rising Stars to Global Icons: MSM Precollege Gala Highlights Artistic Legacy

From Rising Stars to Global Icons: MSM Precollege Gala Highlights Artistic Legacy NEW YORK, NY, UNITED STATES, January

January 25, 2026

ACA Pharma Named Exclusive Distributor for Ferabright™ in Macau, Hong Kong, Singapore, Greater Bay Area & Mainland China

ACA Pharma Named Exclusive Distributor for Ferabright™ in Macau, Hong Kong, Singapore, Greater Bay Area & Mainland China

This expanded partnership builds on our work with Feraheme and allows us to offer physicians a complementary,

January 25, 2026