AI Hallucination ResearchAudiences › AI Labs

For AI Labs

Independent per-regulation testing of how AI models hallucinate on regulatory questions. Each white paper pairs the model's answer with the authenticated regulator text that contradicts it, and verifies every source the model cited — separating real citations from fabricated, pretextual, or contradictory ones.

Browse white papers
Public, on-demand

Per-regulation diagnostics on how specific AI models hallucinate on each rule — grounded in authenticated primary sources by the RLB Specialist Panel.

4 published white papers
Partnership
Strengthen your model

Engage the RLB Specialist Panel to evaluate your model on regulations of strategic priority, diagnose its failure modes per finding, and collaborate on remediation. Includes pre-publication review of any findings we publicly release on your model.

The methodology applies beyond regulation — engagements can be scoped to medical guidelines, tax authorities, investment research, or other critical-accuracy domains your model serves. See indicative list ↓

Engage as AI Labs partner →

What partners see

Public findings carry executive summaries, per-question contrasts, and cited-source verifications. Partner engagements add the Panel's per-finding root-cause analysis, dominant-mode profile of the model's failures, and full substrate context behind each finding — surfaced through Panel-led review of the partner's own AI model.

Beyond regulation

AI products serving these audiences need to be right. We make sure they are.

Across these critical-accuracy domains, AI products that get it wrong have material consequences. RLB applies the same methodology proven on regulation — test the model against the authoritative substrate, classify failures into the 7+2 taxonomy, and collaborate on remediation.

Substrate domain Target audience for AI products
Regulatory rulesLawyers, compliance, regulators, regulated firms (what we do now)
Medical guidelines (WHO, FDA, NICE)Doctors, nurses, patients, pharma
Tax authorities (IRS, HMRC)Accountants, tax advisers, individuals
Investment research (prospectuses, fund factsheets, SEC filings)Advisers, retail investors, asset managers
Banking product T&Cs, rate sheetsRetail bankers, financial advisers
Drug interaction databasesPharmacists, prescribers
Court precedent / case lawLawyers, paralegals
Building codes, safety standardsEngineers, architects, contractors
Cybersecurity standards (NIST, ISO 27001)Security teams, CISOs, IT auditors
Aviation safety (FAA, EASA, ICAO)Pilots, airlines, maintenance technicians
Clinical trial protocols (ICH-GCP, FDA IND)Clinical researchers, regulatory affairs
Discuss a cross-domain engagement →