AI Hallucination ResearchFindings by audienceSectorsInternational / MultilateralPayment InstitutionsCompliance › Harmonised ISO 20022 Data Requirements for Enhancing Cross-Border Payments - Updated Report
Payment Institutions × Compliance — International / Multilateral · Last updated 11 Jun 2026 · methodology v2.3 · Hallucination Register
Share / Print X LinkedIn Email

AI Hallucination on Harmonised ISO 20022 Data Requirements for Enhancing Cross-Border Payments - Updated Report for Compliance teams at Payment Institutions firms in international jurisdictions

Payment Institutions Compliance teams: documentation and reporting gaps possible from AI reading of CPMI ISO 20022 Harmonisation (2026 update)

Compliance teams at Payment Institutions operating under the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) are increasingly using AI to draft regulatory horizon-scanning records on adoption progress, generate correspondent-network readiness assessments, and validate the postal-address mapping in the firm's ISO 20022 message structure. The same tools prepare supervisor-facing descriptions of ISO 20022 readiness.

Two frontier AI models tested by the RLB Specialist Panel on the workflows payment-institution compliance officers use to support advice on the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) produced three discrete hallucinations bound to regulator-issued source text. The Panel records two distinct failure classes, Numeric Drift and Schema Over-Specification across the set. Questions were prepared by the Specialist Panel based on real practical AI usage in the workflows payment-institution compliance officers use AI for, and each finding is bound to verbatim regulator-issued source text held as primary substrate.

For Compliance teams at Payment Institutions, each hallucination has a direct operational consequence in the horizon-scanning record, network-readiness assessment, or supervisor-facing readiness description. The Panel's testing surfaces ISO 20022 adoption rate conflation (RTGS vs faster payments), Fedwire hybrid postal address schema over-specification, and ISO 20022 adoption rate conflation (RTGS vs faster payments). Where these errors flow into a deliverable, the exposure is skewed correspondent-network readiness picture, over-specified vendor due-diligence criteria, and a discoverable error in the firm's regulatory record.

The pattern is uniform across the set: the AI returns a confident, sourced-looking answer that conflicts in a load-bearing specific with the regulator's verbatim text, and the error survives a first-pass review precisely because the surface form is plausible. The Panel records each hallucination with the regulator's primary substrate held as the anchor, so the corrective text is available alongside the failure.

The Specialist Panel records the citation IDs as follows: RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47 (Claude Opus 4.7 (web search on), Numeric Drift); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q010-Opus47 (Claude Opus 4.7 (web search on), Schema Over-Specification); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46 (Claude Sonnet 4.6 (web search on), Numeric Drift). Each citation links to the verbatim regulator-issued source text, the tested AI question, and the recorded AI response, so the Panel's assessment is traceable end to end.

For compliance teams at payment institutions, the citation IDs operate as a reference index: when an AI answer in the working draft matches a known Panel finding, the cited regulator text is already available as the corrective anchor. The full per-finding analysis cards, including the audience-specific impact statement, sit on the cell's detail surface for sign-off use.

Executive Summary

Compliance teams at Payment Institutions firms operating across international jurisdictions are actively working through the CPMI's harmonised ISO 20022 data requirements, mapping their message flows, updating correspondent relationships, and interpreting CPMI guidance on adoption timelines and governance structures. Across the two questions we tested AI tools on for this regulation, AI assistants produced wrong answers on both. The failures span misattributing institutional governance roles to the wrong central bank, with a fabricated named individual inserted for credibility, and conflating two materially distinct adoption-rate statistics into a single invented figure.

Both errors produce wrong deliverables: internal briefings, regulatory horizon-scanning summaries, or business-line responses that carry authoritative-sounding misinformation straight into firm decision-making.

How AI gets this regulation wrong

The AI failures on this regulation fall into two distinct patterns: citing real authoritative sources while misrepresenting what those sources actually say, and collapsing separate statistics into a single invented composite figure. Both modes are particularly dangerous because the responses read as well-sourced and specific, the kind of confident, precise answer a junior analyst would pass straight into a regulatory briefing without a second look.

AI's Failure ModeCountAffected findings
Exposed Fabrication2Finding#1 · Finding#3
Inference Drift1Finding#2

What that means for your team

Both findings map to a single risk category, delivering a wrong work product to an internal or external audience, but the downstream consequences vary: one wrong deliverable misrepresents the regulatory governance chain, the other misrepresents the state of market-wide ISO 20022 adoption in a way that directly distorts a firm's own readiness benchmarking. For a Payment Institutions Compliance team, both failures hit workflows where accuracy is load-bearing: regulatory horizon-scanning, business-line briefings, and correspondent due-diligence.

Risk ImpactCountAffected findings
Wrong deliverable3Finding#1 · Finding#2 · Finding#3

When this affects your department

Compliance teams at international Payment Institutions consult AI tools on the CPMI ISO 20022 harmonisation work most often when building or refreshing regulatory horizon-scanning decks, briefing the business on the global state of adoption, scoping correspondent network reviews against CPMI data-field expectations, or responding quickly to a business-line question about where a particular jurisdiction stands in the implementation timeline.

The governance question, who chairs the CPMI working group, arises in contexts like drafting regulatory engagement briefings, populating compliance registers that map internal workstreams to external counterpart bodies, or preparing materials for board-level regulatory updates where attribution errors embarrass the firm in front of senior stakeholders.

The adoption-rate question is more operationally consequential. When Compliance builds a readiness benchmark or supports Treasury's correspondent-bank due-diligence against a "where is the market" baseline, the difference between more than three-quarters of faster payment systems versus approaching half of RTGS systems is not cosmetic, it tells a firm whether its RTGS counterparts are outliers or the norm. Collapsing both rates into a single 79% figure leads a firm to overestimate RTGS adoption across its network, potentially deprioritising pressure on correspondent banks that are, in reality, still materially behind the curve.

The risk is not hypothetical compliance theatre. Payment Institutions operating internationally face regulatory expectations from multiple bodies simultaneously, and their Compliance teams are routinely asked to produce authoritative-sounding briefings under time pressure. AI tools fill that gap fast, but when the output carries wrong institutional attribution or materially wrong market statistics, the firm's credibility with regulators, correspondent banks, and internal governance bodies is on the line.

The findings at a glance

The two findings below cover distinct question types, one on regulatory governance, one on market adoption statistics, but both produced wrong outputs that would land in firm deliverables unchallenged without source verification.

#Finding titleTypeCitation ID
1ISO 20022 adoption rate conflation: RTGS vs faster payments (Opus 4.7)HallucinationRLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47
2Fedwire hybrid postal address schema over-specificationHallucinationRLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q010-Opus47
3ISO 20022 adoption rate conflation: RTGS vs faster payments (Sonnet 4.6)HallucinationRLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46

Aggregate impact

The two findings in this cell cluster around different knowledge categories, institutional governance and market statistics, but share a common failure signature: AI tools on this regulation produce specific, citation-adjacent responses that are wrong in ways that are not immediately visible to the reader. Neither error is vague or hedged in the AI output; both read as confident, authoritative, and sourced. That is what makes them systematically dangerous for a Compliance team under time pressure.

On governance, the error is a clean misattribution, the wrong central bank named, reinforced by the insertion of a named individual in that role. The Reserve Bank of Australia chairs the CPMI messaging workstream; the AI placed that role with the Federal Reserve Bank of New York and fabricated a named co-lead. For a Compliance team producing a regulatory engagement briefing or an internal governance map, this is the kind of error that gets cited at a senior committee and only surfaces when a counterpart with first-hand knowledge of the CPMI structure pushes back.

On market statistics, the error is a data conflation that flatters the market's readiness. The CPMI's own monitoring data reports more than three-quarters of faster payment systems and approaching half of RTGS systems using ISO 20022, two materially different rates reflecting the well-understood lag in RTGS migration. The AI collapsed both into a single 79% figure for both system types.

For a Payment Institutions Compliance team benchmarking its correspondent network or advising business lines on realistic adoption timelines, overstating RTGS adoption by approximately 30 percentage points is not a rounding error, it is a misjudgement of where the market actually is.

What your team should do

The default position for Compliance teams using AI tools on this regulation should be: AI is useful for orienting to document structure, summarising publicly available CPMI policy papers, and drafting initial frameworks, but every institutional attribution and every statistic must be sourced back to the primary CPMI document or the relevant central bank's public record before it appears in any deliverable. That is not a counsel of perfection; it reflects the specific failure pattern these findings expose. The errors are not in nuanced interpretive territory, they are in factual claims that have authoritative primary sources that take seconds to check.

For governance questions, the single reliable source is the CPMI's own published documentation and the RBA's public communications confirming its role chairing the messaging workstream. If a Compliance team is building a regulatory engagement register or preparing senior committee materials, the named individual holding a CPMI co-chair role should be verified against the RBA's current public record, not taken from an AI response. Personnel in those roles change; AI training data does not update in real time, and, as this finding shows, the AI may have the wrong institution entirely.

For market adoption statistics, the CPMI's monitoring reports and senior CPMI official speeches are the primary sources. Andrew Bailey's March 2026 speech provides the specific two-figure breakdown, more than three-quarters of faster payment systems, approaching half of RTGS systems, and that distinction matters operationally for a firm benchmarking its correspondent network. AI tools are appropriate for flagging that CPMI documents monitoring data and directing the team to where to look, but the figures themselves should come from the source document.

The CPMI data is publicly available; the verification cost is low relative to the cost of mis-scoping a correspondent review or over-assuring a board that the RTGS landscape is further ahead than it is.

How RLB Can Help

RegLeg's published Hallucination Research gives Compliance teams at Payment Institutions firms a practical pre-flight check before placing reliance on AI-assisted output for regulatory questions. Each research entry documents the specific ways AI tools have mis-stated requirements, cited non-existent provisions, or conflated obligations across jurisdictions, giving your team a structured basis for calibrating confidence rather than discovering errors after the fact.

Beyond the published research, RegLeg works with Compliance functions to map which AI-supported workflows carry the highest hallucination exposure for a Payment Institutions firm specifically. Licensing and authorisation timelines, safeguarding and prudential thresholds, cross-border passporting conditions, and AML/CFT obligations each present distinct failure patterns. A bespoke regulator deep-dive surfaces where those patterns are most acute for your operating footprint, so resource and oversight effort is directed where the actual risk sits.

RegLeg can also conduct a confidential review of your firm's existing AI-use policy against our failure-mode catalogue, producing a prioritised remediation plan aligned to the regulatory obligations your Compliance team is already accountable for.

For teams building internal capability, RegLeg develops training material and CPD-aligned content that translates the research into practical guidance, covering how to read AI output critically, what hallucination signals to look for in a regulatory context, and how to document reliance decisions in a way that will withstand supervisory scrutiny. The aim is to leave your Compliance function better equipped to use AI tools responsibly, with the institution's own risk tolerance and regulatory relationships intact.

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.