AI Hallucination on Harmonised ISO 20022 Data Requirements for Enhancing Cross-Border Payments - Updated Report for Compliance teams at Statutory Boards & Agencies firms in international jurisdictions

Compliance teams at Statutory Boards & Agencies responsible for payments infrastructure exposure to the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) are increasingly using AI to draft formal regulatory submissions to central banks or supranational bodies, generate board papers on payments infrastructure benchmarks, and prepare gap-analysis documents against peer adoption rates. The same tools validate citation accuracy in finance-ministry-facing briefings.

Two frontier AI models tested by the RLB Specialist Panel on the workflows statutory-agency compliance officers use to support advice on the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) produced two discrete hallucinations bound to regulator-issued source text. The Panel records a single recurring failure class: Numeric Drift across the set. Questions were prepared by the Specialist Panel based on real practical AI usage in the workflows statutory-agency compliance officers use AI for, and each finding is bound to verbatim regulator-issued source text held as primary substrate.

For Compliance teams at Statutory Boards & Agencies, each hallucination has a direct operational consequence in the regulatory submission, gap-analysis document, or finance-ministry briefing. The Panel's testing surfaces ISO 20022 adoption rate conflation (RTGS vs faster payments), and ISO 20022 adoption rate conflation (RTGS vs faster payments). Where these errors flow into a deliverable, the exposure is a credibility-damaging factual discrepancy in a formal submission, a forced retraction or amended filing, and a misstated baseline for gap analysis presented to the governing board.

The pattern is uniform across the set: the AI returns a confident, sourced-looking answer that conflicts in a load-bearing specific with the regulator's verbatim text, and the error survives a first-pass review precisely because the surface form is plausible. The Panel records each hallucination with the regulator's primary substrate held as the anchor, so the corrective text is available alongside the failure.

The Specialist Panel records the citation IDs as follows: RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47 (Claude Opus 4.7 (web search on), Numeric Drift); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46 (Claude Sonnet 4.6 (web search on), Numeric Drift). Each citation links to the verbatim regulator-issued source text, the tested AI question, and the recorded AI response, so the Panel's assessment is traceable end to end. For compliance teams at statutory boards & agencies, the citation IDs operate as a reference index: when an AI answer in the working draft matches a known Panel finding, the cited regulator text is already available as the corrective anchor.

The full per-finding analysis cards, including the audience-specific impact statement, sit on the cell's detail surface for sign-off use.

This is the consolidated view of findings. Click the Citation IDs or 'see details →' on any item for the full details for each finding.

ISO 20022 adoption rate conflation: RTGS vs faster payments (Opus 4.7)

RLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47

A Compliance team relying on AI to retrieve ISO 20022 adoption statistics for a board paper or regulatory briefing would receive a figure of approximately 79% applied uniformly to both RTGS and faster payment systems, materially overstating RTGS adoption, which the CPMI monitoring data places at approaching half rather than close to three-quarters. For a Statutory Boards & Agencies firm whose compliance obligations centre on RTGS infrastructure, this error misrepresents the peer benchmark and sets the wrong baseline for any gap analysis or implementation-timeline justification presented to the governing board or finance ministry counterpart.

If the figure flows into a formal regulatory submission or is cited in a response to a central bank or supranational body that works from the same CPMI source, the firm risks a credibility-damaging factual discrepancy that is immediately verifiable. Correction after submission requires a formal retraction or amended filing, a reputational and operational cost entirely avoidable had the statistic been verified against the primary CPMI source before the deliverable was finalised.

see details →

ISO 20022 adoption rate conflation: RTGS vs faster payments (Sonnet 4.6)

RLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46

Claude Sonnet 4.6 with web search exhibited the same blended 79% adoption figure as the Opus 4.7 test on the matching question, collapsing the regulator's separately-stated faster-payment and RTGS rates into a single composite. The structural exposure for this audience is identical to the Opus variant: a Compliance team relying on AI to retrieve ISO 20022 adoption statistics for a board paper or regulatory briefing would receive a figure of approximately 79% applied uniformly to both RTGS and faster payment systems, materially overstating RTGS adoption, which the CPMI monitoring data places at approaching half rather than close to three-quarters.

For a Statutory Boards & Agencies firm whose compliance obligations centre on RTGS infrastructure, this error misrepresents the peer benchmark and sets the wrong baseline for any gap analysis or implementation-timeline justification presented to the governing board or finance ministry counterpart. If the figure flows into a formal regulatory submission or is cited in a response to a central bank or supranational body that works from the same CPMI source, the firm risks a credibility-damaging factual discrepancy that is immediately verifiable.

Correction after submission requires a formal retraction or amended filing, a reputational and operational cost entirely avoidable had the statistic been verified against the primary CPMI source before the deliverable was finalised.

see details →

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.