AI Hallucination on Harmonised ISO 20022 Data Requirements for Enhancing Cross-Border Payments - Updated Report for Product & Business Development teams at Payment Institutions firms in international jurisdictions

Product & Business Development teams at Payment Institutions building cross-border payment propositions under the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) are increasingly using AI to build corridor strategy and investor decks, draft business cases for enrichment investment, and populate partner pitch materials with regulator-sourced inquiry-rate benchmarks. The same tools validate competitive-positioning claims with CPMI adoption data.

Two frontier AI models tested by the RLB Specialist Panel on the workflows payment-institution product and business-development teams use to support advice on the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) produced three discrete hallucinations bound to regulator-issued source text. The Panel records two distinct failure classes, False-Negative Retrieval and Numeric Drift across the set. Questions were prepared by the Specialist Panel based on real practical AI usage in the workflows payment-institution product and business-development teams use AI for, and each finding is bound to verbatim regulator-issued source text held as primary substrate.

For Product & Business Development teams at Payment Institutions, each hallucination has a direct operational consequence in the corridor strategy, investor deck, or business case for enrichment. The Panel's testing surfaces ISO 20022 adoption rate conflation (RTGS vs faster payments), missing inquiry-rate and resolution-time benchmarks, and ISO 20022 adoption rate conflation (RTGS vs faster payments). Where these errors flow into a deliverable, the exposure is reputational damage in partner conversations, an investment case with the wrong source attribution for the headline efficiency metric, and a product roadmap built on a false market assumption.

The pattern is uniform across the set: the AI returns a confident, sourced-looking answer that conflicts in a load-bearing specific with the regulator's verbatim text, and the error survives a first-pass review precisely because the surface form is plausible. The Panel records each hallucination with the regulator's primary substrate held as the anchor, so the corrective text is available alongside the failure.

The Specialist Panel records the citation IDs as follows: RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47 (Claude Opus 4.7 (web search on), Numeric Drift); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q007-Sonnet46 (Claude Sonnet 4.6 (web search on), False-Negative Retrieval); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46 (Claude Sonnet 4.6 (web search on), Numeric Drift). Each citation links to the verbatim regulator-issued source text, the tested AI question, and the recorded AI response, so the Panel's assessment is traceable end to end.

For product & business development teams at payment institutions, the citation IDs operate as a reference index: when an AI answer in the working draft matches a known Panel finding, the cited regulator text is already available as the corrective anchor. The full per-finding analysis cards, including the audience-specific impact statement, sit on the cell's detail surface for sign-off use.

This is the consolidated view of findings. Click the Citation IDs or 'see details →' on any item for the full details for each finding.

ISO 20022 adoption rate conflation: RTGS vs faster payments (Opus 4.7)

RLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47

An AI assistant conflated two materially different adoption rates, more than three-quarters for faster payment systems and approaching half for RTGS, into a single '79% for both' figure, then acknowledged on follow-up that the number had been reconstructed. For a Product & Business Development team building a corridor strategy, partner pitch, or board paper, this figure would appear sourced and coherent; the underlying error (overstating RTGS adoption by roughly 30 percentage points) would only surface when a counterpart cites the correct CPMI data.

The firm faces reputational damage in partner or investor conversations, and any product roadmap decisions premised on near-parity RTGS adoption are built on a false market assumption.

see details →

Missing official inquiry-rate and resolution-time benchmarks

RLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q007-Sonnet46

An AI tool searching for the official CPMI/FSB inquiry-rate and resolution-time benchmarks returned a false negative, reporting no official statistic existed, while misattributing the 80% resolution-time figure it did surface to SWIFT and commercial banks rather than to the FSB co-chair statement where it originates. The Panetta speech data (1-3% inquiry rate, 5-10 manual touchpoints, up to 80% resolution-time reduction) is the authoritative quantitative anchor for the operational ROI case for ISO 20022 enrichment investment.

A Product & Business Development team that uses AI to populate this section of a business case will either leave it blank or carry incorrect source attribution forward, weakening the investment case and creating an attribution error that will be visible to any stakeholder who checks the primary record.

see details →

ISO 20022 adoption rate conflation: RTGS vs faster payments (Sonnet 4.6)

RLB-F-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46

Claude Sonnet 4.6 with web search exhibited the same blended 79% adoption figure as the Opus 4.7 test on the matching question, collapsing the regulator's separately-stated faster-payment and RTGS rates into a single composite. The structural exposure for this audience is identical to the Opus variant: an AI assistant conflated two materially different adoption rates, more than three-quarters for faster payment systems and approaching half for RTGS, into a single '79% for both' figure, then acknowledged on follow-up that the number had been reconstructed.

For a Product & Business Development team building a corridor strategy, partner pitch, or board paper, this figure would appear sourced and coherent; the underlying error (overstating RTGS adoption by roughly 30 percentage points) would only surface when a counterpart cites the correct CPMI data. The firm faces reputational damage in partner or investor conversations, and any product roadmap decisions premised on near-parity RTGS adoption are built on a false market assumption.

see details →

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.