Accountants advising on the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) are increasingly using AI to model the ROI of ISO 20022 migration for client business cases, populate audit working papers with regulator-sourced inquiry-rate benchmarks, and validate adoption-rate citations in advisory deliverables. The same tools draft sections of client board memos that depend on official CPMI or FSB quantitative anchors.
Two frontier AI models tested by the RLB Specialist Panel on the workflows accountants use to support advice on the CPMI Harmonised ISO 20022 Data Requirements (Updated Report) produced three discrete hallucinations bound to regulator-issued source text. The Panel records two distinct failure classes, False-Negative Retrieval and Numeric Drift across the set. Questions were prepared by the Specialist Panel based on real practical AI usage in the workflows accountants use AI for, and each finding is bound to verbatim regulator-issued source text held as primary substrate.
For Accountants (CA/PA), each hallucination has a direct operational consequence in the audit working paper, advisory memo, or client board paper. The Panel's testing surfaces ISO 20022 adoption rate conflation (RTGS vs faster payments), missing inquiry-rate and resolution-time benchmarks, and ISO 20022 adoption rate conflation (RTGS vs faster payments). Where these errors flow into a deliverable, the exposure is misstated peer benchmark, weakened investment case, and credibility loss when clients verify the underlying source.
The pattern is uniform across the set: the AI returns a confident, sourced-looking answer that conflicts in a load-bearing specific with the regulator's verbatim text, and the error survives a first-pass review precisely because the surface form is plausible. The Panel records each hallucination with the regulator's primary substrate held as the anchor, so the corrective text is available alongside the failure.
The Specialist Panel records the citation IDs as follows: RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Opus47 (Claude Opus 4.7 (web search on), Numeric Drift); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q007-Sonnet46 (Claude Sonnet 4.6 (web search on), False-Negative Retrieval); RLB-H-INT-BIS-CPMI-ISO-20022-HARMONISATION-UPDATED-2026-Q006-Sonnet46 (Claude Sonnet 4.6 (web search on), Numeric Drift). Each citation links to the verbatim regulator-issued source text, the tested AI question, and the recorded AI response, so the Panel's assessment is traceable end to end. For accountants (ca/pa), the citation IDs operate as a reference index: when an AI answer in the working draft matches a known Panel finding, the cited regulator text is already available as the corrective anchor.
The full per-finding analysis cards, including the audience-specific impact statement, sit on the cell's detail surface for sign-off use.
This is the consolidated view of findings. Click the Citation IDs or 'see details →' on any item for the full details for each finding.
An accountant advising an RTGS operator on competitive positioning or compliance timeline faces a direct exposure: the AI's blended '79% for both' figure overstates RTGS adoption by approximately 30 percentage points relative to the regulator's own 'approaching half' formulation. A client briefing or board memo built on this figure will misrepresent the peer-group baseline and the urgency of the client's migration decision. When the client or their counterparty subsequently references the BIS speech directly, the discrepancy is difficult to explain without acknowledging that the underlying source was not verified.
The Panetta speech figures, 1–3% inquiry rate, 5–10 manual touchpoints, up to 80% resolution-time reduction, are precisely the operational metrics that accountants use to model the ROI of ISO 20022 harmonisation for clients evaluating migration investment. An AI that returns 'no official statistic found' on this question, or attributes the 80% figure to commercial intermediaries rather than an FSB official, produces an advisory memo with either a hole in the business case or a misattributed headline metric. Either outcome undermines the credibility of the advice when clients or opposing advisers do their own source checks.
Claude Sonnet 4.6 with web search exhibited the same blended 79% adoption figure as the Opus 4.7 test on the matching question, collapsing the regulator's separately-stated faster-payment and RTGS rates into a single composite. The structural exposure for this audience is identical to the Opus variant: an accountant advising an RTGS operator on competitive positioning or compliance timeline faces a direct exposure: the AI's blended '79% for both' figure overstates RTGS adoption by approximately 30 percentage points relative to the regulator's own 'approaching half' formulation.
A client briefing or board memo built on this figure will misrepresent the peer-group baseline and the urgency of the client's migration decision. When the client or their counterparty subsequently references the BIS speech directly, the discrepancy is difficult to explain without acknowledging that the underlying source was not verified.
Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.