AI Hallucination ResearchFindings by audienceSectorsInternational / MultilateralRetail BankingLegalDetail › Finding
Retail Banking × Legal — International / Multilateral · Last updated 11 Jun 2026 · Hallucination Register
Share / Print X LinkedIn Email

SARB pre-validation partnership denied

RLB Citation ID: RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47
What the RLB Specialist Panel found
For Claude Opus 4.7 (web search on)
Question (paraphrased to protect IP)

An advisor to a South African financial institution needs to know which central banks are actively involved in piloting or implementing specific API harmonisation recommendations, and whether any central bank is specifically named as partnering with CPMI on the payment pre-validation API recommendation.

RLB's analysis

The model retrieved enough surface signal to identify SARB and the payment pre-validation recommendation but treated retrieval-uncertainty as truth-uncertainty, producing a hedge where CPMI Brief No. 9 had already removed the uncertainty by naming the partnership explicitly. The model's calibration here is biased toward hedge when it cannot directly verify a retrieved fact, even when the regulator has published the verifying source.

AI Head's analysis — what weakness in the AI model caused this

The model retrieved the SARB-CPMI pre-validation partnership context — it had enough signal to know the question was about a named central-bank collaboration on a specific recommendation — but downgraded a regulator-confirmed fact to speculative hedge phrasing ('plausible but unverified'). This is calibration drift in a direction that looks like safety: the model hedged where it should have committed. The training and retrieval pipeline appears to penalise commitment without verifying retrieval, producing under-confidence on facts the regulator has itself published.

For evaluators, the high-value probe is the asymmetry: when the AI has retrieved enough to identify the entity at issue (SARB, payment pre-validation, CPMI Brief No. 9), commitment behaviour should match retrieval depth rather than defaulting to hedge. A confidence-calibration eval that scores against regulator-published source text would surface this drift before it ships to production users.

For Claude Sonnet 4.6 (web search on)
Question (paraphrased to protect IP)

Which central bank is explicitly named as a collaborating partner with CPMI on the payment pre-validation API recommendation from the October 2024 API harmonisation report, and what does that collaboration involve?

RLB's analysis

Sonnet 4.6's web-search loop did not surface CPMI Brief No. 9 (or did surface it without extracting the SARB identification) and the model reported the absence-of-retrieval as an absence-of-fact. The negative answer is presented with the same surface confidence the model uses for verified positive retrieval; nothing in the response signals that the underlying retrieval coverage was incomplete.

AI Head's analysis — what weakness in the AI model caused this

Sonnet 4.6 with web search returned a confident negative — 'available sources do not identify SARB as a named pilot partner' — when CPMI Brief No. 9 (November 2025) explicitly does name SARB. The failure mode is a false-negative retrieval gap presented as a positive knowledge claim. The model's web-search loop either did not surface CPMI Brief No. 9 or surfaced it and did not extract the SARB identification from it; in either case the model treated the absence-of-retrieval as evidence-of-absence rather than as a retrieval-coverage limitation.

For an AI lab, this is a high-value alignment probe: confident negatives on entity-level regulatory questions — 'no named partner exists', 'no specific date is published' — should be evaluated against a corpus of regulator-published material to determine the false-negative rate. The same retrieval pattern almost certainly produces similar false-negatives across other regulator briefs.

Impact for Legal Teams in Retail Banking Sector in international jurisdictions working with the Promoting the Harmonisation of Application Programming Interfaces to Enhance Cross-Border Payments: Recommendations and Toolkit

Retail-bank counsel reads CPMI material when drafting the scheme-membership agreement update, the remittance-partner change-of-law clause, and the consumer-disclosure schedule attached to cross-border products. Opus 4.7 denies any CPMI statement names SARB as the pre-validation pilot partner. CPMI Brief No. 9 (Nov 2025) names SARB outright. A scheme-membership update or remittance-partner clause that relies on the AI denial misses a documented regulator workstream the counterparty already has on its roadmap.

References — raw findings (per AI model)
This finding also affects
Cite this finding

Each finding has a stable Citation ID (RLB-F-… for aggregated case-study findings, RLB-H-… for raw per-model hallucinations) — like a DOI, the ID always resolves to the canonical finding even if URLs change.

RLB Citation ID: RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47
Plain text Download
RegLeg Specialist Panel (2026). "SARB pre-validation partnership denied — Retail Banking × Legal — International / Multilateral." Citation ID: RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47. RegLegBrief AI Hallucination Research, published 2026-06-11. https://reglegbrief.com/regulators/j1/INT/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/sectors/retail_banking/legal/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/
APA 7th edition Download
RegLeg Specialist Panel. (2026). SARB pre-validation partnership denied [Hallucination finding RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47]. RegLegBrief AI Hallucination Research. https://reglegbrief.com/regulators/j1/INT/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/sectors/retail_banking/legal/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/
Bluebook / OSCOLA (US + UK legal) Download
RegLeg Specialist Panel, SARB pre-validation partnership denied [RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47], RegLegBrief AI Hallucination Research (June 11, 2026), https://reglegbrief.com/regulators/j1/INT/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/sectors/retail_banking/legal/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/.
BibTeX Download
@misc{reglegbrief_RLB_H_INT_BIS_CPMI_API_HARMONISATION_CROSS_BORDER_2024_Q007_Opus47,
  author    = {RegLeg Specialist Panel},
  title     = {SARB pre-validation partnership denied},
  year      = {2026},
  publisher = {RegLegBrief AI Hallucination Research},
  note      = {Hallucination finding Citation ID: RLB-H-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007-Opus47},
  url       = {https://reglegbrief.com/regulators/j1/INT/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/sectors/retail_banking/legal/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/}
}
← Back to case study summary Case study detail →

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.