AI Hallucination ResearchFindings by audiencePractitionersInternational / MultilateralLawyersDetail › Finding
Practitioners — Lawyers · Last updated 11 Jun 2026 · Hallucination Register
Share / Print X LinkedIn Email

Finding#1, SARB pre-validation partnership downgraded to speculation

RLB Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007
What the RLB Specialist Panel found
For Claude Opus 4.7 (web search on)
Question (paraphrased to protect IP)

An advisor to a South African financial institution needs to know which central banks are actively involved in piloting or implementing specific API harmonisation recommendations, and whether any central bank is specifically named as partnering with CPMI on the payment pre-validation API recommendation.

RLB's analysis

The model retrieved enough surface signal to identify SARB and the payment pre-validation recommendation but treated retrieval-uncertainty as truth-uncertainty, producing a hedge where CPMI Brief No. 9 had already removed the uncertainty by naming the partnership explicitly. The model's calibration here is biased toward hedge when it cannot directly verify a retrieved fact, even when the regulator has published the verifying source.

AI Head's analysis — what weakness in the AI model caused this

The model retrieved the SARB-CPMI pre-validation partnership context — it had enough signal to know the question was about a named central-bank collaboration on a specific recommendation — but downgraded a regulator-confirmed fact to speculative hedge phrasing ('plausible but unverified'). This is calibration drift in a direction that looks like safety: the model hedged where it should have committed. The training and retrieval pipeline appears to penalise commitment without verifying retrieval, producing under-confidence on facts the regulator has itself published.

For evaluators, the high-value probe is the asymmetry: when the AI has retrieved enough to identify the entity at issue (SARB, payment pre-validation, CPMI Brief No. 9), commitment behaviour should match retrieval depth rather than defaulting to hedge. A confidence-calibration eval that scores against regulator-published source text would surface this drift before it ships to production users.

For Claude Sonnet 4.6 (web search on)
Question (paraphrased to protect IP)

Which central bank is explicitly named as a collaborating partner with CPMI on the payment pre-validation API recommendation from the October 2024 API harmonisation report, and what does that collaboration involve?

RLB's analysis

Sonnet 4.6's web-search loop did not surface CPMI Brief No. 9 (or did surface it without extracting the SARB identification) and the model reported the absence-of-retrieval as an absence-of-fact. The negative answer is presented with the same surface confidence the model uses for verified positive retrieval; nothing in the response signals that the underlying retrieval coverage was incomplete.

AI Head's analysis — what weakness in the AI model caused this

Sonnet 4.6 with web search returned a confident negative — 'available sources do not identify SARB as a named pilot partner' — when CPMI Brief No. 9 (November 2025) explicitly does name SARB. The failure mode is a false-negative retrieval gap presented as a positive knowledge claim. The model's web-search loop either did not surface CPMI Brief No. 9 or surfaced it and did not extract the SARB identification from it; in either case the model treated the absence-of-retrieval as evidence-of-absence rather than as a retrieval-coverage limitation.

For an AI lab, this is a high-value alignment probe: confident negatives on entity-level regulatory questions — 'no named partner exists', 'no specific date is published' — should be evaluated against a corpus of regulator-published material to determine the false-negative rate. The same retrieval pattern almost certainly produces similar false-negatives across other regulator briefs.

Impact for Lawyers in international jurisdictions advising on the Promoting the Harmonisation of Application Programming Interfaces to Enhance Cross-Border Payments: Recommendations and Toolkit

A lawyers lead who asks Claude Opus 4.7 to confirm which central bank CPMI has formally partnered with on the payment pre-validation API receives a hedge that reads as cautious analytic work: the AI says 'plausible but unverified.' In fact CPMI Brief No. 9 published in November 2025 explicitly names the South African Reserve Bank as that partner.

The lawyers team that treats the AI hedge as settled fact mis-scopes its Africa-corridor regulatory engagement, drops SARB from its watch list of named CPMI counterparties, and risks regulatory enforcement exposure when an examiner asks why CPMI Brief No. 9 was not in the regulatory horizon-scan.

References — raw findings (per AI model)
This finding also affects
Next finding → Finding#2, SARB pre-validation partnership denied
Cite this finding

Each finding has a stable Citation ID (RLB-F-… for aggregated case-study findings, RLB-H-… for raw per-model hallucinations) — like a DOI, the ID always resolves to the canonical finding even if URLs change.

RLB Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007
Bluebook / OSCOLA (US + UK legal) Download
RegLeg Specialist Panel, Finding#1, SARB pre-validation partnership downgraded to speculation [RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007], RegLegBrief AI Hallucination Research (June 11, 2026), https://reglegbrief.com/regulators/j1/int/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/practitioners/lawyers/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/.
Plain text Download
RegLeg Specialist Panel (2026). "Finding#1, SARB pre-validation partnership downgraded to speculation — Practitioners — Lawyers." Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007. RegLegBrief AI Hallucination Research, published 2026-06-11. https://reglegbrief.com/regulators/j1/int/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/practitioners/lawyers/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/
APA 7th edition Download
RegLeg Specialist Panel. (2026). Finding#1, SARB pre-validation partnership downgraded to speculation [Hallucination finding RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007]. RegLegBrief AI Hallucination Research. https://reglegbrief.com/regulators/j1/int/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/practitioners/lawyers/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/
BibTeX Download
@misc{reglegbrief_RLB_F_INT_BIS_CPMI_API_HARMONISATION_CROSS_BORDER_2024_Q007,
  author    = {RegLeg Specialist Panel},
  title     = {Finding#1, SARB pre-validation partnership downgraded to speculation},
  year      = {2026},
  publisher = {RegLegBrief AI Hallucination Research},
  note      = {Hallucination finding Citation ID: RLB-F-INT-BIS-CPMI-API-HARMONISATION-CROSS-BORDER-2024-Q007},
  url       = {https://reglegbrief.com/regulators/j1/int/BIS-CPMI/CPMI-API-HARMONISATION-CROSS-BORDER-2024/practitioners/lawyers/finding/INT-BIS-CPMI-INT-001-CPMI-API-HARMONISATION-CROSS-BORDER-2024-v1-007/}
}
← Back to case study summary Case study detail →

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.