AI Hallucination ResearchRegulatorsGlobal standard-settersINTBIS-CPMICPMI-IOSCO-PFMI-2012White paperDetail › Finding
AI Labs · Last updated 7 Jun 2026 · methodology vv2.3 · Hallucination Register

Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON

RLB Citation ID: RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46
What the RLB Specialist Panel found

Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON

  • Question (paraphrased to protect IP): Annex F critical service provider oversight, supervisory scope inverted
  • AI's response: "under the PFMI framework, authorities do not directly supervise or oversee CSPs. The responsibility for ensuring CSP compliance with Annex F expectations rests with the FMI itself; the FMI is expected to contractually require and monitor its CSPs' adherence. Annex F is framed as expectations that flow from the FMI to its CSPs"
  • Regulator's text: A regulator, supervisor, or overseer of an FMI may want to establish expectations for an FMI's critical service providers in order to support the FMI's overall safety and efficiency. The expectations outlined below are specifically targeted at critical service providers.
  • Why the AI went wrong: On Annex F, Claude Sonnet 4.6 with web search inverted the regulator's stated scope, asserting that authorities do not directly supervise or oversee critical service providers and that Annex F's expectations "flow from the FMI to its CSPs." Annex F's opening text expressly contemplates the opposite: a regulator, supervisor, or overseer of an FMI may want to establish expectations directed at CSPs, and the outlined expectations are "specifically targeted at critical service providers." The inversion is structural rather than textual, the model converted a regulator-to-CSP oversight channel into an FMI-internalised contractual obligation, and is the kind of failure that would not surface in standard text-completion evaluations because the surface form of the answer is internally coherent. A probe specifically on Annex F's scope-direction language, tested against the model's default framing of FMI-CSP supervisory relationships, would expose whether the inversion is model-specific or a corpus-level pattern.
  • Citation ID: RLB-F-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011
  • Cited source(s):
Impact for this audience

On Annex F, Claude Sonnet 4.6 with web search inverted the regulator's stated scope, asserting that authorities do not directly supervise or oversee critical service providers and that Annex F's expectations "flow from the FMI to its CSPs." Annex F's opening text expressly contemplates the opposite: a regulator, supervisor, or overseer of an FMI may want to establish expectations directed at CSPs, and the outlined expectations are "specifically targeted at critical service providers." The inversion is structural rather than textual — the model converted a regulator-to-CSP oversight channel into an FMI-internalised contractual obligation — and is the kind of failure that would not surface in standard text-completion evaluations because the surface form of the answer is internally coherent.

A probe specifically on Annex F's scope-direction language, tested against the model's default framing of FMI-CSP supervisory relationships, would expose whether the inversion is model-specific or a corpus-level pattern.

References — raw findings (per AI model)
This finding also affects
← Previous finding Finding on 'Q022 Probe' for Claude Opus 4.7 with web search ON Next finding → Finding on 'Q022 Probe' for Claude Sonnet 4.6 with web search ON
Cite this finding

Each finding has a stable Citation ID (RLB-F-… for aggregated case-study findings, RLB-H-… for raw per-model hallucinations) — like a DOI, the ID always resolves to the canonical finding even if URLs change.

RLB Citation ID: RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46
Plain text Download
RegLeg Specialist Panel (2026). "Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON — AI Labs." Citation ID: RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46. RegLegBrief AI Hallucination Research, published 2026-06-07. https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-iosco-pfmi-2012/whitepaper/finding/INT-BIS-CPMI-INT-001-CPMI-IOSCO-PFMI-2012-v1-011--sonnet-46-websearch/
APA 7th edition Download
RegLeg Specialist Panel. (2026). Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON [Hallucination finding RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46]. RegLegBrief AI Hallucination Research. https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-iosco-pfmi-2012/whitepaper/finding/INT-BIS-CPMI-INT-001-CPMI-IOSCO-PFMI-2012-v1-011--sonnet-46-websearch/
Bluebook / OSCOLA (US + UK legal) Download
RegLeg Specialist Panel, Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON [RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46], RegLegBrief AI Hallucination Research (June 07, 2026), https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-iosco-pfmi-2012/whitepaper/finding/INT-BIS-CPMI-INT-001-CPMI-IOSCO-PFMI-2012-v1-011--sonnet-46-websearch/.
BibTeX Download
@misc{reglegbrief_RLB_H_INT_BIS_CPMI_IOSCO_PFMI_2012_Q011_Sonnet46,
  author    = {RegLeg Specialist Panel},
  title     = {Finding on 'Q011 Probe' for Claude Sonnet 4.6 with web search ON},
  year      = {2026},
  publisher = {RegLegBrief AI Hallucination Research},
  note      = {Hallucination finding Citation ID: RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q011-Sonnet46},
  url       = {https://reglegbrief.com/regulators/j1/int/bis-cpmi/cpmi-iosco-pfmi-2012/whitepaper/finding/INT-BIS-CPMI-INT-001-CPMI-IOSCO-PFMI-2012-v1-011--sonnet-46-websearch/}
}
← Back to case study summary Case study detail →

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.