AI Hallucination ResearchBriefings › Briefing
Practitioner INT BIS-CPMI
Lawyers · Principles for Financial Market Infrastructures (PFMI)

By Kratti A Agrawal, Lead, RegLeg Brief Specialist Panel

Lawyers: AI summaries of PFMI (Principles for Financial Market Infrastructures) may understate professional obligations

RLB Specialist Panel catches where AI cognition drifts across the PFMI 2012 principle landscape.

— RLB Specialist Panel

Frontier AI models invented and misattributed PFMI Key Considerations, regulatory-research panel finds

A frontier AI model tested by the RegLeg Brief Specialist Panel misattributed a soft risk-committee recommendation to PFMI Principle 2 Key Consideration 5, which addresses management roles, not committee architecture.

SINGAPORE. Two frontier artificial-intelligence models generated structurally confident but textually wrong reconstructions of the CPMI-IOSCO Principles for Financial Market Infrastructures (PFMI, 2012), the global standard for systemically important payment systems, central counterparties, and securities settlement infrastructures, according to findings documented by the RegLeg Brief Specialist Panel, the regulatory-research function operated by Singapore-incorporated Verdus Technologies Pte. Ltd. The verbatim regulator-issued source text is held as primary substrate (R2-REGULATION-d101a_PFMI_main_text.pdf).

The pattern in one line

For Lawyers, the pattern is this: the model gets the governance topic substantively in range, then anchors the position to a named PFMI Key Consideration or annex that does not support it. The substantive paraphrase reads as competent regulatory analysis; the citation underneath it does not check out against the published PFMI text. The reader cannot see the failure without opening the PFMI document and reading the cited Key Consideration.

How the RLB Specialist Panel tested this

Questions are prepared by the RLB Specialist Panel based on real practical AI usage in the workflows the respective audience uses AI for. The Panel binds each AI finding to verbatim regulator-issued source text held as primary substrate. For PFMI, the Panel ran two question shapes against frontier AI subjects: Specialist Panel direct questions on what the PFMI text states about CSP oversight and on what Principle 2 Key Consideration 6 requires, and Specialist Panel application-style questions framed in the voice of a practitioner preparing a disclosure-framework response or a board terms-of-reference draft.

Two frontier AI subjects were tested with web search enabled. The Panel does not paraphrase model output; the exact strings the models produced are matched against the regulator-issued primary substrate, and only those answers that diverge from the substrate text are recorded as confirmed hallucinations.

What the models got wrong

Finding RLB-H-INT-BIS-CPMI-IOSCO-PFMI-2012-Q022-Sonnet46: Risk-committee recommendation misattributed to KC 5. Claude Sonnet 4.6, with web search enabled, was asked whether PFMI Principle 2 requires or recommends that FMI boards establish a dedicated risk committee, and if so, which Key Consideration contains that recommendation. The model answered that "the PFMI does not expressly mandate a dedicated risk committee as a hard requirement, but Key Consideration 2.5 states that the board should consider establishing a risk committee with a clear mandate." The verbatim KC 5 text is: "5. The roles and responsibilities of management should be clearly specified.

An FMI's management should have the appropriate experience, a mix of skills, and the integrity necessary to discharge their responsibilities for the operation and risk management of the FMI." KC 5 addresses the roles and responsibilities of management. It contains no committee-architecture recommendation. The misattribution embeds a citation that does not check out against the PFMI text.

Why this matters for Lawyers

Lawyers who paste AI output into a legal opinion or a client-facing compliance memo are the population most exposed to a fabricated Key Consideration citation that the reader will check against the PFMI primary text the moment a counterparty or supervisor disputes the position.

For this audience, the work product is client memos on FMI board obligations, legal opinions on PFMI self-assessment responses, drafted board terms of reference, and counterparty representations on governance compliance. Every item on that list travels under the firm's name to a reader who can locate the cited Key Consideration or annex in the published PFMI document and read it.

The failure pattern is not recoverable at the desktop because the model's output reads as a competent governance paragraph: it uses defined terms correctly (FMI, KC, CSP, Annex F), it tracks the PFMI's structural vocabulary, and it states a position the reader expects to read. The error surfaces only when the reader opens the PFMI and locates the cited Key Consideration, at which point Lawyers' draft is exposed as having attributed a position to the regulator's text that the text does not contain.

The regulator's actual position

The verbatim regulator-issued source text held as primary substrate (R2-REGULATION-d101a_PFMI_main_text.pdf) supports the following positions, each of which contradicts the corresponding AI output.

On Principle 2 Key Consideration 5. The published KC 5 text reads: "5. The roles and responsibilities of management should be clearly specified. An FMI's management should have the appropriate experience, a mix of skills, and the integrity necessary to discharge their responsibilities for the operation and risk management of the FMI." The provision addresses the roles, responsibilities, experience, and integrity of management. It does not address committee structure, does not recommend a risk committee, and does not contain the language the model attributed to it.

What this tells us about AI for Lawyers

The PFMI's structural surface (numbered Principles, numbered Key Considerations, lettered annexes) is the feature that makes the document amenable to AI summarisation and is also the feature that makes a citation-level failure invisible at runtime. The models in these findings did not refuse, did not hedge, and did not flag uncertainty about the cited Key Consideration. They selected a Key Consideration number from the model's prior, paraphrased a substantively plausible governance position, and pinned the position to a number that does not support it.

For Lawyers, the implication is that AI-assisted PFMI drafting work requires a separate verification step in which the cited Key Consideration is opened in the regulator's published text and matched against the position the draft attributes to it. The cost of skipping that step lands in the published work product, not in the AI tool.

What the RLB Specialist Panel is doing about it

The RegLeg Brief Specialist Panel records and documents each confirmed hallucination with an immutable RLB Citation ID, a verbatim copy of the AI subject's exact output, the verbatim regulator-issued source text, and a named failure-class label. The records are open-access and ungated, and the Specialist Panel operates with an aggregate of over 60 years of public-policy and industry experience in financial market infrastructure regulation. AI labs whose subjects appear in any finding have an unconditional right of reply; the Specialist Panel will document any factual correction or contextual response alongside the original finding, with no editorial gatekeeping.

The Panel runs the PFMI test bench on a continuous basis: as model versions and PFMI implementation guidance evolve, the same Specialist Panel questions are re-run against the current subjects, and any change in the failure shape is documented to the same citation ID family. The Panel does not redact the exact strings the AI subjects produced, and does not paraphrase the regulator-issued source text; the value of the audit record sits in the verbatim pairing, and that record stays usable for AI engineering teams, supervisors, and practitioners who need to reproduce the finding against their own deployments.

For the PFMI specifically, the Specialist Panel maintains the source binding to the published primary text and expands the bench as additional Principles and Annexes are tested.

What Lawyers teams should do

The verification cost on the front end is measured in minutes per Key Consideration: open the regulator's PFMI document to the cited number, read the verbatim text, and confirm the AI output's substantive claim is supported. The recovery cost on the back end, if the failure reaches the published work product, is measured in supervisor remediation cycles, counterparty disputes, and reputational exposure for the firm and the practitioner who signed off the draft.

The RegLeg Brief Specialist Panel records each finding so that the failure shape is recognisable on sight, and so the verification step can be targeted at the specific failure class the model exhibits on PFMI material rather than at every line of every output.


Right of Reply

These findings and associated work have been put up in public with a view of the greater good for the development of a safer AI ecosystem. Any party reading this or any finding on reglegbrief.com may contact us and have an unconditional right of reply; the Specialist Panel will publish any factual correction or contextual response alongside the original finding, with no editorial gatekeeping. Researchers, regulators, and compliance teams with questions on methodology or specific findings can reach the Specialist Panel via the same channel.

Source & Methodology Standards

RegLeg Brief is operated by Verdus Technologies Pte. Ltd. (UEN 201616982R), incorporated in Singapore. The RLB Specialist Panel, with an aggregate of over 60 years of public-policy and industry experience, documents only confirmed hallucination findings, under a methodology that requires a verbatim regulator excerpt for every documented claim. All findings, citation IDs, model outputs, regulator excerpts, and methodology notes are open-access.


Primary source verified: CPMI-IOSCO PFMI Report d101, Principles for Financial Market Infrastructures (April 2012) · Substrate documents: R2-REGULATION-d101a_PFMI_main_text.pdf · CPMI portal: bis.org/cpmi

Citation IDs referenced:

Read the full findings page — RLB Citation IDs, AI subject answers, and regulator verbatim text →
← Back to all briefings