AI Hallucination ResearchBriefings › Briefing
Sector × Dept INT UNTC
Clinical Research Legal teams · BBNJ High Seas Biodiversity Agreement

By Kratti A Agrawal, Lead, RegLeg Brief Specialist Panel

Clinical Research Legal teams: documentation and reporting gaps possible from AI reading of BBNJ Agreement

Claude Code charts the hallucination patterns in BBNJ clinical research legal obligations.

— RLB Specialist Panel

Frontier AI models invert the BBNJ Agreement's marine genetic resources retroactivity default.

Two frontier AI subjects tested by the RLB Specialist Panel concluded that the Part II regime applies to legacy collections by default, when Article 10(1) sets the opposite rule.

The pattern in one line

A frontier AI model tested on the BBNJ Agreement inverted the marine genetic resources retroactivity default, producing a legal answer that pointed at the right article number but reversed the direction of the underlying rule.

How the RLB Specialist Panel tested this

The questions in this cell were prepared by the RLB Specialist Panel based on real, practical AI usage in the workflows that legal teams at clinical research firms actually use AI for under the BBNJ Agreement. Each question targets a specific deliverable type where an AI assistant is plausibly the first draft: a memo, an opinion paragraph, a checklist line, a board-paper bullet, a regulator-facing filing sentence. The Panel issued each question to two frontier AI subjects with web search active.

The Panel then bound every AI response to verbatim regulator-issued source text held as primary substrate, comparing the model output against the deposited treaty text and the regulator-issued source documentation for each provision. Only responses where the AI subject was demonstrably wrong against the verbatim regulator-issued source text are published as findings; responses that were substantively correct, or that refused on calibration grounds, are retained internally and not surfaced.

What the models got wrong

The cell carries a single confirmed finding against the AI subjects on the BBNJ Agreement. It is published against verbatim regulator-issued treaty text and carries explicit model attribution for audit transparency.

Finding 2: Marine genetic resources retroactivity flipped from non-retroactive default to retroactive default. The Specialist Panel asked, in application form, whether the BBNJ Agreement applies its marine genetic resources benefit-sharing obligations to specimens and digital sequence information collected from international waters before the Agreement entered into force. Claude Opus 4.7 with web search active answered that under Article 10(1) the marine genetic resources and digital sequence information provisions apply to utilization of resources that were collected or generated before entry into force, not just after (RLB-H-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q003-Opus47).

Claude Sonnet 4.6 with web search active reached the same substantive conclusion, stating that the benefit-sharing and notification obligations apply to the utilisation of resources collected or generated before the Agreement entered into force on 17 January 2026 (RLB-H-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q003-Sonnet46). The substrate held by the Panel records the opposite position: Article 10(1) limits the Part II regime to marine genetic resources and digital sequence information collected and generated after entry into force for each Party, with non-retroactivity as the default.

This is the highest-stakes class of error in the cell because the substantive direction of the obligation is inverted, not merely the article number.

For legal teams at clinical research firms advising on the BBNJ Agreement, treaty-citation accuracy is load-bearing in legal opinions, contractual representations, due-diligence disclosures, and any pleading or position paper engaging the Agreement. A counterparty or opposing counsel who identifies a misattributed article on first reading calls the entire piece of advice into question. The marine genetic resources retroactivity inversion is the more serious failure: a legal opinion structured around a retroactive-by-default rule when the treaty establishes the opposite default produces fundamentally wrong contract terms and exposes the firm to professional liability if the underlying position is later corrected.

The regulator's actual position

The verbatim regulator-issued source text held by the RLB Specialist Panel as primary substrate for the BBNJ Agreement sets the position as follows. The references below are drawn from the deposited treaty text and are the controlling reference points against which any AI-assisted citation should be validated.

Article 10(1). The marine genetic resources and digital sequence information provisions in Part II apply only to resources collected and generated after the Agreement enters into force for each Party. Non-retroactivity is the default. A number of Parties also recorded formal declarations on this point at deposit. Legacy collections obtained before the Party's date of entry into force are outside the Part II benefit-sharing regime by default; bringing them in requires an affirmative act, not the absence of one.

For legal teams at clinical research firms working with AI on the BBNJ Agreement, the Article 10(1) result is the one to internalise. Citation-style errors are visible to a routine citation check; inverted-position errors are not. The AI subjects pointed at the right article number but reversed the direction of the underlying rule, and a downstream reader running a standard citation-validation workflow would not flag the issue. The only defensive workflow that catches this class of error is a substantive comparison against the deposited treaty text.

The practitioner takeaway: never rely on AI to characterise the direction of an obligation under a young multilateral instrument without a substantive read of the underlying provision.

What the RLB Specialist Panel is doing about it

The RLB Specialist Panel is engaging with the AI subjects' developers and with practitioner audiences working under the BBNJ Agreement. The Panel maintains an audit register of confirmed hallucinations bound to verbatim regulator-issued source text, surfaces them on the live regulation page and on each audience-specific briefing, and accepts right-of-reply submissions from the AI subjects' developers and from regulator-side reviewers.

For legal teams at clinical research firms this means the same questions can be re-issued against successor model releases; the bound substrate makes it straightforward to verify whether a specific failure mode has been corrected upstream, or whether the same hallucination is still being produced. Partnership briefings with AI labs are offered against the audit register, not against synthesised demonstrations, so the corrections that matter are evidenced against treaty text rather than against a paraphrase chain.

The register is structured so that each finding records the question put to the AI subject, the AI subject's verbatim answer, the verbatim regulator-issued source text the answer was bound against, the named model and configuration, and the failure mode. That structure lets practitioner readers see exactly where the AI subject diverged from the treaty text without re-doing the underlying verification, and lets AI lab readers see exactly which provision and which phrasing produced the divergence.

Where a hallucination has been corrected in a successor model release, the register records the rerun and withdraws the finding; where it persists, the finding stays live. This makes the register useful as a continuous-improvement signal for the AI labs and as a defensive checklist for practitioners drawing on AI in regulated workflows.

For legal teams at clinical research firms drawing on AI in workflows that touch the BBNJ Agreement, the practical action items are direct:


Right of Reply

These findings and associated work have been put up in public with a view of the greater good for the development of a safer AI ecosystem. Any party reading this or any finding on reglegbrief.com may contact us and have an unconditional right of reply; the Specialist Panel will publish any factual correction or contextual response alongside the original finding, with no editorial gatekeeping. Researchers, regulators, and compliance teams with questions on methodology or specific findings can reach the Specialist Panel via the same channel.

Source & Methodology Standards

RegLeg Brief is operated by Verdus Technologies Pte. Ltd. (UEN 201616982R), incorporated in Singapore. The RLB Specialist Panel, with an aggregate of over 60 years of public-policy and industry experience, documents only confirmed hallucination findings, under a methodology that requires a verbatim regulator excerpt for every documented claim. All findings, citation IDs, model outputs, regulator excerpts, and methodology notes are open-access.


Primary source verified: UN BBNJ Agreement (2023), Agreement on the Conservation and Sustainable Use of Marine Biological Diversity of Areas Beyond National Jurisdiction · Substrate documents: p_01_ACT_Part_III___Article_22_2____non_undermini_text-bbnj-agreement.html · UN portal: documents.un.org

Citation IDs referenced:

Read the full findings page — RLB Citation IDs, AI subject answers, and regulator verbatim text →
← Back to all briefings