AI Hallucination ResearchAudiencesSectorsInternational / MultilateralClinical ResearchCompliance › BBNJ High Seas Biodiversity Agreement
Clinical Research × Compliance — International / Multilateral · updated 2026-06-11 · methodology v2.3
Share / Print Twitter LinkedIn Email

AI Hallucination on BBNJ High Seas Biodiversity Agreement for Compliance teams at Clinical Research firms in international jurisdictions

Clinical Research Compliance teams: documentation and reporting gaps possible from AI reading of BBNJ Agreement

Compliance teams at clinical research firms are increasingly using AI to update sample-provenance screening checklists, generate research-governance bulletins on the marine genetic resource regime under the BBNJ Agreement, and validate which obligations apply to legacy specimen collections held before treaty entry into force.

The RLB Specialist Panel put a set of practitioner-grade questions on the BBNJ Agreement to two frontier AI models with web search active. Each question is prepared by the Panel based on the workflows that compliance teams at clinical research firms actually use AI for under this treaty, covering the screening threshold for environmental impact assessments under Part IV, the temporal scope of the marine genetic resources and digital sequence information regime under Part II, the benefit-sharing duty for digital sequence information, and the non-undermining duty constraining Conference of the Parties decisions on area-based management tools under Part III.

The Panel then binds every AI response to verbatim regulator-issued source text held as primary substrate, comparing the AI output line-by-line against the deposited treaty text. Only responses where the AI subject was demonstrably wrong against the verbatim regulator-issued source text are published; responses that were substantively correct, or that refused on calibration grounds, are retained internally and not surfaced. On the BBNJ Agreement, the AI subjects returned a single hallucinated answer in the form of Inverted-Position Hallucination for compliance teams at clinical research firms.

For compliance teams at clinical research firms working under the BBNJ Agreement, internal policies, regulator-facing filings, and supervisor-engagement memos turn on citation accuracy. A compliance submission that mis-numbers the source article will be identified by a national Clearing-House Mechanism reviewer or a treaty-body monitoring reviewer on first reading, and the wider compliance narrative loses credibility.

Where the AI subjects inverted the direction of the marine genetic resources retroactivity default, the consequence is more serious: the firm could initiate costly and unnecessary remediation of legacy collections, or misstate its position in due diligence disclosures, licensing negotiations, and regulatory filings - any of which could attract scrutiny from national implementing authorities or treaty-body monitoring mechanisms.

The published Specialist Panel findings, with model attribution, carry the following citation identifiers, each hyperlinked to the bound regulator-issued source text on the BBNJ Agreement regulation hub. The audit register surfaces these findings for compliance teams at clinical research firms so that any AI-assisted treaty citation, paraphrase, or rule-statement entering a deliverable can be re-validated against the deposited treaty text before the document is issued:

Take me back to my Compliance x Clinical Research (INT) overview

Executive Summary

The BBNJ Agreement, formally the United Nations Treaty on Marine Biodiversity of Areas Beyond National Jurisdiction, establishes a binding framework governing environmental impact assessments, access to marine genetic resources, benefit-sharing over digital sequence information, and area-based management tools in the high seas. The Agreement entered into force on 17 January 2026. For Compliance teams at Clinical Research firms operating across international jurisdictions, this is new law that already affects ongoing programmes and internal documentation.

Across 1 relevant question that the team is likely to encounter, AI tools tested on the Agreement produced incorrect answers, with the principal errors covering the marine genetic resource retroactivity default inverted from prospective to retroactive. The errors are material because the team is the control point for internal policy, regulator-facing filings, contractual representations, or commercial scoping decisions that depend on accurate citation of the Agreement's operative provisions.

How AI gets this regulation wrong

The dominant failure pattern across our research on this regulation for Compliance teams at Clinical Research firms is article-level misattribution paired with substantively correct paraphrases. AI tools stated the operative content of a provision broadly correctly while pinning it to the wrong article number. In one case the AI also inverted the treaty's default rule on retroactivity, producing a position that is the opposite of what the Agreement provides. Both failure types are particularly hazardous because the substantive plausibility of the answer makes the citation error easy to miss on internal review.

AI's Failure ModeCountAffected findings
Misstated Rule1Finding#1

What that means for your team

For Compliance teams at Clinical Research firms, each documented error feeds into a compliance position, internal policy, regulator-facing filing, or board memo that misstates the BBNJ Agreement's operative provision. The risk category is regulatory enforcement: a treaty-body, national implementing authority, or supervisory reviewer who verifies the citation will identify the defect and may extend scrutiny to other representations in the same document.

Risk ImpactCountAffected findings
Regulatory enforcement1Finding#1

When this affects your department

Clinical research organisations encounter the BBNJ Agreement where research portfolios touch marine-derived inputs or where partnering biotechnology or pharmaceutical sponsors source from high-seas marine genetic resources.

The team is most likely to need accurate AI assistance on the Agreement when it is determining the temporal scope of the marine genetic resource and digital sequence information regime under Article 10(1). These are precisely the questions where the documented errors land. An AI tool that misattributes the screening article, inverts the retroactivity default, or places the digital sequence information duty at the wrong article will produce internal documentation that misstates the Agreement's position even when the underlying substantive criterion is correctly paraphrased.

The risk profile is amplified by the BBNJ Agreement's novelty. It entered into force on 17 January 2026, and the secondary body of professional commentary, supervisory guidance, and academic interpretation that normally lets a team triangulate an uncertain AI response is thin. A compliance team that receives a confident AI answer on a BBNJ provision has fewer cross-checks available than for a mature instrument. Where the team's deliverable cites the Agreement's operative articles, independent verification against the deposited treaty text is the only reliable safeguard, and it is the safeguard most likely to be skipped under time pressure.

The findings at a glance

The table below summarises each finding from our research on this regulation for Compliance teams at Clinical Research firms, with the question area tested, the type of AI failure observed, and the risk category that failure creates for the team.

#Finding titleTypeCitation ID
1MGR retroactivity default invertedHallucinationRLB-F-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q003

Aggregate impact

The errors documented in this cell cluster on operative provisions that Compliance teams at Clinical Research firms most often need accurate citations for: the marine genetic resource and digital sequence information retroactivity default inverted, with Article 10(1) actually establishing a prospective default.

The retroactivity inversion is the most material error in the set. Two independent AI tools described the marine genetic resource and digital sequence information regime as retroactive by default with a written opt-out, when Article 10(1) establishes the opposite: the regime applies only to resources collected and generated after the entry into force of the Agreement for each Party. A compliance team acting on the inverted rule would treat legacy collections as presumptively in scope, imposing obligations the treaty does not extend to them.

Multiple AI tools produced this error independently, suggesting it is not idiosyncratic to a single model's training data.

For a compliance team at a clinical research firm, the systemic risk is that the AI's confident, structured presentation of the wrong answer closely mimics the format of a competent treaty-law summary. Internal reviewers under time pressure rarely re-check article numbers against the deposited text. The errors documented here are the kind that pass through governance and into client-facing or regulator-facing documentation.

What your team should do

The default position for Compliance teams at Clinical Research firms relying on AI tools for BBNJ Agreement research should be: treat AI-generated article citations as unverified until confirmed against the deposited treaty text at treaties.un.org. The errors documented here are not matters of interpretation. They involve the wrong article number, the wrong operative standard, and the inverted operative default. For any deliverable that will be relied on internally, by a counterparty, or by a regulator, independent verification is not optional.

In practical terms, the team should adopt a short checklist for any AI-assisted research on the BBNJ Agreement: confirm the article number cited, confirm the operative threshold or standard verbatim, and confirm whether a provision is stated as a default or an opt-in. These three checks would have caught every error in this cell. Firms with active BBNJ exposure should consider building treaty-text lookups directly into the research workflow rather than relying on AI to supply article-level precision.

AI tools remain useful for orientation on the BBNJ Agreement, including understanding the broad structure, identifying which Part of the Agreement addresses a given topic, and generating first-draft outlines. The hazard lies in trusting AI to supply precise article numbers, operative thresholds, and default rules without verification. Given the treaty text is freely and publicly accessible, the cost of verification is low; the cost of propagating an inverted retroactivity rule or a misattributed article number into a deliverable is not.

How RLB Can Help

RegLeg's published Hallucination Research gives Compliance teams at Clinical Research firms a structured pre-flight check before relying on AI tools for BBNJ Agreement research. Before an AI-assisted internal memo, regulator-facing filing, or commercial scoping document is finalised, the research identifies precisely which provisions of the Agreement have historically generated confident but incorrect AI output. That forewarning lets the team apply targeted human scrutiny rather than blanket scepticism, making AI assistance genuinely efficient without importing undetected risk into the deliverable.

Beyond the published research, RegLeg works with clinical research firms on bespoke regulator deep-dives that map AI-supported workflows within the Compliance function to actual hallucination exposure. Activities such as scoping new programmes, drafting internal policies, preparing regulator correspondence, or supporting commercial transactions carry different risk profiles, and the deep-dive surfaces which ones warrant additional controls or independent verification steps. RegLeg can also conduct a confidential review of the firm's existing AI-use policy against the failure-mode catalogue, delivering a prioritised remediation plan that distinguishes low-risk efficiency gains from higher-risk applications.

For teams that want to build durable in-house capability, RegLeg develops training material and CPD-aligned content tailored to the Compliance context. This covers how to interpret AI-generated regulatory summaries critically, how to structure verification steps where AI confidence is high but human review is essential, and how to document AI-assisted decision-making consistent with good governance standards.

Every finding on this page compares an AI subject's account of the rule against the regulator's verbatim text from the regulator's own portal. Both are linked. Each delta, its root causes, and impact analysis are documented and published with immutable Citation IDs.