AI Hallucination on BBNJ High Seas Biodiversity Agreement for Compliance teams at Clinical Research firms in international jurisdictions

Take me back to my Compliance x Clinical Research (INT) overview

Executive Summary

The BBNJ Agreement, formally the United Nations Treaty on Marine Biodiversity of Areas Beyond National Jurisdiction, establishes a binding framework governing environmental impact assessments, access to marine genetic resources, benefit-sharing over digital sequence information, and area-based management tools in the high seas. The Agreement entered into force on 17 January 2026. For Compliance teams at Clinical Research firms operating across international jurisdictions, this is new law that already affects ongoing programmes and internal documentation.

Across 1 relevant question that the team is likely to encounter, AI tools tested on the Agreement produced incorrect answers, with the principal errors covering the marine genetic resource retroactivity default inverted from prospective to retroactive. The errors are material because the team is the control point for internal policy, regulator-facing filings, contractual representations, or commercial scoping decisions that depend on accurate citation of the Agreement's operative provisions.

How AI gets this regulation wrong

The dominant failure pattern across our research on this regulation for Compliance teams at Clinical Research firms is article-level misattribution paired with substantively correct paraphrases. AI tools stated the operative content of a provision broadly correctly while pinning it to the wrong article number. In one case the AI also inverted the treaty's default rule on retroactivity, producing a position that is the opposite of what the Agreement provides. Both failure types are particularly hazardous because the substantive plausibility of the answer makes the citation error easy to miss on internal review.

AI's Failure Mode	Count	Affected findings
Misstated Rule	1	Finding#1

What that means for your team

For Compliance teams at Clinical Research firms, each documented error feeds into a compliance position, internal policy, regulator-facing filing, or board memo that misstates the BBNJ Agreement's operative provision. The risk category is regulatory enforcement: a treaty-body, national implementing authority, or supervisory reviewer who verifies the citation will identify the defect and may extend scrutiny to other representations in the same document.

Risk Impact	Count	Affected findings
Regulatory enforcement	1	Finding#1

When this affects your department

Clinical research organisations encounter the BBNJ Agreement where research portfolios touch marine-derived inputs or where partnering biotechnology or pharmaceutical sponsors source from high-seas marine genetic resources.

The team is most likely to need accurate AI assistance on the Agreement when it is determining the temporal scope of the marine genetic resource and digital sequence information regime under Article 10(1). These are precisely the questions where the documented errors land. An AI tool that misattributes the screening article, inverts the retroactivity default, or places the digital sequence information duty at the wrong article will produce internal documentation that misstates the Agreement's position even when the underlying substantive criterion is correctly paraphrased.

The risk profile is amplified by the BBNJ Agreement's novelty. It entered into force on 17 January 2026, and the secondary body of professional commentary, supervisory guidance, and academic interpretation that normally lets a team triangulate an uncertain AI response is thin. A compliance team that receives a confident AI answer on a BBNJ provision has fewer cross-checks available than for a mature instrument. Where the team's deliverable cites the Agreement's operative articles, independent verification against the deposited treaty text is the only reliable safeguard, and it is the safeguard most likely to be skipped under time pressure.

The findings at a glance

The table below summarises each finding from our research on this regulation for Compliance teams at Clinical Research firms, with the question area tested, the type of AI failure observed, and the risk category that failure creates for the team.

#	Finding title	Type	Citation ID
1	MGR retroactivity default inverted	Hallucination	RLB-F-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q003

Aggregate impact

The errors documented in this cell cluster on operative provisions that Compliance teams at Clinical Research firms most often need accurate citations for: the marine genetic resource and digital sequence information retroactivity default inverted, with Article 10(1) actually establishing a prospective default.

The retroactivity inversion is the most material error in the set. Two independent AI tools described the marine genetic resource and digital sequence information regime as retroactive by default with a written opt-out, when Article 10(1) establishes the opposite: the regime applies only to resources collected and generated after the entry into force of the Agreement for each Party. A compliance team acting on the inverted rule would treat legacy collections as presumptively in scope, imposing obligations the treaty does not extend to them.

Multiple AI tools produced this error independently, suggesting it is not idiosyncratic to a single model's training data.

For a compliance team at a clinical research firm, the systemic risk is that the AI's confident, structured presentation of the wrong answer closely mimics the format of a competent treaty-law summary. Internal reviewers under time pressure rarely re-check article numbers against the deposited text. The errors documented here are the kind that pass through governance and into client-facing or regulator-facing documentation.

What your team should do

The default position for Compliance teams at Clinical Research firms relying on AI tools for BBNJ Agreement research should be: treat AI-generated article citations as unverified until confirmed against the deposited treaty text at treaties.un.org. The errors documented here are not matters of interpretation. They involve the wrong article number, the wrong operative standard, and the inverted operative default. For any deliverable that will be relied on internally, by a counterparty, or by a regulator, independent verification is not optional.

In practical terms, the team should adopt a short checklist for any AI-assisted research on the BBNJ Agreement: confirm the article number cited, confirm the operative threshold or standard verbatim, and confirm whether a provision is stated as a default or an opt-in. These three checks would have caught every error in this cell. Firms with active BBNJ exposure should consider building treaty-text lookups directly into the research workflow rather than relying on AI to supply article-level precision.

AI tools remain useful for orientation on the BBNJ Agreement, including understanding the broad structure, identifying which Part of the Agreement addresses a given topic, and generating first-draft outlines. The hazard lies in trusting AI to supply precise article numbers, operative thresholds, and default rules without verification. Given the treaty text is freely and publicly accessible, the cost of verification is low; the cost of propagating an inverted retroactivity rule or a misattributed article number into a deliverable is not.

How RLB Can Help

RegLeg's published Hallucination Research gives Compliance teams at Clinical Research firms a structured pre-flight check before relying on AI tools for BBNJ Agreement research. Before an AI-assisted internal memo, regulator-facing filing, or commercial scoping document is finalised, the research identifies precisely which provisions of the Agreement have historically generated confident but incorrect AI output. That forewarning lets the team apply targeted human scrutiny rather than blanket scepticism, making AI assistance genuinely efficient without importing undetected risk into the deliverable.

Beyond the published research, RegLeg works with clinical research firms on bespoke regulator deep-dives that map AI-supported workflows within the Compliance function to actual hallucination exposure. Activities such as scoping new programmes, drafting internal policies, preparing regulator correspondence, or supporting commercial transactions carry different risk profiles, and the deep-dive surfaces which ones warrant additional controls or independent verification steps. RegLeg can also conduct a confidential review of the firm's existing AI-use policy against the failure-mode catalogue, delivering a prioritised remediation plan that distinguishes low-risk efficiency gains from higher-risk applications.

For teams that want to build durable in-house capability, RegLeg develops training material and CPD-aligned content tailored to the Compliance context. This covers how to interpret AI-generated regulatory summaries critically, how to structure verification steps where AI confidence is high but human review is essential, and how to document AI-assisted decision-making consistent with good governance standards.