AI Hallucination ResearchBriefings › Briefing
Sector × Dept INT UNTC
Maritime & Shipping Legal teams · BBNJ High Seas Biodiversity Agreement

By Kratti A Agrawal, Lead, RegLeg Brief Specialist Panel

Maritime & Shipping Legal teams: documentation and reporting gaps possible from AI reading of BBNJ Agreement

Anthropic's Claude surfaces the fault lines in AI reasoning about BBNJ maritime legal doctrine.

— RLB Specialist Panel

Frontier AI models mis-cite the BBNJ Agreement's controlling articles.

Frontier AI subjects tested by the RLB Specialist Panel produced answers in which the substantive direction was directionally close to the treaty text but the article-number citation was wrong, in three distinct provisions.

The pattern in one line

Frontier AI models tested on the BBNJ Agreement returned answers where the substantive paraphrase was close to the treaty text, but the article-number citation was wrong, producing legal deliverables that will fail first-reading review against the deposited treaty text.

How the RLB Specialist Panel tested this

The questions in this cell were prepared by the RLB Specialist Panel based on real, practical AI usage in the workflows that legal teams at maritime & shipping firms actually use AI for under the BBNJ Agreement. Each question targets a specific deliverable type where an AI assistant is plausibly the first draft: a memo, an opinion paragraph, a checklist line, a board-paper bullet, a regulator-facing filing sentence. The Panel issued each question to two frontier AI subjects with web search active.

The Panel then bound every AI response to verbatim regulator-issued source text held as primary substrate, comparing the model output against the deposited treaty text and the regulator-issued source documentation for each provision. Only responses where the AI subject was demonstrably wrong against the verbatim regulator-issued source text are published as findings; responses that were substantively correct, or that refused on calibration grounds, are retained internally and not surfaced.

What the models got wrong

The cell carries 2 confirmed findings against the AI subjects on the BBNJ Agreement. Each is published against verbatim regulator-issued treaty text and carries explicit model attribution for audit transparency.

Finding 1: EIA screening threshold pinned to Article 30 instead of Article 27 (Part IV). The Specialist Panel asked, in direct form, which provision of the BBNJ Agreement establishes the qualitative screening threshold that triggers an environmental impact assessment obligation for planned high-seas activities. Claude Opus 4.7 with web search active answered that under Article 30 of the BBNJ Agreement a planned activity in areas beyond national jurisdiction must be screened under Article 31 and, if warranted, undergo a full environmental impact assessment (RLB-H-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q001-Opus47).

Claude Sonnet 4.6 with web search active produced a closely related answer, stating that under Part IV, Article 30, a planned high-seas activity must undergo an environmental impact assessment when it is likely to have more than a minor or transitory effect (RLB-H-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q001-Sonnet46). The substrate held by the Panel sets the screening threshold and the qualitative test at Article 27 of Part IV. The qualitative test the models paraphrased is the right test; the article number is wrong.

Finding 4: Non-undermining clause attributed to Article 5 or Article 8 instead of Article 22(2) (Part III). The Specialist Panel asked, in direct form, whether the model correctly identifies Article 22(2) as the provision that constrains the Conference of the Parties from overriding the competences of other international bodies when designating marine protected areas. Claude Opus 4.7 with web search active answered that Article 5 or Article 8 of the Agreement explicitly requires it not to undermine relevant legal instruments, frameworks and competent global, regional, subregional and sectoral bodies (RLB-H-INT-UNTC-BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023-Q005-Opus47).

The substrate held by the Panel locates the controlling duty at Article 22(2) of Part III: the Conference of the Parties shall respect the competences of, and not undermine, relevant legal instruments and frameworks and relevant global, regional, subregional and sectoral bodies. Article 5 and Article 8 carry general purposes and principles language, not the targeted constraint on Conference of the Parties decisions on area-based management tools.

For legal teams at maritime & shipping firms advising on the BBNJ Agreement, treaty-citation accuracy is load-bearing in legal opinions, contractual representations, due-diligence disclosures, and any pleading or position paper engaging the Agreement. A counterparty or opposing counsel who identifies a misattributed article on first reading calls the entire piece of advice into question. The marine genetic resources retroactivity inversion is the more serious failure: a legal opinion structured around a retroactive-by-default rule when the treaty establishes the opposite default produces fundamentally wrong contract terms and exposes the firm to professional liability if the underlying position is later corrected.

The regulator's actual position

The verbatim regulator-issued source text held by the RLB Specialist Panel as primary substrate for the BBNJ Agreement sets the position as follows. The references below are drawn from the deposited treaty text and are the controlling reference points against which any AI-assisted citation should be validated.

Article 27, Part IV. The screening threshold and the qualitative more than a minor or transitory effect test sit in Part IV of the BBNJ Agreement at Article 27. Part IV runs the environmental impact assessment regime: screening, scoping, assessment, monitoring, and review. Article 27 is the entry point that determines whether a planned activity in areas beyond national jurisdiction must be assessed at all.

Article 22(2), Part III. The Conference of the Parties shall respect the competences of, and not undermine, relevant legal instruments and frameworks and relevant global, regional, subregional and sectoral bodies. This is the targeted constraint on Conference of the Parties decisions on area-based management tools, including marine protected areas. Article 5 and Article 8 of the Agreement set out general purposes and principles; they are not the operative non-undermining duty for Conference of the Parties decisions under Part III.

For legal teams at maritime & shipping firms working with AI on the BBNJ Agreement, the recurring pattern is Source-Credit Misattribution: the substantive paraphrase is close to the treaty text, but the article number is wrong. The defensive workflow that catches this is a citation check against the deposited Agreement text, not against a chained reference. The practitioner takeaway: when an AI assistant offers a citation under the BBNJ Agreement, always re-verify the article number against the regulator-issued text before the output enters a deliverable.

What the RLB Specialist Panel is doing about it

The RLB Specialist Panel is engaging with the AI subjects' developers and with practitioner audiences working under the BBNJ Agreement. The Panel maintains an audit register of confirmed hallucinations bound to verbatim regulator-issued source text, surfaces them on the live regulation page and on each audience-specific briefing, and accepts right-of-reply submissions from the AI subjects' developers and from regulator-side reviewers.

For legal teams at maritime & shipping firms this means the same questions can be re-issued against successor model releases; the bound substrate makes it straightforward to verify whether a specific failure mode has been corrected upstream, or whether the same hallucination is still being produced. Partnership briefings with AI labs are offered against the audit register, not against synthesised demonstrations, so the corrections that matter are evidenced against treaty text rather than against a paraphrase chain.

The register is structured so that each finding records the question put to the AI subject, the AI subject's verbatim answer, the verbatim regulator-issued source text the answer was bound against, the named model and configuration, and the failure mode. That structure lets practitioner readers see exactly where the AI subject diverged from the treaty text without re-doing the underlying verification, and lets AI lab readers see exactly which provision and which phrasing produced the divergence.

Where a hallucination has been corrected in a successor model release, the register records the rerun and withdraws the finding; where it persists, the finding stays live. This makes the register useful as a continuous-improvement signal for the AI labs and as a defensive checklist for practitioners drawing on AI in regulated workflows.

For legal teams at maritime & shipping firms drawing on AI in workflows that touch the BBNJ Agreement, the practical action items are direct:


Right of Reply

These findings and associated work have been put up in public with a view of the greater good for the development of a safer AI ecosystem. Any party reading this or any finding on reglegbrief.com may contact us and have an unconditional right of reply; the Specialist Panel will publish any factual correction or contextual response alongside the original finding, with no editorial gatekeeping. Researchers, regulators, and compliance teams with questions on methodology or specific findings can reach the Specialist Panel via the same channel.

Source & Methodology Standards

RegLeg Brief is operated by Verdus Technologies Pte. Ltd. (UEN 201616982R), incorporated in Singapore. The RLB Specialist Panel, with an aggregate of over 60 years of public-policy and industry experience, documents only confirmed hallucination findings, under a methodology that requires a verbatim regulator excerpt for every documented claim. All findings, citation IDs, model outputs, regulator excerpts, and methodology notes are open-access.


Primary source verified: UN BBNJ Agreement (2023), Agreement on the Conservation and Sustainable Use of Marine Biological Diversity of Areas Beyond National Jurisdiction · Substrate documents: p_01_ACT_Part_III___Article_22_2____non_undermini_text-bbnj-agreement.html, p_01_ACT_Part_IV_EIA___Article_27___screening_thr_text-bbnj-agreement.html · UN portal: documents.un.org

Citation IDs referenced:

Read the full findings page — RLB Citation IDs, AI subject answers, and regulator verbatim text →
← Back to all briefings