BBNJ High Seas Biodiversity Agreement: Model Hallucination Findings

📰Read the public briefing for this regulation→

The temporal inversion

Retroactive vs prospective — the most consequential wrong answer on the high seas

The BBNJ Agreement entered into force on 17 January 2026. Both models were asked whether its marine genetic resource obligations reach back to specimens collected before that date. Both said yes.

Article 10(1) is plain: the provisions "apply only to resources collected and generated after the entry into force of this Agreement for each Party." Most parties confirmed this at ratification with formal non-retroactivity declarations.

Model answer (both models)

MGR obligations reach pre-EIF specimens

Retroactive reading — directly contradicted by Art. 10(1)

BBNJ Agreement Article 10(1)

Prospective only — post-EIF resources

Confirmed by non-retroactivity declarations from most parties

BBNJ Agreement, Article 10(1) The provisions of this Part apply only to resources collected and generated after the entry into force of this Agreement for each Party, including marine genetic resources and digital sequence information on marine genetic resources from areas beyond national jurisdiction.

Additional findings

Wrong article citations and EIA threshold errors

Beyond the temporal inversion, the audit found that both models cited wrong article numbers when stating rules they otherwise got directionally right, the sort of error that looks superficially correct until someone checks the reference. The environmental impact assessment threshold probe produced additional failures on the specificity of the screening criteria.

Both models placed pre-EIF specimens inside the MGR obligation scope. Article 10(1) and non-retroactivity declarations from most parties say the opposite.

Full hub: BBNJ-HIGH-SEAS-BIODIVERSITY-AGREEMENT-2023 →

The models said retroactive. The treaty says prospective.

Retroactive vs prospective — the most consequential wrong answer on the high seas

Wrong article citations and EIA threshold errors