Seminars

Conversations with leading researchers.

Talks across the agendas that shape AI Safety today.

AISAR 2025

Kei Nishimura-Gasparian

Machine Alignment, Transparency & Security (MATS) Scholar

Early Signs of Steganographic Capabilities in Frontier LLMs

Adrià Garriga-Alonso

Independent

Reverse-engineering a neural network that plans: a mesa-optimizer model organism

Real-Time Detection of Hallucinated Entities in Long-Form Generation

Mikhail Terekhov

EPFL · Anthropic Research Fellow

Control Tax: The Price of Keeping AI in Check

Joar Skalse

Deducto

The Theoretical Foundations of Reward Learning

Fernando Rosas

University of Sussex

AI in a vat: Fundamental limits of efficient world modelling

Nora Ammann

ARIA, Safeguarded AI

Safeguarded AI: a scalable workflow for safety-by-construction