Seminarios

Conversaciones con investigadores destacados.

Charlas sobre las agendas que dan forma a AI Safety.

AISAR 2025

Kei Nishimura-Gasparian

Machine Alignment, Transparency & Security (MATS) Scholar

Early Signs of Steganographic Capabilities in Frontier LLMs

Adrià Garriga-Alonso

Independiente

Reverse-engineering a neural network that plans: a mesa-optimizer model organism

Real-Time Detection of Hallucinated Entities in Long-Form Generation

Mikhail Terekhov

EPFL · Anthropic Research Fellow

Control Tax: The Price of Keeping AI in Check

Joar Skalse

Deducto

The Theoretical Foundations of Reward Learning

Fernando Rosas

University of Sussex

AI in a vat: Fundamental limits of efficient world modelling

Nora Ammann

ARIA, Safeguarded AI

Safeguarded AI: a scalable workflow for safety-by-construction