P.S. The Alignment and Safety Systems teams are hiring!
Alignment Research Blog
Informal updates from the OpenAI team
2025
Dec 1
Debugging misaligned completions with sparse-autoencoder latent attribution
Efficiently finding features that cause behaviors.
Dec 1
A Practical Approach to Verifying Code at Scale
We train and deploy an AI review agent optimised for precision and real-world use, enabling oversight to scale with autonomous code generation.
Dec 1
Hello World
Introducing our blog on alignment research.