back
Get SIGNAL/NOISE in your inbox daily

In this video, we explain how Anthropic trained “sleeper agent” AIs to study deception. A “sleeper agent” is an AI model that behaves normally until…