back
Get SIGNAL/NOISE in your inbox daily

tl;dr
Paper of the month: • Emergent misalignment arises across many models when training on incorrect data and is largely driven by a single “toxic…