back
Get SIGNAL/NOISE in your inbox daily

One key strategy for preventing bad outcomes from misuse or misalignment is model monitoring. However, one way that monitoring can fail is if LLMs us…