AI #167: The Prior Restraint Era Begins
The era of training frontier models and then releasing them whenever you wanted?
One of the main hopes for AI safety is using AIs to automate AI safety research. However, if models are misaligned, then they may sabotage the safety research. For example, misaligned AIs may try to:
The era of training frontier models and then releasing them whenever you wanted?
Pioneering threat assessment and mitigation for AI systems
The era of training frontier models and then releasing them whenever you wanted? That was fun while it lasted. It looks likely to be over now. The White House wants to get an advance look and have …