Short Course, AGI Safety
Remote, Global
-
This course teaches core concepts in AI alignment and AGI safety through 75 minutes of recorded talks and exercises.
-
Covers alignment problems arising from misaligned goals, including specification gaming and goal misgeneralization.
-
Explores technical alignment approaches covering amplified oversight, interpretability, security and safer design patterns.