The Alignment Problem from a Deep Learning Perspective

Authors

Richard Ngo, Lawrence Chan, Sören Mindermann

This paper discusses the alignment problem in deep learning systems and proposes several approaches to address it.

March 15th, 2022

arXiv

March 16th, 2025

AlignmentDeep LearningAI Safety