Concrete Problems in AI Safety

Authors

Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mané

Abstract

Rapid progress in machine learning and artificial intelligence has brought increasing attention to the potential impacts of AI technologies on society. This paper focuses on one such impact: the problem of accidents in machine learning systems, defined as unintended and harmful behavior that may emerge from poor design of real-world AI systems.

Publication Details

Published:

June 21st, 2016

Venue:

arXiv

Added to AI Safety Papers:

March 16th, 2025

Metadata

Tags:

AI SafetyMachine LearningAlignment

Original Paper:

Link