☆ 4.8 Article

Preventing undesirable behavior of intelligent machines

SCIENCE (2019)

期刊

SCIENCE

卷 366, 期 6468, 页码 999-+

出版社

AMER ASSOC ADVANCEMENT SCIENCE

DOI: 10.1126/science.aag3311

关键词

类别

Multidisciplinary Sciences

资金

NSF CAREER [1350984, 1453474]
NSF [1763423]
Institute of Educational Science [R305A130215]
Direct For Computer & Info Scie & Enginr
Division of Computing and Communication Foundations [1453474] Funding Source: National Science Foundation
Division of Computing and Communication Foundations
Direct For Computer & Info Scie & Enginr [1763423] Funding Source: National Science Foundation
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [1350984] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Intelligent machines using machine learning algorithms are ubiquitous, ranging from simple data analysis and pattern recognition tools to complex systems that achieve superhuman performance on various tasks. Ensuring that they do not exhibit undesirable behavior-that they do not, for example, cause harm to humans-is therefore a pressing problem. We propose a general and flexible framework for designing machine learning algorithms. This framework simplifies the problem of specifying and regulating undesirable behavior. To show the viability of this framework, we used it to create machine learning algorithms that precluded the dangerous behavior caused by standard machine learning algorithms in our experiments. Our framework for designing machine learning algorithms simplifies the safe and responsible application of machine learning.

Preventing undesirable behavior of intelligent machines

期刊

SCIENCE

出版社

AMER ASSOC ADVANCEMENT SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Preventing undesirable behavior of intelligent machines

期刊

SCIENCE

出版社

AMER ASSOC ADVANCEMENT SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文