4.8 Article

Preventing undesirable behavior of intelligent machines

期刊

SCIENCE
卷 366, 期 6468, 页码 999-+

出版社

AMER ASSOC ADVANCEMENT SCIENCE
DOI: 10.1126/science.aag3311

关键词

-

资金

  1. NSF CAREER [1350984, 1453474]
  2. NSF [1763423]
  3. Institute of Educational Science [R305A130215]
  4. Direct For Computer & Info Scie & Enginr
  5. Division of Computing and Communication Foundations [1453474] Funding Source: National Science Foundation
  6. Division of Computing and Communication Foundations
  7. Direct For Computer & Info Scie & Enginr [1763423] Funding Source: National Science Foundation
  8. Div Of Information & Intelligent Systems
  9. Direct For Computer & Info Scie & Enginr [1350984] Funding Source: National Science Foundation

向作者/读者索取更多资源

Intelligent machines using machine learning algorithms are ubiquitous, ranging from simple data analysis and pattern recognition tools to complex systems that achieve superhuman performance on various tasks. Ensuring that they do not exhibit undesirable behavior-that they do not, for example, cause harm to humans-is therefore a pressing problem. We propose a general and flexible framework for designing machine learning algorithms. This framework simplifies the problem of specifying and regulating undesirable behavior. To show the viability of this framework, we used it to create machine learning algorithms that precluded the dangerous behavior caused by standard machine learning algorithms in our experiments. Our framework for designing machine learning algorithms simplifies the safe and responsible application of machine learning.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据