GSTDTAP  > 地球科学
DOI10.1126/science.aar6404
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
Silver, David1,2; Hubert, Thomas1; Schrittwieser, Julian1; Antonoglou, Ioannis1; Lai, Matthew1; Guez, Arthur1; Lanctot, Marc1; Sifre, Laurent1; Kumaran, Dharshan1; Graepel, Thore1; Lillicrap, Timothy1; Simonyan, Karen1; Hassabis, Demis1
2018-12-07
发表期刊SCIENCE
ISSN0036-8075
EISSN1095-9203
出版年2018
卷号362期号:6419页码:1140-+
文章类型Article
语种英语
国家England
英文摘要

The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play. In this paper, we generalize this approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games. Starting from random play and given no domain knowledge except the game rules, AlphaZero convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.


领域地球科学 ; 气候变化 ; 资源环境
收录类别SCI-E
WOS记录号WOS:000452506300053
WOS关键词GAME
WOS类目Multidisciplinary Sciences
WOS研究方向Science & Technology - Other Topics
引用统计
文献类型期刊论文
条目标识符http://119.78.100.173/C666/handle/2XK7JSWQ/200250
专题地球科学
资源环境科学
气候变化
作者单位1.DeepMind, 6 Pancras Sq, London N1C 4AG, England;
2.UCL, Gower St, London WC1E 6BT, England
推荐引用方式
GB/T 7714
Silver, David,Hubert, Thomas,Schrittwieser, Julian,et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play[J]. SCIENCE,2018,362(6419):1140-+.
APA Silver, David.,Hubert, Thomas.,Schrittwieser, Julian.,Antonoglou, Ioannis.,Lai, Matthew.,...&Hassabis, Demis.(2018).A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.SCIENCE,362(6419),1140-+.
MLA Silver, David,et al."A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play".SCIENCE 362.6419(2018):1140-+.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Silver, David]的文章
[Hubert, Thomas]的文章
[Schrittwieser, Julian]的文章
百度学术
百度学术中相似的文章
[Silver, David]的文章
[Hubert, Thomas]的文章
[Schrittwieser, Julian]的文章
必应学术
必应学术中相似的文章
[Silver, David]的文章
[Hubert, Thomas]的文章
[Schrittwieser, Julian]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。