A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

doi:10.1126/science.aar6404

GSTDTAP > 地球科学

DOI	10.1126/science.aar6404
	A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
	Silver, David 1,2; Hubert, Thomas 1; Schrittwieser, Julian 1; Antonoglou, Ioannis 1; Lai, Matthew 1; Guez, Arthur 1; Lanctot, Marc 1; Sifre, Laurent 1; Kumaran, Dharshan 1; Graepel, Thore 1; Lillicrap, Timothy 1; Simonyan, Karen 1; Hassabis, Demis 1
	2018-12-07
发表期刊	SCIENCE
ISSN	0036-8075
EISSN	1095-9203
出版年	2018
卷号	362 期号:6419 页码:1140-+
文章类型	Article
语种	英语
国家	England
英文摘要	The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play. In this paper, we generalize this approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games. Starting from random play and given no domain knowledge except the game rules, AlphaZero convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.
领域	地球科学 ; 气候变化 ; 资源环境
收录类别	SCI-E
WOS记录号	WOS:000452506300053
WOS关键词	GAME
WOS类目	Multidisciplinary Sciences
WOS研究方向	Science & Technology - Other Topics
引用统计
文献类型	期刊论文
条目标识符	http://119.78.100.173/C666/handle/2XK7JSWQ/200250
专题	地球科学资源环境科学气候变化
作者单位	1.DeepMind, 6 Pancras Sq, London N1C 4AG, England; 2.UCL, Gower St, London WC1E 6BT, England
推荐引用方式 GB/T 7714	Silver, David,Hubert, Thomas,Schrittwieser, Julian,et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play[J]. SCIENCE,2018,362(6419):1140-+.
APA	Silver, David.,Hubert, Thomas.,Schrittwieser, Julian.,Antonoglou, Ioannis.,Lai, Matthew.,...&Hassabis, Demis.(2018).A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.SCIENCE,362(6419),1140-+.
MLA	Silver, David,et al."A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play".SCIENCE 362.6419(2018):1140-+.