GSTDTAP

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
A distributional code for value in dopamine-based reinforcement learning 期刊论文
NATURE, 2020, 577 (7792) : 671-+
作者:  House, Robert A.;  Maitra, Urmimala;  Perez-Osorio, Miguel A.;  Lozano, Juan G.;  Jin, Liyu;  Somerville, James W.;  Duda, Laurent C.;  Nag, Abhishek;  Walters, Andrew;  Zhou, Ke-Jin;  Roberts, Matthew R.;  Bruce, Peter G.
收藏  |  浏览/下载:61/0  |  提交时间:2020/07/03

Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain(1-3). According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial intelligence research on distributional reinforcement learning(4-6). We hypothesized that the brain represents possible future rewards not as a single mean, but instead as a probability distribution, effectively representing multiple future outcomes simultaneously and in parallel. This idea implies a set of empirical predictions, which we tested using single-unit recordings from mouse ventral tegmental area. Our findings provide strong evidence for a neural realization of distributional reinforcement learning.


Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.


  
Urban public transport and air quality: Empirical study of China cities 期刊论文
ENERGY POLICY, 2019, 135
作者:  Sun, Chuanwang;  Zhang, Wenyue;  Fang, Xingming;  Gao, Xiang;  Xu, Meilian
收藏  |  浏览/下载:9/0  |  提交时间:2020/02/17
Air quality index  Urban public transport  Urban built-up area  2SLS  Air pollution  
Satellite-detected gain in built-up area as a leading economic indicator 期刊论文
ENVIRONMENTAL RESEARCH LETTERS, 2019, 14 (11)
作者:  Ying, Qing;  Hansen, Matthew C.;  Sun, Laixiang;  Wang, Lei;  Steininger, Marc
收藏  |  浏览/下载:8/0  |  提交时间:2020/02/17
leading economic indicator  built-up area  anthropogenic bare ground gain  global and regional scale  the great recession  spatio-temporal dynamics  landsat  
Does economic integration damage or benefit the environment? Africa's experience 期刊论文
ENERGY POLICY, 2019, 132: 991-999
作者:  Awad, Atif
收藏  |  浏览/下载:1/0  |  提交时间:2019/11/27
Economic integration  Environment  Trade  Africa  Free trade area  
An area-based modelling approach for planning heating electrification 期刊论文
ENERGY POLICY, 2019, 131: 262-280
作者:  Calderon, Carlos;  Underwood, Chris;  Yi, Jialiang;  Mcloughlin, Adrian;  Williams, Brian
收藏  |  浏览/下载:14/0  |  提交时间:2019/11/27
Residential buildings  Energy  Planning  Policy  Cities  Heat electrification  Area-based  
Examining the relationship between energy poverty and measures of deprivation 期刊论文
ENERGY POLICY, 2019, 130: 206-217
作者:  Marchand, Robert;  Genovese, Andrea;  Koh, S. C. Lenny;  Brennan, Alan
收藏  |  浏览/下载:6/0  |  提交时间:2019/11/27
Fuel poverty  Energy poverty  Index of multiple deprivation  Area based targeting  Housing  Energy policy  
Impact of land requirements on electricity system decarbonisation pathways 期刊论文
ENERGY POLICY, 2019, 129: 193-205
作者:  Palmer-Wilson, Kevin;  Donald, James;  Robertson, Bryson;  Lyseng, Benjamin;  Keller, Victor;  Fowler, McKenzie;  Wade, Cameron;  Scholtysik, Sven;  Wild, Peter;  Rowe, Andrew
收藏  |  浏览/下载:13/0  |  提交时间:2019/11/26
Decarbonisation  Land  Area  Capacity expansion  OSeMOSYS  Electricity system  
Does China's air pollution abatement policy matter? An assessment of the Beijing-Tianjin-Hebei region based on a multi-regional CGE model 期刊论文
ENERGY POLICY, 2019, 127: 213-227
作者:  Li, Na;  Zhang, Xiaoling;  Shi, Minjun;  Hewings, Geoffrey J. D.
收藏  |  浏览/下载:5/0  |  提交时间:2019/11/26
Air pollution abatement policy  PM2.5 concentration  Cost effectiveness  Multi-regional CGE model  Beijing-Tianjin-Hebei area  
Biodiversity conservation effectiveness provided by a protection status in temperate forest commons of north Spain 期刊论文
FOREST ECOLOGY AND MANAGEMENT, 2019, 433: 656-666
作者:  Guadilla-Saez, Sara;  Pardo-de-Santayana, Manuel;  Reyes-Garcia, Victoria;  Svenning, Jens-Christian
收藏  |  浏览/下载:1/0  |  提交时间:2019/04/09
Anthropogenic disturbances  Biodiversity indices  Protected area  Species richness  Temperate deciduous forests  
Transaction costs (TCs) in green building (GB) incentive schemes: Gross Floor Area (GFA) Concession Scheme in Hong Kong 期刊论文
ENERGY POLICY, 2018, 119: 563-573
作者:  Fan, Ke;  Chan, Edwin H. W.;  Qian, Queena K.
收藏  |  浏览/下载:5/0  |  提交时间:2019/04/09
Green Building (GB) Incentive Scheme  Gross Floor Area (GFA) Concession Scheme  Transaction costs (TCs)  Hong Kong