GSTDTAP

浏览/检索结果: 共5条,第1-5条 帮助

已选(0)清除 条数/页:   排序方式:
中美研究展示强化学习可显著降低气候适应决策的成本效益 快报文章
气候变化快报,2025年第7期
作者:  秦冰雪
Microsoft Word(14Kb)  |  收藏  |  浏览/下载:411/0  |  提交时间:2025/04/05
Reinforcement Learning  Climate Change Adaptation  Coastal Flood  
Local and global consequences of reward-evoked striatal dopamine release 期刊论文
NATURE, 2020, 580 (7802) : 239-+
作者:  Wagner, Felix R.;  Dienemann, Christian;  Wang, Haibo;  Stuetzer, Alexandra;  Tegunov, Dimitry;  Urlaub, Henning;  Cramer, Patrick
收藏  |  浏览/下载:26/0  |  提交时间:2020/07/03

The neurotransmitter dopamine is required for the reinforcement of actions by rewarding stimuli(1). Neuroscientists have tried to define the functions of dopamine in concise conceptual terms(2), but the practical implications of dopamine release depend on its diverse brain-wide consequences. Although molecular and cellular effects of dopaminergic signalling have been extensively studied(3), the effects of dopamine on larger-scale neural activity profiles are less well-understood. Here we combine dynamic dopamine-sensitive molecular imaging(4) and functional magnetic resonance imaging to determine how striatal dopamine release shapes local and global responses to rewarding stimulation in rat brains. We find that dopamine consistently alters the duration, but not the magnitude, of stimulus responses across much of the striatum, via quantifiable postsynaptic effects that vary across subregions. Striatal dopamine release also potentiates a network of distal responses, which we delineate using neurochemically dependent functional connectivity analyses. Hot spots of dopaminergic drive notably include cortical regions that are associated with both limbic and motor function. Our results reveal distinct neuromodulatory actions of striatal dopamine that extend well beyond its sites of peak release, and that result in enhanced activation of remote neural populations necessary for the performance of motivated actions. Our findings also suggest brain-wide biomarkers of dopaminergic function and could provide a basis for the improved interpretation of neuroimaging results that are relevant to learning and addiction.


Molecular and functional magnetic resonance imaging in the rat reveals distinct neuromodulatory effects of striatal dopamine that extend beyond peak release sites and activate remote neural populations necessary for performing motivated actions.


  
A distributional code for value in dopamine-based reinforcement learning 期刊论文
NATURE, 2020, 577 (7792) : 671-+
作者:  House, Robert A.;  Maitra, Urmimala;  Perez-Osorio, Miguel A.;  Lozano, Juan G.;  Jin, Liyu;  Somerville, James W.;  Duda, Laurent C.;  Nag, Abhishek;  Walters, Andrew;  Zhou, Ke-Jin;  Roberts, Matthew R.;  Bruce, Peter G.
收藏  |  浏览/下载:75/0  |  提交时间:2020/07/03

Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain(1-3). According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial intelligence research on distributional reinforcement learning(4-6). We hypothesized that the brain represents possible future rewards not as a single mean, but instead as a probability distribution, effectively representing multiple future outcomes simultaneously and in parallel. This idea implies a set of empirical predictions, which we tested using single-unit recordings from mouse ventral tegmental area. Our findings provide strong evidence for a neural realization of distributional reinforcement learning.


Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.


  
An agent-based simulation of power generation company behavior in electricity markets under different market-clearing mechanisms 期刊论文
ENERGY POLICY, 2017, 100
作者:  Aliabadi, Danial Esmaeili;  Kaya, Murat;  Sahin, Guvenc
收藏  |  浏览/下载:9/0  |  提交时间:2019/04/09
Agent-based simulation  Reinforcement learning  Uniform pricing  Pay-as-bid pricing  DC-OPF  Game-theory  
A framework for mapping and comparing behavioural theories in models of social-ecological systems 期刊论文
ECOLOGICAL ECONOMICS, 2017, 131
作者:  Schluter, Maja;  Baeza, Andres;  Dressler, Gunnar;  Frank, Karin;  Groeneveld, Juergen;  Jager, Wander;  Janssen, Marco A.;  McAllister, Ryan R. J.;  Mueller, Birgit;  Orach, Kirill;  Schwarz, Nina;  Wijermans, Nanda
收藏  |  浏览/下载:35/0  |  提交时间:2019/04/09
Human decision-making  Natural resource management  Rational actor  Bounded rationality  Theory of planned behaviour  Descriptive norm  Habitual  Reinforcement learning  Prospect theory