GSTDTAP

浏览/检索结果: 共2条,第1-2条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
COSORE: A community database for continuous soil respiration and other soil‐atmosphere greenhouse gas flux data 期刊论文
Global Change Biology, 2020
作者:  Ben Bond‐;  Lamberty;  Danielle S. Christianson;  Avni Malhotra;  Stephanie C. Pennington;  Debjani Sihi;  Amir AghaKouchak;  Hassan Anjileli;  M. Altaf Arain;  Juan J. Armesto;  Samaneh Ashraf;  Mioko Ataka;  Dennis Baldocchi;  Thomas Andrew Black;  Nina Buchmann;  Mariah S. Carbone;  Shih‐;  Chieh Chang;  Patrick Crill;  Peter S. Curtis;  Eric A. Davidson;  Ankur R. Desai;  John E. Drake;  Tarek S. El‐;  Madany;  Michael Gavazzi;  Carolyn‐;  Monika Gö;  rres;  Christopher M. Gough;  Michael Goulden;  Jillian Gregg;  Omar Gutié;  rrez del Arroyo;  Jin‐;  Sheng He;  Takashi Hirano;  Anya Hopple;  Holly Hughes;  ;  rvi Jä;  rveoja;  Rachhpal Jassal;  Jinshi Jian;  Haiming Kan;  Jason Kaye;  Yuji Kominami;  Naishen Liang;  David Lipson;  Catriona A. Macdonald;  Kadmiel Maseyk;  Kayla Mathes;  Marguerite Mauritz;  Melanie A. Mayes;  Steve McNulty;  Guofang Miao;  Mirco Migliavacca;  Scott Miller;  Chelcy F. Miniat;  Jennifer G. Nietz;  Mats B. Nilsson;  Asko Noormets;  Hamidreza Norouzi;  Christine S. O’;  Connell;  Bruce Osborne;  Cecilio Oyonarte;  Zhuo Pang;  Matthias Peichl;  Elise Pendall;  Jorge F. Perez‐;  Quezada;  Claire L. Phillips;  Richard P. Phillips;  James W. Raich;  Alexandre A. Renchon;  Nadine K. Ruehr;  Enrique P. Sá;  nchez‐;  Cañ;  ete;  Matthew Saunders;  Kathleen E. Savage;  Marion Schrumpf;  Russell L. Scott;  Ulli Seibt;  Whendee L. Silver;  Wu Sun;  Daphne Szutu;  Kentaro Takagi;  Masahiro Takagi;  Munemasa Teramoto;  Mark G. Tjoelker;  Susan Trumbore;  Masahito Ueyama;  Rodrigo Vargas;  Ruth K. Varner;  Joseph Verfaillie;  Christoph Vogel;  Jinsong Wang;  Greg Winston;  Tana E. Wood;  Juying Wu;  Thomas Wutzler;  Jiye Zeng;  Tianshan Zha;  Quan Zhang;  Junliang Zou
收藏  |  浏览/下载:15/0  |  提交时间:2020/10/12
A distributional code for value in dopamine-based reinforcement learning 期刊论文
NATURE, 2020, 577 (7792) : 671-+
作者:  House, Robert A.;  Maitra, Urmimala;  Perez-Osorio, Miguel A.;  Lozano, Juan G.;  Jin, Liyu;  Somerville, James W.;  Duda, Laurent C.;  Nag, Abhishek;  Walters, Andrew;  Zhou, Ke-Jin;  Roberts, Matthew R.;  Bruce, Peter G.
收藏  |  浏览/下载:61/0  |  提交时间:2020/07/03

Since its introduction, the reward prediction error theory of dopamine has explained a wealth of empirical phenomena, providing a unifying framework for understanding the representation of reward and value in the brain(1-3). According to the now canonical theory, reward predictions are represented as a single scalar quantity, which supports learning about the expectation, or mean, of stochastic outcomes. Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial intelligence research on distributional reinforcement learning(4-6). We hypothesized that the brain represents possible future rewards not as a single mean, but instead as a probability distribution, effectively representing multiple future outcomes simultaneously and in parallel. This idea implies a set of empirical predictions, which we tested using single-unit recordings from mouse ventral tegmental area. Our findings provide strong evidence for a neural realization of distributional reinforcement learning.


Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.