Kenji Doya - Publications

Affiliations:

Okinawa Institute of Science and Technology, Onna-son, Okinawa-ken, Japan

Area:

reinforcement learning

Website:

http://www.oist.jp

Tree Info Grants Similar researchers PubMed Report error

Year	Citation	Score
2023	Blackwell KT, Doya K. Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks. Plos Computational Biology. 19: e1011385. PMID 37594982 DOI: 10.1371/journal.pcbi.1011385	0.364
2021	Uchibe E, Doya K. Forward and inverse reinforcement learning sharing network weights and hyperparameters. Neural Networks : the Official Journal of the International Neural Network Society. 144: 138-153. PMID 34492548 DOI: 10.1016/j.neunet.2021.08.017	0.325
2020	Abe Y, Takata N, Sakai Y, Hamada H, Hiraoka Y, Aida T, Tanaka K, Bihan DL, Doya K, Tanaka KF. Diffusion functional MRI reveals global brain network functional abnormalities driven by targeted local activity in a neuropsychiatric disease mouse model. Neuroimage. 117318. PMID 32882386 DOI: 10.1016/j.neuroimage.2020.117318	0.749
2020	Girard B, Lienard J, Gutierrez CE, Delord B, Doya K. A biologically constrained spiking neural network model of the primate basal ganglia with overlapping pathways exhibits action selection. The European Journal of Neuroscience. PMID 32564449 DOI: 10.1111/ejn.14869	0.55
2020	Han D, Doya K, Tani J. Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks. Neural Networks : the Official Journal of the International Neural Network Society. 129: 149-162. PMID 32534378 DOI: 10.1016/j.neunet.2020.06.002	0.67
2019	Doya K. Neural circuits for reinforcement learning and mental simulation Ibro Reports. 6. DOI: 10.1016/J.Ibror.2019.07.160	0.366
2019	Doya K, Taniguchi T. Toward evolutionary and developmental intelligence Current Opinion in Behavioral Sciences. 29: 91-96. DOI: 10.1016/J.Cobeha.2019.04.006	0.338
2018	Miyazaki K, Miyazaki KW, Yamanaka A, Tokuda T, Tanaka KF, Doya K. Reward probability and timing uncertainty alter the effect of dorsal raphe serotonin neurons on patience. Nature Communications. 9: 2048. PMID 29858574 DOI: 10.1038/S41467-018-04496-Y	0.343
2018	Elfwing S, Uchibe E, Doya K. Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Networks : the Official Journal of the International Neural Network Society. PMID 29395652 DOI: 10.1016/j.neunet.2017.12.012	0.42
2018	Doya K, Wang D. Fostering deep learning and beyond Neural Networks. 97: iii-iv. DOI: 10.1016/S0893-6080(17)30270-8	0.402
2018	Kinjo K, Uchibe E, Doya K. Robustness of linearly solvable Markov games employing inaccurate dynamics model Artificial Life and Robotics. 23: 1-9. DOI: 10.1007/S10015-017-0401-2	0.343
2017	Yoshida K, Shimizu Y, Yoshimoto J, Takamura M, Okada G, Okamoto Y, Yamawaki S, Doya K. Prediction of clinical depression scores and detection of changes in whole-brain using resting-state functional MRI data with partial least squares regression. Plos One. 12: e0179638. PMID 28700672 DOI: 10.1371/journal.pone.0179638	0.307
2017	Wang J, Uchibe E, Doya K. Adaptive Baseline Enhances EM-Based Policy Search: Validation in a View-Based Positioning Task of a Smartphone Balancer. Frontiers in Neurorobotics. 11: 1. PMID 28167910 DOI: 10.3389/fnbot.2017.00001	0.334
2017	Miyazaki K, Miyazaki K, Doya K. Brain computation mechanism of prediction and decision making by dorsal raphe serotonin neurons. Nihon Yakurigaku Zasshi. Folia Pharmacologica Japonica. 149: 34-39. PMID 28049876 DOI: 10.1254/fpj.149.34	0.305
2016	Funamizu A, Kuhn B, Doya K. Neural substrate of dynamic Bayesian inference in the cerebral cortex. Nature Neuroscience. PMID 27643432 DOI: 10.1038/nn.4390	0.324
2016	Elfwing S, Uchibe E, Doya K. From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning. Neural Networks : the Official Journal of the International Neural Network Society. 84: 17-27. PMID 27639720 DOI: 10.1016/j.neunet.2016.07.013	0.366
2016	Fermin AS, Yoshida T, Yoshimoto J, Ito M, Tanaka SC, Doya K. Model-based action planning involves cortico-cerebellar and basal ganglia networks. Scientific Reports. 6: 31378. PMID 27539554 DOI: 10.1038/srep31378	0.547
2016	Sharpee TO, Destexhe A, Kawato M, Sekulić V, Skinner FK, Wójcik DK, Chintaluri C, Cserpán D, Somogyvári Z, Kim JK, Kilpatrick ZP, Bennett MR, Josić K, Elices I, Arroyo D, ... ... Doya K, et al. 25th Annual Computational Neuroscience Meeting: CNS-2016 Bmc Neuroscience. 17: 54. PMID 27534393 DOI: 10.1186/S12868-016-0283-6	0.715
2016	Caligiore D, Pezzulo G, Baldassarre G, Bostan AC, Strick PL, Doya K, Helmich RC, Dirkx M, Houk J, Jörntell H, Lago-Rodriguez A, Galea JM, Miall RC, Popa T, Kishore A, et al. Consensus Paper: Towards a Systems-Level View of Cerebellar Function: the Interplay Between Cerebellum, Basal Ganglia, and Cortex. Cerebellum (London, England). PMID 26873754 DOI: 10.1007/S12311-016-0763-3	0.331
2016	Wang J, Uchibe E, Doya K. EM-based policy hyper parameter exploration: application to standing and balancing of a two-wheeled smartphone robot Artificial Life and Robotics. 1-7. DOI: 10.1007/s10015-015-0260-7	0.342
2015	Ito M, Doya K. Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum. Plos Computational Biology. 11: e1004540. PMID 26529522 DOI: 10.1371/journal.pcbi.1004540	0.372
2015	Shimizu Y, Yoshimoto J, Toki S, Takamura M, Yoshimura S, Okamoto Y, Yamawaki S, Doya K. Toward Probabilistic Diagnosis and Understanding of Depression Based on Functional MRI Data Analysis with Logistic Group LASSO. Plos One. 10: e0123524. PMID 25932629 DOI: 10.1371/journal.pone.0123524	0.319
2015	Funamizu A, Ito M, Doya K, Kanzaki R, Takahashi H. Condition interference in rats performing a choice task with switched variable- and fixed-reward conditions. Frontiers in Neuroscience. 9: 27. PMID 25741231 DOI: 10.3389/fnins.2015.00027	0.338
2015	Nakano T, Otsuka M, Yoshimoto J, Doya K. A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity. Plos One. 10: e0115620. PMID 25734662 DOI: 10.1371/Journal.Pone.0115620	0.542
2015	Ito M, Doya K. Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 35: 3499-514. PMID 25716849 DOI: 10.1523/JNEUROSCI.1962-14.2015	0.355
2015	Elfwing S, Uchibe E, Doya K. Expected energy-based restricted Boltzmann machine for classification. Neural Networks : the Official Journal of the International Neural Network Society. 64: 29-38. PMID 25318375 DOI: 10.1016/j.neunet.2014.09.006	0.363
2015	Balleine BW, Dezfouli A, Ito M, Doya K. Hierarchical control of goal-directed action in the cortical-basal ganglia network Current Opinion in Behavioral Sciences. 5: 1-7. DOI: 10.1016/J.Cobeha.2015.06.001	0.331
2014	Okada G, Okamoto Y, Shishida K, Ueda K, Onoda K, Kunisato Y, Tanaka SC, Doya K, Yamawaki S. [Brain mechanisms of depression--preliminary evidence from fMRI studies]. Seishin Shinkeigaku Zasshi = Psychiatria Et Neurologia Japonica. 116: 825-31. PMID 25672209	0.428
2014	Miyazaki KW, Miyazaki K, Tanaka KF, Yamanaka A, Takahashi A, Tabuchi S, Doya K. Optogenetic activation of dorsal raphe serotonin neurons enhances patience for future rewards. Current Biology : Cb. 24: 2033-40. PMID 25155504 DOI: 10.1016/J.Cub.2014.07.041	0.329
2014	Miyapuram KP, Pamnani U, Doya K, Bapi RS. Inter Subject Correlation of Brain Activity during Visuo-Motor Sequence learning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8834: 35-41.	0.742
2013	Nakano T, Yoshimoto J, Doya K. A model-based prediction of the calcium responses in the striatal synaptic spines depending on the timing of cortical and dopaminergic inputs and post-synaptic spikes. Frontiers in Computational Neuroscience. 7: 119. PMID 24062681 DOI: 10.3389/Fncom.2013.00119	0.452
2013	Kinjo K, Uchibe E, Doya K. Evaluation of linearly solvable Markov decision process with dynamic model learning in a mobile robot navigation task. Frontiers in Neurorobotics. 7: 7. PMID 23576983 DOI: 10.3389/fnbot.2013.00007	0.361
2013	Elfwing S, Uchibe E, Doya K. Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces. Frontiers in Neurorobotics. 7: 3. PMID 23450126 DOI: 10.3389/fnbot.2013.00003	0.406
2012	Miyazaki KW, Miyazaki K, Doya K. Activation of dorsal raphe serotonin neurons is necessary for waiting for delayed rewards. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 32: 10451-7. PMID 22855794 DOI: 10.1523/JNEUROSCI.0915-12.2012	0.334
2012	Okamoto Y, Okada G, Shishida K, Fukumoto T, Machino A, Yamashita H, Tanaka SC, Doya K, Yamawaki S. [Effects of serotonin on delay discounting for rewards--an application for understanding of pathophysiology in psychiatric disorders]. Seishin Shinkeigaku Zasshi = Psychiatria Et Neurologia Japonica. 114: 108-14. PMID 22568113	0.401
2012	Funamizu A, Ito M, Doya K, Kanzaki R, Takahashi H. Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats. The European Journal of Neuroscience. 35: 1180-9. PMID 22487046 DOI: 10.1111/j.1460-9568.2012.08025.x	0.404
2012	Miyazaki K, Miyazaki KW, Doya K. The role of serotonin in the regulation of patience and impulsivity. Molecular Neurobiology. 45: 213-24. PMID 22262065 DOI: 10.1007/s12035-012-8232-6	0.319
2012	Demoto Y, Okada G, Okamoto Y, Kunisato Y, Aoyama S, Onoda K, Munakata A, Nomura M, Tanaka SC, Schweighofer N, Doya K, Yamawaki S. Neural and personality correlates of individual differences related to the effects of acute tryptophan depletion on future reward evaluation. Neuropsychobiology. 65: 55-64. PMID 22222380 DOI: 10.1159/000328990	0.501
2012	Sugimoto N, Haruno M, Doya K, Kawato M. MOSAIC for multiple-reward environments. Neural Computation. 24: 577-606. PMID 22168558 DOI: 10.1162/NECO_a_00246	0.636
2012	Pammi VS, Miyapuram KP, Ahmed, Samejima K, Bapi RS, Doya K. Changing the structure of complex visuo-motor sequences selectively activates the fronto-parietal network. Neuroimage. 59: 1180-9. PMID 21867758 DOI: 10.1016/J.Neuroimage.2011.08.006	0.772
2011	Onoda K, Okamoto Y, Kunisato Y, Aoyama S, Shishida K, Okada G, Tanaka SC, Schweighofer N, Yamaguchi S, Doya K, Yamawaki S. Inter-individual discount factor differences in reward prediction are topographically associated with caudate activation. Experimental Brain Research. 212: 593-601. PMID 21695536 DOI: 10.1007/s00221-011-2771-3	0.515
2011	Ito M, Doya K. Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit. Current Opinion in Neurobiology. 21: 368-73. PMID 21531544 DOI: 10.1016/j.conb.2011.04.001	0.43
2011	Miyazaki K, Miyazaki KW, Doya K. Activation of dorsal raphe serotonin neurons underlies waiting for delayed rewards. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 31: 469-79. PMID 21228157 DOI: 10.1523/JNEUROSCI.3714-10.2011	0.322
2011	Miyazaki KW, Miyazaki K, Doya K. Activation of the central serotonergic system in response to delayed but not omitted rewards. The European Journal of Neuroscience. 33: 153-60. PMID 21070390 DOI: 10.1111/j.1460-9568.2010.07480.x	0.32
2011	Yoshimoto J, Doya K. 1SK-05 A statistical learning method for identifying synaptic connections from spike train data(1SK High Performance Computational Approaches to Biological Functions,The 49th Annual Meeting of the Biophysical Society of Japan) Seibutsu Butsuri. 51. DOI: 10.2142/Biophys.51.S9_4	0.342
2011	Elfwing S, Uchibe E, Doya K, Christensen HI. Darwinian embodied evolution of the learning ability for survival Adaptive Behavior. 19: 101-120. DOI: 10.1177/1059712310397633	0.388
2011	Doya K, Ito M, Samejima K. Model-based analysis of decision variables Decision Making, Affect, and Learning: Attention and Performance Xxiii. DOI: 10.1093/acprof:oso/9780199600434.003.0009	0.447
2011	Uchibe E, Doya K. Evolution of rewards and learning mechanisms in Cyber Rodents Neuromorphic and Brain-Based Robots. 109-128. DOI: 10.1017/CBO9780511994838.007	0.316
2011	Cutsuridis V, Heida T, Duch W, Doya K. Neurocomputational models of brain disorders Neural Networks. 24: 513-514. DOI: 10.1016/j.neunet.2011.03.016	0.316
2011	Miyapuram KP, Doya K, Bapi RS. Chunking During Learning of Visuomotor Sequences with Spatial and Arbitrary Rules: Preliminary Findings Psychological Studies. 57: 22-28. DOI: 10.1007/S12646-011-0118-6	0.758
2010	Fermin A, Yoshida T, Ito M, Yoshimoto J, Doya K. Evidence for model-based action planning in a sequential finger movement task. Journal of Motor Behavior. 42: 371-9. PMID 21184355 DOI: 10.1080/00222895.2010.526467	0.439
2010	Nakano T, Doi T, Yoshimoto J, Doya K. A kinetic model of dopamine- and calcium-dependent striatal synaptic plasticity. Plos Computational Biology. 6: e1000670. PMID 20169176 DOI: 10.1371/Journal.Pcbi.1000670	0.422
2010	Klein M, Kamp H, Palm G, Doya K. A computational neural model of goal-directed utterance selection. Neural Networks : the Official Journal of the International Neural Network Society. 23: 592-606. PMID 20116973 DOI: 10.1016/j.neunet.2010.01.003	0.384
2010	Morimura T, Uchibe E, Yoshimoto J, Peters J, Doya K. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning. Neural Computation. 22: 342-76. PMID 19842990 DOI: 10.1162/Neco.2009.12-08-922	0.355
2010	Nakashi T, Yoshimoto J, Wickens J, Doya K. Electrophysiological and molecular mechanisms of synaptic plasticity in the striatum Neuroscience Research. 68. DOI: 10.1016/J.Neures.2010.07.1534	0.547
2010	Nonomura S, Samejima K, Doya K, Tanji J. Neural activity in the dorsal striatum during cognitive decision making Neuroscience Research. 68: e299. DOI: 10.1016/J.Neures.2010.07.1328	0.631
2010	Fermin A, Yoshida T, Ito M, Yoshimoto J, Doya K. Neural mechanisms for model-free and model-based reinforcement strategies in humans performing a multi-step navigation task Neuroscience Research. 68. DOI: 10.1016/J.Neures.2010.07.1269	0.315
2009	Tanaka SC, Shishida K, Schweighofer N, Okamoto Y, Yamawaki S, Doya K. Serotonin affects association of aversive outcomes to past actions. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 29: 15669-74. PMID 20016081 DOI: 10.1523/JNEUROSCI.2799-09.2009	0.533
2009	Ito M, Doya K. Validation of decision-making models and analysis of decision variables in the rat basal ganglia. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 29: 9861-74. PMID 19657038 DOI: 10.1523/JNEUROSCI.6157-08.2009	0.441
2009	Fujiwara Y, Yamashita O, Kawawaki D, Doya K, Kawato M, Toyama K, Sato MA. A hierarchical Bayesian method to resolve an inverse problem of MEG contaminated with eye movement artifacts. Neuroimage. 45: 393-409. PMID 19150653 DOI: 10.1016/j.neuroimage.2008.12.012	0.546
2009	Ito M, Doya K. Different representation of action and reward in the dorsal and the ventral striatum Neuroscience Research. 65. DOI: 10.1016/J.Neures.2009.09.505	0.316
2009	Yoshida T, Ito M, Morimura T, Samejima K, Okuda J, Yoshimoto J, Doya K. Brain mechanisms for evaluating probabilistic and delayed rewards Neuroscience Research. 65: S239. DOI: 10.1016/J.Neures.2009.09.1350	0.536
2009	Fermin A, Takehiko Y, Tanaka S, Ito M, Yoshimoto J, Doya K. Reinforcement learning strategies for sequential action learning Neuroscience Research. 65. DOI: 10.1016/J.Neures.2009.09.1332	0.556
2009	Nakano T, Yoshimoto J, Wickens J, Doya K. Calcium responses model in striatum dependent on timed input sources Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5768: 249-258. DOI: 10.1007/978-3-642-04274-4_26	0.582
2008	Uchibe E, Doya K. Finding intrinsic rewards by embodied evolution and constrained reinforcement learning. Neural Networks : the Official Journal of the International Neural Network Society. 21: 1447-55. PMID 19013054 DOI: 10.1016/j.neunet.2008.09.013	0.385
2008	Ito M, Doya K. [Mathematical models of decision making and learning]. Brain and Nerve = Shinkei Kenkyå« No Shinpo. 60: 791-8. PMID 18646619	0.346
2008	Schweighofer N, Bertin M, Shishida K, Okamoto Y, Tanaka SC, Yamawaki S, Doya K. Low-serotonin levels increase delayed reward discounting in humans. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 28: 4528-32. PMID 18434531 DOI: 10.1523/JNEUROSCI.4982-07.2008	0.54
2008	Bissmarck F, Nakahara H, Doya K, Hikosaka O. Combining modalities with different latencies for optimal motor control. Journal of Cognitive Neuroscience. 20: 1966-79. PMID 18416676 DOI: 10.1162/jocn.2008.20133	0.412
2008	Doya K. Modulators of decision making. Nature Neuroscience. 11: 410-6. PMID 18368048 DOI: 10.1038/nn2077	0.341
2008	Elfwing S, Uchibe E, Doya K, Christensen HI. Co-evolution of shaping rewards and meta-parameters in reinforcement learning Adaptive Behavior. 16: 400-412. DOI: 10.1177/1059712308092835	0.442
2008	Sato T, Uchibe E, Doya K. Learning how, what, and whether to communicate: Emergence of protocommunication in reinforcement learning agents Artificial Life and Robotics. 12: 70-74. DOI: 10.1007/s10015-007-0444-x	0.388
2008	Uchibe E, Doya K. Finding exploratory rewards by embodied evolution and constrained reinforcement learning in the cyber rodents Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4985: 167-176. DOI: 10.1007/978-3-540-69162-4_18	0.325
2008	Samejima K, Doya K. Estimating internal variables of a decision maker's brain: A model-based approach for neuroscience Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4984: 596-603. DOI: 10.1007/978-3-540-69158-7_62	0.462
2008	Schweighofer N, Bertin M, Shishida K, Okamoto Y, Tanaka SC, Yamawaki S, Doya K. Low-serotonin levels increase delayed reward discounting in humans (Journal of Neuroscience (2008) (4528-4532)) Journal of Neuroscience. 28: 5619.	0.385
2007	Doya K. Reinforcement learning: Computational theory and biological mechanisms. Hfsp Journal. 1: 30-40. PMID 19404458 DOI: 10.2976/1.2732246/10.2976/1	0.423
2007	Tanaka SC, Schweighofer N, Asahi S, Shishida K, Okamoto Y, Yamawaki S, Doya K. Serotonin differentially regulates short- and long-term prediction of rewards in the ventral and dorsal striatum. Plos One. 2: e1333. PMID 18091999 DOI: 10.1371/journal.pone.0001333	0.486
2007	Corrado G, Doya K. Understanding neural coding through the model-based analysis of decision making. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 27: 8178-80. PMID 17670963 DOI: 10.1523/JNEUROSCI.1590-07.2007	0.318
2007	Bertin M, Schweighofer N, Doya K. Multiple model-based reinforcement learning explains dopamine neuronal activity. Neural Networks : the Official Journal of the International Neural Network Society. 20: 668-75. PMID 17611074 DOI: 10.1016/j.neunet.2007.04.028	0.4
2007	Samejima K, Doya K. Multiple representations of belief states and action values in corticobasal ganglia loops. Annals of the New York Academy of Sciences. 1104: 213-28. PMID 17435124 DOI: 10.1196/Annals.1390.024	0.569
2007	Schweighofer N, Tanaka SC, Doya K. Serotonin and the evaluation of future rewards: theory, experiments, and possible neural mechanisms. Annals of the New York Academy of Sciences. 1104: 289-300. PMID 17360806 DOI: 10.1196/annals.1390.011	0.507
2007	Morimoto J, Doya K. Reinforcement learning state estimator. Neural Computation. 19: 730-56. PMID 17298231 DOI: 10.1162/neco.2007.19.3.730	0.345
2007	Ogasawara H, Doi T, Doya K, Kawato M. Nitric oxide regulates input specificity of long-term depression and context dependence of cerebellar learning. Plos Computational Biology. 3: e179. PMID 17222054 DOI: 10.1371/journal.pcbi.0020179	0.638
2007	Elfwing S, Uchibe E, Doya K, Christensen HI. Evolutionary development of hierarchical learning structures Ieee Transactions On Evolutionary Computation. 11: 249-264. DOI: 10.1109/TEVC.2006.890270	0.381
2007	Uchibe E, Doya K. Constrained reinforcement learning from intrinsic and extrinsic rewards 2007 Ieee 6th International Conference On Development and Learning, Icdl. 163-168. DOI: 10.1109/DEVLRN.2007.4354030	0.321
2007	Miyazaki K, Miyazaki K, Doya K. Activity of serotonergic neurons in the dorsal raphe nucleus of freely moving rats during reward and non-reward delay period Neuroscience Research. 58. DOI: 10.1016/J.Neures.2007.06.715	0.301
2007	Samejima K, Ueda Y, Doya K, Kimura M. Action value in the striatum and reinforcement-learning model of cortico-basal ganglia network Neuroscience Research. 58: S22. DOI: 10.1016/J.Neures.2007.06.127	0.609
2007	Ueda Y, Samejima K, Doya K, Kimura M. Selective impairment of reward-based adaptive choice of actions by intra-striatal injection of dopamine D1 receptor antagonist Neuroscience Research. 58: S114. DOI: 10.1016/J.Neures.2007.06.1237	0.513
2007	Doya K. In pursuit of the brain mechanism of reinforcement learning Neuroscience Research. 58. DOI: 10.1016/J.Neures.2007.06.007	0.397
2007	Tanaka SC, Samejima K, Okada G, Ueda K, Okamoto Y, Yamawaki S, Doya K. Erratum to “Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics” [Neural Netw. 19 (8) (2006) 1233–1241] Neural Networks. 20: 285-286. DOI: 10.1016/J.Neunet.2006.12.001	0.588
2007	Matsubara T, Morimoto J, Nakanishi J, Sato MA, Doya K. Learning a dynamic policy by using policy gradient: Application to biped walking Systems and Computers in Japan. 38: 25-38. DOI: 10.1002/scj.20441	0.352
2006	Schweighofer N, Shishida K, Han CE, Okamoto Y, Tanaka SC, Yamawaki S, Doya K. Humans can adopt optimal discounting strategy under real-time constraints. Plos Computational Biology. 2: e152. PMID 17096592 DOI: 10.1371/Journal.Pcbi.0020152	0.502
2006	Tanaka SC, Samejima K, Okada G, Ueda K, Okamoto Y, Yamawaki S, Doya K. Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics. Neural Networks : the Official Journal of the International Neural Network Society. 19: 1233-41. PMID 16979871 DOI: 10.1016/J.Neunet.2006.05.039	0.675
2006	Bapi RS, Miyapuram KP, Graydon FX, Doya K. fMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences. Neuroimage. 32: 714-27. PMID 16798015 DOI: 10.1016/j.neuroimage.2006.04.205	0.76
2006	Daw ND, Doya K. The computational neurobiology of learning and reward. Current Opinion in Neurobiology. 16: 199-204. PMID 16563737 DOI: 10.1016/j.conb.2006.03.006	0.442
2006	Kawawaki D, Shibata T, Goda N, Doya K, Kawato M. Anterior and superior lateral occipito-temporal cortex responsible for target motion prediction during overt and covert visual pursuit. Neuroscience Research. 54: 112-23. PMID 16337706 DOI: 10.1016/j.neures.2005.10.015	0.569
2006	Capi G, Doya K. Application of evolutionary computation for efficient reinforcement learning Applied Artificial Intelligence. 20: 35-55. DOI: 10.1080/08839510500191299	0.427
2006	Matsubara T, Morimoto J, Nakanishi J, Sato Ma, Doya K. Learning CPG-based biped locomotion with a policy gradient method Robotics and Autonomous Systems. 54: 911-920. DOI: 10.1016/j.robot.2006.05.012	0.343
2006	Samejima K, Katagiri K, Doya K, Kawato M. Symbolization and imitation learning of motion sequence using competitive modules Electronics and Communications in Japan, Part Iii: Fundamental Electronic Science (English Translation of Denshi Tsushin Gakkai Ronbunshi). 89: 42-53. DOI: 10.1002/Ecjc.20267	0.702
2006	Samejima K, Katagiri K, Doya K, Kawato M. Multiple model-based reinforcement learning for nonlinear control Electronics and Communications in Japan, Part Iii: Fundamental Electronic Science (English Translation of Denshi Tsushin Gakkai Ronbunshi). 89: 54-69. DOI: 10.1002/Ecjc.20266	0.727
2006	Miyapuram KP, Bapi RS, Pammi CVS, Ahmed, Doya K. Hierarchical chunking during learning of visuomotor sequences Ieee International Conference On Neural Networks - Conference Proceedings. 249-253.	0.744
2005	Samejima K, Ueda Y, Doya K, Kimura M. Representation of action-specific reward values in the striatum. Science (New York, N.Y.). 310: 1337-40. PMID 16311337 DOI: 10.1126/Science.1115270	0.548
2005	Morimoto J, Doya K. Robust reinforcement learning. Neural Computation. 17: 335-59. PMID 15720771 DOI: 10.1162/0899766053011528	0.312
2005	Ogasawara H, Doi T, Doya K, Kawato M. NO Regulates Input-Specificity of LTD and Context Dependence of Cerebellar Learning Plos Computational Biology. DOI: 10.1371/Journal.Pcbi.0020179.Eor	0.637
2005	Doya K, Uchibe E. The cyber rodent project: Exploration of adaptive mechanisms for self-preservation and self-reproduction Adaptive Behavior. 13: 149-160. DOI: 10.1177/105971230501300206	0.356
2005	Capi G, Doya K. Evolution of neural architecture fitting environmental dynamics Adaptive Behavior. 13: 53-66. DOI: 10.1177/105971230501300103	0.332
2004	Sato MA, Yoshioka T, Kajihara S, Toyama K, Goda N, Doya K, Kawato M. Hierarchical Bayesian estimation for MEG inverse problem. Neuroimage. 23: 806-26. PMID 15528082 DOI: 10.1016/j.neuroimage.2004.06.037	0.557
2004	Tanaka SC, Doya K, Okada G, Ueda K, Okamoto Y, Yamawaki S. Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nature Neuroscience. 7: 887-93. PMID 15235607 DOI: 10.1038/nn1279	0.552
2004	Schweighofer N, Doya K, Fukai H, Chiron JV, Furukawa T, Kawato M. Chaos may enhance information transmission in the inferior olive. Proceedings of the National Academy of Sciences of the United States of America. 101: 4655-60. PMID 15070773 DOI: 10.1073/pnas.0305966101	0.576
2004	Miyamoto H, Morimoto J, Doya K, Kawato M. Reinforcement learning with via-point representation. Neural Networks : the Official Journal of the International Neural Network Society. 17: 299-305. PMID 15037348 DOI: 10.1016/j.neunet.2003.11.004	0.648
2004	Schweighofer N, Doya K, Kuroda S. Cerebellar aminergic neuromodulation: towards a functional understanding. Brain Research. Brain Research Reviews. 44: 103-16. PMID 15003388 DOI: 10.1016/j.brainresrev.2003.10.004	0.409
2004	Haruno M, Kuroda T, Doya K, Toyama K, Kimura M, Samejima K, Imamizu H, Kawato M. A neural correlate of reward-based behavioral learning in caudate nucleus: a functional magnetic resonance imaging study of a stochastic decision task. The Journal of Neuroscience : the Official Journal of the Society For Neuroscience. 24: 1660-5. PMID 14973239 DOI: 10.1523/Jneurosci.3417-03.2004	0.731
2004	Uchibe E, Doya K. Hierarchical Reinforcement Learning for Multiple Reward Functions Journal of the Robotics Society of Japan. 22: 120-129. DOI: 10.7210/Jrsj.22.120	0.433
2004	Chandrasekhar Pammi VS, Miyapuram KP, Bapi RS, Doya K. Chunking phenomenon in complex sequential skill learning in humans Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3316: 294-299.	0.734
2004	Elfwing S, Uchibe E, Doya K, Christensen HI. Multi-agent reinforcement learning: Using macro actions to learn a mating task 2004 Ieee/Rsj International Conference On Intelligent Robots and Systems (Iros). 4: 3164-3169.	0.303
2004	Samejima K, Doya K, Ueda Y, Kimura M. Estimating internal variables and parameters of a learning agent by a particle filter Advances in Neural Information Processing Systems.	0.477
2003	Samejima K, Doya K, Kawato M. Inter-module credit assignment in modular reinforcement learning. Neural Networks : the Official Journal of the International Neural Network Society. 16: 985-94. PMID 14692633 DOI: 10.1016/S0893-6080(02)00235-6	0.716
2003	Wolpert DM, Doya K, Kawato M. A unifying computational framework for motor control and social interaction. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 358: 593-602. PMID 12689384 DOI: 10.1098/rstb.2002.1238	0.555
2003	Schweighofer N, Doya K. Meta-learning in reinforcement learning. Neural Networks : the Official Journal of the International Neural Network Society. 16: 5-9. PMID 12576101 DOI: 10.1016/S0893-6080(02)00228-9	0.417
2003	Koike Y, Doya K. Driver model based on reinforced learning with multiple-step state estimation Electronics and Communications in Japan, Part Iii: Fundamental Electronic Science (English Translation of Denshi Tsushin Gakkai Ronbunshi). 86: 85-95. DOI: 10.1002/ecjc.10123	0.392
2002	Doya K. Metalearning and neuromodulation. Neural Networks : the Official Journal of the International Neural Network Society. 15: 495-506. PMID 12371507 DOI: 10.1016/S0893-6080(02)00044-8	0.371
2002	Doya K, Samejima K, Katagiri K, Kawato M. Multiple model-based reinforcement learning. Neural Computation. 14: 1347-69. PMID 12020450 DOI: 10.1162/089976602753712972	0.727
2002	Doya K, Dayan P, Hasselmo ME. Introduction for 2002 Special Issue: Computational models of neuromodulation Neural Networks. 15: 475-477. DOI: 10.1016/S0893-6080(02)00042-4	0.439
2001	Nakahara H, Doya K, Hikosaka O. Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach. Journal of Cognitive Neuroscience. 13: 626-47. PMID 11506661 DOI: 10.1162/089892901750363208	0.412
2001	Schweighofer N, Doya K, Lay F. Unsupervised learning of granule cell sparse codes enhances cerebellar adaptive control. Neuroscience. 103: 35-50. PMID 11311786 DOI: 10.1016/S0306-4522(00)00548-0	0.377
2001	Kuroda S, Yamamoto K, Miyamoto H, Doya K, Kawat M. Statistical characteristics of climbing fiber spikes necessary for efficient cerebellar learning. Biological Cybernetics. 84: 183-92. PMID 11252636 DOI: 10.1007/s004220000206	0.401
2001	Morimoto J, Doya K. Acquisition of Stand-up Behavior by a 3-link 2-joint Robot using Hierarchical Reinforcement Learning Journal of the Robotics Society of Japan. 19: 574-579. DOI: 10.7210/Jrsj.19.574	0.344
2001	Samejima K, Doya K, Kawato M. Journal of the Robotics Society of Japan. 19: 551-556. DOI: 10.7210/Jrsj.19.551	0.657
2001	Doya K, Kimura H, Kawato M. Neural mechanisms of learning and control Ieee Control Systems Magazine. 21: 42-51. DOI: 10.1109/37.939943	0.667
2001	Morimoto J, Doya K. Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning Robotics and Autonomous Systems. 36: 37-51. DOI: 10.1016/S0921-8890(01)00113-0	0.417
2000	Doya K. Complementary roles of basal ganglia and cerebellum in learning and motor control. Current Opinion in Neurobiology. 10: 732-9. PMID 11240282 DOI: 10.1016/S0959-4388(00)00153-7	0.42
2000	Bapi RS, Doya K, Harner AM. Evidence for effector independent and dependent representations and their differential time course of acquisition during motor sequence learning. Experimental Brain Research. 132: 149-62. PMID 10853941 DOI: 10.1007/s002219900332	0.353
2000	Doya K. Reinforcement learning in continuous time and space. Neural Computation. 12: 219-45. PMID 10636940 DOI: 10.1162/089976600300015961	0.366
1999	Doya K. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks : the Official Journal of the International Neural Network Society. 12: 961-974. PMID 12662639 DOI: 10.1016/S0893-6080(99)00046-5	0.462
1999	Hikosaka O, Nakahara H, Rand MK, Sakai K, Lu X, Nakamura K, Miyachi S, Doya K. Parallel neural networks for learning sequential procedures. Trends in Neurosciences. 22: 464-71. PMID 10481194 DOI: 10.1016/S0166-2236(99)01439-3	0.441
1999	Schweighofer N, Doya K, Kawato M. Electrophysiological properties of inferior olive neurons: A compartmental model. Journal of Neurophysiology. 82: 804-17. PMID 10444678 DOI: 10.1152/Jn.1999.82.2.804	0.542
1999	Morimoto J, Doya K. Hierarchical reinforcement learning for motion learning: Learning `stand-up' trajectories Advanced Robotics. 13: 267-268. DOI: 10.1163/156855399X00513	0.4
1994	Doya K, Selverston AI. Dimension Reduction of Biological Neuron Models by Artificial Neural Networks Neural Computation. 6: 696-717. DOI: 10.1162/neco.1994.6.4.696	0.583
1991	Doya K, Yoshizawa S. Neural network model of temporal pattern memory Systems and Computers in Japan. 22: 61-69. DOI: 10.1002/Scj.4690220907	0.433
1989	Doya K, Yoshizawa S. Adaptive neural oscillator using continuous-time back-propagation learning Neural Networks. 2: 375-385. DOI: 10.1016/0893-6080(89)90022-1	0.385
Show low-probability matches.