Andrew Barto - Publications

Affiliations: 
University of Massachusetts, Amherst, Amherst, MA 
Area:
Reinforcement Learning
Website:
http://www-anw.cs.umass.edu/~barto/

67 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2019 Santucci VG, Oudeyer PY, Barto A, Baldassarre G. Editorial: Intrinsically Motivated Open-Ended Learning in Autonomous Robots. Frontiers in Neurorobotics. 13: 115. PMID 32009927 DOI: 10.3389/fnbot.2019.00115  0.411
2019 Barto AG. Reinforcement Learning: Connections, Surprises, and Challenge Ai Magazine. 40: 3-15. DOI: 10.1609/AIMAG.V40I1.2844  0.369
2018 Frankenhuis WE, Panchanathan K, Barto AG. Enriching Behavioral Ecology with Reinforcement Learning Methods. Behavioural Processes. PMID 29412143 DOI: 10.1016/J.Beproc.2018.01.008  0.322
2015 Niekum S, Osentoski S, Konidaris G, Chitta S, Marthi B, Barto AG. Learning grounded finite-state representations from unstructured demonstrations International Journal of Robotics Research. 34: 131-157. DOI: 10.1177/0278364914554471  0.693
2015 Niekum S, Osentoski S, Atkeson CG, Barto AG. Online Bayesian changepoint detection for articulated motion models Proceedings - Ieee International Conference On Robotics and Automation. 2015: 1468-1475. DOI: 10.1109/ICRA.2015.7139383  0.505
2015 Botvinick M, Weinstein A, Solway A, Barto A. Reinforcement learning, efficient coding, and the statistics of natural tasks Current Opinion in Behavioral Sciences. 5: 71-77. DOI: 10.1016/J.Cobeha.2015.08.009  0.459
2014 Baldassarre G, Stafford T, Mirolli M, Redgrave P, Ryan RM, Barto A. Intrinsic motivations and open-ended development in animals, humans, and robots: an overview. Frontiers in Psychology. 5: 985. PMID 25249998 DOI: 10.3389/Fpsyg.2014.00985  0.394
2014 Da Silva BC, Baldassarre G, Konidaris G, Barto A. Learning parameterized motor skills on a humanoid robot Proceedings - Ieee International Conference On Robotics and Automation. 5239-5244. DOI: 10.1109/ICRA.2014.6907629  0.386
2014 Da Silva BC, Konidaris G, Barto A. Active learning of parameterized skills 31st International Conference On Machine Learning, Icml 2014. 5: 3736-3745.  0.361
2013 Levy YZ, Levy DJ, Barto AG, Meyer JS. A computational hypothesis for allostasis: delineation of substance dependence, conventional therapies, and alternative treatments. Frontiers in Psychiatry. 4: 167. PMID 24391601 DOI: 10.3389/Fpsyt.2013.00167  0.709
2013 Barto A, Mirolli M, Baldassarre G. Novelty or surprise? Frontiers in Psychology. 4: 907. PMID 24376428 DOI: 10.3389/fpsyg.2013.00907  0.306
2013 Shah A, Barto AG, Fagg AH. A dual process account of coarticulation in motor skill acquisition. Journal of Motor Behavior. 45: 531-49. PMID 24116847 DOI: 10.1080/00222895.2013.837423  0.528
2013 Kuindersma SR, Grupen RA, Barto AG. Variable risk control via stochastic optimization International Journal of Robotics Research. 32: 806-825. DOI: 10.1177/0278364913476124  0.766
2013 Barto AG. Intrinsic motivation and reinforcement learning Intrinsically Motivated Learning in Natural and Artificial Systems. 17-47. DOI: 10.1007/978-3-642-32375-1_2  0.373
2013 Kuindersma S, Grupen R, Barto A. Variational Bayesian optimization for runtime risk-sensitive control Robotics: Science and Systems. 8: 201-208.  0.609
2012 Konidaris G, Kuindersma S, Grupen R, Barto A. Robot learning from demonstration by constructing skill trees International Journal of Robotics Research. 31: 360-375. DOI: 10.1177/0278364911428653  0.762
2012 Niekum S, Osentoski S, Konidaris G, Barto AG. Learning and generalization of complex tasks from unstructured demonstrations Ieee International Conference On Intelligent Robots and Systems. 5239-5246. DOI: 10.1109/IROS.2012.6386006  0.348
2012 Konidaris G, Scheidwasser I, Barto AG. Transfer in reinforcement learning via shared features Journal of Machine Learning Research. 13: 1333-1371.  0.306
2012 Da Silva BC, Konidaris G, Barto AG. Learning parameterized skills Proceedings of the 29th International Conference On Machine Learning, Icml 2012. 2: 1679-1686.  0.383
2012 Dabney W, Barto AG. Adaptive step-size for online temporal difference learning Proceedings of the National Conference On Artificial Intelligence. 2: 872-878.  0.317
2011 Ribas-Fernandes JJ, Solway A, Diuk C, McGuire JT, Barto AG, Niv Y, Botvinick MM. A neural signature of hierarchical reinforcement learning. Neuron. 71: 370-9. PMID 21791294 DOI: 10.1016/J.Neuron.2011.05.042  0.361
2011 Niekum S, Spector L, Barto A. Evolution of reward functions for reinforcement learning Genetic and Evolutionary Computation Conference, Gecco'11 - Companion Publication. 177-178. DOI: 10.1145/2001858.2001957  0.331
2011 Kuindersma S, Grupen R, Barto A. Learning dynamic arm motions for postural recovery Ieee-Ras International Conference On Humanoid Robots. 7-12. DOI: 10.1109/Humanoids.2011.6100881  0.77
2011 Botvinick MM, Niv Y, Barto AG. Hierarchically organised behaviour and its neural foundations: A reinforcement-learning perspective Modelling Natural Action Selection. 264-299. DOI: 10.1017/CBO9780511731525.017  0.355
2011 Konidaris G, Kuindersma S, Grupen R, Barto A. Autonomous skill acquisition on a mobile manipulator Proceedings of the National Conference On Artificial Intelligence. 2: 1468-1473.  0.599
2010 Konidaris G, Kuindersmay S, Barto A, Grupen R. Constructing skill trees for reinforcement learning agents from demonstration trajectories Advances in Neural Information Processing Systems 23: 24th Annual Conference On Neural Information Processing Systems 2010, Nips 2010 0.662
2009 Shah A, Barto AG. Effect on movement selection of an evolving sensory representation: a multiple controller model of skill acquisition. Brain Research. 1299: 55-73. PMID 19595991 DOI: 10.1016/j.brainres.2009.07.006  0.543
2009 Botvinick MM, Niv Y, Barto AC. Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition. 113: 262-80. PMID 18926527 DOI: 10.1016/j.cognition.2008.08.011  0.388
2009 Konidaris G, Barto A. Skill discovery in continuous reinforcement learning domains using skill chaining Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. 1015-1023.  0.324
2009 Konidaris G, Barto A. Efficient skill learning using abstraction selection Ijcai International Joint Conference On Artificial Intelligence. 1107-1112.  0.343
2007 Konidaris G, Barto A. Building portable options: Skill transfer in reinforcement learning Ijcai International Joint Conference On Artificial Intelligence. 895-900.  0.333
2006 Konidaris G, Barto A. Autonomous shaping: Knowledge transfer in reinforcement learning Acm International Conference Proceeding Series. 148: 489-496. DOI: 10.1145/1143844.1143906  0.325
2006 Rosenstein MT, Barto AG, Van Emmerik REA. Learning at the level of synergies for a robot weightlifter Robotics and Autonomous Systems. 54: 706-717. DOI: 10.1016/J.Robot.2006.03.002  0.773
2006 Wolfe AP, Barto AG. Decision tree methods for finding reusable MDP homomorphisms Proceedings of the National Conference On Artificial Intelligence. 1: 530-535.  0.458
2005 Berthier NE, Rosenstein MT, Barto AG. Approximate optimal control as a model for motor learning. Psychological Review. 112: 329-46. PMID 15783289 DOI: 10.1037/0033-295X.112.2.329  0.747
2005 Singh S, Barto AG, Chentanez N. Intrinsically motivated reinforcement learning Advances in Neural Information Processing Systems 0.373
2005 Şimşek Ö, Wolfe AP, Barto AG. Identifying useful subgoals in reinforcement learning by local graph partitioning Icml 2005 - Proceedings of the 22nd International Conference On Machine Learning. 817-824.  0.594
2005 Şimşek O, Barto AG. Learning skills in reinforcement learning using relative novelty Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3607: 367-374.  0.386
2004 Shah A, Fagg AH, Barto AG. Cortical involvement in the recruitment of wrist muscles. Journal of Neurophysiology. 91: 2445-56. PMID 14749314 DOI: 10.1152/Jn.00879.2003  0.394
2004 Rosenstein MT, Barto AG. Reinforcement learning with supervision by a stable controller Proceedings of the American Control Conference. 5: 4517-4522. DOI: 10.1109/ACC.2004.182663  0.753
2004 Şimşek O, Wolfe AP, Barto AG. Local graph partitioning as a basis for generating temporally-extended actions in reinforcement learning Aaai Workshop - Technical Report. 91-96.  0.592
2004 Şimşek O, Barto AG. Using relative novelty to identify useful temporal abstractions in reinforcement learning Proceedings, Twenty-First International Conference On Machine Learning, Icml 2004. 751-758.  0.342
2003 Barto AG, Mahadevan S. Recent Advances in Hierarchical Reinforcement Learning Discrete Event Dynamic Systems: Theory and Applications. 13: 343-379+382. DOI: 10.1023/A:1022140919877  0.372
2003 Perkins TJ, Barto AG. Lyapunov design for safe reinforcement learning Journal of Machine Learning Research. 3: 803-832.  0.636
2002 Fagg AH, Shah A, Barto AG. A computational model of muscle recruitment for wrist movements. Journal of Neurophysiology. 88: 3348-58. PMID 12466451 DOI: 10.1152/Jn.00621.2002  0.398
2002 Ravindran B, Barto AG. Model minimization in hierarchical reinforcement learning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2371: 196-211.  0.364
2001 Rosenstein MT, Barto AG. Robot weightlifting by direct policy search Ijcai International Joint Conference On Artificial Intelligence. 839-844.  0.714
2001 Perkins TJ, Barto AG. Heuristic search in infinite state spaces guided by Lyapunov analysis Ijcai International Joint Conference On Artificial Intelligence. 242-247.  0.511
1999 Moll R, Barto AG, Perkins TJ, Sutton RS. Learning instance-independent value functions to enhance local search Advances in Neural Information Processing Systems. 1017-1023.  0.701
1998 Monaco JF, Ward DG, Barto AG. Automated aircraft recovery via reinforcement learning: Initial experiments Advances in Neural Information Processing Systems. 1022-1028.  0.328
1998 Crites RH, Barto AG. Elevator Group Control Using Multiple Reinforcement Learning Agents Machine Learning. 12: 235-262.  0.31
1997 Barto AG, Sutton RS. Chapter 19 Reinforcement learning in artificial intelligence Advances in Psychology. 121: 358-386. DOI: 10.1016/S0166-4115(97)80105-7  0.57
1997 Hansen EA, Barto AG, Zilberstein S. Reinforcement learning for mixed open-loop and closed-loop control Advances in Neural Information Processing Systems. 1026-1032.  0.307
1997 Duff MO, Barto AG. Local bandit approximation for optimal learning problems Advances in Neural Information Processing Systems. 1019-1025.  0.666
1994 Barto AG. Reinforcement learning control. Current Opinion in Neurobiology. 4: 888-93. PMID 7888773 DOI: 10.1016/0959-4388(94)90138-4  0.379
1994 Gullapalli V, Barto AG, Grupen RA. Learning admittance mappings for force-guided assembly Proceedings - Ieee International Conference On Robotics and Automation. 2633-2638.  0.609
1992 Sutton RS, Barto AG, Williams RJ. Reinforcement Learning is Direct Adaptive Optimal Control Ieee Control Systems. 12: 19-22. DOI: 10.1109/37.126844  0.567
1992 Gullapalli V, Grupen RA, Barto AG. Learning reactive admittance control Proceedings - Ieee International Conference On Robotics and Automation. 2: 1475-1480.  0.359
1991 Jacobs RA, Jordan MI, Barto AG. Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks Cognitive Science. 15: 219-250. DOI: 10.1016/0364-0213(91)80006-Q  0.64
1986 Moore JW, Desmond JE, Berthier NE, Blazis DE, Sutton RS, Barto AG. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals. Behavioural Brain Research. 21: 143-54. PMID 3755947 DOI: 10.1016/0166-4328(86)90092-6  0.392
1985 Barto AG, Anandan P. Pattern-Recognizing Stochastic Learning Automata Ieee Transactions On Systems, Man and Cybernetics. 360-375. DOI: 10.1109/TSMC.1985.6313371  0.329
1983 Barto AG, Sutton RS, Anderson CW. Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems Ieee Transactions On Systems, Man and Cybernetics. 834-846. DOI: 10.1109/TSMC.1983.6313077  0.536
1982 Barto AG, Anderson CW, Sutton RS. Synthesis of nonlinear control surfaces by a layered associative search network. Biological Cybernetics. 43: 175-85. PMID 7093360 DOI: 10.1007/BF00319977  0.48
1982 Barto AG, Sutton RS. Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behavioural Brain Research. 4: 221-35. PMID 6277346 DOI: 10.1016/0166-4328(82)90001-8  0.387
1981 Barto AG, Sutton RS. Landmark learning: an illustration of associative search. Biological Cybernetics. 42: 1-8. PMID 7326277 DOI: 10.1007/BF00335152  0.518
1981 Sutton RS, Barto AG. Toward a modern theory of adaptive networks: expectation and prediction. Psychological Review. 88: 135-70. PMID 7291377 DOI: 10.1037/0033-295X.88.2.135  0.469
1979 Barto AG, Sutton RS, Brouwer PS. Associative search network: A reinforcement learning associative memory Biological Cybernetics. 40: 201-211. DOI: 10.1007/BF00453370  0.452
Show low-probability matches.