Hamid Reza Maei, Ph.D.
Affiliations: | Computing Science | University of Alberta, Edmonton, Alberta, Canada |
Area:
Machine Learning, Reinforcement LearningGoogle:
"Hamid Reza Maei"Cross-listing: Neurotree
Parents
Sign in to add mentorJané Kondev | grad student | 2001-2002 | Brandeis | |
Larry F. Abbott | grad student | 2002-2003 | Brandeis (Neurotree) | |
Peter Latham | grad student | 2003-2005 | UCL (Neurotree) | |
Paul W. Frankland | grad student | 2005-2006 | University of Toronto (Neurotree) | |
Richard S. Sutton | grad student | 2007-2011 | University of Alberta | |
(Gradient Temporal-Difference Learning Algorithms.) |
BETA: Related publications
See more...
Publications
You can help our author matching system! If you notice any publications incorrectly attributed to this author, please sign in and mark matches as correct or incorrect. |
Wen Z, O'Neill D, Maei H. (2015) Optimal demand response using device-based reinforcement learning Ieee Transactions On Smart Grid. 6: 2312-2324 |
Wallace E, Maei HR, Latham PE. (2013) Randomly connected networks have short temporal memory. Neural Computation. 25: 1408-39 |
Maei HR, Szepesvari C, Bhatnagar S, et al. (2010) Toward off-policy learning control with function approximation Icml 2010 - Proceedings, 27th International Conference On Machine Learning. 719-726 |
Maei HR, Sutton RS. (2010) GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Artificial General Intelligence - Proceedings of the Third Conference On Artificial General Intelligence, Agi 2010. 91-96 |
Maei HR, Zaslavsky K, Wang AH, et al. (2009) Development and validation of a sensitive entropy-based measure for the water maze. Frontiers in Integrative Neuroscience. 3: 33 |
Maei HR, Zaslavsky K, Teixeira CM, et al. (2009) What is the Most Sensitive Measure of Water Maze Probe Test Performance? Frontiers in Integrative Neuroscience. 3: 4 |
Sutton RS, Maei HR, Precup D, et al. (2009) Fast gradient-descent methods for temporal-difference learning with linear function approximation Proceedings of the 26th International Conference On Machine Learning, Icml 2009. 993-1000 |
Maei HR, Szepesvari C, Bhatnagar S, et al. (2009) Convergent temporal-difference learning with arbitrary smooth function approximation Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. 1204-1212 |
Sutton RS, Szepesvári C, Maei HR. (2009) A convergent O(n) algorithm for off-policy temporal-difference learning with linear function approximation Advances in Neural Information Processing Systems 21 - Proceedings of the 2008 Conference. 1609-1616 |
Sutton RS, Maei HR, Precup D, et al. (2009) Fast gradient-descent methods for temporal-difference learning with linear function approximation Proceedings of the 26th International Conference On Machine Learning, Icml 2009. 993-1000 |