Satinder Singh's NIPS Papers

NIPS papers

Go back to publications main page.

Action-Conditional Video Prediction Using Deep Networks in ATARI Games.
by Juhnyuk Oh, Xiaoxiao Guo, Honglak Lee, Richard Lewis, and Satinder Singh.
In Neural Information Processing Systems, 2015.
online videos
arxiv pdf, NIPS pdf, NIPS Appendix pdf.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning.
by Xiaoxiao Guo, Satinder Singh, Honglak Lee, Richard Lewis, and Xiaoshi Wang.
In Neural Information Processing Systems (NIPS), 2014.
pdf.
Reward Mapping for Transfer in Long-Lived Agents.
by Xiaoxiao Guo, Satinder Singh, and Richard L Lewis.
In Advances in Neural Information Processing Systems (NIPS), 26, 2013.
pdf.
Reward Design via Online Gradient Ascent
by Jonathan Sorg, Satinder Singh, and Richard Lewis.
In Neural Information Processing Systems (NIPS), 2010.
pdf.
Simple Local Models for Complex Dynamical Systems
by Erik Talvitie and Satinder Singh.
In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS), 2008.
pdf
Exponential Family Predictive Representations of State
by David Wnigate and Satinder Singh.
In Proceedings of the Advances in Neural Information Processing Systems, 20 (NIPS), pages 1617-1624, 2007.
pdf
Off-policy Learning with Options and Recognizers
by Doina Precup, Richard Sutton, Cosmin Paduraru, Anna Koop and Satinder Singh.
In Proceedings of Advances in Neural Information Processing Systems 18 (NIPS), pages 1097-1104, 2006.
pdf
Intrinsically Motivated Reinforcement Learning by Satinder Singh, Andrew G. Barto and Nuttapong Chentanez. To appear in Proceedings of Advances in Neural Information Processing Systems 17 (NIPS), 2005.
pdf.
Approximately Efficient Online Mechanism Design by David Parkes, Satinder Singh and Dimah Yanovsky. To appear in Proceedings of Advances in Neural Information Processing Systems 17 (NIPS), 2005.
pdf.
A Nonlinear Predictive State Representation by Matthew Rudary and Satinder Singh. In Advances in Neural Information Processing Systems 16 (NIPS), pages 855-862, 2004.
pdf.
An MDP-Based Approach to Online Mechanism Design by David Parkes and Satinder Singh. In Advances in Neural Information Processing Systems 16 (NIPS), pages 791-798, 2004.
pdf.
Predictive Representations of State by Michael Littman, Richard Sutton and Satinder Singh. In Advances in Neural Information Processing Systems 14 (NIPS), pages 1555-1561, 2001.
gzipped postscript pdf.
An Efficient Exact Algorithm for Single Connected Graphical Games by Michael Littman, Michael Kearns and Satinder Singh. In Advances in Neural Information Processing Systems 14 (NIPS), pages 817-823, 2002.
gzipped postscript pdf.
Cobot: A Social Reinforcement Learning Agent by Charles Isbell, Christian Shelton, Michael Kearns, Satinder Singh and Peter Stone. In Advances in Neural Information Processing Systems 14 (NIPS) pages 1393-1400, 2002.
gzipped postscript pdf.
Reinforcement Learning for Spoken Dialogue Systems by Satinder Singh, Michael Kearns, Diane Litman and Marilyn Walker. In Advances in Neural Information Processing Systems 12 (NIPS), 2000.
gzipped postscript.
Policy Gradient Methods for Reinforcement Learning with Function Approximation by Richard Sutton, Dave McAllester, Satinder Singh and Yishay Mansour. In Advances in Neural Information Processing Systems 12 (NIPS), 2000.
gzipped postscript.
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms by Michael Kearns and Satinder Singh. In Advances in Neural Information Processing Systems 11 (NIPS), pages 996-1002, 1999.
gzipped postscript.
Optimizing admission control while ensuring quality of service in multimedia networks via reinforcement learning by Timothy Brown, Hong Tong, and Satinder Singh. In Advances in Neural Information Processing Systems 11 (NIPS), pages 982-988, 1999.
gzipped postscript.
Experimental Results on Learning Stochastic Memoryless Policies for Partially Observable Markov Decision Processes by John K. Williams and Satinder Singh. In Advances in Neural Information Processing Systems 11 (NIPS), pages 1073-1079, 1999.
gzipped postscript.
Improved switching among temporally abstract actions by Richard Sutton, Satinder Singh, Doina Precup and Balaraman Ravindran. In Advances in Neural Information Processing Systems 11 (NIPS), pages 1066-1072, 1999.
gzipped postscript.
How to Dynamically Merge Markov Decision Processes by Satinder Singh and David Cohn. In Advances in Neural Information Processing Systems 10 (NIPS), pages 1057-1063, 1998.
gzipped postscript pdf.
Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems by Satinder Singh and Dimitri Bertsekas. In Advances in Neural Information Processing Systems 9 (NIPS), pages 974-980, 1997.
gzipped postscript
Predicting Lifetimes in Dynamically Allocated Memory by David Cohn and Satinder Singh. In Advances in Neural Information Processing Systems 9 (NIPS), pages 939-945, 1997.
gzipped postscript pdf.
Analytical Mean Squared Error Curves for Temporal Difference Learning by Satinder Singh and Peter Dayan. In Advances in Neural Information Processing Systems 9 (NIPS), pages 1054-1060, 1997.
gzipped postscript.
Improving Policies Without Measuring Merits by Peter Dayan and Satinder Singh. In Advances in Neural Information Processing Systems 8 (NIPS), pages 1059-1065, 1996.
gzipped postscript.
Reinforcement Learning With Soft State Aggregation by Satinder Singh, Tommi Jaakkola and Michael Jordan. In Advances in Neural Information Processing Systems 7 (NIPS), pages 361-368, 1995.
gzipped postscript pdf.
Reinforcement Learning Algorithm for Partially Observable Markov Problems by Tommi Jaakkola, Satinder Singh and Michael Jordan. In Advances in Neural Information Processing Systems 7 (NIPS), pages 345-352, 1995.
gzipped postscript pdf.
Robust Reinforcement Learning in Motion Planning by Satinder Singh, Andrew Barto, Roderic Grupen, and Christopher Connolly. In Advances in Neural Information Processing Systems 6 (NIPS), pages 655-662, 1994.
gzipped postscript.( 68 KBytes)
Stochastic Convergence of Iterative DP Algorithms by Tommi Jaakkola, Michael Jordan and Satinder Singh. In Advances in Neural Information Processing Systems 6 (NIPS), pages 703-710, 1994.
gzipped postscript pdf.
A Cortico-Cerebellar model that learns to generate distributed motor commands to control a kinetic arm by Satinder Singh, Neil Berthier, Andrew Barto, and Jim Houk. In Advances in Neural Information Processing Systems 4 (NIPS), pages 611-618, 1992.
The Efficient Learning of Multiple Task Sequences by Satinder Singh. In Advances in Neural Information Processing Systems 4 (NIPS), pages 251-258, 1992.
gzipped postscript.

Go back to publications main page.