| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 1 | Jonathan Sorg, Satinder P. Singh, Richard L. Lewis |
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Diane J. Litman, Satinder P. Singh, Marilyn A. Walker |
Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Michael L. Littman, Satinder P. Singh, Peter Stone |
ATTac-2000: An Adaptive Autonomous Bidding Agent  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Quang Duong, Michael P. Wellman, Satinder P. Singh |
Modeling Information Diffusion in Networks with Unobserved Links.  |
SocialCom/PASSAT  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Robert Cohn, Edmund H. Durfee, Satinder P. Singh |
Comparing action-query strategies in semi-autonomous agents.  |
AAMAS  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Robert Cohn, Edmund H. Durfee, Satinder P. Singh |
Comparing Action-Query Strategies in Semi-Autonomous Agents.  |
AAAI  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh, Richard L. Lewis |
Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents.  |
AAAI  |
2011 |
DBLP BibTeX RDF |
|
| 1 | David C. Parkes, Ruggiero Cavallo, Florin Constantin, Satinder P. Singh |
Dynamic Incentive Mechanisms.  |
AI Magazine  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Richard L. Lewis, Andrew G. Barto, Jonathan Sorg |
Intrinsically Motivated Reinforcement Learning: An Evolutionary Perspective.  |
IEEE T. Autonomous Mental Development  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Robert Cohn, Michael Maxim, Edmund H. Durfee, Satinder P. Singh |
Selecting Operator Queries Using Expected Myopic Gain.  |
IAT  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh, Richard L. Lewis |
Internal Rewards Mitigate Agent Boundedness.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh, Richard L. Lewis |
Variance-Based Rewards for Approximate Bayesian Reinforcement Learning.  |
UAI  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Quang Duong, Michael P. Wellman, Satinder P. Singh, Yevgeniy Vorobeychik |
History-dependent graphical multiagent models.  |
AAMAS  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh |
Linear options.  |
AAMAS  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh, Richard L. Lewis |
Reward Design via Online Gradient Ascent.  |
NIPS  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Quang Duong, Yevgeniy Vorobeychik, Satinder P. Singh, Michael P. Wellman |
Learning Graphical Game Models.  |
IJCAI  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Jonathan Sorg, Satinder P. Singh |
Transfer via soft homomorphisms.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
Markov decision process, homomorphism, transfer learning |
| 1 | Michael R. James, Satinder P. Singh |
SarsaLandmark: an algorithm for learning in POMDPs with landmarks.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning, landmark, POMDP, partial observability |
| 1 | Matthew R. Rudary, Satinder P. Singh |
Predictive Linear-Gaussian Models of Dynamical Systems with Vector-Valued Actions and Observations.  |
ISAIM  |
2008 |
DBLP BibTeX RDF |
|
| 1 | David Wingate, Satinder P. Singh |
Efficiently learning linear-linear exponential family predictive representations of state.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Quang Duong, Michael P. Wellman, Satinder P. Singh |
Knowledge Combination in Graphical Multiagent Models.  |
UAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Britton Wolfe, Michael R. James, Satinder P. Singh |
Approximate predictive state representations.  |
AAMAS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh |
Learning payoff functions in infinite games.  |
Machine Learning  |
2007 |
DBLP DOI BibTeX RDF |
Learning in games, Nash equilibrium approximation, Game theory |
| 1 | David Wingate, Vishal Soni, Britton Wolfe, Satinder P. Singh |
Relational Knowledge with Predictive State Representations.  |
IJCAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Vishal Soni, Satinder P. Singh, Michael P. Wellman |
Constraint satisfaction algorithms for graphical games.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
constraint satisfaction, graphical games |
| 1 | David Wingate, Satinder P. Singh |
On discovery and learning of models with predictive representations of state for agents with continuous actions and observations.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
predictive representations of state, information theory, dynamical system modeling |
| 1 | Vishal Soni, Satinder P. Singh |
Abstraction in Predictive State Representations.  |
AAAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Charles Lee Isbell Jr., Michael J. Kearns, Satinder P. Singh, Christian R. Shelton, Peter Stone, David P. Kormann |
Cobot in LambdaMOO: An Adaptive Social Statistics Agent.  |
Autonomous Agents and Multi-Agent Systems  |
2006 |
DBLP DOI BibTeX RDF |
Chat agents, Game theory, Reinforcement learning, Autonomous agents, Believable agents, Social modeling |
| 1 | Matthew R. Rudary, Satinder P. Singh |
Predictive linear-Gaussian models of controlled stochastic dynamical systems.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | David Wingate, Satinder P. Singh |
Kernel Predictive Linear Gaussian models for nonlinear stochastic dynamical systems.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Britton Wolfe, Satinder P. Singh |
Predictive state representations with options.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Ruggiero Cavallo, David C. Parkes, Satinder P. Singh |
Optimal Coordinated Planning Amongst Self-Interested Agents with Private State.  |
UAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | David Wingate, Satinder P. Singh |
Mixtures of Predictive Linear Gaussian Models for Nonlinear, Stochastic Dynamical Systems.  |
AAAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Vishal Soni, Satinder P. Singh |
Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains.  |
AAAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Michael P. Wellman, Joshua Estelle, Satinder P. Singh, Yevgeniy Vorobeychik, Christopher Kiekintveld, Vishal Soni |
Strategic Interactions in a Supply Chain Game.  |
Computational Intelligence  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Nicholas L. Cassimatis, Sean Luke, Simon D. Levy, Ross W. Gayler, Pentti Kanerva, Chris Eliasmith, Timothy W. Bickmore, Alan C. Schultz, Randall Davis, James A. Landay, Robert C. Miller, Eric Saund, Thomas F. Stahovich, Michael L. Littman, Satinder P. Singh, Shlomo Argamon, Shlomo Dubnov |
Reports on the 2004 AAAI Fall Symposia.  |
AI Magazine  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Michael R. James, Britton Wolfe, Satinder P. Singh |
Combining Memory and Landmarks with Predictive State Representations.  |
IJCAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh |
Learning Payoff Functions in Infinite Games.  |
IJCAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Britton Wolfe, Michael R. James, Satinder P. Singh |
Learning predictive state representations in dynamical systems without reset.  |
ICML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthew R. Rudary, Satinder P. Singh, David Wingate |
Predictive Linear-Gaussian Models of Stochastic Dynamical Systems.  |
UAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Michael R. James, Satinder P. Singh |
Planning in Models that Combine Memory with Predictive Representations of State.  |
AAAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Doina Precup, Richard S. Sutton, Cosmin Paduraru, Anna Koop, Satinder P. Singh |
Off-policy Learning with Options and Recognizers.  |
NIPS ![In: Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, NIPS 2005, December 5-8, 2005, Vancouver, British Columbia, Canada], 2005. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP BibTeX RDF |
|
| 1 | Christopher Kiekintveld, Michael P. Wellman, Satinder P. Singh, Vishal Soni |
Value-driven procurement in the TAC supply chain game.  |
SIGecom Exchanges  |
2004 |
DBLP DOI BibTeX RDF |
algorithms, e-commerce, economics, supply chains, trading agents |
| 1 | Christopher Kiekintveld, Michael P. Wellman, Satinder P. Singh, Joshua Estelle, Yevgeniy Vorobeychik, Vishal Soni, Matthew R. Rudary |
Distributed Feedback Control for Decision Making on Supply Chains.  |
ICAPS  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Joshua Estelle, Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh, Christopher Kiekintveld, Vishal Soni |
Strategic Interactions in the TAC 2003 Supply Chain Tournament.  |
Computers and Games  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael R. James, Satinder P. Singh, Michael L. Littman |
Planning with predictive state representations.  |
ICMLA  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Michael R. James, Satinder P. Singh |
Learning and discovery of predictive state representations in dynamical systems with reset.  |
ICML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthew R. Rudary, Satinder P. Singh, Martha E. Pollack |
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning.  |
ICML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael R. James, Matthew R. Rudary |
Predictive State Representations: A New Theory for Modeling Dynamical Systems.  |
UAI  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Vishal Soni, Michael P. Wellman |
Computing approximate bayes-nash equilibria in tree-games of incomplete information.  |
ACM Conference on Electronic Commerce  |
2004 |
DBLP DOI BibTeX RDF |
approximate bayes-nash equilibria, games of incomplete information, structured games |
| 1 | David C. Parkes, Satinder P. Singh, Dimah Yanovsky |
Approximately Efficient Online Mechanism Design.  |
NIPS ![In: Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, NIPS 2004, December 13-18, 2004, Vancouver, British Columbia, Canada], 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Andrew G. Barto, Nuttapong Chentanez |
Intrinsically Motivated Reinforcement Learning.  |
NIPS ![In: Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, NIPS 2004, December 13-18, 2004, Vancouver, British Columbia, Canada], 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael L. Littman, Nicholas K. Jong, David Pardoe, Peter Stone |
Learning Predictive State Representations.  |
ICML  |
2003 |
DBLP BibTeX RDF |
|
| 1 | David C. Parkes, Satinder P. Singh |
An MDP-Based Approach to Online Mechanism Design.  |
NIPS ![In: Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, NIPS 2003, December 8-13, 2003, Vancouver and Whistler, British Columbia, Canada], 2003, MIT Press, 0-262-20152-6. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP BibTeX RDF |
|
| 1 | Matthew R. Rudary, Satinder P. Singh |
A Nonlinear Predictive State Representation.  |
NIPS ![In: Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, NIPS 2003, December 8-13, 2003, Vancouver and Whistler, British Columbia, Canada], 2003, MIT Press, 0-262-20152-6. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh |
Introduction.  |
Machine Learning  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael J. Kearns, Satinder P. Singh |
Near-Optimal Reinforcement Learning in Polynomial Time.  |
Machine Learning  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Satinder P. Singh, Diane J. Litman, Michael J. Kearns, Marilyn A. Walker |
Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System.  |
J. Artif. Intell. Res. (JAIR)  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael J. Kearns, Charles Lee Isbell Jr., Satinder P. Singh, Diane J. Litman, Jessica Howe |
CobotDS: A Spoken Dialogue System for Chat.  |
AAAI/IAAI  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns |
ATTac-2000: An Adaptive Autonomous Bidding Agent.  |
J. Artif. Intell. Res. (JAIR)  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Charles Lee Isbell Jr., Christian R. Shelton, Michael J. Kearns, Satinder P. Singh, Peter Stone |
A social reinforcement learning agent.  |
Agents  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns |
ATTac-2000: an adaptive autonomous bidding agent.  |
Agents  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | János A. Csirik, Michael L. Littman, Satinder P. Singh, Peter Stone |
FAucS : An FCC Spectrum Auction Simulator for Autonomous Bidding Agents.  |
WELCOM  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael J. Kearns, Michael L. Littman, Satinder P. Singh |
Graphical Models for Game Theory.  |
UAI  |
2001 |
DBLP BibTeX RDF |
|
| 1 | Charles Lee Isbell Jr., Christian R. Shelton, Michael J. Kearns, Satinder P. Singh, Peter Stone |
Cobot: A Social Reinforcement Learning Agent.  |
NIPS ![In: Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, December 3-8, 2001, Vancouver, British Columbia, Canada], pp. 1393-1400, 2001, MIT Press. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Richard S. Sutton, Satinder P. Singh |
Predictive Representations of State.  |
NIPS ![In: Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, December 3-8, 2001, Vancouver, British Columbia, Canada], pp. 1555-1561, 2001, MIT Press. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Michael J. Kearns, Satinder P. Singh |
An Efficient, Exact Algorithm for Solving Tree-Structured Graphical Games.  |
NIPS ![In: Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, NIPS 2001, December 3-8, 2001, Vancouver, British Columbia, Canada], pp. 817-823, 2001, MIT Press. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári |
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms.  |
Machine Learning  |
2000 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Stone, Richard S. Sutton, Satinder P. Singh |
Reinforcement Learning for 3 vs. 2 Keepaway  |
RoboCup  |
2000 |
DBLP DOI BibTeX RDF |
|
| 1 | Diane J. Litman, Michael S. Kearns, Satinder P. Singh, Marilyn A. Walker |
Automatic Optimization of Dialogue Management.  |
COLING  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Doina Precup, Richard S. Sutton, Satinder P. Singh |
Eligibility Traces for Off-Policy Policy Evaluation.  |
ICML  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Kary Myers, Michael J. Kearns, Satinder P. Singh, Marilyn A. Walker |
A Boosting Approach to Topic Spotting on Subdialogues.  |
ICML  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Yishay Mansour, Satinder P. Singh |
Fast Planning in Stochastic Games.  |
UAI  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael J. Kearns, Yishay Mansour |
Nash Convergence of Gradient Dynamics in General-Sum Games.  |
UAI  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Satinder P. Singh |
Bias-Variance Error Bounds for Temporal Difference Updates.  |
COLT  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Charles Lee Isbell Jr., Michael J. Kearns, David P. Kormann, Satinder P. Singh, Peter Stone |
Cobot in LambdaMOO: A Social Statistics Agent.  |
AAAI/IAAI  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael J. Kearns, Diane J. Litman, Marilyn A. Walker |
Empirical Evaluation of a Reinforcement Learning Spoken Dialogue System.  |
AAAI/IAAI  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Richard S. Sutton, Doina Precup, Satinder P. Singh |
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning.  |
Artif. Intell.  |
1999 |
DBLP DOI BibTeX RDF |
|
| 1 | David A. McAllester, Satinder P. Singh |
Approximate Planning for Factored POMDPs using Belief State Simplification.  |
UAI  |
1999 |
DBLP BibTeX RDF |
|
| 1 | Yishay Mansour, Satinder P. Singh |
On the Complexity of Policy Iteration.  |
UAI  |
1999 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael J. Kearns, Diane J. Litman, Marilyn A. Walker |
Reinforcement Learning for Spoken Dialogue Systems.  |
NIPS ![In: Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29 - December 4, 1999], pp. 956-962, 1999, The MIT Press, 0-262-19450-3. The full citation details ...](Pics/full.jpeg) |
1999 |
DBLP BibTeX RDF |
|
| 1 | Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour |
Policy Gradient Methods for Reinforcement Learning with Function Approximation.  |
NIPS ![In: Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29 - December 4, 1999], pp. 1057-1063, 1999, The MIT Press, 0-262-19450-3. The full citation details ...](Pics/full.jpeg) |
1999 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Peter Dayan |
Analytical Mean Squared Error Curves for Temporal Difference Learning.  |
Machine Learning  |
1998 |
DBLP DOI BibTeX RDF |
|
| 1 | Doina Precup, Richard S. Sutton, Satinder P. Singh |
Theoretical Results on Reinforcement Learning with Temporally Abstract Options.  |
ECML  |
1998 |
DBLP DOI BibTeX RDF |
|
| 1 | Richard S. Sutton, Doina Precup, Satinder P. Singh |
Intra-Option Learning about Temporally Abstract Actions.  |
ICML  |
1998 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Satinder P. Singh |
Near-Optimal Reinforcement Learning in Polynominal Time.  |
ICML  |
1998 |
DBLP BibTeX RDF |
|
| 1 | John Loch, Satinder P. Singh |
Using Eligibility Traces to Find the Best Memoryless Policy in Partially Observable Markov Decision Processes.  |
ICML  |
1998 |
DBLP BibTeX RDF |
|
| 1 | John K. Williams, Satinder P. Singh |
Experimental Results on Learning Stochastic Memoryless Policies for Partially Observable Markov Decision Processes.  |
NIPS ![In: Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30 - December 5, 1998], pp. 1073-1080, 1998, The MIT Press, 0-262-11245-0. The full citation details ...](Pics/full.jpeg) |
1998 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Satinder P. Singh |
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms.  |
NIPS ![In: Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30 - December 5, 1998], pp. 996-1002, 1998, The MIT Press, 0-262-11245-0. The full citation details ...](Pics/full.jpeg) |
1998 |
DBLP BibTeX RDF |
|
| 1 | Timothy X. Brown, Hui Tong, Satinder P. Singh |
Optimizing Admission Control while Ensuring Quality of Service in Multimedia Networks via Reinforcement Learning.  |
NIPS ![In: Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30 - December 5, 1998], pp. 982-988, 1998, The MIT Press, 0-262-11245-0. The full citation details ...](Pics/full.jpeg) |
1998 |
DBLP BibTeX RDF |
|
| 1 | Richard S. Sutton, Satinder P. Singh, Doina Precup, Balaraman Ravindran |
Improved Switching among Temporally Abstract Actions.  |
NIPS ![In: Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30 - December 5, 1998], pp. 1066-1072, 1998, The MIT Press, 0-262-11245-0. The full citation details ...](Pics/full.jpeg) |
1998 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, David Cohn |
How to Dynamically Merge Markov Decision Processes.  |
NIPS ![In: Advances in Neural Information Processing Systems 10, [NIPS Conference, Denver, Colorado, USA, 1997], 1997, The MIT Press, 0-262-10076-2. The full citation details ...](Pics/full.jpeg) |
1997 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Richard S. Sutton |
Reinforcement Learning with Replacing Eligibility Traces.  |
Machine Learning  |
1996 |
DBLP DOI BibTeX RDF |
|
| 1 | Lawrence K. Saul, Satinder P. Singh |
Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards.  |
COLT  |
1996 |
DBLP DOI BibTeX RDF |
|
| 1 | David A. Cohn, Satinder P. Singh |
Predicting Lifetimes in Dynamically Allocated Memory.  |
NIPS  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Dimitri P. Bertsekas |
Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems.  |
NIPS  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Peter Dayan |
Analytical Mean Squared Error Curves in Temporal Difference Learning.  |
NIPS  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh |
Learning to Act Using Real-Time Dynamic Programming.  |
Artif. Intell.  |
1995 |
DBLP DOI BibTeX RDF |
|
| 1 | Lawrence K. Saul, Satinder P. Singh |
Markov Decision Processes in Large State Spaces.  |
COLT  |
1995 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Dayan, Satinder P. Singh |
Improving Policies without Measuring Merits.  |
NIPS  |
1995 |
DBLP BibTeX RDF |
|