The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications of "Satinder P. Singh" ( http://dblp.L3S.de/Authors/Satinder_P._Singh )

  Author page on DBLP  Author page in RDF  Community of Satinder P. Singh in ASPL-2

Publication years (Num. hits)
1991-1995 (17) 1996-1998 (15) 1999-2000 (15) 2001-2003 (15) 2004-2005 (18) 2006-2008 (16) 2009-2011 (17) 2012 (1)
Publication types (Num. hits)
article(22) inproceedings(92)
Venues (Conferences, Journals, ...)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 22 occurrences of 20 keywords

Results
Found 114 publication records. Showing 114 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Jonathan Sorg, Satinder P. Singh, Richard L. Lewis Variance-Based Rewards for Approximate Bayesian Reinforcement Learning Search on Bibsonomy CoRR The full citation details ... 2012 DBLP  BibTeX  RDF
1Michael J. Kearns, Diane J. Litman, Satinder P. Singh, Marilyn A. Walker Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System Search on Bibsonomy CoRR The full citation details ... 2011 DBLP  BibTeX  RDF
1Michael J. Kearns, Michael L. Littman, Satinder P. Singh, Peter Stone ATTac-2000: An Adaptive Autonomous Bidding Agent Search on Bibsonomy CoRR The full citation details ... 2011 DBLP  BibTeX  RDF
1Quang Duong, Michael P. Wellman, Satinder P. Singh Modeling Information Diffusion in Networks with Unobserved Links. Search on Bibsonomy SocialCom/PASSAT The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Robert Cohn, Edmund H. Durfee, Satinder P. Singh Comparing action-query strategies in semi-autonomous agents. Search on Bibsonomy AAMAS The full citation details ... 2011 DBLP  BibTeX  RDF
1Robert Cohn, Edmund H. Durfee, Satinder P. Singh Comparing Action-Query Strategies in Semi-Autonomous Agents. Search on Bibsonomy AAAI The full citation details ... 2011 DBLP  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh, Richard L. Lewis Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents. Search on Bibsonomy AAAI The full citation details ... 2011 DBLP  BibTeX  RDF
1David C. Parkes, Ruggiero Cavallo, Florin Constantin, Satinder P. Singh Dynamic Incentive Mechanisms. Search on Bibsonomy AI Magazine The full citation details ... 2010 DBLP  BibTeX  RDF
1Satinder P. Singh, Richard L. Lewis, Andrew G. Barto, Jonathan Sorg Intrinsically Motivated Reinforcement Learning: An Evolutionary Perspective. Search on Bibsonomy IEEE T. Autonomous Mental Development The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Robert Cohn, Michael Maxim, Edmund H. Durfee, Satinder P. Singh Selecting Operator Queries Using Expected Myopic Gain. Search on Bibsonomy IAT The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh, Richard L. Lewis Internal Rewards Mitigate Agent Boundedness. Search on Bibsonomy ICML The full citation details ... 2010 DBLP  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh, Richard L. Lewis Variance-Based Rewards for Approximate Bayesian Reinforcement Learning. Search on Bibsonomy UAI The full citation details ... 2010 DBLP  BibTeX  RDF
1Quang Duong, Michael P. Wellman, Satinder P. Singh, Yevgeniy Vorobeychik History-dependent graphical multiagent models. Search on Bibsonomy AAMAS The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh Linear options. Search on Bibsonomy AAMAS The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh, Richard L. Lewis Reward Design via Online Gradient Ascent. Search on Bibsonomy NIPS The full citation details ... 2010 DBLP  BibTeX  RDF
1Quang Duong, Yevgeniy Vorobeychik, Satinder P. Singh, Michael P. Wellman Learning Graphical Game Models. Search on Bibsonomy IJCAI The full citation details ... 2009 DBLP  BibTeX  RDF
1Jonathan Sorg, Satinder P. Singh Transfer via soft homomorphisms. Search on Bibsonomy AAMAS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Markov decision process, homomorphism, transfer learning
1Michael R. James, Satinder P. Singh SarsaLandmark: an algorithm for learning in POMDPs with landmarks. Search on Bibsonomy AAMAS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF reinforcement learning, landmark, POMDP, partial observability
1Matthew R. Rudary, Satinder P. Singh Predictive Linear-Gaussian Models of Dynamical Systems with Vector-Valued Actions and Observations. Search on Bibsonomy ISAIM The full citation details ... 2008 DBLP  BibTeX  RDF
1David Wingate, Satinder P. Singh Efficiently learning linear-linear exponential family predictive representations of state. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Quang Duong, Michael P. Wellman, Satinder P. Singh Knowledge Combination in Graphical Multiagent Models. Search on Bibsonomy UAI The full citation details ... 2008 DBLP  BibTeX  RDF
1Britton Wolfe, Michael R. James, Satinder P. Singh Approximate predictive state representations. Search on Bibsonomy AAMAS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh Learning payoff functions in infinite games. Search on Bibsonomy Machine Learning The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Learning in games, Nash equilibrium approximation, Game theory
1David Wingate, Vishal Soni, Britton Wolfe, Satinder P. Singh Relational Knowledge with Predictive State Representations. Search on Bibsonomy IJCAI The full citation details ... 2007 DBLP  BibTeX  RDF
1Vishal Soni, Satinder P. Singh, Michael P. Wellman Constraint satisfaction algorithms for graphical games. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF constraint satisfaction, graphical games
1David Wingate, Satinder P. Singh On discovery and learning of models with predictive representations of state for agents with continuous actions and observations. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF predictive representations of state, information theory, dynamical system modeling
1Vishal Soni, Satinder P. Singh Abstraction in Predictive State Representations. Search on Bibsonomy AAAI The full citation details ... 2007 DBLP  BibTeX  RDF
1Charles Lee Isbell Jr., Michael J. Kearns, Satinder P. Singh, Christian R. Shelton, Peter Stone, David P. Kormann Cobot in LambdaMOO: An Adaptive Social Statistics Agent. Search on Bibsonomy Autonomous Agents and Multi-Agent Systems The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Chat agents, Game theory, Reinforcement learning, Autonomous agents, Believable agents, Social modeling
1Matthew R. Rudary, Satinder P. Singh Predictive linear-Gaussian models of controlled stochastic dynamical systems. Search on Bibsonomy ICML The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1David Wingate, Satinder P. Singh Kernel Predictive Linear Gaussian models for nonlinear stochastic dynamical systems. Search on Bibsonomy ICML The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Britton Wolfe, Satinder P. Singh Predictive state representations with options. Search on Bibsonomy ICML The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Ruggiero Cavallo, David C. Parkes, Satinder P. Singh Optimal Coordinated Planning Amongst Self-Interested Agents with Private State. Search on Bibsonomy UAI The full citation details ... 2006 DBLP  BibTeX  RDF
1David Wingate, Satinder P. Singh Mixtures of Predictive Linear Gaussian Models for Nonlinear, Stochastic Dynamical Systems. Search on Bibsonomy AAAI The full citation details ... 2006 DBLP  BibTeX  RDF
1Vishal Soni, Satinder P. Singh Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains. Search on Bibsonomy AAAI The full citation details ... 2006 DBLP  BibTeX  RDF
1Michael P. Wellman, Joshua Estelle, Satinder P. Singh, Yevgeniy Vorobeychik, Christopher Kiekintveld, Vishal Soni Strategic Interactions in a Supply Chain Game. Search on Bibsonomy Computational Intelligence The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Nicholas L. Cassimatis, Sean Luke, Simon D. Levy, Ross W. Gayler, Pentti Kanerva, Chris Eliasmith, Timothy W. Bickmore, Alan C. Schultz, Randall Davis, James A. Landay, Robert C. Miller, Eric Saund, Thomas F. Stahovich, Michael L. Littman, Satinder P. Singh, Shlomo Argamon, Shlomo Dubnov Reports on the 2004 AAAI Fall Symposia. Search on Bibsonomy AI Magazine The full citation details ... 2005 DBLP  BibTeX  RDF
1Michael R. James, Britton Wolfe, Satinder P. Singh Combining Memory and Landmarks with Predictive State Representations. Search on Bibsonomy IJCAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh Learning Payoff Functions in Infinite Games. Search on Bibsonomy IJCAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Britton Wolfe, Michael R. James, Satinder P. Singh Learning predictive state representations in dynamical systems without reset. Search on Bibsonomy ICML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Matthew R. Rudary, Satinder P. Singh, David Wingate Predictive Linear-Gaussian Models of Stochastic Dynamical Systems. Search on Bibsonomy UAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Michael R. James, Satinder P. Singh Planning in Models that Combine Memory with Predictive Representations of State. Search on Bibsonomy AAAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Doina Precup, Richard S. Sutton, Cosmin Paduraru, Anna Koop, Satinder P. Singh Off-policy Learning with Options and Recognizers. Search on Bibsonomy NIPS The full citation details ... 2005 DBLP  BibTeX  RDF
1Christopher Kiekintveld, Michael P. Wellman, Satinder P. Singh, Vishal Soni Value-driven procurement in the TAC supply chain game. Search on Bibsonomy SIGecom Exchanges The full citation details ... 2004 DBLP  DOI  BibTeX  RDF algorithms, e-commerce, economics, supply chains, trading agents
1Christopher Kiekintveld, Michael P. Wellman, Satinder P. Singh, Joshua Estelle, Yevgeniy Vorobeychik, Vishal Soni, Matthew R. Rudary Distributed Feedback Control for Decision Making on Supply Chains. Search on Bibsonomy ICAPS The full citation details ... 2004 DBLP  BibTeX  RDF
1Joshua Estelle, Yevgeniy Vorobeychik, Michael P. Wellman, Satinder P. Singh, Christopher Kiekintveld, Vishal Soni Strategic Interactions in the TAC 2003 Supply Chain Tournament. Search on Bibsonomy Computers and Games The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
1Michael R. James, Satinder P. Singh, Michael L. Littman Planning with predictive state representations. Search on Bibsonomy ICMLA The full citation details ... 2004 DBLP  BibTeX  RDF
1Michael R. James, Satinder P. Singh Learning and discovery of predictive state representations in dynamical systems with reset. Search on Bibsonomy ICML The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
1Matthew R. Rudary, Satinder P. Singh, Martha E. Pollack Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning. Search on Bibsonomy ICML The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
1Satinder P. Singh, Michael R. James, Matthew R. Rudary Predictive State Representations: A New Theory for Modeling Dynamical Systems. Search on Bibsonomy UAI The full citation details ... 2004 DBLP  BibTeX  RDF
1Satinder P. Singh, Vishal Soni, Michael P. Wellman Computing approximate bayes-nash equilibria in tree-games of incomplete information. Search on Bibsonomy ACM Conference on Electronic Commerce The full citation details ... 2004 DBLP  DOI  BibTeX  RDF approximate bayes-nash equilibria, games of incomplete information, structured games
1David C. Parkes, Satinder P. Singh, Dimah Yanovsky Approximately Efficient Online Mechanism Design. Search on Bibsonomy NIPS The full citation details ... 2004 DBLP  BibTeX  RDF
1Satinder P. Singh, Andrew G. Barto, Nuttapong Chentanez Intrinsically Motivated Reinforcement Learning. Search on Bibsonomy NIPS The full citation details ... 2004 DBLP  BibTeX  RDF
1Satinder P. Singh, Michael L. Littman, Nicholas K. Jong, David Pardoe, Peter Stone Learning Predictive State Representations. Search on Bibsonomy ICML The full citation details ... 2003 DBLP  BibTeX  RDF
1David C. Parkes, Satinder P. Singh An MDP-Based Approach to Online Mechanism Design. Search on Bibsonomy NIPS The full citation details ... 2003 DBLP  BibTeX  RDF
1Matthew R. Rudary, Satinder P. Singh A Nonlinear Predictive State Representation. Search on Bibsonomy NIPS The full citation details ... 2003 DBLP  BibTeX  RDF
1Satinder P. Singh Introduction. Search on Bibsonomy Machine Learning The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
1Michael J. Kearns, Satinder P. Singh Near-Optimal Reinforcement Learning in Polynomial Time. Search on Bibsonomy Machine Learning The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
1Satinder P. Singh, Diane J. Litman, Michael J. Kearns, Marilyn A. Walker Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System. Search on Bibsonomy J. Artif. Intell. Res. (JAIR) The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
1Michael J. Kearns, Charles Lee Isbell Jr., Satinder P. Singh, Diane J. Litman, Jessica Howe CobotDS: A Spoken Dialogue System for Chat. Search on Bibsonomy AAAI/IAAI The full citation details ... 2002 DBLP  BibTeX  RDF
1Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns ATTac-2000: An Adaptive Autonomous Bidding Agent. Search on Bibsonomy J. Artif. Intell. Res. (JAIR) The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
1Charles Lee Isbell Jr., Christian R. Shelton, Michael J. Kearns, Satinder P. Singh, Peter Stone A social reinforcement learning agent. Search on Bibsonomy Agents The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
1Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns ATTac-2000: an adaptive autonomous bidding agent. Search on Bibsonomy Agents The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
1János A. Csirik, Michael L. Littman, Satinder P. Singh, Peter Stone FAucS : An FCC Spectrum Auction Simulator for Autonomous Bidding Agents. Search on Bibsonomy WELCOM The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
1Michael J. Kearns, Michael L. Littman, Satinder P. Singh Graphical Models for Game Theory. Search on Bibsonomy UAI The full citation details ... 2001 DBLP  BibTeX  RDF
1Charles Lee Isbell Jr., Christian R. Shelton, Michael J. Kearns, Satinder P. Singh, Peter Stone Cobot: A Social Reinforcement Learning Agent. Search on Bibsonomy NIPS The full citation details ... 2001 DBLP  BibTeX  RDF
1Michael L. Littman, Richard S. Sutton, Satinder P. Singh Predictive Representations of State. Search on Bibsonomy NIPS The full citation details ... 2001 DBLP  BibTeX  RDF
1Michael L. Littman, Michael J. Kearns, Satinder P. Singh An Efficient, Exact Algorithm for Solving Tree-Structured Graphical Games. Search on Bibsonomy NIPS The full citation details ... 2001 DBLP  BibTeX  RDF
1Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms. Search on Bibsonomy Machine Learning The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
1Peter Stone, Richard S. Sutton, Satinder P. Singh Reinforcement Learning for 3 vs. 2 Keepaway Search on Bibsonomy RoboCup The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
1Diane J. Litman, Michael S. Kearns, Satinder P. Singh, Marilyn A. Walker Automatic Optimization of Dialogue Management. Search on Bibsonomy COLING The full citation details ... 2000 DBLP  BibTeX  RDF
1Doina Precup, Richard S. Sutton, Satinder P. Singh Eligibility Traces for Off-Policy Policy Evaluation. Search on Bibsonomy ICML The full citation details ... 2000 DBLP  BibTeX  RDF
1Kary Myers, Michael J. Kearns, Satinder P. Singh, Marilyn A. Walker A Boosting Approach to Topic Spotting on Subdialogues. Search on Bibsonomy ICML The full citation details ... 2000 DBLP  BibTeX  RDF
1Michael J. Kearns, Yishay Mansour, Satinder P. Singh Fast Planning in Stochastic Games. Search on Bibsonomy UAI The full citation details ... 2000 DBLP  BibTeX  RDF
1Satinder P. Singh, Michael J. Kearns, Yishay Mansour Nash Convergence of Gradient Dynamics in General-Sum Games. Search on Bibsonomy UAI The full citation details ... 2000 DBLP  BibTeX  RDF
1Michael J. Kearns, Satinder P. Singh Bias-Variance Error Bounds for Temporal Difference Updates. Search on Bibsonomy COLT The full citation details ... 2000 DBLP  BibTeX  RDF
1Charles Lee Isbell Jr., Michael J. Kearns, David P. Kormann, Satinder P. Singh, Peter Stone Cobot in LambdaMOO: A Social Statistics Agent. Search on Bibsonomy AAAI/IAAI The full citation details ... 2000 DBLP  BibTeX  RDF
1Satinder P. Singh, Michael J. Kearns, Diane J. Litman, Marilyn A. Walker Empirical Evaluation of a Reinforcement Learning Spoken Dialogue System. Search on Bibsonomy AAAI/IAAI The full citation details ... 2000 DBLP  BibTeX  RDF
1Richard S. Sutton, Doina Precup, Satinder P. Singh Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Search on Bibsonomy Artif. Intell. The full citation details ... 1999 DBLP  DOI  BibTeX  RDF
1David A. McAllester, Satinder P. Singh Approximate Planning for Factored POMDPs using Belief State Simplification. Search on Bibsonomy UAI The full citation details ... 1999 DBLP  BibTeX  RDF
1Yishay Mansour, Satinder P. Singh On the Complexity of Policy Iteration. Search on Bibsonomy UAI The full citation details ... 1999 DBLP  BibTeX  RDF
1Satinder P. Singh, Michael J. Kearns, Diane J. Litman, Marilyn A. Walker Reinforcement Learning for Spoken Dialogue Systems. Search on Bibsonomy NIPS The full citation details ... 1999 DBLP  BibTeX  RDF
1Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour Policy Gradient Methods for Reinforcement Learning with Function Approximation. Search on Bibsonomy NIPS The full citation details ... 1999 DBLP  BibTeX  RDF
1Satinder P. Singh, Peter Dayan Analytical Mean Squared Error Curves for Temporal Difference Learning. Search on Bibsonomy Machine Learning The full citation details ... 1998 DBLP  DOI  BibTeX  RDF
1Doina Precup, Richard S. Sutton, Satinder P. Singh Theoretical Results on Reinforcement Learning with Temporally Abstract Options. Search on Bibsonomy ECML The full citation details ... 1998 DBLP  DOI  BibTeX  RDF
1Richard S. Sutton, Doina Precup, Satinder P. Singh Intra-Option Learning about Temporally Abstract Actions. Search on Bibsonomy ICML The full citation details ... 1998 DBLP  BibTeX  RDF
1Michael J. Kearns, Satinder P. Singh Near-Optimal Reinforcement Learning in Polynominal Time. Search on Bibsonomy ICML The full citation details ... 1998 DBLP  BibTeX  RDF
1John Loch, Satinder P. Singh Using Eligibility Traces to Find the Best Memoryless Policy in Partially Observable Markov Decision Processes. Search on Bibsonomy ICML The full citation details ... 1998 DBLP  BibTeX  RDF
1John K. Williams, Satinder P. Singh Experimental Results on Learning Stochastic Memoryless Policies for Partially Observable Markov Decision Processes. Search on Bibsonomy NIPS The full citation details ... 1998 DBLP  BibTeX  RDF
1Michael J. Kearns, Satinder P. Singh Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms. Search on Bibsonomy NIPS The full citation details ... 1998 DBLP  BibTeX  RDF
1Timothy X. Brown, Hui Tong, Satinder P. Singh Optimizing Admission Control while Ensuring Quality of Service in Multimedia Networks via Reinforcement Learning. Search on Bibsonomy NIPS The full citation details ... 1998 DBLP  BibTeX  RDF
1Richard S. Sutton, Satinder P. Singh, Doina Precup, Balaraman Ravindran Improved Switching among Temporally Abstract Actions. Search on Bibsonomy NIPS The full citation details ... 1998 DBLP  BibTeX  RDF
1Satinder P. Singh, David Cohn How to Dynamically Merge Markov Decision Processes. Search on Bibsonomy NIPS The full citation details ... 1997 DBLP  BibTeX  RDF
1Satinder P. Singh, Richard S. Sutton Reinforcement Learning with Replacing Eligibility Traces. Search on Bibsonomy Machine Learning The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
1Lawrence K. Saul, Satinder P. Singh Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards. Search on Bibsonomy COLT The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
1David A. Cohn, Satinder P. Singh Predicting Lifetimes in Dynamically Allocated Memory. Search on Bibsonomy NIPS The full citation details ... 1996 DBLP  BibTeX  RDF
1Satinder P. Singh, Dimitri P. Bertsekas Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems. Search on Bibsonomy NIPS The full citation details ... 1996 DBLP  BibTeX  RDF
1Satinder P. Singh, Peter Dayan Analytical Mean Squared Error Curves in Temporal Difference Learning. Search on Bibsonomy NIPS The full citation details ... 1996 DBLP  BibTeX  RDF
1Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh Learning to Act Using Real-Time Dynamic Programming. Search on Bibsonomy Artif. Intell. The full citation details ... 1995 DBLP  DOI  BibTeX  RDF
1Lawrence K. Saul, Satinder P. Singh Markov Decision Processes in Large State Spaces. Search on Bibsonomy COLT The full citation details ... 1995 DBLP  DOI  BibTeX  RDF
1Peter Dayan, Satinder P. Singh Improving Policies without Measuring Merits. Search on Bibsonomy NIPS The full citation details ... 1995 DBLP  BibTeX  RDF
Displaying result #1 - #100 of 114 (100 per page; Change: )
Pages: [1][2][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.