| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 1 | Michael L. Littman |
A new way to search game trees: technical perspective.  |
Commun. ACM  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | John Asmuth, Michael L. Littman |
Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Shimon Whiteson, Michael L. Littman |
Introduction to the special issue on empirical evaluations in reinforcement learning.  |
Machine Learning  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman, Thomas J. Walsh, Alexander L. Strehl |
Knows what it knows: a framework for self-aware learning.  |
Machine Learning  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | János A. Csirik, Michael L. Littman, David A. McAllester, Robert E. Schapire, Peter Stone |
Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Nikos Vlassis, Michael L. Littman, David Barber |
On the computational complexity of stochastic controller optimization in POMDPs  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Michael J. Kearns, Michael L. Littman, Satinder P. Singh, Peter Stone |
ATTac-2000: An Adaptive Autonomous Bidding Agent  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Fusun Yaman, Thomas J. Walsh, Michael L. Littman, Marie desJardins |
Democratic approximation of lexicographic preference models.  |
Artif. Intell.  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Brian Russell, Michael L. Littman, Wade Trappe |
Integrating machine learning in ad hoc routing: A wireless adaptive routing protocol.  |
Int. J. Communication Systems  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael L. Littman, Daniel M. Reeves |
Puzzle: baffling raffling.  |
SIGecom Exchanges  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Changhe Yuan, Heejin Lim, Michael L. Littman |
Most Relevant Explanation: computational complexity and approximation methods.  |
Ann. Math. Artif. Intell.  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Sergiu Goschin, Michael L. Littman, David H. Ackley |
The effects of selection on noisy fitness optimization.  |
GECCO  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Christopher R. Mansley, Ari Weinstein, Michael L. Littman |
Sample-Based Planning for Continuous Action Markov Decision Processes.  |
ICAPS  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Jordan Ash, Monica Babes, Gal Cohen, Sameen Jalal, Sam Lichtenberg, Michael L. Littman, Vukosi N. Marivate, Phillip Quiza, Blase Ur, Emily Zhang |
Scratchable Devices: User-Friendly Programming for Household Appliances.  |
HCI  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Monica Babes, Vukosi N. Marivate, Kaushik Subramanian, Michael L. Littman |
Apprenticeship Learning About Multiple Intentions.  |
ICML  |
2011 |
DBLP BibTeX RDF |
|
| 1 | John Asmuth, Michael L. Littman |
Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search.  |
UAI  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Michael Wunder, Michael Kaisers, John Robert Yaros, Michael L. Littman |
Using iterated reasoning to predict opponent strategies.  |
AAMAS  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Ali Nouri, Michael L. Littman |
Dimension reduction and its application to model-based exploration in continuous spaces.  |
Machine Learning  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman |
Reducing reinforcement learning to KWIK online regression.  |
Ann. Math. Artif. Intell.  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael Wunder, Michael L. Littman, Monica Babes |
Classes of Multiagent Q-learning Dynamics with epsilon-greedy Exploration.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Thomas J. Walsh, Kaushik Subramanian, Michael L. Littman, Carlos Diuk |
Generalizing Apprenticeship Learning across Hypothesis Classes.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Marie desJardins, Michael L. Littman |
Broadening student enthusiasm for computer science with a great insights course.  |
SIGCSE  |
2010 |
DBLP DOI BibTeX RDF |
attitudes towards computing, introductory courses |
| 1 | Kaushik Subramanian, Michael L. Littman |
Efficient Apprenticeship Learning with Smart Humans.  |
Enabling Intelligence through Middleware  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Michael Wunder, Michael L. Littman, Michael Kaisers, John Robert Yaros |
A Cognitive Hierarchy Model Applied to the Lemonade Game.  |
Interactive Decision Theory and Game Theory  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Thomas J. Walsh, Sergiu Goschin, Michael L. Littman |
Integrating Sample-Based Planning and Model-Based Reinforcement Learning.  |
AAAI  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Emma Brunskill, Bethany R. Leffler, Lihong Li, Michael L. Littman, Nicholas Roy |
Provably Efficient Learning with Typed Parametric Models.  |
Journal of Machine Learning Research  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Alexander L. Strehl, Lihong Li, Michael L. Littman |
Reinforcement Learning in Finite MDPs: PAC Analysis.  |
Journal of Machine Learning Research  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Thomas J. Walsh, Ali Nouri, Lihong Li, Michael L. Littman |
Learning and planning in environments with delayed feedback.  |
Autonomous Agents and Multi-Agent Systems  |
2009 |
DBLP DOI BibTeX RDF |
Delayed feedback, Reinforcement learning, Markov decision processes |
| 1 | Carlos Diuk, Michael L. Littman |
Hierarchical Reinforcement Learning.  |
Encyclopedia of Artificial Intelligence  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Andrea Pohoreckyj Danyluk, Léon Bottou, Michael L. Littman (eds.) |
Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, Montreal, Quebec, Canada, June 14-18, 2009  |
ICML  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Thomas J. Walsh, Istvan Szita, Carlos Diuk, Michael L. Littman |
Exploring compact reinforcement-learning representations with linear regression.  |
UAI  |
2009 |
DBLP BibTeX RDF |
|
| 1 | John Asmuth, Lihong Li, Michael L. Littman, Ali Nouri, David Wingate |
A Bayesian Sampling Approach to Exploration in Reinforcement Learning.  |
UAI  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman, Christopher R. Mansley |
Online exploration in least-squares policy iteration.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
PAC-MDP, least-squares policy iteration (LSPI), reinforcement learning, Markov decision processes, exploration |
| 1 | David L. Roberts, Charles L. Isbell, Michael L. Littman |
Optimization problems involving collections of dependent objects.  |
Annals OR  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Alexander L. Strehl, Michael L. Littman |
An analysis of model-based Interval Estimation for Markov Decision Processes.  |
J. Comput. Syst. Sci.  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman |
Efficient Value-Function Approximation via Online Linear Regression.  |
ISAIM  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Fusun Yaman, Thomas J. Walsh, Michael L. Littman, Marie desJardins |
Democratic approximation of lexicographic preference models.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman, Thomas J. Walsh |
Knows what it knows: a framework for self-aware learning.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Ronald Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, Michael L. Littman |
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Carlos Diuk, Andre Cohen, Michael L. Littman |
An object-oriented representation for efficient reinforcement learning.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Emma Brunskill, Bethany R. Leffler, Lihong Li, Michael L. Littman, Nicholas Roy |
CORL: A Continuous-state Offset-dynamics Reinforcement Learner.  |
UAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Enrique Munoz de Cote, Michael L. Littman |
A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games.  |
UAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman |
Autonomous Model Learning for Reinforcement Learning.  |
QEST  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Monica Babes, Enrique Munoz de Cote, Michael L. Littman |
Social reward shaping in the prisoner's dilemma.  |
AAMAS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | John Asmuth, Michael L. Littman, Robert Zinkov |
Potential-based Shaping in Model-based Reinforcement Learning.  |
AAAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Thomas J. Walsh, Michael L. Littman |
Efficient Learning of Action Schemas and Web-Service Descriptions.  |
AAAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Ali Nouri, Michael L. Littman |
Multi-resolution Exploration in Continuous Spaces.  |
NIPS  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Amy Greenwald, Michael L. Littman |
Introduction to the special issue on learning and computational game theory.  |
Machine Learning  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Martin Zinkevich, Amy Greenwald, Michael L. Littman |
A hierarchy of prescriptive goals for multiagent learning.  |
Artif. Intell.  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Thomas J. Walsh, Ali Nouri, Lihong Li, Michael L. Littman |
Planning and Learning in Environments with Delayed Feedback.  |
ECML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Ronald Parr, Christopher Painter-Wakefield, Lihong Li, Michael L. Littman |
Analyzing feature generation for value-function approximation.  |
ICML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Bethany R. Leffler, Michael L. Littman, Timothy Edmunds |
Efficient Reinforcement Learning with Relocatable Action Models.  |
AAAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Alexander L. Strehl, Carlos Diuk, Michael L. Littman |
Efficient Structure Learning in Factored-State MDPs.  |
AAAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Alexander L. Strehl, Michael L. Littman |
Online Linear Regression and Its Application to Model-Based Reinforcement Learning.  |
NIPS  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Lihong Li, Thomas J. Walsh, Michael L. Littman |
Towards a Unified Theory of State Abstraction for MDPs.  |
ISAIM  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Carlos Diuk, Michael L. Littman |
A Change Detection Model for Non-Stationary k-Armed Bandit Problems.  |
AAAI Spring Symposium: Between a Rock and a Hard Place: Cognitive Science Principles Meet AI-Hard Problems  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Alexander L. Strehl, Chris Mesterharm, Michael L. Littman, Haym Hirsh |
Experience-efficient learning in associative bandit problems.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Alexander L. Strehl, Lihong Li, Eric Wiewiora, John Langford, Michael L. Littman |
PAC model-free reinforcement learning.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Alexander L. Strehl, Lihong Li, Michael L. Littman |
Incremental Model-based Learners With Formal Learning-Time Guarantees.  |
UAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Nishkam Ravi, Arjun Talwar, Martin Zinkevich |
An Efficient Optimal-Equilibrium Algorithm for Two-player Game Trees.  |
UAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Carlos Diuk, Alexander L. Strehl, Michael L. Littman |
A hierarchical approach to efficient reinforcement learning in deterministic domains.  |
AAMAS  |
2006 |
DBLP DOI BibTeX RDF |
factored representations, reinforcement learning, hierarchical reinforcement learning, sample complexity |
| 1 | David L. Roberts, Mark J. Nelson, Charles Lee Isbell Jr., Michael Mateas, Michael L. Littman |
Targeting Specific Distributions of Trajectories in MDPs.  |
AAAI  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Peter Stone |
A polynomial-time Nash equilibrium algorithm for repeated games.  |
Decision Support Systems  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Corpus-based Learning of Analogies and Semantic Relations.  |
Machine Learning  |
2005 |
DBLP DOI BibTeX RDF |
noun-modifier pairs, metaphor, analogy, vector space model, semantic relations, cosine similarity |
| 1 | Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder |
Combining Independent Modules in Lexical Multiple-Choice Problems  |
CoRR  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Corpus-based Learning of Analogies and Semantic Relations  |
CoRR  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Håkan L. S. Younes, Michael L. Littman, David Weissman, John Asmuth |
The First Probabilistic Track of the International Planning Competition.  |
J. Artif. Intell. Res. (JAIR)  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Nicholas L. Cassimatis, Sean Luke, Simon D. Levy, Ross W. Gayler, Pentti Kanerva, Chris Eliasmith, Timothy W. Bickmore, Alan C. Schultz, Randall Davis, James A. Landay, Robert C. Miller, Eric Saund, Thomas F. Stahovich, Michael L. Littman, Satinder P. Singh, Shlomo Argamon, Shlomo Dubnov |
Reports on the 2004 AAAI Fall Symposia.  |
AI Magazine  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Bethany R. Leffler, Michael L. Littman, Alexander L. Strehl, Thomas J. Walsh |
Efficient Exploration With Latent Structure.  |
Robotics: Science and Systems  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Alexander L. Strehl, Michael L. Littman |
A theoretical analysis of Model-Based Interval Estimation.  |
ICML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Lihong Li, Michael L. Littman |
Lazy Approximation for Solving Continuous Finite-Horizon MDPs.  |
AAAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Nishkam Ravi, Nikhil Dandekar, Preetham Mysore, Michael L. Littman |
Activity Recognition from Accelerometer Data.  |
AAAI  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Martin Zinkevich, Amy Greenwald, Michael L. Littman |
Cyclic Equilibria in Markov Games.  |
NIPS ![In: Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, NIPS 2005, December 5-8, 2005, Vancouver, British Columbia, Canada], 2005. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Nishkam Ravi, Eitan Fenson, Richard Howard |
Reinforcement Learning for Autonomic Network Repair.  |
ICAC  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael R. James, Satinder P. Singh, Michael L. Littman |
Planning with predictive state representations.  |
ICMLA  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Alexander L. Strehl, Michael L. Littman |
An Empirical Evaluation of Interval Estimation for Markov Decision Processes.  |
ICTAI  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael L. Littman, Nishkam Ravi, Eitan Fenson, Richard Howard |
An Instance-Based State Representation for Network Repair.  |
AAAI  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder |
Combining Independent Modules to Solve Multiple-choice Synonym and Analogy Problems  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Learning Analogies and Semantic Relations  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Measuring Praise and Criticism: Inference of Semantic Orientation from Association  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Measuring praise and criticism: Inference of semantic orientation from association.  |
ACM Trans. Inf. Syst.  |
2003 |
DBLP DOI BibTeX RDF |
text mining, web mining, text classification, unsupervised learning, mutual information, latent semantic analysis, semantic association, semantic orientation |
| 1 | Peter Stone, Robert E. Schapire, Michael L. Littman, János A. Csirik, David A. McAllester |
Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions.  |
J. Artif. Intell. Res. (JAIR)  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Stephen M. Majercik, Michael L. Littman |
Contingent planning under uncertainty via stochastic satisfiability.  |
Artif. Intell.  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Yukio Ohsawa, Peter McBurney, Simon Parsons, Christopher A. Miller, Alan C. Schultz, Jean Scholtz, Michael A. Goodrich, Eugene Santos Jr., Benjamin Bell, Charles Lee Isbell Jr., Michael L. Littman |
AAAI-2002 Fall Symposium Series.  |
AI Magazine  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder |
Combining independent modules in lexical multiple-choice problems.  |
RANLP  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Michael L. Littman, Nicholas K. Jong, David Pardoe, Peter Stone |
Learning Predictive State Representations.  |
ICML  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Peter Stone |
A polynomial-time nash equilibrium algorithm for repeated games.  |
ACM Conference on Electronic Commerce  |
2003 |
DBLP DOI BibTeX RDF |
nash equilibrium, complexity analysis, repeated games, computational game theory |
| 1 | Michael L. Littman |
Tutorial: Learning Topics in Game-Theoretic Decision Making.  |
COLT  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter D. Turney, Michael L. Littman |
Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus  |
CoRR  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Michael L. Littman, Greg A. Keim, Noam M. Shazeer |
A probabilistic approach to solving crossword puzzles.  |
Artif. Intell.  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Michail G. Lagoudakis, Ronald Parr, Michael L. Littman |
Least-Squares Methods in Reinforcement Learning for Control.  |
SETN  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Robert E. Schapire, Peter Stone, David A. McAllester, Michael L. Littman, János A. Csirik |
Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation.  |
ICML  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Paul S. A. Reitsma, Peter Stone, János A. Csirik, Michael L. Littman |
Randomized strategic demand reduction: getting more by asking for less.  |
AAMAS  |
2002 |
DBLP DOI BibTeX RDF |
strategic demand reduction, auctions, bidding agents |
| 1 | Peter Stone, Robert E. Schapire, János A. Csirik, Michael L. Littman, David A. McAllester |
ATTac-2001: A Learning, Autonomous Bidding Agent.  |
AMEC  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Paul S. A. Reitsma, Peter Stone, János A. Csirik, Michael L. Littman |
Self-Enforcing Strategic Demand Reduction.  |
AMEC  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Michail G. Lagoudakis, Michael L. Littman |
Learning to Select Branching Rules in the DPLL Procedure for Satisfiability.  |
Electronic Notes in Discrete Mathematics  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael L. Littman |
Value-function reinforcement learning in Markov games.  |
Cognitive Systems Research  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael L. Littman, Stephen M. Majercik, Toniann Pitassi |
Stochastic Boolean Satisfiability.  |
J. Autom. Reasoning  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns |
ATTac-2000: An Adaptive Autonomous Bidding Agent.  |
J. Artif. Intell. Res. (JAIR)  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Peter Stone, Michael L. Littman, Satinder P. Singh, Michael J. Kearns |
ATTac-2000: an adaptive autonomous bidding agent.  |
Agents  |
2001 |
DBLP DOI BibTeX RDF |
|