|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 89 occurrences of 61 keywords
|
|
|
|
|
Results
Found 123 publication records. Showing 123 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 2 | Yuki Taniguchi, Takeshi Mori, Shin Ishii |
A Continuous Internal-State Controller for Partially Observable Markov Decision Processes.  |
ICANN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Derek W. Seward, Conrad Pace, Rahee Agate |
Safe and effective navigation of autonomous robots in hazardous environments.  |
Auton. Robots  |
2007 |
DBLP DOI BibTeX RDF |
Task effective, Safety, Risk analysis, Autonomous vehicles, Partially observable Markov decision processes, Unstructured environments, Real-time control system, Robot architecture |
| 2 | Kaustubh R. Joshi, William H. Sanders, Matti A. Hiltunen, Richard D. Schlichting |
Automatic Recovery Using Bounded Partially Observable Markov Decision Processes.  |
DSN  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Masoumeh T. Izadi, Doina Precup |
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes.  |
ECML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong |
Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes.  |
Discrete Event Dynamic Systems  |
2004 |
DBLP DOI BibTeX RDF |
rollout, multiclass scheduling, simulation, buffer management, partially observable Markov decision process |
| 1 | A. Vozikis, J. E. Goulionis, V. K. Benos |
The partially observable Markov decision processes in healthcare: an application to patients with ischemic heart disease (IHD).  |
Operational Research  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Fabio Martinelli, Charles Morisset |
Quantitative access control with partially-observable Markov decision processes.  |
CODASPY  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Stéphane Ross, Joelle Pineau, Brahim Chaib-draa, Pierre Kreitmann |
A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes.  |
Journal of Machine Learning Research  |
2011 |
DBLP BibTeX RDF |
|
| 1 | John Goulionis, D. Stengos |
Partially Observable Markov Decision Processes and periodic Policies with Applications.  |
International Journal of Information Technology and Decision Making  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Nevin Lianwen Zhang, Weihong Zhang |
Speeding Up the Convergence of Value Iteration in Partially Observable Markov Decision Processes  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Judy Goldsmith, Christopher Lusena, Martin Mundhenk |
Nonapproximability Results for Partially Observable Markov Decision Processes  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Milos Hauskrecht |
Value-Function Approximations for Partially Observable Markov Decision Processes  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Harold Soh, Yiannis Demiris |
Evolving policies for multi-reward partially observable markov decision processes (MR-POMDPs).  |
GECCO  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Emad Saad |
Learning to Act Optimally in Partially Observable Markov Decision Processes Using Hybrid Probabilistic Logic Programs.  |
SUM  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Rajneesh Sharma, Matthijs T. J. Spaan |
Fuzzy reinforcement learning control for decentralized partially observable Markov decision processes.  |
FUZZ-IEEE  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Nathalie Bertrand, Blaise Genest |
Minimal Disclosure in Partially Observable Markov Decision Processes.  |
FSTTCS  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Hao Zhang |
Partially Observable Markov Decision Processes: A Geometric Technique and Analysis.  |
Operations Research  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Emad Saad |
Reinforcement Learning in Partially Observable Markov Decision Processes using Hybrid Probabilistic Logic Programs  |
CoRR  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Pascal Poupart |
Partially Observable Markov Decision Processes.  |
Encyclopedia of Machine Learning  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Krishnendu Chatterjee, Laurent Doyen, Thomas A. Henzinger |
Qualitative Analysis of Partially-Observable Markov Decision Processes.  |
MFCS  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami, Kohji Dohsaka |
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes.  |
COLING  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Yong (Yates) Lin, Kyungseo Park, Fillia Makedon |
From dialogue management to pervasive interaction based assistive technology.  |
PETRA  |
2010 |
DBLP DOI BibTeX RDF |
pervasive interaction, multimodal, POMDP, assistive environment |
| 1 | Augusto Cesar Espíndola Baffa, Angelo E. M. Ciarlini |
Modeling POMDPs for generating and simulating stock investment policies.  |
SAC  |
2010 |
DBLP DOI BibTeX RDF |
simulation, stock market, POMDP, technical analysis |
| 1 | Krishnendu Chatterjee, Laurent Doyen, Thomas A. Henzinger |
Qualitative Analysis of Partially-observable Markov Decision Processes  |
CoRR  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Pablo Samuel Castro, Prakash Panangaden, Doina Precup |
Equivalence Relations in Fully and Partially Observable Markov Decision Processes.  |
IJCAI  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Camille Besse, Brahim Chaib-draa |
Quasi-Deterministic Partially Observable Markov Decision Processes.  |
ICONIP  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael R. James, Satinder P. Singh |
SarsaLandmark: an algorithm for learning in POMDPs with landmarks.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning, landmark, POMDP, partial observability |
| 1 | Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J. Spaan |
Lossless clustering of histories in decentralized POMDPs.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
decentralized POMDPs, planning under uncertainty, cooperative multiagent systems |
| 1 | Verena Heidrich-Meisner, Christian Igel |
Uncertainty handling CMA-ES for reinforcement learning.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
covariance matrix adaptation evolution strategy, direct policy search, reinforcement learning, uncertainty handling |
| 1 | Abdeslam Boularias, Brahim Chaib-draa |
Predictive representations for policy gradient in POMDPs.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Patrick Dallaire, Camille Besse, Stéphane Ross, Brahim Chaib-draa |
Bayesian reinforcement learning in continuous POMDPs with gaussian processes.  |
IROS  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Yunxia Chen, Qing Zhao, Ananthram Swami |
Distributed Spectrum Sensing and Access in Cognitive Radio Networks With Energy Constraint.  |
IEEE Transactions on Signal Processing  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Sylvie C. W. Ong, David Hsu, Wee Sun Lee, Hanna Kurniawati |
Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction.  |
HCI  |
2009 |
DBLP DOI BibTeX RDF |
human-computer interaction, Sign language recognition, planning under uncertainty |
| 1 | Prashant Doshi, Yifeng Zeng, Qiongyu Chen |
Graphical models for interactive POMDPs: representations and solutions.  |
Autonomous Agents and Multi-Agent Systems  |
2009 |
DBLP DOI BibTeX RDF |
Interactive POMDPs, Sequential multiagent decision making, Probabilistic graphical models |
| 1 | Yanjie Li, Baoqun Yin, Hongsheng Xi |
Partially Observable Markov Decision Processes and Performance Sensitivity Analysis.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Yaodong Ni, Zhi-Qiang Liu |
Bounded-Parameter Partially Observable Markov Decision Processes.  |
ICAPS  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Will Thompson, Darren Gergle |
Modeling situated conversational agents as partially observable Markov decision processes.  |
IUI  |
2008 |
DBLP DOI BibTeX RDF |
situated conversational agents, decision-theoretic planning |
| 1 | Sven R. Schmidt-Rohr, Steffen Knoop, Martin Lösch, Rüdiger Dillmann |
Reasoning for a multi-modal service robot considering uncertainty in human-robot interaction.  |
HRI  |
2008 |
DBLP DOI BibTeX RDF |
robot decision making, pomdp, HRI |
| 1 | Finale Doshi, Joelle Pineau, Nicholas Roy |
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Enlu Zhou, Michael C. Fu, Steven I. Marcus |
A density projection approach to dimension reduction for continuous-state POMDPs.  |
CDC  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chaib-draa |
Prediction-Directed Compression of POMDPs.  |
ICMLA  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Stéphane Ross, Brahim Chaib-draa, Joelle Pineau |
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation.  |
ICRA  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Ai-Hua Bian, Chong-Jun Wang, Shifu Chen |
Preprocessing for Point-Based Algorithms of POMDPs.  |
ICTAI  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Zonghua Zhang, Farid Naït-Abdesselam, Pin-Han Ho |
Boosting Markov Reward Models for Probabilistic Security Evaluation by Characterizing Behaviors of Attacker and Defender.  |
ARES  |
2008 |
DBLP DOI BibTeX RDF |
Network security, anomaly detection, security evaluation, Markov Reward Models |
| 1 | Masoumeh T. Izadi, Doina Precup |
Point-Based Planning for Predictive State Representations.  |
Canadian Conference on AI  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Sarah Filippi, Olivier Cappé, Fabrice Clérot, Eric Moulines |
A Near Optimal Policy for Channel Allocation in Cognitive Radio.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Christel Baier, Nathalie Bertrand, Marcus Größer |
On Decision Problems for Probabilistic Büchi Automata.  |
FoSSaCS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Steven David Prestwich, Armagan Tarim, Roberto Rossi, Brahim Hnich |
A Cultural Algorithm for POMDPs from Stochastic Inventory Control.  |
Hybrid Metaheuristics  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Jesse Hoey, James J. Little |
Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes.  |
IEEE Trans. Pattern Anal. Mach. Intell.  |
2007 |
DBLP DOI BibTeX RDF |
machine learning, dynamic programming, motion, video analysis, statistical models, clustering algorithms, control theory, Face and gesture recognition, parameter learning |
| 1 | Abraham Grosfeld-Nir |
Control limits for two-state partially observable Markov decision processes.  |
European Journal of Operational Research  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Shihao Ji, Ronald Parr, Lawrence Carin |
Nonmyopic Multiaspect Sensing With Partially Observable Markov Decision Processes.  |
IEEE Transactions on Signal Processing  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Hideaki Itoh, Kiyohiko Nakamura |
Partially observable Markov decision processes with imprecise parameters.  |
Artif. Intell.  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Jason D. Williams, Steve Young |
Partially observable Markov decision processes for spoken dialog systems.  |
Computer Speech & Language  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Nicholas Armstrong-Crews, Manuela M. Veloso |
Oracular Partially Observable Markov Decision Processes: A Very Special Case.  |
ICRA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Doran Chakraborty, Sandip Sen |
Distributed intrusion detection in partially observable Markov decision processes.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
fault tolerance, multiagent planning |
| 1 | Olivier Buffet, Alain Dutech, François Charpillet |
Shaping multi-agent systems with gradient reinforcement learning.  |
Autonomous Agents and Multi-Agent Systems  |
2007 |
DBLP DOI BibTeX RDF |
Policy-gradient, Multi-agent systems, Reinforcement learning, Shaping, Partially observable Markov decision processes |
| 1 | Anton Chechetka, Katia P. Sycara |
Subjective approximate solutions for decentralized POMDPs.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
perception and action, coordination, cooperation, teamwork, multiagent planning |
| 1 | Prashant Doshi, Yifeng Zeng, Qiongyu Chen |
Graphical models for online solutions to interactive POMDPs.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
dynamic influence diagrams, decision-making, agent modeling |
| 1 | Kyle Polich, Piotr J. Gmytrasiewicz |
Interactive dynamic influence diagrams.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Ping Xuan |
Modeling plan coordination in multiagent decision processes.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
coordination, multiagent systems, cooperation, teamwork, multiagent planning |
| 1 | Maciej A. Mazurowski, Jacek M. Zurada |
Solving decentralized multi-agent control problems with genetic algorithms.  |
IEEE Congress on Evolutionary Computation  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Le Tien Dung, Takashi Komeda, Motoki Takagi |
Mixed Reinforcement Learning for Partially Observable Markov Decision Process.  |
CIRA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Mohammad Rezaeian |
Sensor Scheduling for Optimal Observability Using Estimation Entropy.  |
PerCom Workshops  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | J. D. Williams, S. Young |
Scaling POMDPs for Spoken Dialog Management.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Stephan Timmer, Martin Riedmiller |
Safe Q-Learning on Complete History Spaces.  |
ECML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Daan Wierstra, Jürgen Schmidhuber |
Policy Gradient Critics.  |
ECML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki |
Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs.  |
PRIMA  |
2007 |
DBLP DOI BibTeX RDF |
Trembling-hand perfect equilibrium, Multiagent systems, Nash equilibrium, Partially Observable Markov Decision Process |
| 1 | Francisco S. Melo, M. Isabel Ribeiro |
Transition Entropy in Partially Observable Markov Decision Processes.  |
IAS  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Hui Li, Xuejun Liao, Lawrence Carin |
Region-based value iteration for partially observable Markov decision processes.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Nathan Brannon, Gregory Conrad, Timothy Draelos, John Seiffertt, Donald C. Wunsch |
Information Fusion and Situation Awareness using ARTMAP and Partially Observable Markov Decision Processes.  |
IJCNN  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos A. Vlassis |
Decentralized planning under uncertainty for teams of communicating agents.  |
AAMAS  |
2006 |
DBLP DOI BibTeX RDF |
decentralized POMDPs, artificial intelligence, planning under uncertainty, cooperative multiagent systems |
| 1 | Deepak Verma, Rajesh P. N. Rao |
Planning and Acting in Uncertain Environments using Probabilistic Inference.  |
IROS  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Masoumeh T. Izadi, Doina Precup, Danielle Azar |
Belief Selection in Point-Based Planning Algorithms for POMDPs.  |
Canadian Conference on AI  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Jeongsoo Han |
Network-Adaptive QoS Routing Using Local Information.  |
APNOMS  |
2006 |
DBLP DOI BibTeX RDF |
Localized Adaptive QoS Routing, Exploration Bonus, Certainty Equivalency Approximation, Edge-disjoint multi-path, Reinforcement Learning, POMDP |
| 1 | Joaquín Lopez Fernández, Rafael Sanz, Reid G. Simmons, Amador R. Diéguez |
Heuristic anytime approaches to stochastic decision processes.  |
J. Heuristics  |
2006 |
DBLP DOI BibTeX RDF |
Planning, Heuristic algorithms, POMDP, Partially observable Markov decision process, Decision Systems |
| 1 | Hiroshi Osada, Satoshi Fujita |
CHQ: A Multi-Agent Reinforcement Learning Scheme for Partially Observable Markov Decision Processes.  |
IEICE Transactions  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Koichiro Takita, Masafumi Hagiwara |
A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes.  |
Systems and Computers in Japan  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Robin Jaulmes, Joelle Pineau, Doina Precup |
Active Learning in Partially Observable Markov Decision Processes.  |
ECML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | María Elena López Guillén, Luis Miguel Bergasa, Rafael Barea, María Soledad Escudero |
A Navigation System for Assistant Robots Using Visually Augmented POMDPs.  |
Auton. Robots  |
2005 |
DBLP DOI BibTeX RDF |
probabilistic navigation, multisensorial fusion, assistant robots, Partially Observable Markov Decision Processes, planning under uncertainty |
| 1 | Anthony R. Cassandra, Marian H. Nodine, Shilpa Bondale, Steve Ford, David L. Wells |
Using decision-theoretic models to enhance agent system survivability.  |
AAMAS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Chenggang Wang, James G. Schmolze |
Planning with POMDPs Using a Compact, Logic-Based Representation.  |
ICTAI  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | J. S. Ivy, S. M. Pollock |
Marginally monotonic maintenance policies for a multi-state deteriorating machine with probabilistic monitoring, and silent failures.  |
IEEE Transactions on Reliability  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Francisco Martín, Vicente Matellán, José María Cañas, Pablo Barrera |
Visual Based Localization for a Legged Robot.  |
RoboCup  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | María Elena López Guillén, Rafael Barea, Luis Miguel Bergasa, María Soledad Escudero |
A Human-Robot Cooperative Learning System for Easy Installation of Assistant Robots in New Working Environments.  |
Journal of Intelligent and Robotic Systems  |
2004 |
DBLP DOI BibTeX RDF |
probabilistic navigation, learning under uncertainty, expectation-maximization algorithm, assistant robots, partially observable Markov decision processes |
| 1 | Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian Thrun |
Learning low dimensional predictive representations.  |
ICML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Daan Wierstra, Marco Wiering |
Utile distinction hidden Markov models.  |
ICML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Piotr J. Gmytrasiewicz, Prashant Doshi |
Interactive POMDPs: Properties and Preliminary Results.  |
AAMAS  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Hiroshi Osada, Satoshi Fujita |
CHQ: A Multi-Agent Reinforcement Learning Scheme CHQ: A Multi-Agent Reinforcement Learning Scheme.  |
IAT  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Georgios Theocharous, Kevin P. Murphy, Leslie Pack Kaelbling |
Representing Hierarchical POMDPs as DBNs for Multi-scale Robot Localization.  |
ICRA  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Arthur Plínio de S. Braga, Aluizio F. R. Araújo, Jeremy Wyatt |
Incremental topological reinforcement learning agent in non-structured environments.  |
SMC  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Rinat Khoussainov |
Towards Well-Defined Multi-agent Reinforcement Learning.  |
AIMSA  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Gabriel Catalin Balan, Sean Luke |
A Demonstration of Neural Programming Applied to Non-Markovian Problems.  |
GECCO  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Piotr J. Gmytrasiewicz |
Issues in Rational Planning in Multi-Agent Settings. (PDF / PS)  |
HICSS  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasiewicz |
Formalizing Multi-Agent POMDP's in the context of network routing. (PDF / PS)  |
HICSS  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun |
Planning under Uncertainty for Reliable Health Care Robotics.  |
FSR  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Blai Bonet |
An epsilon-Optimal Grid-Based Algorithm for Partially Observable Markov Decision Processes.  |
ICML  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Iadine Chades, Bruno Scherrer, François Charpillet |
A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem.  |
SAC  |
2002 |
DBLP DOI BibTeX RDF |
decision theoretic agents, multiagent systems |
| 1 | Bruno Scherrer, François Charpillet |
Cooperative Co-Learning: A Model-Based Approach for Solving Multi Agent Reinforcement Problems.  |
ICTAI  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Martijn C. Schut, Michael Wooldridge, Simon Parsons |
On Partially Observable MDPs and BDI Models.  |
Foundations and Applications of Multi-Agent Systems  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Christopher Lusena, Judy Goldsmith, Martin Mundhenk |
Nonapproximability Results for Partially Observable Markov Decision Processes.  |
J. Artif. Intell. Res. (JAIR)  |
2001 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 123 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|