The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for POMDPs with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1996-2000 (16) 2001-2002 (16) 2003-2004 (17) 2005 (28) 2006 (26) 2007 (43) 2008 (45) 2009 (25) 2010 (34) 2011 (28) 2012 (45) 2013 (49) 2014 (27) 2015 (28) 2016 (32) 2017 (27) 2018 (43) 2019 (38) 2020 (30) 2021 (45) 2022 (47) 2023 (61) 2024 (10)
Publication types (Num. hits)
article(263) book(1) data(7) incollection(4) inproceedings(478) phdthesis(7)
Venues (Conferences, Journals, ...)
CoRR(176) AAAI(48) AAMAS(48) IJCAI(40) UAI(30) ICRA(23) ICML(22) ICAPS(17) NIPS(17) J. Artif. Intell. Res.(13) NeurIPS(13) AAMAS (1)(10) IROS(9) CDC(8) Artif. Intell.(7) IEEE Trans. Autom. Control.(7) More (+10 of total 191)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 88 occurrences of 55 keywords

Results
Found 760 publication records. Showing 760 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
102Kyle Polich, Piotr J. Gmytrasiewicz Interactive dynamic influence diagrams. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
99Prashant Doshi, Yifeng Zeng, Qiongyu Chen Graphical models for interactive POMDPs: representations and solutions. Search on Bibsonomy Auton. Agents Multi Agent Syst. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Interactive POMDPs, Sequential multiagent decision making, Probabilistic graphical models
88Nevin Lianwen Zhang, Weihong Zhang Space-Progressive Value Iteration: An Anytime Algorithm for a Class of POMDPs. Search on Bibsonomy ECSQARU The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
74Steven D. Prestwich, Armagan Tarim, Roberto Rossi 0002, Brahim Hnich A Cultural Algorithm for POMDPs from Stochastic Inventory Control. Search on Bibsonomy Hybrid Metaheuristics The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
74Makoto Tasaki, Yuichi Yabu, Yuki Iwanari, Makoto Yokoo, Milind Tambe, Janusz Marecki, Pradeep Varakantham Introducing Communication in Dis-POMDPs with Locality of Interaction. Search on Bibsonomy IAT The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
74Prashant Doshi, Yifeng Zeng, Qiongyu Chen Graphical models for online solutions to interactive POMDPs. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF dynamic influence diagrams, decision-making, agent modeling
74Bharaneedharan Rathnasabapathy, Prashant Doshi, Piotr J. Gmytrasiewicz Exact solutions of interactive POMDPs using behavioral equivalence. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
74Georgios Theocharous, Kevin P. Murphy, Leslie Pack Kaelbling Representing Hierarchical POMDPs as DBNs for Multi-scale Robot Localization. Search on Bibsonomy ICRA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
73Weihong Zhang Value Iteration over Belief Subspace. Search on Bibsonomy ECSQARU The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
70Akshat Kumar, Shlomo Zilberstein Constraint-based dynamic programming for decentralized POMDPs with structured interactions. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF DEC-POMDPs, multiagent planning
70Frans A. Oliehoek, Nikos Vlassis Q-value functions for decentralized POMDPs. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF decentralized POMDPs, planning under uncertainty, cooperative multiagent systems
70Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein Solving POMDPs using quadratically constrained linear programs. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF optimization, POMDPs, planning under uncertainty
59Michael R. James 0001, Satinder Singh 0001 SarsaLandmark: an algorithm for learning in POMDPs with landmarks. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF reinforcement learning, landmark, POMDP, partial observability
59Enlu Zhou, Michael C. Fu 0001, Steven I. Marcus A density projection approach to dimension reduction for continuous-state POMDPs. Search on Bibsonomy CDC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
59Pradeep Varakantham, Janusz Marecki, Yuichi Yabu, Milind Tambe, Makoto Yokoo Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF distributed POMDP, globally optimal solution, partially observable markov decision process (POMDP), multi-agent systems
59Pradeep Varakantham, Rajiv T. Maheswaran, Milind Tambe Implementation Techniques for Solving POMDPs in Personal Assistant Agents. Search on Bibsonomy PROMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
59Chenggang Wang, James G. Schmolze Planning with POMDPs Using a Compact, Logic-Based Representation. Search on Bibsonomy ICTAI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
59Pradeep Varakantham, Rajiv T. Maheswaran, Milind Tambe Exploiting belief bounds: practical POMDPs for personal assistant agents. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF meeting rescheduling, partially observable markov decision process (POMDP), task allocation
59Piotr J. Gmytrasiewicz, Prashant Doshi Interactive POMDPs: Properties and Preliminary Results. Search on Bibsonomy AAMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
58Diego R. Pereira, Luciano V. Gonçalves, Graçaliz Pereira Dimuro, Antônio Carlos da Rocha Costa Towards the Self-regulation of Personality-Based Social Exchange Processes in Multiagent Systems. Search on Bibsonomy SBIA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF self-regulation of social exchanges, Belief-Desire-Intention, multiagent systems, social simulation, Partially Observable Markov Decision Process
58Maayan Roth, Reid G. Simmons, Manuela M. Veloso Reasoning about joint beliefs for execution-time communication decisions. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF communication, POMDP, distributed execution, robot teams
55Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Brahim Chaib-draa Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF decentralized pomdps, point-based solver, artificial intelligence, branch-and-bound, planning under uncertainty
55Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J. Spaan Lossless clustering of histories in decentralized POMDPs. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF decentralized POMDPs, planning under uncertainty, cooperative multiagent systems
45Abdeslam Boularias, Brahim Chaib-draa Predictive representations for policy gradient in POMDPs. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
45Noel Welsh, Jeremy L. Wyatt United We Stand: Population Based Methods for Solving Unknown POMDPs. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
45Finale Doshi, Joelle Pineau, Nicholas Roy Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
45Jason D. Williams, S. Young Scaling POMDPs for Spoken Dialog Management. Search on Bibsonomy IEEE Trans. Speech Audio Process. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
45Anton Chechetka, Katia P. Sycara Subjective approximate solutions for decentralized POMDPs. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF perception and action, coordination, cooperation, teamwork, multiagent planning
45Pradeep Varakantham, Ranjit Nair, Milind Tambe, Makoto Yokoo Winning back the CUP for distributed POMDPs: planning over continuous belief spaces. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF continuous initial beliefs, distributed POMDP, partially observable Markov decision process (POMDP), multi-agent systems
44Masoumeh T. Izadi, Doina Precup Point-Based Planning for Predictive State Representations. Search on Bibsonomy Canadian AI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
44Milind Tambe, Emma Bowring, Hyuckchul Jung, Gal A. Kaminka, Rajiv T. Maheswaran, Janusz Marecki, Pragnesh Jay Modi, Ranjit Nair, Stephen Okamoto, Jonathan P. Pearce, Praveen Paruchuri, David V. Pynadath, Paul Scerri, Nathan Schurr, Pradeep Varakantham Conflicts in teamwork: hybrids to the rescue. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF game theory, BDI, POMDP, DCOP
40Matthijs T. J. Spaan, Geoffrey J. Gordon, Nikos Vlassis Decentralized planning under uncertainty for teams of communicating agents. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF decentralized POMDPs, artificial intelligence, planning under uncertainty, cooperative multiagent systems
31Jonathan Cohen 0001 Formation dynamique d'équipes dans les DEC-POMDPS ouverts à base de méthodes Monte-Carlo. (Dynamic team formation in open DEC-POMDPs with Monte-Carlo methods). Search on Bibsonomy 2019   RDF
31Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs. Search on Bibsonomy Auton. Agents Multi Agent Syst. The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
31Ranjit Nair, Pradeep Varakantham, Milind Tambe, Makoto Yokoo Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs. Search on Bibsonomy AAAI The full citation details ... 2005 DBLP  BibTeX  RDF
31Ranjit Nair, Pradeep Varakantham, Milind Tambe, Makoto Yokoo Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs. Search on Bibsonomy IJCAI The full citation details ... 2005 DBLP  BibTeX  RDF
30Patrick Dallaire, Camille Besse, Stéphane Ross, Brahim Chaib-draa Bayesian reinforcement learning in continuous POMDPs with gaussian processes. Search on Bibsonomy IROS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
30Vikram Krishnamurthy Optimal Threshold Policies for Multivariate Stopping-Time POMDPs. Search on Bibsonomy ECSQARU The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
30Christopher Amato, Shlomo Zilberstein Achieving goals in decentralized POMDPs. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF
30Stéphane Ross, Brahim Chaib-draa, Joelle Pineau Bayesian reinforcement learning in continuous POMDPs with application to robot navigation. Search on Bibsonomy ICRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30Nicholas Armstrong-Crews, Manuela M. Veloso An approximate algorithm for solving oracular POMDPs. Search on Bibsonomy ICRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chaib-draa Prediction-Directed Compression of POMDPs. Search on Bibsonomy ICMLA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30François Laviolette, Ludovic Tobin A Stochastic Point-Based Algorithm for POMDPs. Search on Bibsonomy Canadian AI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30Ai-Hua Bian, Chong-Jun Wang, Shifu Chen Preprocessing for Point-Based Algorithms of POMDPs. Search on Bibsonomy ICTAI (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30Feng Wu 0001, Xiaoping Chen Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs. Search on Bibsonomy RoboCup The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
30Daan Wierstra, Alexander Förster, Jan Peters 0001, Jürgen Schmidhuber Solving Deep Memory POMDPs with Recurrent Policy Gradients. Search on Bibsonomy ICANN (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
30Masoumeh T. Izadi, Doina Precup, Danielle Azar Belief Selection in Point-Based Planning Algorithms for POMDPs. Search on Bibsonomy Canadian AI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
30David J. Montana, Eric Van Wyk, Marshall Brinn, Joshua Montana, Stephen Milligan Genomic computing networks learn complex POMDPs. Search on Bibsonomy GECCO The full citation details ... 2006 DBLP  DOI  BibTeX  RDF POMDP, evolutionary neural networks
30Sébastien Paquet, Ludovic Tobin, Brahim Chaib-draa Real-Time Decision Making for Large POMDPs. Search on Bibsonomy Canadian AI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
30Ranjit Nair, Milind Tambe, Maayan Roth, Makoto Yokoo Communication for Improving Policy Computation in Distributed POMDPs. Search on Bibsonomy AAMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
29Yanjie Li, Baoqun Yin, Hongsheng Xi Partially Observable Markov Decision Processes and Performance Sensitivity Analysis. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
29Kazuteru Miyazaki, Shigenobu Kobayashi Proposal of Exploitation-Oriented Learning PS-r#. Search on Bibsonomy IDEAL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
29Daan Wierstra, Tom Schaul, Jan Peters 0001, Jürgen Schmidhuber Episodic Reinforcement Learning by Logistic Reward-Weighted Regression. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
29Yang Xiang 0004, Franklin Hanshar Planning in Multiagent Expedition with Collaborative Design Networks. Search on Bibsonomy Canadian AI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
29Le Tien Dung, Takashi Komeda, Motoki Takagi Mixed Reinforcement Learning for Partially Observable Markov Decision Process. Search on Bibsonomy CIRA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
29Daan Wierstra, Jürgen Schmidhuber Policy Gradient Critics. Search on Bibsonomy ECML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
29Finale Doshi, Nicholas Roy Efficient model learning for dialog management. Search on Bibsonomy HRI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF human-robot interaction, decision-making under uncertainty, model learning
29Deepak Verma, Rajesh P. N. Rao Planning and Acting in Uncertain Environments using Probabilistic Inference. Search on Bibsonomy IROS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
29Joelle Pineau, Geoffrey J. Gordon POMDP Planning for Robust Robot Control. Search on Bibsonomy ISRR The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
29Anthony R. Cassandra, Marian H. Nodine, Shilpa Bondale, Steve Ford, David L. Wells Using decision-theoretic models to enhance agent system survivability. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
29Sébastien Paquet, Ludovic Tobin, Brahim Chaib-draa An online POMDP algorithm for complex multiagent environments. Search on Bibsonomy AAMAS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF online search, POMDP
29Ranjit Nair, Milind Tambe Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach. Search on Bibsonomy PROMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
29Rinat Khoussainov Towards Well-Defined Multi-agent Reinforcement Learning. Search on Bibsonomy AIMSA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
29David V. Pynadath, Stacy Marsella Fitting and Compilation of Multiagent Models through Piecewise Linear Functions. Search on Bibsonomy AAMAS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
29Martijn C. Schut, Michael J. Wooldridge, Simon Parsons On Partially Observable MDPs and BDI Models. Search on Bibsonomy Foundations and Applications of Multi-Agent Systems The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
29Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber Market-Based Reinforcement Learning in Partially Observable Worlds. Search on Bibsonomy ICANN The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
29Martin Mundhenk, Judy Goldsmith, Eric Allender The Complexity of Policy Evaluation for Finite-Horizon Partially-Observable Markov Decision Processes. Search on Bibsonomy MFCS The full citation details ... 1997 DBLP  DOI  BibTeX  RDF
25Stéphane Ross, Masoumeh T. Izadi, Mark Mercer, David L. Buckeridge Sensitivity Analysis of POMDP Value Functions. Search on Bibsonomy ICMLA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Value Function Error Bound, Perturbation Analysis, POMDPs
25Simon Andrew Williamson, Enrico H. Gerding, Nicholas R. Jennings Reward shaping for valuing communications during multi-agent coordination. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF decentralised POMDPs, communication, agents
25Xi-Ren Cao Basic Ideas for Event-Based Optimization of Markov Systems. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Markov decision processes (MDPs), performance potentials, policy gradients, aggregation, perturbation analysis, POMDPs, policy iteration
16Or Wertheim, Dan R. Suissa, Ronen I. Brafman Plug'n Play Task-Level Autonomy for Robotics Using POMDPs and Probabilistic Programs. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Daniele Meli, Alberto Castellini, Alessandro Farinelli Learning Logic Specifications for Policy Guidance in POMDPs: an Inductive Logic Programming Approach. Search on Bibsonomy J. Artif. Intell. Res. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16J.-Anne Yow, Neha Priyadarshini Garg, Wei Tech Ang Shared Autonomy of a Robotic Manipulator for Grasping Under Human Intent Uncertainty Using POMDPs. Search on Bibsonomy IEEE Trans. Robotics The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Daniele Meli, Alberto Castellini, Alessandro Farinelli Learning Logic Specifications for Policy Guidance in POMDPs: an Inductive Logic Programming Approach. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Johan Peralez, Aurélien Delage, Olivier Buffet, Jilles Steeve Dibangoye Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Michael Lanier, Ying Xu, Nathan Jacobs, Chongjie Zhang, Yevgeniy Vorobeychik Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Huifan Gao, Yifeng Zeng, Yinghui Pan Inducing Individual Students' Learning Strategies through Homomorphic POMDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Yannick Eich, Bastian Alt, Heinz Koeppl Approximate Control for Continuous-Time POMDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Maris F. L. Galesloot, Thiago D. Simão, Sebastian Junges, Nils Jansen 0001 Factored Online Planning in Many-Agent POMDPs. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
16Yannick Eich, Bastian Alt, Heinz Koeppl Approximate Control for Continuous-Time POMDPs. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
16Wei Zheng, Hai Lin 0002 Provable-Correct Partitioning Approach for Continuous-Observation POMDPs With Special Observation Distributions. Search on Bibsonomy IEEE Control. Syst. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Manav Vora, Pranay Thangeda, Michael N. Grussing, Melkior Ornik Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs. Search on Bibsonomy IEEE Control. Syst. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Marijana Peti, Frano Petric, Stjepan Bogdan Decentralized Coordination of Multi-Agent Systems Based on POMDPs and Consensus for Active Perception. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Giacomo Arcieri, Cyprien Hoelzl, Oliver Schwery, Daniel Straub, Konstantinos G. Papakonstantinou, Eleni N. Chatzi Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems. Search on Bibsonomy Reliab. Eng. Syst. Saf. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Junchao Li, Mingyu Cai, Zhaoan Wang, Shaoping Xiao Model-based motion planning in POMDPs with temporal logic specifications. Search on Bibsonomy Adv. Robotics The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Moran Barenboim, Moshe Shienman, Vadim Indelman Monte Carlo Planning in Hybrid Belief POMDPs. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Franck Djeumou, Christian Ellis, Murat Cubuktepe, Craig Lennon, Ufuk Topcu Task-guided IRL in POMDPs that scales. Search on Bibsonomy Artif. Intell. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Majid Khonji Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions. Search on Bibsonomy Artif. Intell. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Timothy L. Molloy, Girish N. Nair Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Victor Cohen, Axel Parmentier Future memories are not needed for large classes of POMDPs. Search on Bibsonomy Oper. Res. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg Optimality Guarantees for Particle Belief Approximation of POMDPs. Search on Bibsonomy J. Artif. Intell. Res. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Maris F. L. Galesloot, Thiago D. Simão, Sebastian Junges, Nils Jansen 0001 Factored Online Planning in Many-Agent POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Hai Nguyen, Sammie Katt, Yuchen Xiao, Christopher Amato On-Robot Bayesian Reinforcement Learning for POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Rui Yan 0002, Gabriel Santos, Gethin Norman, David Parker 0001, Marta Kwiatkowska Point-based Value Iteration for Neuro-Symbolic POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Jonathan N. Lee, Alekh Agarwal, Christoph Dann, Tong Zhang 0001 Learning in POMDPs is Sample-Efficient with Hindsight Observability. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Thiago D. Simão, Marnix Suilen, Nils Jansen 0001 Safe Policy Improvement for POMDPs via Finite-State Controllers. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Manav Vora, Pranay Thangeda, Michael N. Grussing, Melkior Ornik Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Marcus Hörger, Hanna Kurniawati, Dirk P. Kroese, Nan Ye Adaptive Discretization using Voronoi Trees for Continuous POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Soichiro Nishimori, Sotetsu Koyamada, Shin Ishii End-to-End Policy Gradient Method for POMDPs and Explainable Agents. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
16Roman Andriushchenko, Alexander Bork, Milan Ceska 0002, Sebastian Junges, Joost-Pieter Katoen, Filip Macák Search and Explore: Symbiotic Policy Synthesis in POMDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 760 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license