|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 358 occurrences of 208 keywords
|
|
|
|
|
Results
Found 674 publication records. Showing 674 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 3 | Juan Frausto Solís, Elizabeth Santiago D., Jaime Mora-Vargas |
Cosine Policy Iteration for Solving Infinite-Horizon Markov Decision Processes.  |
MICAI  |
2009 |
DBLP DOI BibTeX RDF |
cosine simplex method, Markov decision processes, hybrid method, policy iteration |
| 3 | Mohammed Shahid Abdulla, Shalabh Bhatnagar |
Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes.  |
Discrete Event Dynamic Systems  |
2007 |
DBLP DOI BibTeX RDF |
Actor-critic algorithms, Two timescale stochastic approximation, Simultaneous perturbation stochastic approximation, Normalized Hadamard matrices, TD-learning, Reinforcement learning, Markov decision processes, Policy iteration |
| 3 | Qi Sui, Haiyang Wang |
A Dynamic Generation Algorithm for Meta Process Using Markov Decision Processes.  |
IMSCCS  |
2006 |
DBLP DOI BibTeX RDF |
Services Composition, Markov Decision Processes, Services Computing |
| 3 | Osman Abul, Reda Alhajj, Faruk Polat |
Markov Decision Processes Based Optimal Control Policies for Probabilistic Boolean Network.  |
BIBE  |
2004 |
DBLP DOI BibTeX RDF |
probabilistic boolean networks, monitoring, Markov decision processes, optimal control |
| 3 | Andrew G. Barto, Sridhar Mahadevan |
Recent Advances in Hierarchical Reinforcement Learning.  |
Discrete Event Dynamic Systems  |
2003 |
DBLP DOI BibTeX RDF |
reinforcement learning, hierarchy, Markov decision processes, temporal abstraction, semi-Markov decision processes |
| 3 | Bruno Scherrer, François Charpillet |
Coevolutive planning in markov decision processes.  |
AAMAS  |
2002 |
DBLP DOI BibTeX RDF |
coordinating multi-agent & activites, evolution adaptation and learning, markov decision processes, action selection and planning |
| 3 | Pierre Laroche |
Building efficient partial plans using Markov decision processes.  |
ICTAI  |
2000 |
DBLP DOI BibTeX RDF |
efficient partial plan building, optimal action sequences, actuator uncertainties, goal state, uncertainty, planning, mobile robots, mobile robot, Markov processes, directed graphs, directed graph, path planning, Markov decision processes, decision theory, state space, uncertainty handling |
| 2 | Henri Hansen, Marta Z. Kwiatkowska, Hongyang Qu |
Partial Order Reduction for Model Checking Markov Decision Processes under Unconditional Fairness.  |
QEST  |
2011 |
DBLP DOI BibTeX RDF |
unconditional fairness, Markov decision processes, partial order reduction, Probabilistic model checking |
| 2 | Omid Madani, Mikkel Thorup, Uri Zwick |
Discounted deterministic Markov decision processes and discounted all-pairs shortest paths.  |
SODA  |
2009 |
DBLP DOI BibTeX RDF |
|
| 2 | Husain Aljazzar, Stefan Leue |
Generation of Counterexamples for Model Checking of Markov Decision Processes.  |
QEST  |
2009 |
DBLP DOI BibTeX RDF |
Stochastic Model Checking, $k$-Shortest-Paths Search, K$^*$, Markov Decision Processes, Counterexamples, Directed Search |
| 2 | Stefan J. Witwicki, Edmund H. Durfee |
Flexible approximation of structured interactions in decentralized Markov decision processes.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
decentralized markov decision processes, event-driven interactions, multiagent systems, commitments |
| 2 | Scott Proper, Prasad Tadepalli |
Solving multiagent assignment Markov decision processes.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
coordination graphs, reinforcement learning, Markov decision processes, assignment problem |
| 2 | Martin R. Neuhäußer, Mariëlle Stoelinga, Joost-Pieter Katoen |
Delayed Nondeterminism in Continuous-Time Markov Decision Processes.  |
FOSSACS  |
2009 |
DBLP DOI BibTeX RDF |
|
| 2 | Lihong Li, Michael L. Littman, Christopher R. Mansley |
Online exploration in least-squares policy iteration.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
PAC-MDP, least-squares policy iteration (LSPI), reinforcement learning, Markov decision processes, exploration |
| 2 | Xin Li, Qianchuan Zhao, Xiaohong Guan, Lang Tong |
On the performance of cognitive access with periodic spectrum sensing.  |
MOBICOM-CoRoNet  |
2009 |
DBLP DOI BibTeX RDF |
constrained markov decision processes, resource allocation, dynamic spectrum access |
| 2 | Yanjie Li, Baoqun Yin, Hongsheng Xi |
Partially Observable Markov Decision Processes and Performance Sensitivity Analysis.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Tomás Brázdil, Vojtech Forejt, Antonín Kucera |
Controller Synthesis and Verification for Markov Decision Processes with Qualitative Branching Time Objectives.  |
ICALP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith |
Information state for Markov decision processes with network delays.  |
CDC  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Yuki Taniguchi, Takeshi Mori, Shin Ishii |
A Continuous Internal-State Controller for Partially Observable Markov Decision Processes.  |
ICANN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Pritam Roy, David Parker, Gethin Norman, Luca de Alfaro |
Symbolic Magnifying Lens Abstraction in Markov Decision Processes.  |
QEST  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Calin Ciufudean, Otilia Ciufudean, Constantin Filote |
New Models for Immune Mechanism Diagnosis.  |
MDA  |
2008 |
DBLP DOI BibTeX RDF |
Markov Decision Processes (MDPs), Immune mechanisms diagnosis, Petri nets |
| 2 | Sankalp S. Kallakuri, Alex Doboli |
Customization of Arbitration Policies and Buffer Space Distribution Using Continuous-Time Markov Decision Processes.  |
IEEE Trans. VLSI Syst.  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Tomás Brázdil, Vojtech Forejt |
Strategy Synthesis for Markov Decision Processes and Branching-Time Logics.  |
CONCUR  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Martin R. Neuhäußer, Joost-Pieter Katoen |
Bisimulation and Logical Preservation for Continuous-Time Markov Decision Processes.  |
CONCUR  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Joke Lambert, Benny Van Houdt, Chris Blondia |
A policy iteration algorithm for Markov decision processes skip-free in one direction.  |
VALUETOOLS  |
2007 |
DBLP DOI BibTeX RDF |
fibre delay lines, policy iteration algorithm, skip-free in one direction, Markov decision process, matrix analytic methods, loss rate, optical buffer |
| 2 | Kousha Etessami, Marta Z. Kwiatkowska, Moshe Y. Vardi, Mihalis Yannakakis |
Multi-objective Model Checking of Markov Decision Processes.  |
TACAS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Elizabeth Novoa |
Simple Model-Based Exploration and Exploitation of Markov Decision Processes Using the Elimination Algorithm.  |
MICAI  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Anders Jonsson, Andrew G. Barto |
Active Learning of Dynamic Bayesian Networks in Markov Decision Processes.  |
SARA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Hugo Gimbert |
Pure Stationary Optimal Strategies in Markov Decision Processes.  |
STACS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Erick Delage, Shie Mannor |
Percentile optimization in uncertain Markov decision processes with application to efficient exploration.  |
ICML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Ronald Ortner |
Pseudometrics for State Aggregation in Average Reward Markov Decision Processes.  |
ALT  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Luca de Alfaro, Pritam Roy |
Magnifying-Lens Abstraction for Markov Decision Processes.  |
CAV  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Hugo Gimbert, Wieslaw Zielonka |
Limits of Multi-Discounted Markov Decision Processes.  |
LICS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Janusz Marecki, Milind Tambe |
On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
decentralized Markov decision process, locally optimal solution, multi-agent systems, temporal constraints |
| 2 | Ambuj Tewari, Peter L. Bartlett |
Bounded Parameter Markov Decision Processes with Average Reward Criterion.  |
COLT  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Baohua Li, Jennie Si |
Approximate Robust Policy Iteration for Discounted Infinite-Horizon Markov Decision Processes with Uncertain Stationary Parametric Transition Matrices.  |
IJCNN  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Hadi Bannazadeh, Alberto Leon-Garcia |
Allocating Services to Applications using Markov Decision Processes.  |
SOCA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Krishnendu Chatterjee |
Markov Decision Processes with Multiple Long-Run Average Objectives.  |
FSTTCS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 2 | Sooraj Bhat, David L. Roberts, Mark J. Nelson, Charles L. Isbell, Michael Mateas |
A globally optimal algorithm for TTD-MDPs.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
Markov decision processes, convex optimization, interactive entertainment |
| 2 | José Niño-Mora |
Characterization and computation of restless bandit marginal productivity indices.  |
VALUETOOLS  |
2007 |
DBLP DOI BibTeX RDF |
marginal productivity index, restless bandits, Markov decision processes, block algorithms, index policies |
| 2 | Christel Baier, Nathalie Bertrand, Ph. Schnoebelen |
Verifying nondeterministic probabilistic channel systems against ω-regular linear-time properties.  |
ACM Trans. Comput. Log.  |
2007 |
DBLP DOI BibTeX RDF |
lossy channels, probabilistic models, Communication protocols, Markov decision processes |
| 2 | Derek W. Seward, Conrad Pace, Rahee Agate |
Safe and effective navigation of autonomous robots in hazardous environments.  |
Auton. Robots  |
2007 |
DBLP DOI BibTeX RDF |
Task effective, Safety, Risk analysis, Autonomous vehicles, Partially observable Markov decision processes, Unstructured environments, Real-time control system, Robot architecture |
| 2 | Jennifer Boger, Jesse Hoey, Pascal Poupart, Craig Boutilier, Geoff Fernie, Alex Mihailidis |
A Planning System Based on Markov Decision Processes to Guide People With Dementia Through Activities of Daily Living.  |
IEEE Transactions on Information Technology in Biomedicine  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Tomás Brázdil, Václav Brozek, Vojtech Forejt, Antonín Kucera |
Reachability in Recursive Markov Decision Processes.  |
CONCUR  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Di Wu, Xenofon D. Koutsoukos |
Probabilistic Verification of Uncertain Systems Using Bounded-Parameter Markov Decision Processes.  |
MDAI  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Alberto Reyes, Luis Enrique Sucar, Eduardo F. Morales, Pablo H. Ibargüengoytia |
Solving Hybrid Markov Decision Processes.  |
MICAI  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Felipe W. Trevizan, Fabio Gagliardi Cozman, Leliane Nunes de Barros |
Unifying Nondeterministic and Probabilistic Planning Through Imprecise Markov Decision Processes.  |
IBERAMIA-SBIA  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Kaustubh R. Joshi, William H. Sanders, Matti A. Hiltunen, Richard D. Schlichting |
Automatic Recovery Using Bounded Partially Observable Markov Decision Processes.  |
DSN  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Kousha Etessami, Mihalis Yannakakis |
Efficient Qualitative Analysis of Classes of Recursive Markov Decision Processes and Simple Stochastic Games.  |
STACS  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Krishnendu Chatterjee, Rupak Majumdar, Thomas A. Henzinger |
Markov Decision Processes with Multiple Objectives.  |
STACS  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Marc Toussaint, Amos J. Storkey |
Probabilistic inference for solving discrete and continuous state Markov Decision Processes.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin |
Learning the structure of Factored Markov Decision Processes in reinforcement learning problems.  |
ICML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Marta Z. Kwiatkowska, Gethin Norman, David Parker |
Game-based Abstraction for Markov Decision Processes.  |
QEST  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Karel Sladký |
Risk-Sensitive Optimality Criteria in Markov Decision Processes.  |
OR  |
2006 |
DBLP DOI BibTeX RDF |
|
| 2 | Ji Wu, Chaoqun Ye, Shiyao Jin |
Opponent Learning for Multi-agent System Simulation.  |
RSKT  |
2006 |
DBLP DOI BibTeX RDF |
reinforcement learning, Markov decision processes, multi-agent simulation, Opponent modeling |
| 2 | Christel Baier, Frank Ciesinski, Marcus Größer |
ProbMela and verification of Markov decision processes.  |
SIGMETRICS Performance Evaluation Review  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Masami Kurano, Masami Yasuda, Jun-ichi Nakagami, Yuji Yoshida |
Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes.  |
MDAI  |
2005 |
DBLP DOI BibTeX RDF |
Fuzzy perceptive model, fuzzy perceptive reward, optimal policy function, Markov decision process |
| 2 | Kousha Etessami, Mihalis Yannakakis |
Recursive Markov Decision Processes and Recursive Stochastic Games.  |
ICALP  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Marcus Größer, Christel Baier |
Partial Order Reduction for Markov Decision Processes: A Survey.  |
FMCO  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Masoumeh T. Izadi, Doina Precup |
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes.  |
ECML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Khashayar Rohanimanesh, Sridhar Mahadevan |
Coarticulation: an approach for generating concurrent plans in Markov decision processes.  |
ICML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Aurélie Beynier, Abdel-Illah Mouaddib |
A polynomial algorithm for decentralized Markov decision processes with temporal constraints.  |
AAMAS  |
2005 |
DBLP DOI BibTeX RDF |
multi-agent systems, uncertainty, planning, Markov decision processes |
| 2 | Aiqiang Gao, Dongqing Yang, Shiwei Tang, Ming Zhang |
Web Service Composition Using Markov Decision Processes.  |
WAIM  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | Antonín Kucera, Oldrich Strazovský |
On the Controller Synthesis for Finite-State Markov Decision Processes.  |
FSTTCS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 2 | María Elena López Guillén, Luis Miguel Bergasa, Rafael Barea, María Soledad Escudero |
A Navigation System for Assistant Robots Using Visually Augmented POMDPs.  |
Auton. Robots  |
2005 |
DBLP DOI BibTeX RDF |
probabilistic navigation, multisensorial fusion, assistant robots, Partially Observable Markov Decision Processes, planning under uncertainty |
| 2 | Xi-Ren Cao |
Basic Ideas for Event-Based Optimization of Markov Systems.  |
Discrete Event Dynamic Systems  |
2005 |
DBLP DOI BibTeX RDF |
Markov decision processes (MDPs), performance potentials, policy gradients, aggregation, perturbation analysis, POMDPs, policy iteration |
| 2 | Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong |
Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes.  |
Discrete Event Dynamic Systems  |
2004 |
DBLP DOI BibTeX RDF |
rollout, multiclass scheduling, simulation, buffer management, partially observable Markov decision process |
| 2 | Shun-Pin Hsu, Aristotle Arapostathis |
Competitive Markov decision processes with partial observation.  |
SMC  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Shun-Pin Hsu, Aristotle Arapostathis |
Strict-sense constrained Markov decision processes.  |
SMC  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Hyeong Soo Chang |
An adaptation of particle swarm optimization for Markov decision processes.  |
SMC  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Graçaliz Pereira Dimuro, Antônio Carlos da Rocha Costa |
Interval-Based Markov Decision Processes for Regulating Interactions Between Two Agents in Multi-agent Systems.  |
PARA  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Julien Burlet, Olivier Aycard, Thierry Fraichard |
Robust Motion Planning using Markov Decision Processes and Quadtree Decomposition.  |
ICRA  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Prashant Doshi, Richard Goodwin, Rama Akkiraju, Kunal Verma |
Dynamic Workflow Composition using Markov Decision Processes.  |
ICWS  |
2004 |
DBLP DOI BibTeX RDF |
|
| 2 | Frank Ciesinski, Marcus Größer |
On Probabilistic Computation Tree Logic.  |
Validation of Stochastic Systems  |
2004 |
DBLP DOI BibTeX RDF |
PCTL, PCTL*, probabilistic deterministic systems, probabilistic nondeterministic systems, quantitative model checking, scheduler, fairness, Markov decision processes, discrete time Markov chains |
| 2 | Xi-Ren Cao |
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning.  |
Discrete Event Dynamic Systems  |
2003 |
DBLP DOI BibTeX RDF |
gradient-based policy iteration, perturbation realization, TD(), Q-learning, Poisson equations, Potentials |
| 2 | Qiying Hu, Jianyong Liu, Wuyi Yue |
Continuous Time Markov Decision Processes with Expected Discounted Total Rewards.  |
International Conference on Computational Science  |
2003 |
DBLP DOI BibTeX RDF |
|
| 2 | Seon Wook Kim, Hyeong Soo Chang |
Parallelizing Parallel Rollout Algorithm for Solving Markov Decision Processes.  |
WOMPAT  |
2003 |
DBLP DOI BibTeX RDF |
|
| 2 | Olivier Buffet, Alain Dutech, François Charpillet |
Automatic generation of an agent's basic behaviors.  |
AAMAS  |
2003 |
DBLP DOI BibTeX RDF |
adaptation, scalability, reinforcement learning, Markov decision processes, complex environments |
| 2 | Bohdana Ratitch, Doina Precup |
Characterizing Markov Decision Processes.  |
ECML  |
2002 |
DBLP DOI BibTeX RDF |
|
| 2 | Mohammad Ghavamzadeh, Sridhar Mahadevan |
A multiagent reinforcement learning algorithm by dynamically merging markov decision processes.  |
AAMAS  |
2002 |
DBLP DOI BibTeX RDF |
|
| 2 | Eyal Even-Dar, Shie Mannor, Yishay Mansour |
PAC Bounds for Multi-armed Bandit and Markov Decision Processes.  |
COLT  |
2002 |
DBLP DOI BibTeX RDF |
|
| 2 | David N. Jansen, Holger Hermanns, Joost-Pieter Katoen |
A Probabilistic Extension of UML Statecharts.  |
FTRTFT  |
2002 |
DBLP DOI BibTeX RDF |
model checking, semantics, probabilities, Markov decision processes, UML statecharts |
| 2 | Hannu Rummukainen, Jorma T. Virtamo |
Polynomial cost approximations in markov decision theory based call admission control.  |
IEEE/ACM Trans. Netw.  |
2001 |
DBLP DOI BibTeX RDF |
network revenue, piecewise polynomial approximation, Markov decision processes, telecommunication network routing, telecommunication congestion control, Broadband networks, connection admission control |
| 2 | Ping Xuan, Victor R. Lesser, Shlomo Zilberstein |
Communication in Multi-Agent Markov Decision Processes.  |
ICMAS  |
2000 |
DBLP DOI BibTeX RDF |
|
| 2 | Martin Mundhenk, Judy Goldsmith, Christopher Lusena, Eric Allender |
Complexity of finite-horizon Markov decision process problems.  |
J. ACM  |
2000 |
DBLP DOI BibTeX RDF |
NPPP, computational complexity, Markov decision processes, NP, PL, partially observable Markov decision processes, PSPACE, succinct representations |
| 2 | Pierre Laroche, François Charpillet, René Schott |
Mobile Robotics Planning Using Abstract Markov Decision Processes. (PDF / PS)  |
ICTAI  |
1999 |
DBLP DOI BibTeX RDF |
Markov Decision Process, planning under uncertainty, state aggregation |
| 2 | Danièle Beauquier, Dima Burago, Anatol Slissenko |
On the Complexity of Finite Memory Policies for Markov Decision Processes.  |
MFCS  |
1995 |
DBLP DOI BibTeX RDF |
|
| 1 | Xianping Guo, Yonghui Huang, XinYuan Song |
Linear Programming and Constrained Average Optimality for General Continuous-Time Markov Decision Processes in History-Dependent Policies.  |
SIAM J. Control and Optimization  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Weifen Zhuang, Michael Z. F. Li |
Monotone optimal control for a class of Markov decision processes.  |
European Journal of Operational Research  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Xianping Guo, Liuer Ye, George Yin |
A mean-variance optimization problem for discounted Markov decision processes.  |
European Journal of Operational Research  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Moser Silva Fagundes, Sascha Ossowski, Michael Luck, Simon Miles |
Using Normative Markov Decision Processes for evaluating electronic contracts.  |
AI Commun.  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Lucian Busoniu, Rémi Munos |
Optimistic planning for Markov decision processes.  |
Journal of Machine Learning Research - Proceedings Track  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Sachin Adlakha, Sanjay Lall, Andrea Goldsmith |
Networked Markov Decision Processes With Delays.  |
IEEE Trans. Automat. Contr.  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Bruno Scherrer |
On the Use of Non-Stationary Policies for Infinite-Horizon Discounted Markov Decision Processes  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Kousha Etessami, Alistair Stewart, Mihalis Yannakakis |
Polynomial Time Algorithms for Branching Markov Decision Processes and Probabilistic Min(Max) Polynomial Bellman Equations  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Krishnendu Chatterjee, Manas Joglekar, Nisarg Shah |
Average Case Analysis of the Classical Algorithm for Markov Decision Processes with Büchi Objectives  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Takayuki Osogami |
Iterated risk measures for risk-sensitive Markov decision processes with discounted cost  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Shalabh Bhatnagar, K. Lakshmanan |
An Online Actor-Critic Algorithm with Function Approximation for Constrained Markov Decision Processes.  |
J. Optimization Theory and Applications  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Qingda Wei, Xianping Guo |
New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces.  |
J. Optimization Theory and Applications  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | A. Vozikis, J. E. Goulionis, V. K. Benos |
The partially observable Markov decision processes in healthcare: an application to patients with ischemic heart disease (IHD).  |
Operational Research  |
2012 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 674 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ >>] |
|