The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Policy-gradient (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1999-2003 (18) 2004-2005 (19) 2006-2007 (25) 2008 (25) 2009-2010 (18) 2011-2012 (12)
Publication types (Num. hits)
article(27) incollection(2) inproceedings(88)
Venues (Conferences, Journals, ...)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 23 occurrences of 20 keywords

Results
Found 117 publication records. Showing 117 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
3Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya A New Natural Policy Gradient by Stationary Distribution Metric. Search on Bibsonomy ECML/PKDD The full citation details ... 2008 DBLP  DOI  BibTeX  RDF policy gradient reinforcement learning, Riemannian metric matrix, Markov decision process, natural gradient
2Emmanuel Daucé A Model of Neuronal Specialization Using Hebbian Policy-Gradient with "Slow" Noise. Search on Bibsonomy ICANN The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
2Abdeslam Boularias, Brahim Chaib-draa Predictive representations for policy gradient in POMDPs. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
2Thomas Rückstieß, Martin Felder, Jürgen Schmidhuber State-Dependent Exploration for Policy Gradient Methods. Search on Bibsonomy ECML/PKDD The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Maarten Peeters, Ville Könönen, Katja Verbeeck, Ann Nowé A Learning Automata Approach to Multi-agent Policy Gradient Learning. Search on Bibsonomy KES The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Yu Hiei, Takeshi Mori, Shin Ishii Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments. Search on Bibsonomy ICANN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Tomoya Tamei, Tomohiro Shibata Policy Gradient Learning of Cooperative Interaction with a Robot Using User's Biological Signals. Search on Bibsonomy ICONIP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Nguyen Hoang Viet, Ngo Anh Vien, TaeChoong Chung Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks. Search on Bibsonomy ICNSC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Seiji Ishihara, Harukazu Igarashi Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies. Search on Bibsonomy PRICAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Harukazu Igarashi, K. Nakamura, Seiji Ishihara Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
2Andrea Cherubini, Francesca Giannone, Luca Iocchi, Pier Francesco Palamara An extended policy gradient algorithm for robot task learning. Search on Bibsonomy IROS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
2Daan Wierstra, Jürgen Schmidhuber Policy Gradient Critics. Search on Bibsonomy ECML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
2Dongbing Gu, Erfu Yang Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems. Search on Bibsonomy Journal of Intelligent and Robotic Systems The full citation details ... 2007 DBLP  DOI  BibTeX  RDF flocking behavior, policy gradient reinforcement learning, cooperative control, multi-agent reinforcement learning
2Jan Peters, Stefan Schaal Policy Gradient Methods for Robotics. Search on Bibsonomy IROS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
2Yutaka Nakamura, Takeshi Mori, Shin Ishii An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process. Search on Bibsonomy ICANN The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
2Yutaka Nakamura, Takeshi Mori, Shin Ishii Natural Policy Gradient Reinforcement Learning for a CPG Control of a Biped Robot. Search on Bibsonomy PPSN The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
2Nate Kohl, Peter Stone Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion. Search on Bibsonomy ICRA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
2Ville Könönen Policy Gradient Method for Team Markov Games. Search on Bibsonomy IDEAL The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
2Bikramjit Banerjee, Jing Peng Adaptive policy gradient in multiagent learning. Search on Bibsonomy AAMAS The full citation details ... 2003 DBLP  DOI  BibTeX  RDF gradient ascent learning, game theory, nash equilibria
1Tingting Zhao, Hirotaka Hachiya, Gang Niu, Masashi Sugiyama Analysis and improvement of policy gradient estimation. Search on Bibsonomy Neural Networks The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
1Ngo Anh Vien, Hwanjo Yu, TaeChoong Chung Hessian matrix distribution for Bayesian policy gradient reinforcement learning. Search on Bibsonomy Inf. Sci. The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Jervis Pinto, Alan Fern, Tim Bauer, Martin Erwig Improving Policy Gradient Estimates with Influence Information. Search on Bibsonomy Journal of Machine Learning Research - Proceedings Track The full citation details ... 2011 DBLP  BibTeX  RDF
1Peter L. Bartlett, Jonathan Baxter Infinite-Horizon Policy-Gradient Estimation Search on Bibsonomy CoRR The full citation details ... 2011 DBLP  BibTeX  RDF
1Michael Fairbank, Eduardo Alonso The Local Optimality of Reinforcement Learning by Value Gradients, and its Relationship to Policy Gradient Learning Search on Bibsonomy CoRR The full citation details ... 2011 DBLP  BibTeX  RDF
1Peter L. Bartlett, Jonathan Baxter, Lex Weaver Experiments with Infinite-Horizon, Policy-Gradient Estimation Search on Bibsonomy CoRR The full citation details ... 2011 DBLP  BibTeX  RDF
1Kfir Y. Levy, Nahum Shimkin Unified Inter and Intra Options Learning Using Policy Gradient Methods. Search on Bibsonomy EWRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Seiji Ishihara, Harukazu Igarashi Policy Gradient Reinforcement Learning with Environmental Dynamics and Action-Values in Policies. Search on Bibsonomy KES The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Hunor Jakab, Lehel Csató Improving Gaussian Process Value Function Approximation in Policy Gradient Algorithms. Search on Bibsonomy ICANN The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Mark Crowley, David Poole Policy Gradient Planning for Environmental Decision Making with Existing Simulators. Search on Bibsonomy AAAI The full citation details ... 2011 DBLP  BibTeX  RDF
1Philip S. Thomas Policy Gradient Coagent Networks. Search on Bibsonomy NIPS The full citation details ... 2011 DBLP  BibTeX  RDF
1Tingting Zhao, Hirotaka Hachiya, Gang Niu, Masashi Sugiyama Analysis and Improvement of Policy Gradient Estimation. Search on Bibsonomy NIPS The full citation details ... 2011 DBLP  BibTeX  RDF
1Andrea Cherubini, Francesca Giannone, Luca Iocchi, Daniele Nardi, Pier Francesco Palamara Policy gradient learning for quadruped soccer robots. Search on Bibsonomy Robotics and Autonomous Systems The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Ngo Anh Vien, SeungGwan Lee, TaeChoong Chung Policy Gradient Based Semi-Markov Decision Problems: Approximation and Estimation Errors. Search on Bibsonomy IEICE Transactions The full citation details ... 2010 DBLP  BibTeX  RDF
1Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Jan Peters, Kenji Doya Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning. Search on Bibsonomy Neural Computation The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Jan Peters Policy gradient methods. Search on Bibsonomy Scholarpedia The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Yan-Jie Li, Fang Cao 0003, Xi-Ren Cao On-Line Policy Gradient Estimation with Multi-Step Sampling. Search on Bibsonomy Discrete Event Dynamic Systems The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Jan Peters, J. Andrew Bagnell Policy Gradient Methods. Search on Bibsonomy Encyclopedia of Machine Learning The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1John W. Roberts, Lionel Moret, Jun Zhang, Russ Tedrake Motor Learning at Intermediate Reynolds Number: Experiments with Policy Gradient on the Flapping Flight of a Rigid Wing. Search on Bibsonomy From Motor Learning to Interaction Learning in Robots The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
1Atsushi Miyamae, Yuichi Nagata, Isao Ono, Shigenobu Kobayashi Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks. Search on Bibsonomy NIPS The full citation details ... 2010 DBLP  BibTeX  RDF
1Jie Tang, Pieter Abbeel On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient. Search on Bibsonomy NIPS The full citation details ... 2010 DBLP  BibTeX  RDF
1Andrea Cherubini, Francesca Giannone, Luca Iocchi, M. Lombardo, Giuseppe Oriolo Policy gradient learning for a humanoid soccer robot. Search on Bibsonomy Robotics and Autonomous Systems The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
1Ngo Anh Vien, Nguyen Hoang Viet, SeungGwan Lee, TaeChoong Chung Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks. Search on Bibsonomy IEICE Transactions The full citation details ... 2009 DBLP  BibTeX  RDF
1Eleni Vasilaki, Nicolas Frémaux, Robert Urbanczik, Walter Senn, Wulfram Gerstner Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail. Search on Bibsonomy PLoS Computational Biology The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
1Olivier Buffet, Douglas Aberdeen The factored policy-gradient planner. Search on Bibsonomy Artif. Intell. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
1Henning Sprekeler, Guillaume Hennequin, Wulfram Gerstner Code-specific policy gradient rules for spiking neurons. Search on Bibsonomy NIPS The full citation details ... 2009 DBLP  BibTeX  RDF
1Verena Heidrich-Meisner, Christian Igel Uncertainty handling CMA-ES for reinforcement learning. Search on Bibsonomy GECCO The full citation details ... 2009 DBLP  DOI  BibTeX  RDF covariance matrix adaptation evolution strategy, direct policy search, reinforcement learning, uncertainty handling
1David Silver, Gerald Tesauro Monte-Carlo simulation balancing. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
1Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng Learning CPG-based Biped Locomotion with a Policy Gradient Method: Application to a Humanoid Robot. Search on Bibsonomy I. J. Robotic Res. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Huaxiang Zhang, Ying Fan An adaptive policy gradient in learning Nash equilibria. Search on Bibsonomy Neurocomputing The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Francisco S. Melo Exploiting locality of interactions using a policy-gradient approach in multiagent learning. Search on Bibsonomy ECAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Andres El-Fakdi, Marc Carreras Policy gradient based Reinforcement Learning for real autonomous underwater cable tracking. Search on Bibsonomy IROS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Verena Heidrich-Meisner, Christian Igel Similarities and differences between policy gradient methods and evolution strategies. Search on Bibsonomy ESANN The full citation details ... 2008 DBLP  BibTeX  RDF
1Ngo Anh Vien, TaeChoong Chung Policy Gradient Semi-markov Decision Process. Search on Bibsonomy ICTAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Pierre-Arnaud Coquelin, Romain Deguest, Rémi Munos Particle Filter-based Policy Gradient in POMDPs. Search on Bibsonomy NIPS The full citation details ... 2008 DBLP  BibTeX  RDF
1John W. Roberts, Russ Tedrake Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms. Search on Bibsonomy NIPS The full citation details ... 2008 DBLP  BibTeX  RDF
1Kristian Kersting, Kurt Driessens Non-parametric policy gradients: a unified treatment of propositional and relational domains. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Matthew Zucker, James Kuffner, James A. Bagnell Adaptive workspace biasing for sampling-based planners. Search on Bibsonomy ICRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Chenfeng Xu, Jian Yang, Hongsheng Xi, Qi Jiang, Baoqun Yin Event-related optimization for a class of resource location with admission control. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Matthias Rungger, Hao Ding, Olaf Stursberg Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning. Search on Bibsonomy ABiALS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF hybrid automaton, behavioral programming, artificial intelligence, Reinforcement learning, planning, hierarchical model
1Sertan Girgin, Philippe Preux Basis Expansion in Natural Actor Critic Methods. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Jan Peters, Jens Kober, Duy Nguyen-Tuong Policy Learning - A Unified Perspective with Applications in Robotics. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Frank Sehnke, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, Jürgen Schmidhuber Policy Gradients with Parameter-Based Exploration for Control. Search on Bibsonomy ICANN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Yuki Taniguchi, Takeshi Mori, Shin Ishii A Continuous Internal-State Controller for Partially Observable Markov Decision Processes. Search on Bibsonomy ICANN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Verena Heidrich-Meisner, Christian Igel Evolution Strategies for Direct Policy Search. Search on Bibsonomy PPSN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Sumeetpal S. Singh, Vladislav B. Tadic, Arnaud Doucet A policy gradient method for semi-Markov decision processes with application to call admission control. Search on Bibsonomy European Journal of Operational Research The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya Learning a dynamic policy by using policy gradient: application to biped walking. Search on Bibsonomy Systems and Computers in Japan The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Olivier Buffet, Douglas Aberdeen FF + FPG: Guiding a Policy-Gradient Planner. Search on Bibsonomy ICAPS The full citation details ... 2007 DBLP  BibTeX  RDF
1Olivier Buffet, Alain Dutech, François Charpillet Shaping multi-agent systems with gradient reinforcement learning. Search on Bibsonomy Autonomous Agents and Multi-Agent Systems The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Policy-gradient, Multi-agent systems, Reinforcement learning, Shaping, Partially observable Markov decision processes
1Mohammad Ghavamzadeh, Yaakov Engel Bayesian actor-critic algorithms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanathan Conditional random fields for multi-agent reinforcement learning. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Diego E. Pardo Ayala, Cecilio Angulo Bahón Understanding Sensori-motor Coordination during a Humanoid Robot Dynamic Task. Search on Bibsonomy FUZZ-IEEE The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Sang-Ho Hyon, Joshua G. Hale, Gordon Cheng Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks. Search on Bibsonomy ICRA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Pawel Wawrzynski Reinforcement Learning in Fine Time Discretization. Search on Bibsonomy ICANNGA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Daniel Schneegaß, Steffen Udluft, Thomas Martinetz Improving Optimality of Neural Rewards Regression for Data-Efficient Batch Near-Optimal Policy Identification. Search on Bibsonomy ICANN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Yuki Taniguchi, Takeshi Mori, Shin Ishii Reinforcement Learning for Cooperative Actions in a Partially Observable Multi-agent System. Search on Bibsonomy ICANN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Daan Wierstra, Alexander Förster, Jan Peters, Jürgen Schmidhuber Solving Deep Memory POMDPs with Recurrent Policy Gradients. Search on Bibsonomy ICANN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Eiji Uchibe, Kenji Doya Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents. Search on Bibsonomy ICONIP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Diego E. Pardo, Cecilio Angulo Emerging Behaviors by Learning Joint Coordination in Articulated Mobile Robots. Search on Bibsonomy IWANN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Sensor-Motor control, Coordination, Reinforcement Learning, Cognitive Robotics
1Andrea Cherubini, Francesca Giannone, Luca Iocchi Layered Learning for a Soccer Legged Robot Helped with a 3D Simulator. Search on Bibsonomy RoboCup The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
1Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya Learning CPG-based biped locomotion with a policy gradient method. Search on Bibsonomy Robotics and Autonomous Systems The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Rémi Munos Policy Gradient in Continuous Time. Search on Bibsonomy Journal of Machine Learning Research The full citation details ... 2006 DBLP  BibTeX  RDF
1Seiji Ishihara, Harukazu Igarashi Applying the policy gradient method to behavior learning in multiagent systems: The pursuit problem. Search on Bibsonomy Systems and Computers in Japan The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Mohammad Ghavamzadeh, Yaakov Engel Bayesian Policy Gradient Algorithms. Search on Bibsonomy NIPS The full citation details ... 2006 DBLP  BibTeX  RDF
1Xuening Wang, Wei Chen 0009, Daxue Liu, Tao Wu, Hangen He The Optimality Analysis of Hybrid Reinforcement Learning Combined with SVMs. Search on Bibsonomy ISDA The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Manish Saggar, Thomas D'Silva, Nate Kohl, Peter Stone Autonomous Learning of Stable Quadruped Locomotion. Search on Bibsonomy RoboCup The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
1Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya Learning CPG-based biped locomotion with a policy gradient method. Search on Bibsonomy Humanoids The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Rémi Munos Policy gradient in continuous time. Search on Bibsonomy CAP The full citation details ... 2005 DBLP  BibTeX  RDF
1Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura, Shin Ishii On-line learning of a feedback controller for quasi-passive-dynamic walking by a stochastic policy gradient method. Search on Bibsonomy IROS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Noriaki Mitsunaga, Christian Smith, Takayuki Kanda, Hiroshi Ishiguro, Norihiro Hagita Robot behavior adaptation for human-robot interaction based on policy gradient reinforcement learning. Search on Bibsonomy IROS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya Learning Sensory Feedback to CPG with Policy Gradient for Biped Locomotion. Search on Bibsonomy ICRA The full citation details ... 2005 DBLP  BibTeX  RDF
1Huizhen Yu A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies. Search on Bibsonomy UAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid. Search on Bibsonomy AAAI The full citation details ... 2005 DBLP  BibTeX  RDF
1Nicol N. Schraudolph, Douglas Aberdeen, Jin Yu Fast Online Policy Gradient Learning with SMD Gain Vector Adaptation. Search on Bibsonomy NIPS The full citation details ... 2005 DBLP  BibTeX  RDF
1Douglas Aberdeen Policy-Gradient Methods for Planning. Search on Bibsonomy NIPS The full citation details ... 2005 DBLP  BibTeX  RDF
1Zonghua Zhang, Hong Shen Constructing Multi-Layered Boundary to Defend Against Intrusive Anomalies: An Autonomic Detection Coordinator. Search on Bibsonomy DSN The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Jooyoung Park, Jongho Kim, Daesung Kang An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. Search on Bibsonomy CIS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Jan Peters, Sethu Vijayakumar, Stefan Schaal Natural Actor-Critic. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Zonghua Zhang, Hong Shen Dynamic Combination of Multiple Host-Based Anomaly Detectors with Broader Detection Coverage and Fewer False Alerts. Search on Bibsonomy ICN The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
1Xi-Ren Cao Basic Ideas for Event-Based Optimization of Markov Systems. Search on Bibsonomy Discrete Event Dynamic Systems The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Markov decision processes (MDPs), performance potentials, policy gradients, aggregation, perturbation analysis, POMDPs, policy iteration
1Douglas Aberdeen Filtered Reinforcement Learning. Search on Bibsonomy ECML The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 117 (100 per page; Change: )
Pages: [1][2][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.