Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Haci Mehmet Guzey, Hao Xu 0002, Sarangapani Jagannathan |
Neural network-based adaptive optimal consensus control of leaderless networked mobile robots. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Lucian Busoniu, Rémi Munos, Elod Páll |
An analysis of optimistic, best-first search for minimax sequential decision making. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Li-Bing Wu, Dan Ye 0001, Xin-Gang Zhao |
Adaptive fault identification for a class of nonlinear dynamic systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Avimanyu Sahoo, Hao Xu 0002, Sarangapani Jagannathan |
Event-based optimal regulator design for nonlinear networked control systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Lei Liu 0006, Zhanshan Wang, Zhengwei Shen |
Neural-network-based adaptive dynamic surface control for MIMO systems with unknown hysteresis. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Hao Xu 0002, Sarangapani Jagannathan |
Model-free Q-learning over finite horizon for uncertain linear continuous-time systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Hengshuai Yao, Csaba Szepesvári, Bernardo Ávila Pires, Xinhua Zhang |
Pseudo-MDPs and factored linear action models. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Taishi Fujita, Toshimitsu Ushio |
Reinforcement learning-based optimal control considering L computation time delay of linear discrete-time systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Minwoo Lee 0001, Charles W. Anderson |
Convergent reinforcement learning control with neural networks and continuous action search. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Yunpeng Pan, Evangelos A. Theodorou |
Nonparametric infinite horizon Kullback-Leibler stochastic control. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Joschka Boedecker, Jost Tobias Springenberg, Jan Wülfing, Martin A. Riedmiller |
Approximate real-time optimal control based on sparse Gaussian process models. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Abhijit Gosavi, Sajal K. Das 0001, Susan L. Murray |
Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Marco A. Wiering, Maikel Withagen, Madalina M. Drugan |
Model-based multi-objective reinforcement learning. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Eugene A. Feinberg, Pavlo O. Kasyanov, Michael Z. Zgurovsky |
Convergence of value iterations for total-cost MDPs and POMDPs with general state and action sets. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He |
Data-driven partially observable dynamic processes using adaptive dynamic programming. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Hadrien Glaude, Olivier Pietquin, Cyrille Enderli |
Subspace identification for predictive state representation by nuclear norm minimization. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Simon Haykin 0001, Ashkan Amiri, Mehdi Fatemi |
Cognitive control in cognitive dynamic systems: A new way of thinking inspired by the brain. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Madalina M. Drugan, Ann Nowé, Bernard Manderick |
Pareto Upper Confidence Bounds algorithms: An empirical study. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Simone Parisi, Matteo Pirotta, Nicola Smacchia, Luca Bascetta, Marcello Restelli |
Policy gradient approaches for multi-objective sequential decision making: A comparison. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Wei Sun 0032, Evangelos A. Theodorou, Panagiotis Tsiotras |
Continuous-time differential dynamic programming with terminal constraints. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Balázs Csanád Csáji, András Kovács, József Váncza |
Adaptive aggregated predictions for renewable energy systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Daniel L. Elliott, Charles Anderson 0001 |
Using supervised training signals of observable state dynamics to speed-up and improve reinforcement learning. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Saba Q. Yahyaa, Madalina M. Drugan, Bernard Manderick |
Annealing-pareto multi-objective multi-armed bandit algorithm. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Yuhai Hu, Boris Defourny |
Near-optimality bounds for greedy periodic policies with application to grid-level storage. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Qinglai Wei, Derong Liu 0001, Guang Shi, Yu Liu, Qiang Guan |
Optimal self-learning battery control in smart residential grids by iterative Q-learning algorithm. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | |
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014, Orlando, FL, USA, December 9-12, 2014 |
ADPRL |
2014 |
DBLP BibTeX RDF |
|
1 | Seyed Reza Ahmadzadeh, Petar Kormushev, Darwin G. Caldwell |
Multi-objective reinforcement learning for AUV thruster failure recovery. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Oktay Arslan, Evangelos A. Theodorou, Panagiotis Tsiotras |
Information-theoretic stochastic optimal control via incremental sampling-based algorithms. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Ahmad A. Al-Talabi, Howard M. Schwartz 0001 |
A two stage learning technique for dual learning in the pursuit-evasion differential game. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Xiaohong Cui, Yanhong Luo, Huaguang Zhang |
An adaptive dynamic programming algorithm to solve optimal control of uncertain nonlinear systems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Yuanheng Zhu, Dongbin Zhao |
A data-based online reinforcement learning algorithm with high-efficient exploration. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Sumit Kumar Jha 0004, Shubhendu Bhasin |
On-policy Q-learning for adaptive optimal control. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Yang Liu 0077, Yanhong Luo, Huaguang Zhang |
Adaptive dynamic programming for discrete-time LQR optimal tracking control problems with unknown dynamics. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Vincent François-Lavet, Raphaël Fonteneau, Damien Ernst |
Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Dominik Meyer, Rémy Degenne, Ahmed Omrane, Hao Shen |
Accelerated gradient temporal difference learning algorithms. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Ali Heydari |
Theoretical analysis of a reinforcement learning based switching scheme. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Martin W. Allen, David Hahn, Douglas C. MacFarland |
Heuristics for multiagent reinforcement learning in decentralized decision problems. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Deon Garrett, Jordi Bieger, Kristinn R. Thórisson |
Tunable and generic problem instance generation for multi-objective reinforcement learning. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Daniel R. Jiang, Thuy V. Pham, Warren B. Powell, Daniel F. Salas, Warren R. Scott |
A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Timothé Collet, Olivier Pietquin |
Active learning for classification: An optimistic approach. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Yanhong Luo, Geyang Xiao |
ADP-based optimal control for a class of nonlinear discrete-time systems with inequality constraints. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Regina Padmanabhan, Nader Meskin, Wassim M. Haddad |
Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Xiaofeng Lin, Qiang Ding, Weikai Kong, Chunning Song, Qingbao Huang |
Adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration. |
ADPRL |
2014 |
DBLP DOI BibTeX RDF |
|
1 | Raphaël Fonteneau, Lucian Busoniu, Rémi Munos |
Optimistic planning for belief-augmented Markov Decision Processes. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Teck-Hou Teng, Ah-Hwee Tan |
Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Mostafa D. Awheda, Howard M. Schwartz 0001 |
Exponential moving average Q-learning algorithm. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Sachiko Soga, Ichiro Kobayashi |
A study on the efficiency of learning a robot controller in various environments. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Hisashi Handa |
On the coordination system for the dimensionality-reduced inputs of mario. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Luuk Bom, Ruud Henken, Marco A. Wiering |
Reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Xiaofeng Lin, Nuyun Cao, Yuzhang Lin |
Optimal control for a class of nonlinear systems with state delay based on Adaptive Dynamic Programming with ε-error bound. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Evangelos A. Theodorou, Jiri Najemnik, Emanuel Todorov |
Free energy based policy gradients. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Michiel van der Ree, Marco A. Wiering |
Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Toshiyuki Yasuda, Nanami Wada, Kazuhiro Ohkura, Yoshiyuki Matsumura |
Analyzing collective behavior in evolutionary swarm robotic systems based on an ethological approach. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Xiong Luo, Jennie Si, Yuchao Zhou |
An integrated design for intensified direct heuristic dynamic programming. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Zhanshan Wang, Fufei Chu, Hongjing Liang, Huaguang Zhang |
Fault accommodation for complete synchronization of complex neural networks. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Hao Xu 0002, Sarangapani Jagannathan |
Finite horizon stochastic optimal control of uncertain linear networked control system. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Chunbin Qin, Huaguang Zhang, Yanhong Luo |
Adaptive optimal control for nonlinear discrete-time systems. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Yujiao Huang, Huaguang Zhang, Dongsheng Yang 0001 |
Local stability analysis of high-order recurrent neural networks with multi-step piecewise linear activation functions. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Mingyuan Zhong 0002, M. Johnson, Yuval Tassa, Tom Erez, Emo Todorov |
Value function approximation and model predictive control. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Jian Wang 0011, Zhenhua Huang 0004, Xin Xu 0001 |
A novel approach for constructing basis functions in approximate dynamic programming for feedback control. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | |
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2013, IEEE Symposium Series on Computational Intelligence (SSCI), 16-19 April 2013, Singapore |
ADPRL |
2013 |
DBLP BibTeX RDF |
|
1 | Yifan Cai, Simon X. Yang, Xin Xu 0001 |
A combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Ruizhuo Song, Wendong Xiao, Yanhong Luo |
Optimal control for a class of nonlinear system with controller constraints based on finite-approximation-errors ADP algorithm. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Donghun Lee, Boris Defourny, Warren B. Powell |
Bias-corrected Q-learning to control max-operator bias in Q-learning. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Tobias Jung, Damien Ernst, Francis Maes |
Optimized look-ahead trees: Extensions to large and continuous action spaces. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Kristof Van Moffaert, Madalina M. Drugan, Ann Nowé |
Scalarized multi-objective reinforcement learning: Novel design techniques. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Robert Lowe, Tom Ziemke |
Exploring the relationship of reward and punishment in reinforcement learning. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Qi-ming Fu 0001, Quan Liu, Fei Xiao, Guixin Chen |
The second order temporal difference error for Sarsa(λ). |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Lucian Busoniu, Alexander Daniels, Rémi Munos, Robert Babuska |
Optimistic planning for continuous-action deterministic systems. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Qiming Zhao, Hao Xu 0002, Sarangapani Jagannathan |
Finite-horizon optimal control design for uncertain linear discrete-time systems. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu 0001 |
Real-time tracking on adaptive critic design with uniformly ultimately bounded condition. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | A. Y. F. Lau, Dipti Srinivasan, Thomas Reindl |
A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces. |
ADPRL |
2013 |
DBLP DOI BibTeX RDF |
|
1 | Abdeslem Boukhtouta, Jean Berger, Warren B. Powell, Abraham P. George |
An adaptive-learning framework for semi-cooperative multi-agent coordination. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthew J. Reindorp, Michael C. Fu 0001 |
Dynamic lead time promising. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Marco A. Wiering, Hado van Hasselt, Auke-Dirk Pietersma, Lambert Schomaker |
Reinforcement learning algorithms for solving classification problems. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Suman Chakravorty, Richard Scott Erwin |
Information space receding horizon control. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel, Damien Ernst |
Active exploration by searching for experiments that falsify the computed control policy. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Mingyuan Zhong 0002, Emanuel Todorov |
Moving least-squares approximations for linearly-solvable MDP. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | |
2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, ADPRL 2011, Paris, France, April 12-14, 2011 |
ADPRL |
2011 |
DBLP BibTeX RDF |
|
1 | Hassan Zargarzadeh, Sarangapani Jagannathan, James A. Drallmeier |
Online near optimal control of unknown nonaffine systems with application to HCCI engines. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Andreas Witsch, Roland Reichle, Kurt Geihs, Sascha Lange, Martin A. Riedmiller |
Enhancing the episodic natural actor-critic algorithm by a regularisation term to stabilize learning of control structures. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Lucian Busoniu, Rémi Munos, Bart De Schutter, Robert Babuska |
Optimistic planning for sparsely stochastic systems. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Kun Deng, Joelle Pineau, Susan A. Murphy |
Active learning for personalizing treatment. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Abhijit Gosavi, Susan L. Murray, Jiaqiao Hu |
Model-building semi-Markov adaptive critics. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | George G. Lendaris |
Higher-level application of Adaptive Dynamic Programming/Reinforcement Learning - a next phase for controls and system identification? |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Dongbin Zhao, Zhaohui Hu |
Supervised adaptive dynamic programming based adaptive cruise control. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Jian Fu, Haibo He, Zhen Ni |
Adaptive dynamic programming with balanced weights seeking strategy. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Daniel A. Braun 0001, Pedro A. Ortega, Evangelos A. Theodorou, Stefan Schaal |
Path integral control and bounded rationality. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Ilya O. Ryzhov, Warren B. Powell |
Bayesian active learning with basis functions. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Alex Simpkins, Emanuel Todorov |
Complex object manipulation with hierarchical optimal control. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Shivaram Kalyanakrishnan, Peter Stone |
On learning with imperfect representations. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Vishnuteja Nanduri |
Application of reinforcement learning-based algorithms in CO2 allowance and electricity markets. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Petru Emanuel Stingu, Frank L. Lewis |
An approximate Dynamic Programming based controller for an underactuated 6DoF quadrotor. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Derong Liu 0001, Ding Wang 0001, Dongbin Zhao |
Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Mohsen Davarynejad, Jelmer van Ast, Jos L. M. Vrancken, Jan van den Berg |
Evolutionary value function approximation. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Xin Zhang, Huaguang Zhang, Lili Cui, Yanhong Luo |
Global optimal strategies of a class of finite-horizon continuous-time nonaffine nonlinear zero-sum game using a new iteration algorithm. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Lucian Busoniu, Damien Ernst, Bart De Schutter, Robert Babuska |
Approximate reinforcement learning: An overview. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Yuval Tassa, Emanuel Todorov |
High-order local dynamic programming. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthieu Geist, Olivier Pietquin |
Parametric value function approximation: A unified view. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Martino Migliavacca, Alessio Pecorino, Matteo Pirotta, Marcello Restelli, Andrea Bonarini |
Fitted policy search. |
ADPRL |
2011 |
DBLP DOI BibTeX RDF |
|