The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications at "ADPRL"( http://dblp.L3S.de/Venues/ADPRL )

URL (DBLP): http://dblp.uni-trier.de/db/conf/adprl

Publication years (Num. hits)
2009 (35) 2011 (46) 2013 (29) 2014 (43)
Publication types (Num. hits)
inproceedings(149) proceedings(4)
Venues (Conferences, Journals, ...)
ADPRL(153)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
No Growbag Graphs found.

Results
Found 153 publication records. Showing 153 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Haci Mehmet Guzey, Hao Xu 0002, Sarangapani Jagannathan Neural network-based adaptive optimal consensus control of leaderless networked mobile robots. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Lucian Busoniu, Rémi Munos, Elod Páll An analysis of optimistic, best-first search for minimax sequential decision making. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Li-Bing Wu, Dan Ye 0001, Xin-Gang Zhao Adaptive fault identification for a class of nonlinear dynamic systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Avimanyu Sahoo, Hao Xu 0002, Sarangapani Jagannathan Event-based optimal regulator design for nonlinear networked control systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Lei Liu 0006, Zhanshan Wang, Zhengwei Shen Neural-network-based adaptive dynamic surface control for MIMO systems with unknown hysteresis. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Hao Xu 0002, Sarangapani Jagannathan Model-free Q-learning over finite horizon for uncertain linear continuous-time systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Hengshuai Yao, Csaba Szepesvári, Bernardo Ávila Pires, Xinhua Zhang Pseudo-MDPs and factored linear action models. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Taishi Fujita, Toshimitsu Ushio Reinforcement learning-based optimal control considering L computation time delay of linear discrete-time systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Minwoo Lee 0001, Charles W. Anderson Convergent reinforcement learning control with neural networks and continuous action search. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Yunpeng Pan, Evangelos A. Theodorou Nonparametric infinite horizon Kullback-Leibler stochastic control. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Joschka Boedecker, Jost Tobias Springenberg, Jan Wülfing, Martin A. Riedmiller Approximate real-time optimal control based on sparse Gaussian process models. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Abhijit Gosavi, Sajal K. Das 0001, Susan L. Murray Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Marco A. Wiering, Maikel Withagen, Madalina M. Drugan Model-based multi-objective reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Eugene A. Feinberg, Pavlo O. Kasyanov, Michael Z. Zgurovsky Convergence of value iterations for total-cost MDPs and POMDPs with general state and action sets. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Xiangnan Zhong, Zhen Ni, Yufei Tang, Haibo He Data-driven partially observable dynamic processes using adaptive dynamic programming. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Hadrien Glaude, Olivier Pietquin, Cyrille Enderli Subspace identification for predictive state representation by nuclear norm minimization. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Simon Haykin 0001, Ashkan Amiri, Mehdi Fatemi Cognitive control in cognitive dynamic systems: A new way of thinking inspired by the brain. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Madalina M. Drugan, Ann Nowé, Bernard Manderick Pareto Upper Confidence Bounds algorithms: An empirical study. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Simone Parisi, Matteo Pirotta, Nicola Smacchia, Luca Bascetta, Marcello Restelli Policy gradient approaches for multi-objective sequential decision making: A comparison. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Wei Sun 0032, Evangelos A. Theodorou, Panagiotis Tsiotras Continuous-time differential dynamic programming with terminal constraints. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Balázs Csanád Csáji, András Kovács, József Váncza Adaptive aggregated predictions for renewable energy systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Daniel L. Elliott, Charles Anderson 0001 Using supervised training signals of observable state dynamics to speed-up and improve reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Saba Q. Yahyaa, Madalina M. Drugan, Bernard Manderick Annealing-pareto multi-objective multi-armed bandit algorithm. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Yuhai Hu, Boris Defourny Near-optimality bounds for greedy periodic policies with application to grid-level storage. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Qinglai Wei, Derong Liu 0001, Guang Shi, Yu Liu, Qiang Guan Optimal self-learning battery control in smart residential grids by iterative Q-learning algorithm. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014, Orlando, FL, USA, December 9-12, 2014 Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  BibTeX  RDF
1Seyed Reza Ahmadzadeh, Petar Kormushev, Darwin G. Caldwell Multi-objective reinforcement learning for AUV thruster failure recovery. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Oktay Arslan, Evangelos A. Theodorou, Panagiotis Tsiotras Information-theoretic stochastic optimal control via incremental sampling-based algorithms. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Ahmad A. Al-Talabi, Howard M. Schwartz 0001 A two stage learning technique for dual learning in the pursuit-evasion differential game. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Xiaohong Cui, Yanhong Luo, Huaguang Zhang An adaptive dynamic programming algorithm to solve optimal control of uncertain nonlinear systems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Yuanheng Zhu, Dongbin Zhao A data-based online reinforcement learning algorithm with high-efficient exploration. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Sumit Kumar Jha 0004, Shubhendu Bhasin On-policy Q-learning for adaptive optimal control. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Yang Liu 0077, Yanhong Luo, Huaguang Zhang Adaptive dynamic programming for discrete-time LQR optimal tracking control problems with unknown dynamics. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Vincent François-Lavet, Raphaël Fonteneau, Damien Ernst Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Dominik Meyer, Rémy Degenne, Ahmed Omrane, Hao Shen Accelerated gradient temporal difference learning algorithms. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Ali Heydari Theoretical analysis of a reinforcement learning based switching scheme. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Martin W. Allen, David Hahn, Douglas C. MacFarland Heuristics for multiagent reinforcement learning in decentralized decision problems. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Deon Garrett, Jordi Bieger, Kristinn R. Thórisson Tunable and generic problem instance generation for multi-objective reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Daniel R. Jiang, Thuy V. Pham, Warren B. Powell, Daniel F. Salas, Warren R. Scott A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work? Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Timothé Collet, Olivier Pietquin Active learning for classification: An optimistic approach. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Yanhong Luo, Geyang Xiao ADP-based optimal control for a class of nonlinear discrete-time systems with inequality constraints. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Regina Padmanabhan, Nader Meskin, Wassim M. Haddad Closed-loop control of anesthesia and mean arterial pressure using reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Xiaofeng Lin, Qiang Ding, Weikai Kong, Chunning Song, Qingbao Huang Adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Raphaël Fonteneau, Lucian Busoniu, Rémi Munos Optimistic planning for belief-augmented Markov Decision Processes. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Teck-Hou Teng, Ah-Hwee Tan Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Mostafa D. Awheda, Howard M. Schwartz 0001 Exponential moving average Q-learning algorithm. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Sachiko Soga, Ichiro Kobayashi A study on the efficiency of learning a robot controller in various environments. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Hisashi Handa On the coordination system for the dimensionality-reduced inputs of mario. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Luuk Bom, Ruud Henken, Marco A. Wiering Reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Xiaofeng Lin, Nuyun Cao, Yuzhang Lin Optimal control for a class of nonlinear systems with state delay based on Adaptive Dynamic Programming with ε-error bound. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Evangelos A. Theodorou, Jiri Najemnik, Emanuel Todorov Free energy based policy gradients. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Michiel van der Ree, Marco A. Wiering Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Toshiyuki Yasuda, Nanami Wada, Kazuhiro Ohkura, Yoshiyuki Matsumura Analyzing collective behavior in evolutionary swarm robotic systems based on an ethological approach. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Xiong Luo, Jennie Si, Yuchao Zhou An integrated design for intensified direct heuristic dynamic programming. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Zhanshan Wang, Fufei Chu, Hongjing Liang, Huaguang Zhang Fault accommodation for complete synchronization of complex neural networks. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Hao Xu 0002, Sarangapani Jagannathan Finite horizon stochastic optimal control of uncertain linear networked control system. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Chunbin Qin, Huaguang Zhang, Yanhong Luo Adaptive optimal control for nonlinear discrete-time systems. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Yujiao Huang, Huaguang Zhang, Dongsheng Yang 0001 Local stability analysis of high-order recurrent neural networks with multi-step piecewise linear activation functions. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Mingyuan Zhong 0002, M. Johnson, Yuval Tassa, Tom Erez, Emo Todorov Value function approximation and model predictive control. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Jian Wang 0011, Zhenhua Huang 0004, Xin Xu 0001 A novel approach for constructing basis functions in approximate dynamic programming for feedback control. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1 Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2013, IEEE Symposium Series on Computational Intelligence (SSCI), 16-19 April 2013, Singapore Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  BibTeX  RDF
1Yifan Cai, Simon X. Yang, Xin Xu 0001 A combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Ruizhuo Song, Wendong Xiao, Yanhong Luo Optimal control for a class of nonlinear system with controller constraints based on finite-approximation-errors ADP algorithm. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Donghun Lee, Boris Defourny, Warren B. Powell Bias-corrected Q-learning to control max-operator bias in Q-learning. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Tobias Jung, Damien Ernst, Francis Maes Optimized look-ahead trees: Extensions to large and continuous action spaces. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Kristof Van Moffaert, Madalina M. Drugan, Ann Nowé Scalarized multi-objective reinforcement learning: Novel design techniques. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Robert Lowe, Tom Ziemke Exploring the relationship of reward and punishment in reinforcement learning. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Qi-ming Fu 0001, Quan Liu, Fei Xiao, Guixin Chen The second order temporal difference error for Sarsa(λ). Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Lucian Busoniu, Alexander Daniels, Rémi Munos, Robert Babuska Optimistic planning for continuous-action deterministic systems. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Qiming Zhao, Hao Xu 0002, Sarangapani Jagannathan Finite-horizon optimal control design for uncertain linear discrete-time systems. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Zhen Ni, Xiao Fang, Haibo He, Dongbin Zhao, Xin Xu 0001 Real-time tracking on adaptive critic design with uniformly ultimately bounded condition. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1A. Y. F. Lau, Dipti Srinivasan, Thomas Reindl A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces. Search on Bibsonomy ADPRL The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
1Abdeslem Boukhtouta, Jean Berger, Warren B. Powell, Abraham P. George An adaptive-learning framework for semi-cooperative multi-agent coordination. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Matthew J. Reindorp, Michael C. Fu 0001 Dynamic lead time promising. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Marco A. Wiering, Hado van Hasselt, Auke-Dirk Pietersma, Lambert Schomaker Reinforcement learning algorithms for solving classification problems. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Suman Chakravorty, Richard Scott Erwin Information space receding horizon control. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel, Damien Ernst Active exploration by searching for experiments that falsify the computed control policy. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Mingyuan Zhong 0002, Emanuel Todorov Moving least-squares approximations for linearly-solvable MDP. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, ADPRL 2011, Paris, France, April 12-14, 2011 Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  BibTeX  RDF
1Hassan Zargarzadeh, Sarangapani Jagannathan, James A. Drallmeier Online near optimal control of unknown nonaffine systems with application to HCCI engines. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Andreas Witsch, Roland Reichle, Kurt Geihs, Sascha Lange, Martin A. Riedmiller Enhancing the episodic natural actor-critic algorithm by a regularisation term to stabilize learning of control structures. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Lucian Busoniu, Rémi Munos, Bart De Schutter, Robert Babuska Optimistic planning for sparsely stochastic systems. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Kun Deng, Joelle Pineau, Susan A. Murphy Active learning for personalizing treatment. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Abhijit Gosavi, Susan L. Murray, Jiaqiao Hu Model-building semi-Markov adaptive critics. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1George G. Lendaris Higher-level application of Adaptive Dynamic Programming/Reinforcement Learning - a next phase for controls and system identification? Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Dongbin Zhao, Zhaohui Hu Supervised adaptive dynamic programming based adaptive cruise control. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Jian Fu, Haibo He, Zhen Ni Adaptive dynamic programming with balanced weights seeking strategy. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Daniel A. Braun 0001, Pedro A. Ortega, Evangelos A. Theodorou, Stefan Schaal Path integral control and bounded rationality. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Ilya O. Ryzhov, Warren B. Powell Bayesian active learning with basis functions. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Alex Simpkins, Emanuel Todorov Complex object manipulation with hierarchical optimal control. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Shivaram Kalyanakrishnan, Peter Stone On learning with imperfect representations. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Vishnuteja Nanduri Application of reinforcement learning-based algorithms in CO2 allowance and electricity markets. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Petru Emanuel Stingu, Frank L. Lewis An approximate Dynamic Programming based controller for an underactuated 6DoF quadrotor. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Derong Liu 0001, Ding Wang 0001, Dongbin Zhao Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Mohsen Davarynejad, Jelmer van Ast, Jos L. M. Vrancken, Jan van den Berg Evolutionary value function approximation. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Xin Zhang, Huaguang Zhang, Lili Cui, Yanhong Luo Global optimal strategies of a class of finite-horizon continuous-time nonaffine nonlinear zero-sum game using a new iteration algorithm. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Lucian Busoniu, Damien Ernst, Bart De Schutter, Robert Babuska Approximate reinforcement learning: An overview. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Yuval Tassa, Emanuel Todorov High-order local dynamic programming. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Matthieu Geist, Olivier Pietquin Parametric value function approximation: A unified view. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
1Martino Migliavacca, Alessio Pecorino, Matteo Pirotta, Marcello Restelli, Andrea Bonarini Fitted policy search. Search on Bibsonomy ADPRL The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 153 (100 per page; Change: )
Pages: [1][2][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license