The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for critic with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1973-1990 (16) 1991-1996 (20) 1997-1999 (23) 2000-2001 (23) 2002 (15) 2003 (17) 2004 (19) 2005 (26) 2006 (20) 2007 (42) 2008 (49) 2009 (30) 2010 (34) 2011 (18) 2012 (27) 2013 (30) 2014 (19) 2015-2016 (43) 2017 (61) 2018 (99) 2019 (146) 2020 (197) 2021 (281) 2022 (265) 2023 (337) 2024 (97)
Publication types (Num. hits)
article(1125) incollection(1) inproceedings(826) phdthesis(2)
Venues (Conferences, Journals, ...)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 106 occurrences of 86 keywords

Results
Found 1954 publication records. Showing 1954 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
112Huaglory Tianfield, Ruwen Wang Critic Systems - Towards Human-Computer Collaborative Problem Solving. Search on Bibsonomy Artif. Intell. Rev. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF human-computer collaborative problem solving, knowledge-based system, expert system, critic system, human-computer collaboration
101Thomas Hanselmann, Lyle Noakes, Anthony Zaknich Continuous-Time Adaptive Critics. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
88Chuan-Kai Lin Adaptive critic autopilot design of Bank-to-turn missiles using fuzzy basis function networks. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
87Shamama Anwar, K. Sridhar Patnaik Actor Critic Learning: A Near Set Approach. Search on Bibsonomy RSCTC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF ethogram, ethology, actor critic, rough sets, Adaptive learning, approximation space, near sets
75Jan Peters 0001, Sethu Vijayakumar, Stefan Schaal Natural Actor-Critic. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
63Alok Kanti Deb, Jayadeva, Madan Gopal, Suresh Chandra 0001 SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
63Jooyoung Park, Jongho Kim, Daesung Kang An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. Search on Bibsonomy CIS (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
63Ayose Falcón, Jared Stark, Alex Ramírez, Konrad Lai, Mateo Valero Prophet/Critic Hybrid Branch Prediction. Search on Bibsonomy ISCA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
63Andrew Ireland, Alan Bundy Extensions to a Generalization Critic for Inductive Proof. Search on Bibsonomy CADE The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
63Barry G. Silverman Building a Better Critic-Recent Empirical Results. Search on Bibsonomy IEEE Expert The full citation details ... 1992 DBLP  DOI  BibTeX  RDF
63Peter Shih, Brian C. Kaul, Sarangapani Jagannathan, James A. Drallmeier Reinforcement-Learning-Based Dual-Control Methodology for Complex Nonlinear Discrete-Time Systems With Application to Spark Engine EGR Operation. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
63Pingan He 0002, Sarangapani Jagannathan Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
62Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots. Search on Bibsonomy Soft Comput. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Multi-step prediction, Nonlinear predictive model, Simulated experience, Kinematic model, Nonholonomic mobile robot
62Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi Adaptive actor-critic learning for the control of mobile robots by applying predictive models. Search on Bibsonomy Soft Comput. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Tracking control problem, Predictive model, Temporal difference learning, Nonholonomic mobile robot
62Andrés Pérez-Uribe Using a Time-Delay Actor-Critic Neural Architecture with Dopamine-Like Reinforcement Signal for Learning in Autonomous Robots. Search on Bibsonomy Emergent Neural Computational Architectures Based on Neuroscience The full citation details ... 2001 DBLP  DOI  BibTeX  RDF Learning robots, actor-critic architecture, TD-learning, dopamine neurons, human teaching signals, reinforcement learning, time-delay neural networks
62Cleidson R. B. de Souza, Jair S. Ferreira Jr., Kléder Miranda Gonçalves, Jacques Wainer A Group Critic System for Object-Oriented Analysis and Design. Search on Bibsonomy ASE The full citation details ... 2000 DBLP  DOI  BibTeX  RDF group critic system, critiquing system, cooperative software development, design rationale
50Norhayati Mohd. Ali, John G. Hosking, Jun Huh, John C. Grundy Critic Authoring Templates for Specifying Domain-Specific Visual Language Tool Critics. Search on Bibsonomy Australian Software Engineering Conference The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
50Francisco S. Melo, Manuel Lopes 0001 Fitted Natural Actor-Critic: A New Algorithm for Continuous State-Action MDPs. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
50Efraín Franco Flores, Julio Waissman Vilanova, Jair García Lamont Learning the Filling Policy of a Biodegradation Process by Fuzzy Actor-Critic Learning Methodology. Search on Bibsonomy MICAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
50Dapeng Zhang, Aiguo Wu, Fuli Wang, Zhiling Lin The Application of Adaptive Critic Design in the Nosiheptide Fermentation. Search on Bibsonomy ISNN (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
50James F. Peters Granular Computing in Actor-Critic Learning. Search on Bibsonomy FOCI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
50Norhayati Mohd. Ali A Generic Visual Critic Authoring Tool. Search on Bibsonomy VL/HCC The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
50Zenon Hendzel Adaptive Critic Neural Networks for Identification of Wheeled Mobile Robot. Search on Bibsonomy ICAISC The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
50Toby Walsh A Divergence Critic. Search on Bibsonomy CADE The full citation details ... 1994 DBLP  DOI  BibTeX  RDF
50James F. Peters, Christopher J. Henry, Sheela Ramanna Reinforcement Learning in Swarms that Learn. Search on Bibsonomy IAT The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
49Mohammed Shahid Abdulla, Shalabh Bhatnagar Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Two timescale stochastic approximation, Simultaneous perturbation stochastic approximation, Normalized Hadamard matrices, TD-learning, Reinforcement learning, Markov decision processes, Policy iteration
38Patañjali S. Venkatacharya, Jonathan Kessler, Tami Hardeman, Ed Seiber, Bill Buxton What makes a good design critic?: food design vs. product design criticism. Search on Bibsonomy CHI Extended Abstracts The full citation details ... 2010 DBLP  DOI  BibTeX  RDF culinary, user experience, metaphors, criticism, food
38Derong Liu 0001, Hossein Javaherian, Olesia Kovalenko, Ting Huang Adaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
38Jih-Wen Sheu, Wei-Song Lin Designing Automatic Train Regulation for MRT system by adaptive critic method. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
38Shingo Mabu, Yan Chen 0008, Kotaro Hirasawa, Jinglu Hu Stock trading rules using genetic network programming with actor-critic. Search on Bibsonomy IEEE Congress on Evolutionary Computation The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
38Huaguang Zhang, Yanhong Luo, Derong Liu 0001 A New Fuzzy Identification Method Based on Adaptive Critic Designs. Search on Bibsonomy ISNN (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
38Ayose Falcón, Jared Stark, Alex Ramírez, Konrad K. Lai, Mateo Valero Better Branch Prediction Through Prophet/Critic Hybrids. Search on Bibsonomy IEEE Micro The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
38Matti Aksela, Jorma Laaksonen On Adaptive Confidences for Critic-Driven Classifier Combining. Search on Bibsonomy ICAPR (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
38Zhongwu Huang, S. N. Balakrishnan Robust Adaptive Critic Based Neurocontrollers for Systems with Input Uncertainties. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
38Haifeng Chen, Guofei Jiang, Hui Zhang 0002, Kenji Yoshihira Boosting the performance of computing systems through adaptive configuration tuning. Search on Bibsonomy SAC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF configuration tuning, reinforcement learning, system management
38Sarangapani Jagannathan, Pingan He 0002 Neural-Network-Based State Feedback Control of a Nonlinear Discrete-Time System in Nonstrict Feedback Form. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
38Jia Ma, Tao Yang 0011, Zeng-Guang Hou, Min Tan 0001, Derong Liu 0001 Dual Heuristic Programming Based Neurocontroller for Vibration Isolation Control. Search on Bibsonomy ICNSC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
38Peter Shih, Brian C. Kaul, Sarangapani Jagannathan, James A. Drallmeier Near Optimal Output-Feedback Control of Nonlinear Discrete-time Systems in Nonstrict Feedback Form with Application to Engines. Search on Bibsonomy IJCNN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
38Richard L. Welch, Ganesh K. Venayagamoorthy Optimal Control of a Photovoltaic Solar Energy System with Adaptive Critics. Search on Bibsonomy IJCNN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
38Hossein Javaherian, Derong Liu 0001, Olesia Kovalenko Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming. Search on Bibsonomy IJCNN The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
38Junichiro Yoshimoto, Shin Ishii, Masa-aki Sato On-Line EM Reinforcement Learning. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
38Rajit Gadh, Donna Herbert, Alexander Kott, Charles P. Kollar Feature-Based Design for Manufacturability Critique in Concurrent Engineering. Search on Bibsonomy MIT-JSME Workshop The full citation details ... 1989 DBLP  DOI  BibTeX  RDF
37Chrisantha Fernando Neuronal replicators solve the stability-plasticity dilemma. Search on Bibsonomy GECCO The full citation details ... 2010 DBLP  DOI  BibTeX  RDF actor-critic, neuronal replicator hypothesis, robotics, reinforcement learning
37Dusko Katic, Aleksandar Rodic 0001, Miomir Vukobratovic Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning. Search on Bibsonomy J. Intell. Robotic Syst. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Biped locomotion, Integrated dynamic control, Actor-critic method, Reinforcement learning, Humanoid robots
37Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space. Search on Bibsonomy ICIC (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF neuro-fuzzy net, swarm behavior, actor-critic algorithm, goal-exploration problem, multi-agent system, reinforcement learning
37James F. Peters Toward Approximate Adaptive Learning. Search on Bibsonomy RSEISP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic, behaviour pattern, stopping time, perception, adaptive learning, approximation space
37Yoichiro Matsuno, Tatsuya Yamazaki, Shin Ishii A multi-agent reinforcement learning method for a partially-observable competitive game. Search on Bibsonomy Agents The full citation details ... 2001 DBLP  DOI  BibTeX  RDF actor-critic model, competitive game, reinforcement learning, multi-agent
25Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin Actor-Critic or Critic-Actor? A Tale of Two Time Scales. Search on Bibsonomy IEEE Control. Syst. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
25Prashansa Panda, Shalabh Bhatnagar Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
25Spilios Evmorfos, Athina P. Petropulu, H. Vincent Poor Actor-Critic Methods for IRS Design in Correlated Channel Environments: A Closer Look Into the Neural Tangent Kernel of the Critic. Search on Bibsonomy IEEE Trans. Signal Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
25Swaminathan Gurumurthy, Zachary Manchester, J. Zico Kolter Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning. Search on Bibsonomy L4DC The full citation details ... 2023 DBLP  BibTeX  RDF
25Xin Huo, Hamid Reza Karimi, Xudong Zhao 0001, Bohui Wang, Guangdeng Zong Adaptive-Critic Design for Decentralized Event-Triggered Control of Constrained Nonlinear Interconnected Systems Within an Identifier-Critic Framework. Search on Bibsonomy IEEE Trans. Cybern. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
25Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin Actor-Critic or Critic-Actor? A Tale of Two Time Scales. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
25Riazat Ryan, Ming Shao Critic-over-Actor-Critic Modeling: Finding Optimal Strategy in ICU Environments. Search on Bibsonomy IEEE Big Data The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
25Arushi Jain, Khimya Khetarpal, Doina Precup Safe option-critic: learning safety in the option-critic architecture. Search on Bibsonomy Knowl. Eng. Rev. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
25Gengzhi Zhang, Liang Feng 0001, Yaqing Hou Multi-task Actor-Critic with Knowledge Transfer via a Shared Critic. Search on Bibsonomy ACML The full citation details ... 2021 DBLP  BibTeX  RDF
25Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
25Aras Dargazany Model-based actor-critic: GAN + DRL (actor-critic) => AGI. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
25Jiajun Fan, He Ba, Xian Guo, Jianye Hao Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
25Roumeissa Kitouni, Abderrahim Kitouni, Feng Jiang 0001 Generalized Critic Policy Optimization: A Model For Combining Advantage Estimates In Actor Critic Methods. Search on Bibsonomy ICIP The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
25Yen-Chen Wu, Bo-Hsiang Tseng, Milica Gasic Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems. Search on Bibsonomy EMNLP (Findings) The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
25Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. Search on Bibsonomy NeurIPS The full citation details ... 2020 DBLP  BibTeX  RDF
25Norman L. Tasfi, Miriam A. M. Capretz Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay. Search on Bibsonomy IJCNN The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
25Jonathan Lebensold, William L. Hamilton, Borja Balle, Doina Precup Actor Critic with Differentially Private Critic. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
25Ala'eddin Masadeh, Zhengdao Wang, Ahmed E. Kamal 0001 Selector-Actor-Critic and Tuner-Actor-Critic Algorithms for Reinforcement Learning. Search on Bibsonomy WCSP The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
25Arushi Jain, Khimya Khetarpal, Doina Precup Safe Option-Critic: Learning Safety in the Option-Critic Architecture. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
25Jing Wang 0044, Ioannis Ch. Paschalidis An Actor-Critic Algorithm With Second-Order Actor and Critic. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
25Ian J. Livingston, Regan L. Mandryk, Kevin G. Stanley Critic-proofing: how using critic reviews and game genres can refine heuristic evaluations. Search on Bibsonomy Future Play The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
25Petia D. Koprinkova-Hristova, Günther Palm Adaptive Critic Design with ESN Critic for Bioprocess Optimization. Search on Bibsonomy ICANN (2) The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
25Swakshar Ray, Ganesh K. Venayagamoorthy, Balarko Chaudhuri, Rajat Majumder Comparison of Adaptive Critic-Based and Classical Wide-Area Controllers for Power Systems. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Byungchan Kim, Byungduk Kang, Shinsuk Park, Sungchul Kang Learning robot stiffness for contact tasks using the natural actor-critic. Search on Bibsonomy ICRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Sertan Girgin, Philippe Preux Basis Expansion in Natural Actor Critic Methods. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Zhao Sun, Xi Chen, Zhihai He Adaptive Critic Design for Energy Minimization of Portable Video Communication Devices. Search on Bibsonomy ICCCN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Asma Al-Tamimi, Murad Abu-Khalaf, Frank L. Lewis Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Hinfty Control. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Mohammad Ghavamzadeh, Yaakov Engel Bayesian actor-critic algorithms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, Tomohiro Shibata, Koh Hosoda, Shin Ishii Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic. Search on Bibsonomy IROS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
25Mehdi Khamassi, Louis-Emmanuel Martinet, Agnès Guillot Combining Self-organizing Maps with Mixtures of Experts: Application to an Actor-Critic Model of Reinforcement Learning in the Basal Ganglia. Search on Bibsonomy SAB The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
25Xin Xu 0001, Xuening Wang, Dewen Hu Mobile Robot Path-Tracking Using an Adaptive Critic Learning PD Controller. Search on Bibsonomy ISNN (2) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
25Farzan Rashidi, Behzad Moshiri Improvement of Low Frequency Oscillation Damping in Power Systems Via an Adaptive Critic Based NeuroFuzzy Controller. Search on Bibsonomy KES The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
25George G. Lendaris, Larry Schultz, Thaddeus T. Shannon Adaptive Critic Design for Intelligent Steering and Speed Control of a 2-Axle Vehicle. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
25Tomas Hrycej An Estimate of the Number of Samples to Convergence for Critic Algorithms. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
25Donald C. Wunsch The Cellular Simultaneous Recurrent Network Adaptive Critic Design for the Generalized Maze Problem Has a Simple Closed-Form Solution. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
25Louise A. Dennis, Alan Bundy, Ian Green Using A Generalisation Critic to Find Bisimulations for Coinductive Proofs. Search on Bibsonomy CADE The full citation details ... 1997 DBLP  DOI  BibTeX  RDF
25Anthony G. Pipe, Terence C. Fogarty, Alan F. T. Winfield Hybrid Adaptive Heuristic Critic Architectures for Learning in Mazes with Continuous Search Spaces. Search on Bibsonomy PPSN The full citation details ... 1994 DBLP  DOI  BibTeX  RDF
25Hassab Elgawi Osman Architecture of behavior-based and robotics self-optimizing memory controller. Search on Bibsonomy ICRA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
25Abdeslam Boularias, Brahim Chaib-draa Predictive representations for policy gradient in POMDPs. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
25Qinmin Yang, Sarangapani Jagannathan A Suite of Robust Controllers for the Manipulation of Microscale Objects. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Asma Al-Tamimi, Frank L. Lewis, Murad Abu-Khalaf Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Qinmin Yang, Jonathan Blake Vance, Sarangapani Jagannathan Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Silvia Ferrari, Mark Jensenius A Constrained Optimization Approach to Preserving Prior Knowledge During Incremental Training. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Xiaohua Wang, S. N. Balakrishnan Optimal controller synthesis of variable-time impulsive problems using single network adaptive critics. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda A reinforcement learning system for swarm behaviors. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
25Wipawee Usaha, Javier A. Barria Reinforcement Learning for Resource Allocation in LEO Satellite Networks. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Danil V. Prokhorov Training Recurrent Neurocontrollers for Real-Time Applications. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Huai-Yu Wu, Chunhong Pan, Qing Yang 0002, Songde Ma Consistent Correspondence between Arbitrary Manifold Surfaces. Search on Bibsonomy ICCV The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Asma Al-Tamimi, Draguna L. Vrabie, Murad Abu-Khalaf, Frank L. Lewis Model-free Approximate Dynamic Programming Schemes for Linear Systems. Search on Bibsonomy IJCNN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Daan Wierstra, Jürgen Schmidhuber Policy Gradient Critics. Search on Bibsonomy ECML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
25Thaddeus T. Shannon Qualitative Adaptive Critics. Search on Bibsonomy IJCNN The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
25Chia-Feng Juang Combination of online clustering and Q-value based GA for reinforcement fuzzy system design. Search on Bibsonomy IEEE Trans. Fuzzy Syst. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
25Nathan Denny, Michael M. Marefat Exploiting similarity metrics and case-bases for knowledge sharing between case-based reasoners. Search on Bibsonomy IRI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 1954 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license