The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase actor-critic (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1994-2003 (18) 2004-2005 (17) 2006-2007 (30) 2008 (27) 2009-2010 (28) 2011-2012 (25) 2013 (17) 2014 (15) 2015-2016 (25) 2017 (43) 2018 (63) 2019 (115) 2020 (140) 2021 (222) 2022 (199) 2023 (255) 2024 (64)
Publication types (Num. hits)
article(753) inproceedings(548) phdthesis(2)
Venues (Conferences, Journals, ...)
CoRR(380) ICML(28) NeurIPS(25) CDC(20) IEEE Access(19) IJCNN(19) IEEE Trans. Neural Networks Le...(18) IEEE Internet Things J.(17) Neurocomputing(17) GLOBECOM(16) AAAI(15) AAMAS(15) ICRA(14) ICLR(13) ACC(12) Inf. Sci.(10) More (+10 of total 387)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 36 occurrences of 24 keywords

Results
Found 1303 publication records. Showing 1303 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
158Thomas Hanselmann, Lyle Noakes, Anthony Zaknich Continuous-Time Adaptive Critics. Search on Bibsonomy IEEE Trans. Neural Networks The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
146Shamama Anwar, K. Sridhar Patnaik Actor Critic Learning: A Near Set Approach. Search on Bibsonomy RSCTC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF ethogram, ethology, actor critic, rough sets, Adaptive learning, approximation space, near sets
127Jan Peters 0001, Sethu Vijayakumar, Stefan Schaal Natural Actor-Critic. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
113Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots. Search on Bibsonomy Soft Comput. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Multi-step prediction, Nonlinear predictive model, Simulated experience, Kinematic model, Nonholonomic mobile robot
113Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi Adaptive actor-critic learning for the control of mobile robots by applying predictive models. Search on Bibsonomy Soft Comput. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Tracking control problem, Predictive model, Temporal difference learning, Nonholonomic mobile robot
107Jooyoung Park, Jongho Kim, Daesung Kang An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. Search on Bibsonomy CIS (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
105Andrés Pérez-Uribe Using a Time-Delay Actor-Critic Neural Architecture with Dopamine-Like Reinforcement Signal for Learning in Autonomous Robots. Search on Bibsonomy Emergent Neural Computational Architectures Based on Neuroscience The full citation details ... 2001 DBLP  DOI  BibTeX  RDF Learning robots, actor-critic architecture, TD-learning, dopamine neurons, human teaching signals, reinforcement learning, time-delay neural networks
94James F. Peters Granular Computing in Actor-Critic Learning. Search on Bibsonomy FOCI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
86Francisco S. Melo, Manuel Lopes 0001 Fitted Natural Actor-Critic: A New Algorithm for Continuous State-Action MDPs. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
86Efraín Franco Flores, Julio Waissman Vilanova, Jair García Lamont Learning the Filling Policy of a Biodegradation Process by Fuzzy Actor-Critic Learning Methodology. Search on Bibsonomy MICAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
83James F. Peters, Christopher J. Henry, Sheela Ramanna Reinforcement Learning in Swarms that Learn. Search on Bibsonomy IAT The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
81Mohammed Shahid Abdulla, Shalabh Bhatnagar Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic algorithms, Two timescale stochastic approximation, Simultaneous perturbation stochastic approximation, Normalized Hadamard matrices, TD-learning, Reinforcement learning, Markov decision processes, Policy iteration
69Yoichiro Matsuno, Tatsuya Yamazaki, Shin Ishii A multi-agent reinforcement learning method for a partially-observable competitive game. Search on Bibsonomy Agents The full citation details ... 2001 DBLP  DOI  BibTeX  RDF actor-critic model, competitive game, reinforcement learning, multi-agent
65Shingo Mabu, Yan Chen 0008, Kotaro Hirasawa, Jinglu Hu Stock trading rules using genetic network programming with actor-critic. Search on Bibsonomy IEEE Congress on Evolutionary Computation The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
62Junichiro Yoshimoto, Shin Ishii, Masa-aki Sato On-Line EM Reinforcement Learning. Search on Bibsonomy IJCNN (3) The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
60Chrisantha Fernando Neuronal replicators solve the stability-plasticity dilemma. Search on Bibsonomy GECCO The full citation details ... 2010 DBLP  DOI  BibTeX  RDF actor-critic, neuronal replicator hypothesis, robotics, reinforcement learning
60Dusko Katic, Aleksandar Rodic 0001, Miomir Vukobratovic Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning. Search on Bibsonomy J. Intell. Robotic Syst. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Biped locomotion, Integrated dynamic control, Actor-critic method, Reinforcement learning, Humanoid robots
60Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space. Search on Bibsonomy ICIC (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF neuro-fuzzy net, swarm behavior, actor-critic algorithm, goal-exploration problem, multi-agent system, reinforcement learning
60James F. Peters Toward Approximate Adaptive Learning. Search on Bibsonomy RSEISP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Actor-critic, behaviour pattern, stopping time, perception, adaptive learning, approximation space
47Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin Actor-Critic or Critic-Actor? A Tale of Two Time Scales. Search on Bibsonomy IEEE Control. Syst. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
47Prashansa Panda, Shalabh Bhatnagar Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
47Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin Actor-Critic or Critic-Actor? A Tale of Two Time Scales. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
47Aras Dargazany Model-based actor-critic: GAN + DRL (actor-critic) => AGI. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
47Norman L. Tasfi, Miriam A. M. Capretz Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay. Search on Bibsonomy IJCNN The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
47Ala'eddin Masadeh, Zhengdao Wang, Ahmed E. Kamal 0001 Selector-Actor-Critic and Tuner-Actor-Critic Algorithms for Reinforcement Learning. Search on Bibsonomy WCSP The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
47Jing Wang 0044, Ioannis Ch. Paschalidis An Actor-Critic Algorithm With Second-Order Actor and Critic. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
44Byungchan Kim, Byungduk Kang, Shinsuk Park, Sungchul Kang Learning robot stiffness for contact tasks using the natural actor-critic. Search on Bibsonomy ICRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
44Sertan Girgin, Philippe Preux Basis Expansion in Natural Actor Critic Methods. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
44Mohammad Ghavamzadeh, Yaakov Engel Bayesian actor-critic algorithms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
44Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, Tomohiro Shibata, Koh Hosoda, Shin Ishii Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic. Search on Bibsonomy IROS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
44Mehdi Khamassi, Louis-Emmanuel Martinet, Agnès Guillot Combining Self-organizing Maps with Mixtures of Experts: Application to an Actor-Critic Model of Reinforcement Learning in the Basal Ganglia. Search on Bibsonomy SAB The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
41Hassab Elgawi Osman Architecture of behavior-based and robotics self-optimizing memory controller. Search on Bibsonomy ICRA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
41Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda A reinforcement learning system for swarm behaviors. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
41Daan Wierstra, Jürgen Schmidhuber Policy Gradient Critics. Search on Bibsonomy ECML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
41Yutaka Nakamura, Takeshi Mori, Shin Ishii Natural Policy Gradient Reinforcement Learning for a CPG Control of a Biped Robot. Search on Bibsonomy PPSN The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
36Spilios Evmorfos, Athina P. Petropulu, H. Vincent Poor Actor-Critic Methods for IRS Design in Correlated Channel Environments: A Closer Look Into the Neural Tangent Kernel of the Critic. Search on Bibsonomy IEEE Trans. Signal Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
36Swaminathan Gurumurthy, Zachary Manchester, J. Zico Kolter Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning. Search on Bibsonomy L4DC The full citation details ... 2023 DBLP  BibTeX  RDF
36Riazat Ryan, Ming Shao Critic-over-Actor-Critic Modeling: Finding Optimal Strategy in ICU Environments. Search on Bibsonomy IEEE Big Data The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
36Gengzhi Zhang, Liang Feng 0001, Yaqing Hou Multi-task Actor-Critic with Knowledge Transfer via a Shared Critic. Search on Bibsonomy ACML The full citation details ... 2021 DBLP  BibTeX  RDF
36Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
36Jiajun Fan, He Ba, Xian Guo, Jianye Hao Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
36Roumeissa Kitouni, Abderrahim Kitouni, Feng Jiang 0001 Generalized Critic Policy Optimization: A Model For Combining Advantage Estimates In Actor Critic Methods. Search on Bibsonomy ICIP The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
36Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. Search on Bibsonomy NeurIPS The full citation details ... 2020 DBLP  BibTeX  RDF
36Jonathan Lebensold, William L. Hamilton, Borja Balle, Doina Precup Actor Critic with Differentially Private Critic. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
35Lin Li, Yuze Li, Wei Wei 0018, Yujia Zhang, Jiye Liang Multi-actor mechanism for actor-critic reinforcement learning. Search on Bibsonomy Inf. Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
35Bo Li 0004, Shuangxia Bai, Shiyang Liang, Rui Ma, Evgeny Sergeevich Neretin, Jingyi Huang Manoeuvre decision-making of unmanned aerial vehicles in air combat based on an expert actor-based soft actor critic algorithm. Search on Bibsonomy CAAI Trans. Intell. Technol. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
35Yuhu Cheng, Longyang Huang, C. L. Philip Chen, Xuesong Wang 0001 Robust Actor-Critic With Relative Entropy Regulating Actor. Search on Bibsonomy IEEE Trans. Neural Networks Learn. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
35Thibault Lahire Actor Loss of Soft Actor Critic Explained. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
35Siddharth Mysore, Bassel El Mabsout, Renato Mancuso 0001, Kate Saenko Honey. I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL. Search on Bibsonomy CoG The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
35Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
35Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. Search on Bibsonomy ICML The full citation details ... 2018 DBLP  BibTeX  RDF
33Abdeslam Boularias, Brahim Chaib-draa Predictive representations for policy gradient in POMDPs. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
33Qinmin Yang, Jonathan Blake Vance, Sarangapani Jagannathan Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
33Wipawee Usaha, Javier A. Barria Reinforcement Learning for Resource Allocation in LEO Satellite Networks. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
29Atsushi Shimizu, Yuko Osana Reinforcement Learning Using Kohonen Feature Map Associative Memory with Refractoriness Based on Area Representation. Search on Bibsonomy ICONIP (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
29Dimitri Ognibene, Angelo Rega, Gianluca Baldassarre A Model of Reaching that Integrates Reinforcement Learning and Population Encoding of Postures. Search on Bibsonomy SAB The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
24Hongjun Zhu, Yong Xie, Suijun Zheng A double Actor-Critic learning system embedding improved Monte Carlo tree search. Search on Bibsonomy Neural Comput. Appl. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Hao Li, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu 0004, Zhen-Qiu Feng, Zeng-Guang Hou CASOG: Conservative Actor-Critic With SmOoth Gradient for Skill Learning in Robot-Assisted Intervention. Search on Bibsonomy IEEE Trans. Ind. Electron. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Jingliang Duan, Yangang Ren, Fawang Zhang, Jie Li 0042, Shengbo Eben Li, Yang Guan, Keqiang Li Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-Lane Scenarios [Research Frontier] [Research Frontier]. Search on Bibsonomy IEEE Comput. Intell. Mag. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Bingyi Liu, Weizhen Han, Enshu Wang, Shengwu Xiong, Libing Wu, Qian Wang 0002, Jianping Wang 0001, Chunming Qiao Multi-Agent Attention Double Actor-Critic Framework for Intelligent Traffic Light Control in Urban Scenarios With Hybrid Traffic. Search on Bibsonomy IEEE Trans. Mob. Comput. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Ali Beikmohammadi, Sindri Magnússon Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge. Search on Bibsonomy Inf. Sci. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Amgad Abdallah Mahmoud, Nada Adel Alyan, Ahmed Elkerdawy, Shihori Tanabe, Frédéric Andrès, Andreas Pester, Hesham H. Ali Geom-SAC: Geometric Multi-Discrete Soft Actor Critic With Applications in De Novo Drug Design. Search on Bibsonomy IEEE Access The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Sudheer Mangalampalli, Ganesh Reddy Karri, Sachi Nandan Mohanty, Shahid Ali, Muhammad Ijaz Khan, Sherzod Sh. Abdullaev, Salman A. AlQahtani Multi-Objective Prioritized Task Scheduler Using Improved Asynchronous Advantage Actor Critic (a3c) Algorithm in Multi Cloud Environment. Search on Bibsonomy IEEE Access The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Dan Wu, Liming Wang, Meiyan Liang, Yunpeng Kang, Qi Jiao, Yajun Cheng, Jian Li 0010 UAV-Assisted Real-Time Video Transmission for Vehicles: A Soft Actor-Critic DRL Approach. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Xiangyu Gao, Yaping Sun, Hao Chen 0013, Xiaodong Xu 0001, Shuguang Cui Joint Computing, Pushing, and Caching Optimization for Mobile-Edge Computing Networks via Soft Actor-Critic Learning. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Kaiqing Bu, Yan Liu, Fuli Wang Adaptation Entropy Regularization Actor-Critic for Process Operation Performance Assessment. Search on Bibsonomy IEEE Trans. Ind. Informatics The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Fang Fu, Xianpeng Wei, Zhicai Zhang, Laurence T. Yang, Lin Cai 0001, Jia Luo 0003, Zhe Zhang 0010, Chenmeng Wang Age of Information Minimization for UAV-Assisted Internet of Things Networks: A Safe Actor-Critic With Policy Distillation Approach. Search on Bibsonomy IEEE Trans. Netw. Sci. Eng. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Mohammad Cheraghinia, Seyed Hamed Rastegar, Vahid Shah-Mansouri, Hamed Kebriaei, Kun Zhu 0001, Dusit Niyato Toward a Virtual Edge Service Provider: Actor-Critic Learning to Incentivize the Computation Nodes. Search on Bibsonomy IEEE Trans. Netw. Sci. Eng. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Ran Wang, Ye Tian, Kenji Kashima Density estimation based soft actor-critic: deep reinforcement learning for static output feedback control with measurement noise. Search on Bibsonomy Adv. Robotics The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Sicen Li, Yiming Pang, Panju Bai, Jiawei Li, Zhaojin Liu, Shihao Hu, Liquan Wang, Gang Wang Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Mélodie Daniel, Aly Magassouba, Miguel Aranda, Laurent Lequièvre, Juan Antonio Corrales Ramon, Roberto Iglesias Rodriguez, Youcef Mezouar Multi Actor-Critic DDPG for Robot Action Space Decomposition: A Framework to Control Large 3D Deformation of Soft Linear Objects. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Lulu Zhang, Huaguang Zhang, Xiaohui Yue, Tianbiao Wang Actor-Critic Optimal Control for Semi-Markovian Jump Systems With Time Delay. Search on Bibsonomy IEEE Trans. Circuits Syst. II Express Briefs The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Long Zhang 0003, Xingliang Jia, Ni Tian, Choong Seon Hong, Zhu Han 0001 When Visible Light Communication Meets RIS: A Soft Actor-Critic Approach. Search on Bibsonomy IEEE Wirel. Commun. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Le Thanh Tan, Martin Reisslein, Sachin Shetty Multi-Timescale Actor-Critic Learning for Computing Resource Management With Semi-Markov Renewal Process Mobility. Search on Bibsonomy IEEE Trans. Intell. Transp. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Lu Dong 0002, Zichen He, Chunwei Song, Xin Yuan, Haichao Zhang Multi-robot social-aware cooperative planning in pedestrian environments using attention-based actor-critic. Search on Bibsonomy Artif. Intell. Rev. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Jiacun Wang, GuiPeng Xi, Xiwang Guo, Shujin Qin, Henry Han Multi-Objective Advantage Actor-Critic Algorithm for Hybrid Disassembly Line Balancing with Multi-Skilled Workers. Search on Bibsonomy Inf. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Xiao Wang 0034, Dazi Li Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy. Search on Bibsonomy Neurocomputing The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Alberto Sinigaglia, Niccolò Turcato, Alberto Dalla Libera, Ruggero Carli, Gian Antonio Susto Exploiting Estimation Bias in Deep Double Q-Learning for Actor-Critic Methods. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Jinxuan Chen, Mustafa Özger, Cicek Cavdar Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Samuel Chun-Hei Lam, Justin A. Sirignano, Ziheng Wang Weak Convergence Analysis of Online Neural Actor-Critic Algorithms. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Yuan Lin, Xiao Liu, Zishun Zheng, Liyao Wang Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Shengchao Yan, Lukas König, Wolfram Burgard Single-Agent Actor Critic for Decentralized Cooperative Driving. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Tianying Ji, Yongyuan Liang, Yan Zeng 0002, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun 0001, Huazhe Xu ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Jiarui Wang, Mahyar Fazlyab Actor-Critic Physics-informed Neural Lyapunov Control. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Michal Nauman, Michal Bortkiewicz, Mateusz Ostaszewski, Piotr Milos, Tomasz Trzcinski, Marek Cygan Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu 0003, Melih Kandemir Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes Uncertainty. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Luca Grillotti, Maxence Faldor, Borja G. León, Antoine Cully Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Tobias Enders, James Harrison, Maximilian Schiffer Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Nikhil Kumar Singh 0004, Indranil Saha Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Honghao Wei, Xiyue Peng, Xin Liu, Arnob Ghosh Adversarially Trained Actor Critic for offline CMDPs. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Michal Nauman, Mateusz Ostaszewski, Marek Cygan A Case for Validation Buffer in Pessimistic Actor-Critic. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang 0001, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller Offline Actor-Critic Reinforcement Learning Scales to Large Models. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Michael Kölle 0001, Mohamad Hgog, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Stein 0001, Claudia Linnhoff-Popien Quantum Advantage Actor-Critic for Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Clément Gaspard, Grégoire Passault, Mélodie Daniel, Olivier Ly FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Hamed Rahimi Nohooji, Abolfazl Zaraki, Holger Voos Actor-critic learning based PID control for robotic manipulators. Search on Bibsonomy Appl. Soft Comput. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Marta Monaci, Valerio Agasucci, Giorgio Grani An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents. Search on Bibsonomy Eur. J. Oper. Res. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Guilherme Piêgas Koslovski, Kleiton Pereira, Paulo Roberto Albuquerque DAG-based workflows scheduling using Actor-Critic Deep Reinforcement Learning. Search on Bibsonomy Future Gener. Comput. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Xiaohong Nian, MengMeng Li, Haibo Wang, Yalei Gong, Hongyun Xiong Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm. Search on Bibsonomy Appl. Intell. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Wei Li, Si Li, Huaguang Shi, Wenhao Yan, Yi Zhou 0004 UAV-enabled fair offloading for MEC networks: a DRL approach based on actor-critic parallel architecture. Search on Bibsonomy Appl. Intell. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 1303 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license