The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Policy-gradient (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1999-2003 (20) 2004-2005 (22) 2006-2007 (25) 2008 (25) 2009-2010 (21) 2011-2013 (18) 2014-2015 (18) 2016-2017 (49) 2018 (80) 2019 (101) 2020 (133) 2021 (160) 2022 (171) 2023 (207) 2024 (57)
Publication types (Num. hits)
article(605) data(1) incollection(3) inproceedings(495) phdthesis(3)
Venues (Conferences, Journals, ...)
CoRR(374) ICML(46) NeurIPS(32) AAAI(26) IEEE Access(25) CDC(18) NIPS(17) AISTATS(15) IROS(13) ICLR(10) AAMAS(9) Appl. Intell.(9) Eng. Appl. Artif. Intell.(8) ICASSP(8) IEEE Trans. Neural Networks Le...(8) IJCNN(8) More (+10 of total 316)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 23 occurrences of 20 keywords

Results
Found 1107 publication records. Showing 1107 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
56Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya A New Natural Policy Gradient by Stationary Distribution Metric. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF policy gradient reinforcement learning, Riemannian metric matrix, Markov decision process, natural gradient
51Maarten Peeters, Ville Könönen, Katja Verbeeck, Ann Nowé A Learning Automata Approach to Multi-agent Policy Gradient Learning. Search on Bibsonomy KES (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
48Nguyen Hoang Viet, Ngo Anh Vien, TaeChoong Chung Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks. Search on Bibsonomy ICNSC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
41Yutaka Nakamura, Takeshi Mori, Shin Ishii An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process. Search on Bibsonomy ICANN (2) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
40Daan Wierstra, Jürgen Schmidhuber Policy Gradient Critics. Search on Bibsonomy ECML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
39Abdeslam Boularias, Brahim Chaib-draa Predictive representations for policy gradient in POMDPs. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
35David Silver, Gerald Tesauro Monte-Carlo simulation balancing. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
34Dongbing Gu, Erfu Yang Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems. Search on Bibsonomy J. Intell. Robotic Syst. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF flocking behavior, policy gradient reinforcement learning, cooperative control, multi-agent reinforcement learning
33Jan Peters 0001, Sethu Vijayakumar, Stefan Schaal Natural Actor-Critic. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
31Shixiang Gu, Timothy P. Lillicrap, Zoubin Ghahramani, Richard E. Turner, Bernhard Schölkopf, Sergey Levine Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2017 DBLP  BibTeX  RDF
31Shixiang Gu, Tim Lillicrap, Richard E. Turner, Zoubin Ghahramani, Bernhard Schölkopf, Sergey Levine Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning. Search on Bibsonomy NIPS The full citation details ... 2017 DBLP  BibTeX  RDF
29Thomas Rückstieß, Martin Felder, Jürgen Schmidhuber State-Dependent Exploration for Policy Gradient Methods. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
29Sertan Girgin, Philippe Preux Basis Expansion in Natural Actor Critic Methods. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
28Emmanuel Daucé A Model of Neuronal Specialization Using Hebbian Policy-Gradient with "Slow" Noise. Search on Bibsonomy ICANN (1) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
28Seiji Ishihara, Harukazu Igarashi Behavior Learning Based on a Policy Gradient Method: Separation of Environmental Dynamics and State Values in Policies. Search on Bibsonomy PRICAI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
28Jan Peters 0001, Stefan Schaal Policy Gradient Methods for Robotics. Search on Bibsonomy IROS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
28Ville Könönen Policy Gradient Method for Team Markov Games. Search on Bibsonomy IDEAL The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
28Sham M. Kakade Optimizing Average Reward Using Discounted Rewards. Search on Bibsonomy COLT/EuroCOLT The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
25Rui Yuan Stochastic Second Order Methods and Finite Time Analysis of Policy Gradient Methods. (Méthodes du second d'ordre stochastiques et analyse de temps fini des méthodes de policy-gradient). Search on Bibsonomy 2023   RDF
25Yanli Liu 0003, Kaiqing Zhang, Tamer Basar, Wotao Yin An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
25Ju-Seung Byun, Byungmoon Kim, Huamin Wang Proximal Policy Gradient: PPO with Policy Gradient. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
25Yanli Liu 0003, Kaiqing Zhang, Tamer Basar, Wotao Yin An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods. Search on Bibsonomy NeurIPS The full citation details ... 2020 DBLP  BibTeX  RDF
25Andrew Ilyas, Logan Engstrom, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms? Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
24Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, Frans A. Oliehoek When Do Off-Policy and On-Policy Policy Gradient Methods Align? Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
24Nicholas E. Corrado, Josiah P. Hanna On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Bikramjit Banerjee, Jing Peng Adaptive policy gradient in multiagent learning. Search on Bibsonomy AAMAS The full citation details ... 2003 DBLP  DOI  BibTeX  RDF gradient ascent learning, game theory, nash equilibria
22Jan Peters 0001, Jens Kober, Duy Nguyen-Tuong Policy Learning - A Unified Perspective with Applications in Robotics. Search on Bibsonomy EWRL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
22Mohammad Ghavamzadeh, Yaakov Engel Bayesian actor-critic algorithms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
20Harukazu Igarashi, Kouji Nakamura, Seiji Ishihara Learning of soccer player agents using a policy gradient method: Coordination between kicker and receiver during free kicks. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
20Yu Hiei, Takeshi Mori, Shin Ishii Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
20Tomoya Tamei, Tomohiro Shibata Policy Gradient Learning of Cooperative Interaction with a Robot Using User's Biological Signals. Search on Bibsonomy ICONIP (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
20Andrea Cherubini, Francesca Giannone, Luca Iocchi, Pier Francesco Palamara An extended policy gradient algorithm for robot task learning. Search on Bibsonomy IROS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
20Nate Kohl, Peter Stone Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion. Search on Bibsonomy ICRA The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
20Yutaka Nakamura, Takeshi Mori, Shin Ishii Natural Policy Gradient Reinforcement Learning for a CPG Control of a Biped Robot. Search on Bibsonomy PPSN The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
20Frank Sehnke, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters 0001, Jürgen Schmidhuber Policy Gradients with Parameter-Based Exploration for Control. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
20Kristian Kersting, Kurt Driessens Non-parametric policy gradients: a unified treatment of propositional and relational domains. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
19Olivier Buffet, Alain Dutech, François Charpillet Shaping multi-agent systems with gradient reinforcement learning. Search on Bibsonomy Auton. Agents Multi Agent Syst. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Policy-gradient, Multi-agent systems, Reinforcement learning, Shaping, Partially observable Markov decision processes
19Stefana Anita, Gabriel Turinici On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
19Guangchen Lan, Han Wang, James Anderson, Christopher G. Brinton, Vaneet Aggarwal Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
19Guangchen Lan, Han Wang, James Anderson, Christopher G. Brinton, Vaneet Aggarwal Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
19Matilde Gargiani, Andrea Zanelli, Andrea Martinelli, Tyler H. Summers, John Lygeros PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
19Matilde Gargiani, Andrea Zanelli, Andrea Martinelli, Tyler H. Summers, John Lygeros PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation. Search on Bibsonomy ICML The full citation details ... 2022 DBLP  BibTeX  RDF
19Harshat Kumar, Dionysios S. Kalogerias, George J. Pappas, Alejandro Ribeiro Actor-only Deterministic Policy Gradient via Zeroth-order Gradient Oracles in Action Space. Search on Bibsonomy ISIT The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
19Chris Nota, Philip S. Thomas Is the Policy Gradient a Gradient? (PDF / PS) Search on Bibsonomy AAMAS The full citation details ... 2020 DBLP  BibTeX  RDF
19Chris Nota, Philip S. Thomas Is the Policy Gradient a Gradient? Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
19Peter Henderson 0002, Joshua Romoff, Joelle Pineau Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
18Lorenzo Sforni, Guido Carnevale, Ivano Notarnicola, Giuseppe Notarstefano Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
18Jonathan Viquerat, Régis Duvigneau, P. Meliga, Alexander Kuhnle, Elie Hachem Policy-based optimization: single-step policy gradient method seen as an evolution strategy. Search on Bibsonomy Neural Comput. Appl. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Yifei Zhou, Ayush Sekhari, Yuda Song 0001, Wen Sun 0002 Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Qinghua Liu, Gellért Weisz, András György 0001, Chi Jin 0001, Csaba Szepesvári Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Fengdi Che, Gautham Vasan, A. Rupam Mahmood Correcting discount-factor mismatch in on-policy policy gradient methods. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Yinbin Han, Meisam Razaviyayn, Renyuan Xu Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Yashaswini Murthy, R. Srikant 0001 On the Convergence of Natural Policy Gradient and Mirror Descent-Like Policy Methods for Average-Reward MDPs. Search on Bibsonomy CDC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
18Qinghua Liu, Gellért Weisz, András György 0001, Chi Jin 0001, Csaba Szepesvári Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
18Fengdi Che, Gautham Vasan, A. Rupam Mahmood Correcting discount-factor mismatch in on-policy policy gradient methods. Search on Bibsonomy ICML The full citation details ... 2023 DBLP  BibTeX  RDF
18Romain Laroche, Remi Tachet des Combes Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
18Carlo Alfano, Patrick Rebeschini Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
18Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
18Samuele Tosatto, João Carvalho, Jan Peters 0001 Batch Reinforcement Learning With a Nonparametric Off-Policy Policy Gradient. Search on Bibsonomy IEEE Trans. Pattern Anal. Mach. Intell. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
18Romain Laroche, Remi Tachet des Combes Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms. Search on Bibsonomy AISTATS The full citation details ... 2022 DBLP  BibTeX  RDF
18Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
18Ishaan Shah, David Halpern, Kavosh Asadi, Michael L. Littman Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
18Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay. Search on Bibsonomy ICTAI The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
18Samuele Tosatto, João Carvalho, Jan Peters 0001 Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
18Romina Abachi, Mohammad Ghavamzadeh, Amir-massoud Farahmand Policy-Aware Model Learning for Policy Gradient Methods. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
18Alekh Agarwal, Mikael Henaff, Sham M. Kakade, Wen Sun 0002 PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
18Seiji Ishihara, Harukazu Igarashi Policy Gradient Reinforcement Learning for Policy Represented by Fuzzy Rules: Application to Simulations of Speed Control of an Automobile. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
18Yixiang Wang, Feng Wu 0001 Policy Adaptive Multi-agent Deep Deterministic Policy Gradient. Search on Bibsonomy PRIMA The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
18Alekh Agarwal, Mikael Henaff, Sham M. Kakade, Wen Sun 0002 PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning. Search on Bibsonomy NeurIPS The full citation details ... 2020 DBLP  BibTeX  RDF
18Samuele Tosatto, João Carvalho, Hany Abdulsamad, Jan Peters 0001 A Nonparametric Off-Policy Policy Gradient. Search on Bibsonomy AISTATS The full citation details ... 2020 DBLP  BibTeX  RDF
18Yunhao Tang, Mingzhang Yin, Mingyuan Zhou Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
18Riashat Islam, Komal K. Teru, Deepak Sharma Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
18Hélène Plisnier, Denis Steckelmacher, Diederik M. Roijers, Ann Nowé The Actor-Advisor: Policy Gradient With Off-Policy Advice. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
18Yao Liu 0009, Adith Swaminathan, Alekh Agarwal, Emma Brunskill Off-Policy Policy Gradient with State Distribution Correction. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
18Yao Liu 0009, Adith Swaminathan, Alekh Agarwal, Emma Brunskill Off-Policy Policy Gradient with Stationary Distribution Correction. (PDF / PS) Search on Bibsonomy UAI The full citation details ... 2019 DBLP  BibTeX  RDF
18Ehsan Imani, Eric Graves 0002, Martha White An Off-policy Policy Gradient Theorem Using Emphatic Weightings. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
18Junta Wu, Huiyun Li Aggregated Multi-deep Deterministic Policy Gradient for Self-driving Policy. Search on Bibsonomy IOV The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
18Josiah P. Hanna, Peter Stone Towards a Data Efficient Off-Policy Policy Gradient. Search on Bibsonomy AAAI Spring Symposia The full citation details ... 2018 DBLP  BibTeX  RDF
18Ehsan Imani, Eric Graves 0002, Martha White An Off-policy Policy Gradient Theorem Using Emphatic Weightings. Search on Bibsonomy NeurIPS The full citation details ... 2018 DBLP  BibTeX  RDF
18Yan Yan, Quan Liu Policy Space Noise in Deep Deterministic Policy Gradient. Search on Bibsonomy ICONIP (2) The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
18Li Zhou 0006, Kevin Small, Oleg Rokhlenko, Charles Elkan End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient. Search on Bibsonomy CoRR The full citation details ... 2017 DBLP  BibTeX  RDF
18Shixiang Gu, Timothy P. Lillicrap, Zoubin Ghahramani, Richard E. Turner, Sergey Levine Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic. Search on Bibsonomy ICLR The full citation details ... 2017 DBLP  BibTeX  RDF
18Shixiang Gu, Timothy P. Lillicrap, Zoubin Ghahramani, Richard E. Turner, Sergey Levine Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic. Search on Bibsonomy CoRR The full citation details ... 2016 DBLP  BibTeX  RDF
18Lucas Lehnert, Doina Precup Policy Gradient Methods for Off-policy Control. Search on Bibsonomy CoRR The full citation details ... 2015 DBLP  BibTeX  RDF
18Tingting Zhao 0001, Gang Niu 0001, Ning Xie 0003, Jucheng Yang 0001, Masashi Sugiyama Regularized Policy Gradients: Direct Variance Reduction in Policy Gradient Estimation. Search on Bibsonomy ACML The full citation details ... 2015 DBLP  BibTeX  RDF
18Ujjwal Das Gupta, Erik Talvitie, Michael Bowling Policy Tree: Adaptive Representation for Policy Gradient. Search on Bibsonomy AAAI The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
18Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya Learning a dynamic policy by using policy gradient: application to biped walking. Search on Bibsonomy Syst. Comput. Jpn. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
18Yutaka Nakamura, Takeshi Mori, Yoichi Tokita, Tomohiro Shibata, Shin Ishii Off-Policy Natural Policy Gradient Method for a Biped Walking Using a CPG Controller. Search on Bibsonomy J. Robotics Mechatronics The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
18Xi-Ren Cao Basic Ideas for Event-Based Optimization of Markov Systems. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF Markov decision processes (MDPs), performance potentials, policy gradients, aggregation, perturbation analysis, POMDPs, policy iteration
17Jooyoung Park, Jongho Kim, Daesung Kang An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. Search on Bibsonomy CIS (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
14Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Sang-Ho Hyon, Joshua G. Hale, Gordon Cheng Learning to acquire whole-body humanoid CoM movements to achieve dynamic tasks. Search on Bibsonomy ICRA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
14Daniel Schneegaß, Steffen Udluft, Thomas Martinetz Improving Optimality of Neural Rewards Regression for Data-Efficient Batch Near-Optimal Policy Identification. Search on Bibsonomy ICANN (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
12Xu Li, Yuehui Ji, Yu Song 0004, Junjie Liu, Qiang Gao Modified deep deterministic policy gradient based on active disturbance rejection control for hypersonic vehicles. Search on Bibsonomy Neural Comput. Appl. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Bo Lyu, Yin Yang 0001, Yuting Cao, Pengcheng Wang, Jian Zhu, Jingfei Chang, Shiping Wen 0001 Efficient multi-objective neural architecture search framework via policy gradient algorithm. Search on Bibsonomy Inf. Sci. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Amirhossein Dolatabadi, Hussein Hassan Abdeltawab, Yasser Abdel-Rady I. Mohamed SFNAS-DDPG: A Biomass-Based Energy Hub Dynamic Scheduling Approach via Connecting Supervised Federated Neural Architecture Search and Deep Deterministic Policy Gradient. Search on Bibsonomy IEEE Access The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Haowei Shi, Jiadao Zou, Qingxue Zhang Efficient Massive-Device Orchestration Through Reinforcement Learning With Boosted Deep Deterministic Policy Gradient. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Pengcheng Dai, Wenwu Yu, He Wang 0006, Jiahui Jiang Applications in Traffic Signal Control: A Distributed Policy Gradient Decomposition Algorithm. Search on Bibsonomy IEEE Trans. Ind. Informatics The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Hao Zhang 0008, Yan Li, Zhuping Wang, Yi Ding, Huaicheng Yan 0001 Distributed Optimal Control of Nonlinear System Based on Policy Gradient With External Disturbance. Search on Bibsonomy IEEE Trans. Netw. Sci. Eng. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Haofei Li, Chen Chen 0006, Hangguan Shan, Pu Li, Yoong Choon Chang, Houbing Song Deep Deterministic Policy Gradient-Based Algorithm for Computation Offloading in IoV. Search on Bibsonomy IEEE Trans. Intell. Transp. Syst. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
12Shokichi Takakura, Kazuhiro Sato Structured Output Feedback Control for Linear Quadratic Regulator Using Policy Gradient Method. Search on Bibsonomy IEEE Trans. Autom. Control. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 1107 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license