|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 5 occurrences of 4 keywords
|
|
|
Results
Found 35 publication records. Showing 35 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
163 | Gerhard Weiß 0001 |
A Multiagent Variant of Dyna-Q. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMAS ![In: 4th International Conference on Multi-Agent Systems, ICMAS 2000, Boston, MA, USA, July 10-12, 2000, pp. 461-462, 2000, IEEE Computer Society, 0-7695-0625-9. The full citation details ...](Pics/full.jpeg) |
2000 |
DBLP DOI BibTeX RDF |
|
160 | Gerhard Weiß 0001 |
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ATAL ![In: Intelligent Agents VII. Agent Theories Architectures and Languages, 7th International Workshop, ATAL 2000, Boston, MA, USA, July 7-9, 2000, Proceedings, pp. 320-330, 2000, Springer, 3-540-42422-9. The full citation details ...](Pics/full.jpeg) |
2000 |
DBLP DOI BibTeX RDF |
|
96 | Roman Zajdel |
Epoch-Incremental Queue-Dyna Algorithm. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICAISC ![In: Artificial Intelligence and Soft Computing - ICAISC 2008, 9th International Conference, Zakopane, Poland, June 22-26, 2008, Proceedings, pp. 1160-1170, 2008, Springer, 978-3-540-69572-1. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Dyna-Q, prioritized sweeping, reinforcement learning |
46 | Tarek Faycal, Claudio Zito |
Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2201.04502, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
23 | Xuecheng Niu, Akinori Ito, Takashi Nose |
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 12, pp. 46940-46952, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Xuecheng Niu, Akinori Ito, Takashi Nose |
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2402.00085, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Silvia Del Giorno, Federico D'Antoni, Vincenzo Piemonte, Mario Merone |
A New Glycemic closed-loop control based on Dyna-Q for Type-1-Diabetes. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Biomed. Signal Process. Control. ![In: Biomed. Signal Process. Control. 81, pp. 104492, March 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Cheng Fan 0003, Zhaohui Wang, Kaichen Yang |
Energy-efficient underwater acoustic communication based on Dyna-Q with an adaptive action space. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Phys. Commun. ![In: Phys. Commun. 61, pp. 102218, December 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Yubin Fu, Xiaochuan Ma, Chao Feng, Xingxuan Pei, Pengzhuo Li |
Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonar. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EURASIP J. Adv. Signal Process. ![In: EURASIP J. Adv. Signal Process. 2023(1), pp. 116, December 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Muleilan Pei, Hao An, Bo Liu 0066, Changhong Wang |
An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Syst. Man Cybern. Syst. ![In: IEEE Trans. Syst. Man Cybern. Syst. 52(7), pp. 4415-4425, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Marcos Maroto-Gómez, Rodrigo González, Álvaro Castro González, María Malfaz, Miguel Ángel Salichs |
Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 9, pp. 98381-98397, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Rui Zhang 0046, Zhenyu Wang 0001, Mengdan Zheng, Yangyang Zhao, Zhenhua Huang 0002 |
Emotion-sensitive deep dyna-Q learning for task-completion dialogue policy learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Neurocomputing ![In: Neurocomputing 459, pp. 122-130, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Yuan Chai, Xiao-Jun Zeng |
A multi-objective Dyna-Q based routing in wireless mesh network. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Appl. Soft Comput. ![In: Appl. Soft Comput. 108, pp. 107486, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Guanlin Wu, Wenqi Fang, Ji Wang 0002, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu 0001, Zheng Wang |
Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACL/IJCNLP (Findings) ![In: Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, Online Event, August 1-6, 2021, pp. 1786-1795, 2021, Association for Computational Linguistics, 978-1-954085-54-1. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Fan Wang, Jie Gao 0002, Mushu Li, Lian Zhao |
Autonomous PEV Charging Scheduling Using Dyna-Q Reinforcement Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Veh. Technol. ![In: IEEE Trans. Veh. Technol. 69(11), pp. 12609-12620, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Yohei Hayamizu, Saeid Amiri, Kishan Chandan, Shiqi Zhang 0001, Keiki Takadama |
Guided Dyna-Q for Mobile Robot Exploration and Navigation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2004.11456, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
23 | Yangyang Zhao, Zhenyu Wang 0001, Kai Yin, Rui Zhang 0046, Zhenhua Huang 0002, Pei Wang |
Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AAAI ![In: The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020., pp. 9676-9684, 2020, AAAI Press, 978-1-57735-823-7. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Lixin Zou, Long Xia, Pan Du 0001, Zhuo Zhang, Ting Bai, Weidong Liu 0001, Jian-Yun Nie, Dawei Yin |
Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
WSDM ![In: WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, Houston, TX, USA, February 3-7, 2020, pp. 816-824, 2020, ACM, 978-1-4503-6822-3. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Zhiwei Li 0003, Yu Lu, Yun Shi, Zengguang Wang, Wenxin Qiao, Yicen Liu |
A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Symmetry ![In: Symmetry 11(5), pp. 617, 2019. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang |
Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AAAI ![In: The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019., pp. 7289-7296, 2019, AAAI Press, 978-1-57735-809-1. The full citation details ...](Pics/full.jpeg) |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Haobin Shi, Shike Yang, Kao-Shing Hwang, Jialin Chen, Mengkai Hu, Heng-sheng Zhang |
A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 6, pp. 37173-37184, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Iris Hwang |
Model Learning for Multistep Backward Prediction in Dyna-Q Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Syst. Man Cybern. Syst. ![In: IEEE Trans. Syst. Man Cybern. Syst. 48(9), pp. 1470-1481, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen |
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1808.09442, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
23 | Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang |
Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/1811.07550, 2018. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
23 | Emanuele Vitolo, Alberto San Miguel, Javier Civera 0001, Cristian Mahulea |
Performance Evaluation of the Dyna-Q algorithm for Robot Navigation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CASE ![In: 14th IEEE International Conference on Automation Science and Engineering, CASE 2018, Munich, Germany, August 20-24, 2018, pp. 322-327, 2018, IEEE, 978-1-5386-3593-3. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen |
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EMNLP ![In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, pp. 3813-3823, 2018, Association for Computational Linguistics, 978-1-948087-84-1. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Baolin Peng, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Kam-Fai Wong |
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACL (1) ![In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15-20, 2018, Volume 1: Long Papers, pp. 2182-2192, 2018, Association for Computational Linguistics, 978-1-948087-32-2. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Pheromone-Based Planning Strategies in Dyna-Q Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Ind. Informatics ![In: IEEE Trans. Ind. Informatics 13(2), pp. 424-435, 2017. The full citation details ...](Pics/full.jpeg) |
2017 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Cybern. ![In: IEEE Trans. Cybern. 45(5), pp. 964-976, 2015. The full citation details ...](Pics/full.jpeg) |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Yi-Jia Tseng, Kao-Shing Hwang, Wei-Cheng Jiang, Tsung-Chuan Huang, Song-Shyong Chen |
An Improved Dyna-Q Algorithm Based in Reverse Model Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICSSE ![In: New Trends on System Sciences and Engineering - Proceedings of ICSSE 2015 [International Conference on System Science and Engineering, Morioka, Japan, July 6-8 2015], pp. 200-212, 2015, IOS Press, 978-1-61499-521-0. The full citation details ...](Pics/full.jpeg) |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Shao-Ming Hung, Sidney Nascimento Givigi, Aboelmagd Noureldin |
A Dyna-Q (Lambda) Approach to Flocking with Fixed-Wing UAVs in a Stochastic Environment. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SMC ![In: 2015 IEEE International Conference on Systems, Man, and Cybernetics, Kowloon Tong, Hong Kong, October 9-12, 2015, pp. 1918-1923, 2015, IEEE, 978-1-4799-8697-2. The full citation details ...](Pics/full.jpeg) |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Hoang Huu Viet, Sang Hyeok An, TaeChoong Chung |
Dyna-Q-based vector direction for path planning problem of autonomous mobile robots in unknown environments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Adv. Robotics ![In: Adv. Robotics 27(3), pp. 159-173, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Adaptive Model Learning Based on Dyna-Q Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Cybern. Syst. ![In: Cybern. Syst. 44(8), pp. 641-662, 2013. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Wei-Han Wang |
Model-Based Indirect Learning Method Based on Dyna-Q Architecture. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SMC ![In: IEEE International Conference on Systems, Man, and Cybernetics, Manchester, SMC 2013, United Kingdom, October 13-16, 2013, pp. 2540-2544, 2013, IEEE, 978-0-7695-5154-8. The full citation details ...](Pics/full.jpeg) |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Arthur Plínio de S. Braga, Aluízio F. R. Araújo |
A topological reinforcement learning agent for navigation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Neural Comput. Appl. ![In: Neural Comput. Appl. 12(3-4), pp. 220-236, 2003. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
Latent learning, Neural networks, Navigation, Reinforcement learning, Topological maps |
Displaying result #1 - #35 of 35 (100 per page; Change: )
|
|