|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 24 occurrences of 17 keywords
|
|
|
Results
Found 196 publication records. Showing 196 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
73 | Alasdair Macleod |
Game design through self-play experiments. |
Advances in Computer Entertainment Technology |
2005 |
DBLP DOI BibTeX RDF |
perudo, self-play experiments, reinforcement learning, computer games, game design |
57 | Ernst A. Heinz |
New Self-Play Results in Computer Chess. |
Computers and Games |
2000 |
DBLP DOI BibTeX RDF |
diminishing returns, search vs. knowledge, self-play |
52 | Thomas Philip Runarsson, Simon M. Lucas |
Coevolution versus self-play temporal difference learning for acquiring position evaluation in small-board go. |
IEEE Trans. Evol. Comput. |
2005 |
DBLP DOI BibTeX RDF |
|
26 | Vincent Conitzer, Tuomas Sandholm |
AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. |
Mach. Learn. |
2007 |
DBLP DOI BibTeX RDF |
Learning in games, Game theory, Nash equilibrium |
24 | Bikramjit Banerjee, Jing Peng |
Unifying Convergence and No-Regret in Multiagent Learning. |
LAMAS |
2005 |
DBLP DOI BibTeX RDF |
|
22 | Hongyi Guo, Zhihan Liu, Yufeng Zhang, Zhaoran Wang 0001 |
Can Large Language Models Play Games? A Case Study of A Self-Play Approach. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
22 | Chin-Wing Leung, Shuyue Hu, Ho-fung Leung |
Self-Play or Group Practice: Learning to Play Alternating Markov Game in Multi-Agent System. |
ICPR |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Marco A. Wiering |
Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning. |
J. Intell. Learn. Syst. Appl. |
2010 |
DBLP DOI BibTeX RDF |
|
19 | Andriy Burkov, Brahim Chaib-draa |
Anytime Self-play Learning to Satisfy Functional Optimality Criteria. |
ADT |
2009 |
DBLP DOI BibTeX RDF |
|
19 | Ari Shapiro, Gil Fuchs, Robert Levinson |
Learning a Game Strategy Using Pattern-Weights and Self-play. |
Computers and Games |
2002 |
DBLP DOI BibTeX RDF |
|
18 | Bikramjit Banerjee, Jing Peng |
Generalized multiagent learning with performance bound. |
Auton. Agents Multi Agent Syst. |
2007 |
DBLP DOI BibTeX RDF |
Multiagent reinforcement learning, Game theory |
17 | Francisco S. Melo, Manuel C. Lopes |
Convergence of Independent Adaptive Learners. |
EPIA Workshops |
2007 |
DBLP DOI BibTeX RDF |
|
17 | Graham E. Farr, David R. Powell |
Unsupervised Learning in Metagame. |
Australian Joint Conference on Artificial Intelligence |
1999 |
DBLP DOI BibTeX RDF |
|
17 | Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo 0002, Xiaolong Wang |
Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Doran Chakraborty, Peter Stone |
Online Multiagent Learning against Memory Bounded Adversaries. |
ECML/PKDD (1) |
2008 |
DBLP DOI BibTeX RDF |
|
13 | Andriy Burkov, Abdeslam Boularias, Brahim Chaib-draa |
Competition and Coordination in Stochastic Games. |
Canadian AI |
2007 |
DBLP DOI BibTeX RDF |
|
13 | Qian Luo, Tien-Ping Tan, Yi Su, Zhanggen Jin |
MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network. |
IEEE Trans. Games |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Daphne Cornelisse, Eugene Vinitsky |
Human-compatible driving partners through data-regularized self-play reinforcement learning. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Jake Levi, Chris Lu 0001, Timon Willi, Christian Schröder de Witt, Jakob N. Foerster |
The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Xiaoxi Wang |
Balancing the AI Strength of Roles in Self-Play Training with Regret Matching+. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu |
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu |
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Dan Qiao, Yu-Xiang Wang |
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Jingxiao Chen, Weiji Xie, Weinan Zhang 0001, Yong Yu 0001, Ying Wen 0001 |
Offline Fictitious Self-Play for Competitive Games. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao |
Learning Diverse Risk Preferences in Population-Based Self-Play. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Upasana Biswas, Lin Guan, Subbarao Kambhampati |
On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. |
HRI (Companion) |
2024 |
DBLP DOI BibTeX RDF |
|
13 | Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai |
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play. |
AAMAS |
2024 |
DBLP BibTeX RDF |
|
13 | Qi Wang 0044, Yuqing He, Chunlei Tang |
Mastering construction heuristics with self-play deep reinforcement learning. |
Neural Comput. Appl. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Dennis J. N. J. Soemers, Spyridon Samothrakis, Éric Piette, Matthew Stephenson |
Extracting tactics learned from self-play in general games. |
Inf. Sci. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Xiaoshu Guan, Huabin Sun, Rongrong Hou, Yang Xu, Yuequan Bao, Hui Li |
A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy. |
Reliab. Eng. Syst. Saf. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Bo Li 0004, Jingyi Huang, Shuangxia Bai, Zhigang Gan, Shiyang Liang, Evgeny Sergeevich Neretin, Shouwen Yao |
Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning. |
CAAI Trans. Intell. Technol. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Weiren Kong, Deyun Zhou, Ying Zhou, Yiyang Zhao |
Hierarchical reinforcement learning from competitive self-play for dual-aircraft formation air combat. |
J. Comput. Des. Eng. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yuxuan Chen, Li Zhang 0045, Shijian Li, Xili Chen, Gang Pan 0001, Zhijie Pan |
RM-FSP: Regret minimization optimizes neural fictitious self-play. |
Neurocomputing |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yuan Gao, Junfeng Chen, Xi Chen 0051, Chongyang Wang, Junjie Hu 0003, Fuqin Deng, Tin Lun Lam |
Asymmetric Self-Play-Enabled Intelligent Heterogeneous Multirobot Catching System Using Deep Multiagent Reinforcement Learning. |
IEEE Trans. Robotics |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yi Wang, Hui Tang, Lichao Huang, Lulu Pan, Lixiang Yang, Huanming Yang, Feng Mu, Meng Yang |
Self-play reinforcement learning guides protein engineering. |
Nat. Mac. Intell. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yi Wang, Hui Tang, Lichao Huang, Lulu Pan, Lixiang Yang, Huanming Yang, Feng Mu, Meng Yang |
Author Correction: Self-play reinforcement learning guides protein engineering. |
Nat. Mac. Intell. |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li 0002, Yiqin Yang, Jun Yang 0028, Bin Liang 0001, Qianchuan Zhao |
Learning Diverse Risk Preferences in Population-based Self-play. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Revan MacQueen, James R. Wright |
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Sichao Xiong, Yigit Ihlamur |
Founder-GPT: Self-play to evaluate the Founder-Idea fit. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai |
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jeremiah Zhe Liu, Krishnamurthy (Dj) Dvijotham, Jihyeon Lee, Quan Yuan, Martin Strobel 0001, Balaji Lakshminarayanan, Deepak Ramachandran |
Pushing the Accuracy-Group Robustness Frontier with Introspective Self-play. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu |
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Ahmet Semih Tasbas, Safa Onur Sahin, Nazim Kemal Ure |
Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yao Fu, Hao Peng 0018, Tushar Khot, Mirella Lapata |
Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Dexun Li, Wenjun Li, Pradeep Varakantham |
Diversity Induced Environment Design via Self-Play. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Huan Rong, Victor S. Sheng, Tinghuai Ma, Yang Zhou 0001, Mznah Al-Rodhaan |
A Self-play and Sentiment-Emphasized Comment Integration Framework Based on Deep Q-Learning in a Crowdsourcing Scenario : Extended Abstract. |
ICDE |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Poorna Syama Sundar, Manjunath Vasam, Ajin George Joseph |
Monotonic Model Improvement Self-Play Algorithm for Adversarial Games. |
CDC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Shaoqin He, Yang Gao, Baofeng Zhang, Hui Chang, Xinchen Zhang |
Advancing Air Combat Tactics with Improved Neural Fictitious Self-play Reinforcement Learning. |
ICIC (5) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yngvi Björnsson, Róbert Leó Þormar Jónsson, Sigurjón Ingi Jónsson |
Expediting Self-Play Learning in AlphaZero-Style Game-Playing Agents. |
ECAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Chaohao Hu, Yunlong Cai, Weidong Li, Hongfei Li |
Fictitious Self-Play Reinforcement Learning with Expanding Value Estimation. |
RICAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Revan MacQueen, James R. Wright |
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Cristina Cutajar, Josef Bajada |
Mastering the Card Game of Jaipur Through Zero-Knowledge Self-Play Reinforcement Learning and Action Masks. |
AI*IA |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Shohei Ohsawa |
Truthful Self-Play. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Jeremiah Zhe Liu, Krishnamurthy (Dj) Dvijotham, Jihyeon Lee, Quan Yuan, Balaji Lakshminarayanan, Deepak Ramachandran |
Pushing the Accuracy-Group Robustness Frontier with Introspective Self-play. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu |
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play. |
AAMAS |
2023 |
DBLP BibTeX RDF |
|
13 | Chaitanya Kharyal, Tanmay Kumar Sinha, Sai Krishna Gottipati, Fatemeh Abdollahi, Srijita Das 0001, Matthew E. Taylor |
Do As You Teach: A Multi-Teacher Approach to Self-Play in Deep Reinforcement Learning. |
AAMAS |
2023 |
DBLP BibTeX RDF |
|
13 | Masahiro Shioda, Takeshi Ito |
Improving Mini-Shogi Engine Using Self-play and Possibility of White's Advantage. |
J. Inf. Sci. Eng. |
2022 |
DBLP BibTeX RDF |
|
13 | Xiaoyang Wang 0005, Jonathan D. Thomas, Robert J. Piechocki, Shipra Kapoor, Raúl Santos-Rodríguez, Arjun Parekh |
Self-play learning strategies for resource assignment in Open-RAN networks. |
Comput. Networks |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Kang Li, Bo Jiu, Wenqiang Pu, Hongwei Liu 0001, Xiaojun Peng |
Neural Fictitious Self-Play for Radar Antijamming Dynamic Game With Imperfect Information. |
IEEE Trans. Aerosp. Electron. Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Bruno Brandão, Telma Woerle de Lima, Anderson Soares, Luckeciano C. Melo, Marcos R. O. A. Máximo |
Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play. |
IEEE Access |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Daniel Hernández 0008, Kevin Denamganaï, Sam Devlin, Spyridon Samothrakis, James Alfred Walker |
A Comparison of Self-Play Algorithms Under a Generalized Framework. |
IEEE Trans. Games |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Huan Rong, Victor S. Sheng, Tinghuai Ma, Yang Zhou 0001, Mznah Al-Rodhaan |
A Self-Play and Sentiment-Emphasized Comment Integration Framework Based on Deep Q-Learning in a Crowdsourcing Scenario. |
IEEE Trans. Knowl. Data Eng. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Mycal Tucker, Yilun Zhou, Julie A. Shah |
Latent Space Alignment Using Adversarially Guided Self-Play. |
Int. J. Hum. Comput. Interact. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Tianchi Huang, Rui-Xiao Zhang, Lifeng Sun |
Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services. |
IEEE Trans. Multim. |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Stephen McAleer, John B. Lanier, Kevin A. Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm |
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Wei Xiong 0015, Han Zhong 0001, Chengshuai Shi, Cong Shen 0001, Tong Zhang 0001 |
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Qi Liu 0049, Zihuiwen Ye, Tao Yu, Phil Blunsom, Linfeng Song |
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Licheng Wu, Qifei Wu, Hongming Zhong, Xiali Li |
Mastering "Gongzhu" with Self-play Deep Reinforcement Learning. |
ICCSIP |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Oren Neumann, Claudius Gros |
Size Scaling in Self-Play Reinforcement Learning. |
ESANN |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Yanran Xu, Kangxin He, Shu Hu, Hui Li |
A reinforcement learning framework based on regret minimization for approximating best response in fictitious self-play. |
HPCC/DSS/SmartCity/DependSys |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Stephen Obonyo, Nicolas Jouandeau, Dickson Owuor |
Designing RNA Sequences by Self-play. |
IJCCI |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Evgeny Kusmenko, Maximilian Münker, Matthias Nadenau, Bernhard Rumpe |
A Model-Driven Generative Self Play-Based Toolchain for Developing Games and Players. |
GPCE |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Qi Liu 0049, Zihuiwen Ye, Tao Yu, Linfeng Song, Phil Blunsom |
Augmenting Multi-Turn Text-to-SQL Datasets with Self-Play. |
EMNLP (Findings) |
2022 |
DBLP DOI BibTeX RDF |
|
13 | Yuqing Du, Pieter Abbeel, Aditya Grover |
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
13 | Wei Xiong 0015, Han Zhong 0001, Chengshuai Shi, Cong Shen 0001, Tong Zhang 0001 |
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. |
ICML |
2022 |
DBLP BibTeX RDF |
|
13 | Jan-Alexander Posth, Piotr Kotlarz, Branka Hadji Misheva, Jörg Osterrieder, Peter Schwendner |
The Applicability of Self-Play Algorithms to Trading and Forecasting Financial Markets. |
Frontiers Artif. Intell. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Huitao Wang, Ruopeng Yang, Changsheng Yin, Xiaofei Zou, Xuefeng Wang |
Research on the Difficulty of Mobile Node Deployment's Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning. |
Wirel. Commun. Mob. Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Shanqi Liu, Junjie Cao, Yujie Wang, Wenzhou Chen, Yong Liu 0007 |
Self-play reinforcement learning with comprehensive critic in computer games. |
Neurocomputing |
2021 |
DBLP DOI BibTeX RDF |
|
13 | OpenAI, Matthias Plappert, Raul Sampedro, Tao Xu, Ilge Akkaya, Vineet Kosaraju, Peter Welinder, Ruben D'Sa, Arthur Petron, Henrique Pondé de Oliveira Pinto, Alex Paino, Hyeonwoo Noh, Lilian Weng, Qiming Yuan, Casey Chu, Wojciech Zaremba |
Asymmetric self-play for automatic goal discovery in robotic manipulation. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Weizhe Chen 0001, Zihan Zhou 0002, Yi Wu 0013, Fei Fang 0001 |
Temporal Induced Self-Play for Stochastic Bayesian Games. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Arkady Arkhangorodsky, Scot Fang, Victoria Knight, Ajay Nagesh, Maria Ryskina, Kevin Knight |
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Xiaoyang Wang 0005, Jonathan D. Thomas, Robert J. Piechocki, Shipra Kapoor, Raúl Santos-Rodríguez, Arjun Parekh |
Self-play Learning Strategies for Resource Assignment in Open-RAN Networks. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Daochen Zha, Jingru Xie, Wenye Ma, Sheng Zhang, Xiangru Lian, Xia Hu 0001, Ji Liu |
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Shohei Ohsawa |
Unbiased Self-Play. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Anthony DiGiovanni, Ethan C. Zell |
Survey of Self-Play in Reinforcement Learning. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Yuxuan Chen, Li Zhang 0045, Shijian Li, Gang Pan 0001 |
Optimize Neural Fictitious Self-Play in Regret Minimization Thinking. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Wanqi Xue, Youzhi Zhang 0001, Shuxin Li, Xinrun Wang, Bo An 0001, Chai Kiat Yeo |
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Luis Perez |
Mastering Terra Mystica: Applying Self-Play to Multi-agent Cooperative Board Games. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Panagiotis Tigas, Tyson Hosmer |
Spatial Assembly: Generative Architecture With Reinforcement Learning, Self Play and Tree Search. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
13 | Qi Wang 0044, Yongsheng Hao, Jie Cao 0011 |
Learning to traverse over graphs with a Monte Carlo tree search-based self-play framework. |
Eng. Appl. Artif. Intell. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Zhixiao Sun, Haiyin Piao, Zhen Yang, Yiyang Zhao, Guang Zhan, Deyun Zhou, Guanglei Meng, Hechang Chen, Xing Chen, Bohao Qu, Yuanjie Lu |
Multi-agent hierarchical policy gradient for Air Combat Tactics emergence via self-play. |
Eng. Appl. Artif. Intell. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Li Zhang 0045, Yuxuan Chen, Wei Wang, Ziliang Han, Shijian Li, Zhijie Pan, Gang Pan 0001 |
A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games. |
Frontiers Comput. Sci. |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Michael Groth, Pascal Freier, Matthias Schumann |
Using Self-Play within Deep Q Learning to improve real-time Production Scheduling. |
AMCIS |
2021 |
DBLP BibTeX RDF |
|
13 | Jun Ma 0034, Shunyi Yao, Guangda Chen, Jiakai Song, Jianmin Ji |
Distributed Reinforcement Learning with Self-Play in Parameterized Action Space. |
SMC |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Dogukan Arslan, Gülsen Eryigit |
Evaluation of Wizard-of-Oz and Self-Play Data Collection Techniques for Turkish Goal-Oriented Dialogue Agents. |
INISTA |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Pankaj Khanchandani, Oliver Richter, Lukas Rusch, Roger Wattenhofer |
Learning Algorithms with Self-Play: A New Approach to the Distributed Directory Problem. |
ICTAI |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Qinghua Liu, Tiancheng Yu, Yu Bai 0017, Chi Jin 0001 |
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play. |
ICML |
2021 |
DBLP BibTeX RDF |
|
13 | Daochen Zha, Jingru Xie, Wenye Ma, Sheng Zhang, Xiangru Lian, Xia Hu 0001, Ji Liu |
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning. |
ICML |
2021 |
DBLP BibTeX RDF |
|
13 | Wanqi Xue, Youzhi Zhang 0001, Shuxin Li, Xinrun Wang, Bo An 0001, Chai Kiat Yeo |
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play. |
IJCAI |
2021 |
DBLP DOI BibTeX RDF |
|
13 | Weizhe Chen 0001, Zihan Zhou 0002, Yi Wu 0013, Fei Fang 0001 |
Temporal Induced Self-Play for Stochastic Bayesian Games. |
IJCAI |
2021 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 196 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|