The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for bandits with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1993-2001 (15) 2003-2007 (25) 2008 (19) 2009 (26) 2010 (19) 2011 (43) 2012 (43) 2013 (59) 2014 (76) 2015 (104) 2016 (114) 2017 (149) 2018 (203) 2019 (274) 2020 (432) 2021 (492) 2022 (532) 2023 (505) 2024 (133)
Publication types (Num. hits)
article(1723) book(2) incollection(1) inproceedings(1508) phdthesis(29)
Venues (Conferences, Journals, ...)
CoRR(1423) NeurIPS(216) ICML(191) AISTATS(141) AAAI(97) COLT(83) NIPS(55) UAI(44) IJCAI(43) AAMAS(34) ALT(33) J. Mach. Learn. Res.(24) CDC(22) ISIT(22) ICLR(16) RecSys(16) More (+10 of total 370)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 50 occurrences of 39 keywords

Results
Found 3263 publication records. Showing 3263 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
89José Niño-Mora Characterization and computation of restless bandit marginal productivity indices. Search on Bibsonomy VALUETOOLS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF marginal productivity index, restless bandits, Markov decision processes, block algorithms, index policies
57José Niño-Mora Computing an index policy for bandits with switching penalties. Search on Bibsonomy VALUETOOLS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF bandits, restless, switching delays, Markov decision processes, switching costs, index policies
46José Niño-Mora Marginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queue. Search on Bibsonomy Queueing Syst. Theory Appl. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Multiclass queue, Multi-queue switch, Delay-sensitive, Loss-sensitive, Restless bandits, Work-cost analysis, Index policies, Bias optimality, Scheduling, Conservation laws, Finite buffers
44Nicolas Galichet Contributions to Multi-Armed Bandits: Risk-Awareness and Sub-Sampling for Linear Contextual Bandits. (Contributions aux bandits manchots : gestion du risque et sous-échantillonnage pour les bandits contextuels linéaires). Search on Bibsonomy 2015   RDF
33Louis Faury Variance-sensitive confidence intervals for parametric and offline bandits. (Intervalles de confiance sensibles à la variance : Applications aux bandits paramétriques et bandits hors ligne). Search on Bibsonomy 2021   RDF
32Jan Poland FPL Analysis for Adaptive Bandits. Search on Bibsonomy SAGA The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
32Tadhg O'Meara, Ahmed Patel A Topic-Specific Web Robot Model Based on Restless Bandits. Search on Bibsonomy IEEE Internet Comput. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
25Álvaro Fialho, Marc Schoenauer, Michèle Sebag Fitness-AUC bandit adaptive strategy selection vs. the probability matching one within differential evolution: an empirical comparison on the bbob-2010 noiseless testbed. Search on Bibsonomy GECCO (Companion) The full citation details ... 2010 DBLP  DOI  BibTeX  RDF adaptive strategy selection, comparison-based, roc area under curve, benchmarking, black-box optimization, multi-armed bandits
25Álvaro Fialho, Marc Schoenauer, Michèle Sebag Toward comparison-based adaptive operator selection. Search on Bibsonomy GECCO The full citation details ... 2010 DBLP  DOI  BibTeX  RDF ROC area under curve, adaptive operator selection, parameter control, multi-armed bandits
25Moshe Babaioff, Robert D. Kleinberg, Aleksandrs Slivkins Truthful mechanisms with implicit payment computation. Search on Bibsonomy EC The full citation details ... 2010 DBLP  DOI  BibTeX  RDF single-parameter mechanism design, truthful auctions, multi-armed bandits
25Alina Beygelzimer, John Langford 0001 The offset tree for learning with partial labels. Search on Bibsonomy KDD The full citation details ... 2009 DBLP  DOI  BibTeX  RDF associative reinforcement learning, contextual bandits, interactive learning
25Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivkins Characterizing truthful multi-armed bandit mechanisms: extended abstract. Search on Bibsonomy EC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF single-parameter auctions, mechanism design, online learning, multi-armed bandits, truthful mechanisms
25Dimitris E. Koulouriotis, A. S. Xanthopoulos A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems. Search on Bibsonomy Oper. Res. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Exploitation-exploration trade-off, Evolutionary algorithm, Multi-armed bandits, Heuristic techniques
25Paat Rusmevichientong, David P. Williamson An adaptive algorithm for selecting profitable keywords for search-based advertising services. Search on Bibsonomy EC The full citation details ... 2006 DBLP  DOI  BibTeX  RDF search-based advertising, adaptive algorithms, online optimization, multi-armed bandits
25Dimitris Bertsimas The achievable region method in the optimal control of queueing systems; formulations, bounds and policies. Search on Bibsonomy Queueing Syst. Theory Appl. The full citation details ... 1995 DBLP  DOI  BibTeX  RDF multiarmed bandits, optimization, policies, Queueing networks, bounds, loss networks
22Arnab Maiti, Ross Boczar, Kevin G. Jamieson, Lillian J. Ratliff Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits. Search on Bibsonomy AISTATS The full citation details ... 2024 DBLP  BibTeX  RDF
22Arnab Maiti, Ross Boczar, Kevin G. Jamieson, Lillian J. Ratliff Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
22Dirk van der Hoeven, Lukas Zierahn, Tal Lancewicki, Aviv Rosenberg 0002, Nicolò Cesa-Bianchi A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
22Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama Asymptotically Optimal Thompson Sampling Based Policy for the Uniform Bandits and the Gaussian Bandits. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
22Zongqi Wan, Zhijie Zhang, Tongyang Li, Jialin Zhang 0001, Xiaoming Sun 0001 Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
22Dirk van der Hoeven, Lukas Zierahn, Tal Lancewicki, Aviv Rosenberg 0002, Nicolò Cesa-Bianchi A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs. Search on Bibsonomy COLT The full citation details ... 2023 DBLP  BibTeX  RDF
22Neetu Singh, Sandeep Kumar Singh 0001 An Empirical Assessment of the Performance of Multi-Armed Bandits and Contextual Multi-Armed Bandits in Handling Cold-Start Bugs. Search on Bibsonomy IC3 The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
22Kimang Khun Algorithms for Markovian bandits: Indexability and Learning. (Des algorithmes pour les bandits markoviens: indexabilité et apprentissage). Search on Bibsonomy 2023   RDF
22Haipeng Luo, Mengxiao Zhang, Peng Zhao 0006, Zhi-Hua Zhou Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
22Zongqi Wan, Zhijie Zhang, Tongyang Li, Jialin Zhang 0001, Xiaoming Sun 0001 Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
22Haipeng Luo, Mengxiao Zhang, Peng Zhao 0006, Zhi-Hua Zhou Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits. Search on Bibsonomy COLT The full citation details ... 2022 DBLP  BibTeX  RDF
22Camille-Sovanneary Gauthier List recommendations with multi-armed bandits. (Recommandation de listes d'items par bandits manchots). Search on Bibsonomy 2022   RDF
22Hiba Dakdouk Massive multi-player multi-armed bandits for internet of things networks. (Bandits massifs multi-bras multi-joueurs pour les réseaux de l'internet des objets). Search on Bibsonomy 2022   RDF
22Geovani Rizk Stochastic Graphical Bilinear Bandits. (Bandits Bilinéaires Graphiques Stochastiques). Search on Bibsonomy 2022   RDF
22Dorian Baudry Non-Parametric Algorithms for Multi-Armed Bandits. (Algorithmes Non-Paramétriques de Bandits Multi-Bras). Search on Bibsonomy 2022   RDF
22Chen Yan 0002 Close-to-opimal policies for Markovian bandits. (Politiques quasi-optimales de bandits Markoviens). Search on Bibsonomy 2022   RDF
22Chen Yan Close-to-opimal policies for Markovian bandits. (Politiques quasi-optimales de bandits Markoviens). Search on Bibsonomy 2022   RDF
22Julia Kreutzer, David Vilar, Artem Sokolov Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
22Julia Kreutzer, David Vilar, Artem Sokolov Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits. Search on Bibsonomy EMNLP (Findings) The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
22Shinji Ito Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits. Search on Bibsonomy NeurIPS The full citation details ... 2021 DBLP  BibTeX  RDF
22Saeed Masoudian, Yevgeny Seldin Improved Analysis of the Tsallis-INF Algorithm in Stochastically Constrained Adversarial Bandits and Stochastic Bandits with Adversarial Corruptions. Search on Bibsonomy COLT The full citation details ... 2021 DBLP  BibTeX  RDF
22Réda Alami Bandits à Mémoire pour la prise de décision en environnement dynamique. Application à l'optimisation des réseaux de télécommunications. (Memory Bandits for decision making in dynamical environments. Application to network optimization). Search on Bibsonomy 2021   RDF
22Baihan Lin, Guillermo A. Cecchi, Djallel Bouneffouf 0001, Jenna M. Reinen, Irina Rish Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
22Baihan Lin, Guillermo A. Cecchi, Djallel Bouneffouf 0001, Jenna M. Reinen, Irina Rish Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL. Search on Bibsonomy HBAI@IJCAI The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
22Pierre Perrault Efficient Learning in Stochastic Combinatorial Semi-Bandits. (Apprentissage Efficient dans les Problèmes de Semi-Bandits Stochastiques Combinatoires). Search on Bibsonomy 2020   RDF
22David Cortes Adapting multi-armed bandits policies to contextual bandits scenarios. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
22Aditya Gopalan, Prashanth L. A., Michael C. Fu 0001, Steven I. Marcus Weighted Bandits or: How Bandits Learn Distorted Values That Are Not Expected. Search on Bibsonomy AAAI The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
22Pratik Gajane Bandits Multi-bras avec retour d'information non-conventionnelle. (Multi-Armed Bandits with Unconventional Feedback). Search on Bibsonomy 2017   RDF
22Aditya Gopalan, Prashanth L. A., Michael C. Fu 0001, Steven I. Marcus Weighted bandits or: How bandits learn distorted values that are not expected. Search on Bibsonomy CoRR The full citation details ... 2016 DBLP  BibTeX  RDF
22Robin Allesiardo Bandits Manchots sur Flux de Données Non Stationnaires. (Multi-armed bandits for non-stationary data streams). Search on Bibsonomy 2016   RDF
22Nir Ailon, Thorsten Joachims, Zohar Shay Karnin Reducing Dueling Bandits to Cardinal Bandits. Search on Bibsonomy CoRR The full citation details ... 2014 DBLP  BibTeX  RDF
22Aaron Segal, Bryan Ford, Joan Feigenbaum Catching Bandits and Only Bandits: Privacy-Preserving Intersection Warrants for Lawful Surveillance. Search on Bibsonomy FOCI The full citation details ... 2014 DBLP  BibTeX  RDF
22Nir Ailon, Zohar Shay Karnin, Thorsten Joachims Reducing Dueling Bandits to Cardinal Bandits. Search on Bibsonomy ICML The full citation details ... 2014 DBLP  BibTeX  RDF
22Yaqin Zhou, Xiang-Yang Li 0001 Multi-Armed Bandits With Combinatorial Strategies Under Stochastic Bandits. Search on Bibsonomy CoRR The full citation details ... 2013 DBLP  BibTeX  RDF
21Sudipto Guha, Kamesh Munagala, Peng Shi 0002 Approximation algorithms for restless bandit problems. Search on Bibsonomy SODA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
21Frédéric de Mesmay, Arpad Rimmel, Yevgen Voronenko, Markus Püschel Bandit-based optimization on graphs with application to library performance tuning. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
21Vivek Raghunathan, Vivek S. Borkar, Min Cao, P. R. Kumar 0001 Index Policies for Real-Time Multicast Scheduling for Wireless Broadcast Systems. Search on Bibsonomy INFOCOM The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
21José Niño-Mora Marginal Productivity Index Policies for Admission Control and Routing to Parallel Multi-server Loss Queues with Reneging. Search on Bibsonomy NET-COOP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF loss queues, admission control, dynamic routing, multi-server, parallel queues, index policies
21Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarwal Multi-armed bandit problems with dependent arms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
21Rick Neal No parking!: (and other library technology quandaries). Search on Bibsonomy SIGUCCS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF usage statistics, notification, parking
21Max-Olivier Hongler, Fabrice Dusonchet Optimal Stopping and Gittins' Indices for Piecewise Deterministic Evolution Processes. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF dynamic allocation of jobs, piecewise-deterministic processes, continuous time Gittins' indices, optimal stopping
21Michael Condict, Dejan S. Milojicic, Franklin Reynolds, Don Bolinger Towards a world-wide civilization of objects. Search on Bibsonomy ACM SIGOPS European Workshop The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
11Baihan Lin Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook. Search on Bibsonomy Expert Syst. Appl. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Xuchuang Wang, Hong Xie 0004, John C. S. Lui Analyzing Queueing Problems via Bandits With Linear Reward & Nonlinear Workload Fairness. Search on Bibsonomy IEEE Trans. Mob. Comput. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Qiyu Kang, Wee Peng Tay, Rui She, Sijie Wang, Xiaoqian Liu, Yuán-Ruì Yáng Multi-armed linear bandits with latent biases. Search on Bibsonomy Inf. Sci. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Zahraa Khais Shahid, Saguna Saguna, Christer Åhlund Multiarmed Bandits for Sleep Recognition of Elderly Living in Single-Resident Smart Homes. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Fengjiao Li, Xingyu Zhou 0001, Bo Ji 0001 Distributed Linear Bandits With Differential Privacy. Search on Bibsonomy IEEE Trans. Netw. Sci. Eng. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Hyun-Suk Lee, Do-Yup Kim, Kyungsik Min Universal Dynamic Pilot Allocation for Beam Alignment Based on Multi-Armed Bandits. Search on Bibsonomy IEEE Wirel. Commun. Lett. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Wenting Liu, Jinlong Lei, Peng Yi 0001, Yiguang Hong No-regret learning for repeated non-cooperative games with lossy bandits. Search on Bibsonomy Autom. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Rahul Singh 0001, Fang Liu 0020, Yin Sun, Ness B. Shroff Multi-armed bandits with dependent arms. Search on Bibsonomy Mach. Learn. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yingyan Zeng, Xiaoyu Chen, Ran Jin Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing. Search on Bibsonomy ACM Trans. Intell. Syst. Technol. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yuriy Dorn, Nikita Kornilov, Nikolay Kutuzov, Alexander Nazin, Eduard Gorbunov, Alexander V. Gasnikov Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed bandits. Search on Bibsonomy Comput. Manag. Sci. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yingkai Li, Yining Wang 0001, Yuan Zhou 0007 Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits. Search on Bibsonomy IEEE Trans. Inf. Theory The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yasong Feng, Zengfeng Huang, Tianyu Wang 0008 Lipschitz Bandits With Batched Feedback. Search on Bibsonomy IEEE Trans. Inf. Theory The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran, Tara Javidi, Arya Mazumdar Competing Bandits in Non-Stationary Matching Markets. Search on Bibsonomy IEEE Trans. Inf. Theory The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Avishek Ghosh, Abishek Sankararaman, Kannan Ramchandran Model Selection for Generic Contextual Bandits. Search on Bibsonomy IEEE Trans. Inf. Theory The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Mengxiao Zhang, Haipeng Luo Contextual Multinomial Logit Bandits with General Value Functions. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Viraj Nadkarni, D. Manjunath, Sharayu Moharir Influencing Bandits: Arm Selection for Preference Shaping. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Khaled Eldowa, Nicolò Cesa-Bianchi, Alberto Maria Metelli, Marcello Restelli Information Capacity Regret Bounds for Bandits with Mediator Feedback. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Hannes Nilsson, Rikard Johansson, Niklas Åkerblom, Morteza Haghir Chehreghani Tree Ensembles for Contextual Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yuxiao Wen, Yanjun Han, Zhengyuan Zhou Stochastic contextual bandits with graph feedback: from independence number to MAS number. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Zhiwei Wang, Huazheng Wang, Hongning Wang Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Andrey Pudovikov Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noise. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Sambhav Solanki, Shweta Jain 0002, Sujit Gujar Fairness and Privacy Guarantees in Federated Contextual Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Ruiqi Zhang, Yuexiang Zhai, Andrea Zanette Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Zhiyong Wang, Jize Xie, Yi Chen, John C. S. Lui, Dongruo Zhou Variance-Dependent Regret Bounds for Non-stationary Linear Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Junwen Yang, Tianyuan Jin, Vincent Y. F. Tan Multi-Armed Bandits with Abstention. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Nikola Pavlovic, Sudeep Salgia, Qing Zhao 0001 Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Stephen Pasteris, Alberto Rumi, Maximilian Thiessen, Shota Saito, Atsushi Miyauchi 0001, Fabio Vitale, Mark Herbster Bandits with Abstention under Expert Advice. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Aldo Pacchiano, Mohammad Ghavamzadeh, Peter L. Bartlett Contextual Bandits with Stage-wise Constraints. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Zirui Yan, Dennis Wei, Dmitriy A. Katz-Rogozhnikov, Prasanna Sattigeri, Ali Tajer Causal Bandits with General Causal Models and Interventions. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Jincheng Mei, Zixin Zhong, Bo Dai 0001, Alekh Agarwal, Csaba Szepesvári, Dale Schuurmans Stochastic Gradient Succeeds for Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Fang Kong, Shuai Li 0010 Improved Bandits in Many-to-one Matching Markets with Incentive Compatibility. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Avrim Blum, Kavya Ravichandran Nearly-tight Approximation Guarantees for the Improving Multi-Armed Bandits Problem. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Ethan Blaser, Chuanhao Li, Hongning Wang Federated Linear Contextual Bandits with Heterogeneous Clients. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Quan Nguyen, Nishant A. Mehta Near-optimal Per-Action Regret Bounds for Sleeping Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Kwang-Sung Jun, Jungtaek Kim Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Archit Sood, Shweta Jain 0002, Sujit Gujar Fairness of Exposure in Online Restless Multi-armed Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Steven Bilaj, Sofien Dhouib, Setareh Maghsudi Meta Learning in Bandits within Shared Affine Subspaces. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel Covariance-Adaptive Least-Squares Algorithm for Stochastic Combinatorial Semi-Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Rahul N. R, Vaibhav Katewa Transfer in Sequential Multi-armed Bandits via Reward Samples. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Yige Hong, Qiaomin Xie, Yudong Chen 0001, Weina Wang 0001 Unichain and Aperiodicity are Sufficient for Asymptotic Optimality of Average-Reward Restless Bandits. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
11Joe Suk, Arpit Agarwal Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 3263 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license