| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 1 | Gergely Neu, András György, Csaba Szepesvári |
The adversarial stochastic shortest path problem with unknown transition probabilities.  |
Journal of Machine Learning Research - Proceedings Track  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári |
Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits.  |
Journal of Machine Learning Research - Proceedings Track  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Sylvain Gelly, Levente Kocsis, Marc Schoenauer, Michèle Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud |
The grand challenge of computer Go: Monte Carlo tree search and extensions.  |
Commun. ACM  |
2012 |
DBLP DOI BibTeX RDF |
|
| 1 | Mahdi Milani Fard, Joelle Pineau, Csaba Szepesvári |
PAC-Bayesian Policy Evaluation for Reinforcement Learning  |
CoRR  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári |
X-Armed Bandits.  |
Journal of Machine Learning Research  |
2011 |
DBLP BibTeX RDF |
|
| 1 | István Szita, Csaba Szepesvári |
Agnostic KWIK learning and efficient approximate reinforcement learning.  |
Journal of Machine Learning Research - Proceedings Track  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Yasin Abbasi-Yadkori, Csaba Szepesvári |
Regret Bounds for the Adaptive Control of Linear Quadratic Systems.  |
Journal of Machine Learning Research - Proceedings Track  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Gábor Bartók, Dávid Pál, Csaba Szepesvári |
Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments.  |
Journal of Machine Learning Research - Proceedings Track  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Amir Massoud Farahmand, Csaba Szepesvári |
Model selection in reinforcement learning.  |
Machine Learning  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | András Antos, Gábor Bartók, Dávid Pál, Csaba Szepesvári |
Toward a Classification of Finite Partial-Monitoring Games  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári |
Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Arash Afkanpour, Csaba Szepesvári, Michael H. Bowling |
Alignment Based Kernel Learning with a Continuous Set of Base Kernels  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | András Antos, Gábor Bartók, Csaba Szepesvári |
Non-trivial two-armed partial-monitoring games are bandits  |
CoRR  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Pallavi Arora, Csaba Szepesvári, Rong Zheng |
Sequential learning for optimal monitoring of multi-channel wireless networks.  |
INFOCOM  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann (eds.) |
Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings  |
ALT  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann |
Editors' Introduction.  |
ALT  |
2011 |
DBLP DOI BibTeX RDF |
|
| 1 | Mahdi Milani Fard, Joelle Pineau, Csaba Szepesvári |
PAC-Bayesian Policy Evaluation for Reinforcement Learning.  |
UAI  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári |
Improved Algorithms for Linear Stochastic Bandits.  |
NIPS  |
2011 |
DBLP BibTeX RDF |
|
| 1 | András Antos, Varun Grover, Csaba Szepesvári |
Active learning in heteroscedastic noise.  |
Theor. Comput. Sci.  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Gábor Bartók, Csaba Szepesvári, Sandra Zilles |
Models of active learning in group-structured state spaces.  |
Inf. Comput.  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Péter Torma, András György, Csaba Szepesvári |
A Markov-Chain Monte Carlo Approach to Simultaneous Localization and Mapping.  |
Journal of Machine Learning Research - Proceedings Track  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Barnabás Póczos, Sergey Kirshner, Csaba Szepesvári |
REGO: Rank-based Estimation of Renyi Information using Euclidean Graph Optimization.  |
Journal of Machine Learning Research - Proceedings Track  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Dávid Pál, Barnabás Póczos, Csaba Szepesvári |
Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs  |
CoRR  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári |
X-Armed Bandits  |
CoRR  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Csaba Szepesvári |
Algorithms for Reinforcement Learning  |
|
2010 |
DOI RDF |
|
| 1 | Yasin Abbasi-Yadkori, Joseph Modayil, Csaba Szepesvári |
Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning.  |
IROS  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, Csaba Szepesvári |
Model-based reinforcement learning with nearly tight exploration complexity bounds.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton |
Toward Off-Policy Learning Control with Function Approximation.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Liuyang Li, Barnabás Póczos, Csaba Szepesvári, Russell Greiner |
Budgeted Distribution Learning of Belief Net Parameters.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Gábor Bartók, Dávid Pál, Csaba Szepesvári |
Toward a Classification of Finite Partial-Monitoring Games.  |
ALT  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Gergely Neu, András György, Csaba Szepesvári |
The Online Loop-free Stochastic Shortest-Path Problem.  |
COLT  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Sarah Filippi, Olivier Cappé, Aurélien Garivier, Csaba Szepesvári |
Parametric Bandits: The Generalized Linear Case.  |
NIPS  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Gergely Neu, András György, Csaba Szepesvári, András Antos |
Online Markov Decision Processes under Bandit Feedback.  |
NIPS  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Amir Massoud Farahmand, Rémi Munos, Csaba Szepesvári |
Error Propagation for Approximate Policy and Value Iteration.  |
NIPS  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Dávid Pál, Barnabás Póczos, Csaba Szepesvári |
Estimation of Renyi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs.  |
NIPS  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári |
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits.  |
Theor. Comput. Sci.  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Yuxi Li, Csaba Szepesvári, Dale Schuurmans |
Learning Exercise Policies for American Options.  |
Journal of Machine Learning Research - Proceedings Track  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Gergely Neu, Csaba Szepesvári |
Training parsers by inverse reinforcement learning.  |
Machine Learning  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári |
LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS.  |
CDC  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Amir Massoud Farahmand, Azad Shademan, Martin Jägersand, Csaba Szepesvári |
Model-based and model-free reinforcement learning for visual servoing.  |
ICRA  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora |
Fast gradient-descent methods for temporal-difference learning with linear function approximation.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Barnabás Póczos, Yasin Abbasi-Yadkori, Csaba Szepesvári, Russell Greiner, Nathan R. Sturtevant |
Learning when to stop thinking and do something!  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Jean-Yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko, Csaba Szepesvári |
Workshop summary: On-line learning with limited feedback.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Alireza Farhangfar, Russell Greiner, Csaba Szepesvári |
Learning to segment from a few well-selected training images.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Yaoliang Yu, Yuxi Li, Dale Schuurmans, Csaba Szepesvári |
A General Projection Property for Distribution Families.  |
NIPS  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton |
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.  |
NIPS  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári |
Multi-Step Dyna Planning for Policy Evaluation and Control.  |
NIPS  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Rémi Munos, Csaba Szepesvári |
Finite-Time Bounds for Fitted Value Iteration.  |
Journal of Machine Learning Research  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | András Antos, Csaba Szepesvári, Rémi Munos |
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path.  |
Machine Learning  |
2008 |
DBLP DOI BibTeX RDF |
Bellman-residual minimization, Least-squares temporal difference learning, Off-policy learning, Finite-sample bounds, Reinforcement learning, Nonparametric regression, Policy iteration, Least-squares regression |
| 1 | Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor |
Regularized Fitted Q-Iteration: Application to Planning.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Volodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert |
Empirical Bernstein stopping.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Gábor Bartók, Csaba Szepesvári, Sandra Zilles |
Active Learning of Group-Structured Environments.  |
ALT  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | András Antos, Varun Grover, Csaba Szepesvári |
Active Learning in Multi-armed Bandits.  |
ALT  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Alejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner |
Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstraction.  |
UAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling |
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.  |
UAI  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Richard S. Sutton, Csaba Szepesvári, Hamid Reza Maei |
A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation.  |
NIPS  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor |
Regularized Policy Iteration.  |
NIPS  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári |
Online Optimization in X-Armed Bandits.  |
NIPS  |
2008 |
DBLP BibTeX RDF |
|
| 1 | István Bíró, Zoltán Szamonek, Csaba Szepesvári |
Sequence Prediction Exploiting Similary Information.  |
IJCAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | András György, Levente Kocsis, Ivett Szabó, Csaba Szepesvári |
Continuous Time Associative Bandit Problems.  |
IJCAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Amir Massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert |
Manifold-adaptive dimension estimation.  |
ICML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári |
Tuning Bandit Algorithms in Stochastic Environments.  |
ALT  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Gergely Neu, Csaba Szepesvári |
Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods.  |
UAI  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Peter Auer, Ronald Ortner, Csaba Szepesvári |
Improved Rates for the Stochastic Continuum-Armed Bandit Problem.  |
COLT  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | András Antos, Rémi Munos, Csaba Szepesvári |
Fitted Q-iteration in continuous action-space MDPs.  |
NIPS  |
2007 |
DBLP BibTeX RDF |
|
| 1 | Levente Kocsis, Csaba Szepesvári |
Universal parameter optimisation in games based on SPSA.  |
Machine Learning  |
2006 |
DBLP DOI BibTeX RDF |
SPSA, Stochastic gradient ascent, Learning, Games, Poker |
| 1 | Péter Torma, Csaba Szepesvári |
Local Importance Sampling: A Novel Technique to Enhance Particle Filtering.  |
Journal of Multimedia  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Levente Kocsis, Csaba Szepesvári, Mark H. M. Winands |
RSPSA: Enhanced Parameter Optimization in Games.  |
ACG  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Levente Kocsis, Csaba Szepesvári |
Bandit Based Monte-Carlo Planning.  |
ECML  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | András Antos, Csaba Szepesvári, Rémi Munos |
Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path.  |
COLT  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Zoltán Szamonek, Csaba Szepesvári |
X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown.  |
ICDM  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári, Rémi Munos |
Finite time bounds for sampling based fitted value iteration.  |
ICML  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári, András Kocsor, Kornél Kovács |
Kernel Machine Based Feature Extraction Algorithms for Regression Problems.  |
ECAI  |
2004 |
DBLP BibTeX RDF |
|
| 1 | András Kocsor, Kornél Kovács, Csaba Szepesvári |
Margin Maximizing Discriminant Analysis.  |
ECML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Péter Torma, Csaba Szepesvári |
Enhancing Particle Filters Using Local Likelihood Sampling.  |
ECCV  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári, William D. Smart |
Interpolation-based Q-learning.  |
ICML  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári |
Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results.  |
AAAI  |
2004 |
DBLP BibTeX RDF |
|
| 1 | M. French, Csaba Szepesvári, Eric Rogers |
LQ performance bounds for adaptive output feedback controllers for functionally uncertain nonlinear systems.  |
Automatica  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | M. French, Csaba Szepesvári, Eric Rogers |
An Asymptotic Scaling Analysis of LQ Performance for an Approximate Adaptive Control Design.  |
MCSS  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | András Lörincz, György Hévízi, Csaba Szepesvári |
Ockham's Razor Modeling of the Matrisome Channels of the Basal Ganglia Thalamocortical Loops.  |
Int. J. Neural Syst.  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári |
Efficient approximate planning in continuous space Markovian Decision Problems.  |
AI Commun.  |
2001 |
DBLP BibTeX RDF |
|
| 1 | Zsolt Kalmár, Csaba Szepesvári, András Lörincz |
Modular Reinforcement Learning: A Case Study in a Robot Domain.  |
Acta Cybern.  |
2000 |
DBLP BibTeX RDF |
|
| 1 | Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári |
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms.  |
Machine Learning  |
2000 |
DBLP DOI BibTeX RDF |
|
| 1 | György Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári |
FlexVoice: A Parametric Approach to High-Quality Speech Synthesis.  |
TSD  |
2000 |
DBLP DOI BibTeX RDF |
|
| 1 | Zsolt Kalmár, Zsolt Marczell, Csaba Szepesvári, András Lörincz |
Parallel and robust skeletonization built on self-organizing elements.  |
Neural Networks  |
1999 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári, Michael L. Littman |
A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms.  |
Neural Computation  |
1999 |
DBLP DOI BibTeX RDF |
|
| 1 | János Murvai, Kristian Vlahovicek, Endre Barta, Csaba Szepesvári, Cristina Acatrinei, Sándor Pongor |
The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments.  |
Nucleic Acids Research  |
1999 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári |
Non-Markovian Policies in Sequential Decision Problems.  |
Acta Cybern.  |
1998 |
DBLP BibTeX RDF |
|
| 1 | Zsolt Kalmár, Csaba Szepesvári, András Lörincz |
Module-Based Reinforcement Learning: Experiments with a Real Robot.  |
Machine Learning  |
1998 |
DBLP DOI BibTeX RDF |
|
| 1 | Zsolt Kalmár, Csaba Szepesvári, András Lörincz |
Module-Based Reinforcement Learning: Experiments with a Real Robot.  |
Auton. Robots  |
1998 |
DBLP DOI BibTeX RDF |
|
| 1 | Zoltán Gábor, Zsolt Kalmár, Csaba Szepesvári |
Multi-criteria Reinforcement Learning.  |
ICML  |
1998 |
DBLP BibTeX RDF |
|
| 1 | Csaba Szepesvári, Szabolcs Cimmer, András Lörincz |
Neurocontroller using dynamic state feedback for compensatory control.  |
Neural Networks  |
1997 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári |
Learning and Exploitation Do Not Conflict Under Minimax Optimality.  |
ECML  |
1997 |
DBLP DOI BibTeX RDF |
self-optimizing systems, reinforcement learning, dynamic games |
| 1 | Zsolt Kalmár, Csaba Szepesvári, András Lörincz |
Module Based Reinforcement Learning: An Application to a Real Robot.  |
EWLR  |
1997 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári |
The Asymptotic Convergence-Rate of Q-learning.  |
NIPS ![In: Advances in Neural Information Processing Systems 10, [NIPS Conference, Denver, Colorado, USA, 1997], 1997, The MIT Press, 0-262-10076-2. The full citation details ...](Pics/full.jpeg) |
1997 |
DBLP BibTeX RDF |
|
| 1 | Tibor Fomin, Tamás Rozgonyi, Csaba Szepesvári, András Lörincz |
Self-Organizing Multi-Resolution Grid for Motion Planning and Control.  |
Int. J. Neural Syst.  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Csaba Szepesvári, András Lörincz |
Approximate geometry representations and sensory fusion.  |
Neurocomputing  |
1996 |
DBLP DOI BibTeX RDF |
|
| 1 | Csaba Szepesvári, András Lörincz |
Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers.  |
ICANN  |
1996 |
DBLP DOI BibTeX RDF |
|
| 1 | Michael L. Littman, Csaba Szepesvári |
A Generalized Reinforcement-Learning Model: Convergence and Applications.  |
ICML  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Csaba Szepesvári, László Balázs, András Lörincz |
Topology Learning Solved by Extended Objects: A Neural Network Model.  |
Neural Computation  |
1994 |
DBLP DOI BibTeX RDF |
|