| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 1 | István Szita, Csaba Szepesvári |
Agnostic KWIK learning and efficient approximate reinforcement learning.  |
Journal of Machine Learning Research - Proceedings Track  |
2011 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Csaba Szepesvári |
Model-based reinforcement learning with nearly tight exploration complexity bounds.  |
ICML  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Marc J. V. Ponsen, Pieter Spronck |
Effective and Diverse Adaptive Game AI.  |
IEEE Trans. Comput. Intellig. and AI in Games  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version  |
CoRR  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Guillaume Chaslot, Pieter Spronck |
Monte-Carlo Tree Search in Settlers of Catan.  |
ACG  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Optimistic initialization and greediness lead to polynomial time learning in factored MDPs.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Thomas J. Walsh, Istvan Szita, Carlos Diuk, Michael L. Littman |
Exploring compact reinforcement-learning representations with linear regression.  |
UAI  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Factored Value Iteration Converges.  |
Acta Cybern.  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Guillaume Chaslot, Mark H. M. Winands, Istvan Szita, H. Jaap van den Herik |
Cross-Entropy for Monte-Carlo Tree Search.  |
ICGA Journal  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
The many faces of optimism - Extended version  |
CoRR  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Factored Value Iteration Converges  |
CoRR  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Online variants of the cross-entropy method  |
CoRR  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Guillaume Chaslot, Sander Bakkes, Istvan Szita, Pieter Spronck |
Monte-Carlo Tree Search: A New Framework for Game AI.  |
AIIDE  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
The many faces of optimism: a unifying approach.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man.  |
J. Artif. Intell. Res. (JAIR)  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs  |
CoRR  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Learning Tetris Using the Noisy Cross-Entropy Method.  |
Neural Computation  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, Viktor Gyenes, András Lörincz |
Reinforcement Learning with Echo State Networks.  |
ICANN  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Applying Policy Iteration for Training Recurrent Neural Networks  |
CoRR  |
2004 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Kalman Filter Control Embedded into the Reinforcement Learning Framework.  |
Neural Computation  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | András Lörincz, Imre Pólik, Istvan Szita |
Event-learning and robust policy heuristics.  |
Cognitive Systems Research  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Bálint Takács, Istvan Szita, András Lörincz |
Temporal plannability by variance of the episode length  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Reinforcement Learning with Linear Function Approximation and LQ control Converges  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, András Lörincz |
Kalman filter control in the reinforcement learning framework  |
CoRR  |
2003 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Bálint Takács, András Lörincz |
MDPs: Learning in Varying Environments.  |
Journal of Machine Learning Research  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Bálint Takács, András Lörincz |
Searching for Plannable Domains can Speed up Reinforcement Learning  |
CoRR  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Istvan Szita, Bálint Takács, András Lörincz |
Reinforcement Learning Integrated with a Non-Markovian Controller.  |
ECAI  |
2002 |
DBLP BibTeX RDF |
|