|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 6 occurrences of 5 keywords
|
|
|
Results
Found 2 publication records. Showing 2 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Xi-Ren Cao |
From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. |
Discret. Event Dyn. Syst. |
2003 |
DBLP DOI BibTeX RDF |
gradient-based policy iteration, perturbation realization, TD(), Q-learning, Poisson equations, Potentials |
1 | Marco A. Wiering, Jürgen Schmidhuber |
Speeding up Q(lambda)-Learning. |
ECML |
1998 |
DBLP DOI BibTeX RDF |
TD(), online Q(), Reinforcement learning, Q-learning, lazy learning |
Displaying result #1 - #2 of 2 (100 per page; Change: )
|
|