A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning.
Resource URI: https://dblp.l3s.de/d2r/resource/publications/journals/deds/ChoiR06
Home
|
Example Publications
Property
Value
dcterms:
bibliographicCitation
<
http://dblp.uni-trier.de/rec/bibtex/journals/deds/ChoiR06
>
dc:
creator
<
https://dblp.l3s.de/d2r/resource/authors/Benjamin_Van_Roy
>
dc:
creator
<
https://dblp.l3s.de/d2r/resource/authors/David_Choi
>
foaf:
homepage
<
http://dx.doi.org/doi.org%2F10.1007%2Fs10626-006-8134-8
>
foaf:
homepage
<
https://doi.org/10.1007/s10626-006-8134-8
>
dc:
identifier
DBLP journals/deds/ChoiR06
(xsd:string)
dc:
identifier
DOI doi.org%2F10.1007%2Fs10626-006-8134-8
(xsd:string)
dcterms:
issued
2006
(xsd:gYear)
swrc:
journal
<
https://dblp.l3s.de/d2r/resource/journals/deds
>
rdfs:
label
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning.
(xsd:string)
foaf:
maker
<
https://dblp.l3s.de/d2r/resource/authors/Benjamin_Van_Roy
>
foaf:
maker
<
https://dblp.l3s.de/d2r/resource/authors/David_Choi
>
swrc:
number
2
(xsd:string)
swrc:
pages
207-239
(xsd:string)
owl:
sameAs
<
http://bibsonomy.org/uri/bibtexkey/journals/deds/ChoiR06/dblp
>
owl:
sameAs
<
http://dblp.rkbexplorer.com/id/journals/deds/ChoiR06
>
rdfs:
seeAlso
<
http://dblp.uni-trier.de/db/journals/deds/deds16.html#ChoiR06
>
rdfs:
seeAlso
<
https://doi.org/10.1007/s10626-006-8134-8
>
dc:
subject
Dynamic programming; Kalman filter; Optimal stopping; Queueing; Recursive least-squares; Reinforcement learning; Temporal-difference learning
(xsd:string)
dc:
title
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning.
(xsd:string)
dc:
type
<
http://purl.org/dc/dcmitype/Text
>
rdf:
type
swrc:Article
rdf:
type
foaf:Document
swrc:
volume
16
(xsd:string)