From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. | D2R Server publishing the DBLP Bibliography Database, hosted at L3S Research Center

Property	Value
dcterms:bibliographicCitation	<http://dblp.uni-trier.de/rec/bibtex/journals/deds/Cao03a>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Xi-Ren_Cao>
foaf:homepage	<http://dx.doi.org/doi.org%2F10.1023%2FA%3A1022188803039>
foaf:homepage	<https://doi.org/10.1023/A:1022188803039>
dc:identifier	DBLP journals/deds/Cao03a (xsd:string)
dc:identifier	DOI doi.org%2F10.1023%2FA%3A1022188803039 (xsd:string)
dcterms:issued	2003 (xsd:gYear)
swrc:journal	<https://dblp.l3s.de/d2r/resource/journals/deds>
rdfs:label	From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. (xsd:string)
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Xi-Ren_Cao>
swrc:number	1-2 (xsd:string)
swrc:pages	9-39 (xsd:string)
owl:sameAs	<http://bibsonomy.org/uri/bibtexkey/journals/deds/Cao03a/dblp>
owl:sameAs	<http://dblp.rkbexplorer.com/id/journals/deds/Cao03a>
rdfs:seeAlso	<http://dblp.uni-trier.de/db/journals/deds/deds13.html#Cao03a>
rdfs:seeAlso	<https://doi.org/10.1023/A:1022188803039>
dc:subject	Potentials; Poisson equations; gradient-based policy iteration; perturbation realization; Q-learning; TD() (xsd:string)
dc:title	From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. (xsd:string)
dc:type	<http://purl.org/dc/dcmitype/Text>
rdf:type	swrc:Article
rdf:type	foaf:Document
swrc:volume	13 (xsd:string)