[RDF data]
Home | Example Publications
PropertyValue
dcterms:bibliographicCitation <http://dblp.uni-trier.de/rec/bibtex/journals/deds/Cao03a>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Xi-Ren_Cao>
foaf:homepage <http://dx.doi.org/doi.org%2F10.1023%2FA%3A1022188803039>
foaf:homepage <https://doi.org/10.1023/A:1022188803039>
dc:identifier DBLP journals/deds/Cao03a (xsd:string)
dc:identifier DOI doi.org%2F10.1023%2FA%3A1022188803039 (xsd:string)
dcterms:issued 2003 (xsd:gYear)
swrc:journal <https://dblp.l3s.de/d2r/resource/journals/deds>
rdfs:label From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. (xsd:string)
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Xi-Ren_Cao>
swrc:number 1-2 (xsd:string)
swrc:pages 9-39 (xsd:string)
owl:sameAs <http://bibsonomy.org/uri/bibtexkey/journals/deds/Cao03a/dblp>
owl:sameAs <http://dblp.rkbexplorer.com/id/journals/deds/Cao03a>
rdfs:seeAlso <http://dblp.uni-trier.de/db/journals/deds/deds13.html#Cao03a>
rdfs:seeAlso <https://doi.org/10.1023/A:1022188803039>
dc:subject Potentials; Poisson equations; gradient-based policy iteration; perturbation realization; Q-learning; TD() (xsd:string)
dc:title From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning. (xsd:string)
dc:type <http://purl.org/dc/dcmitype/Text>
rdf:type swrc:Article
rdf:type foaf:Document
swrc:volume 13 (xsd:string)