[RDF data]
Home | Example Publications
PropertyValue
dcterms:bibliographicCitation <http://dblp.uni-trier.de/rec/bibtex/journals/corr/abs-2011-06752>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/He_Ba>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Jiajun_Fan>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Jianye_Hao>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Xian_Guo>
foaf:homepage <https://arxiv.org/abs/2011.06752>
dc:identifier DBLP journals/corr/abs-2011-06752 (xsd:string)
dcterms:issued 2020 (xsd:gYear)
swrc:journal <https://dblp.l3s.de/d2r/resource/journals/corr>
rdfs:label Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning. (xsd:string)
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/He_Ba>
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Jiajun_Fan>
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Jianye_Hao>
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Xian_Guo>
owl:sameAs <http://bibsonomy.org/uri/bibtexkey/journals/corr/abs-2011-06752/dblp>
owl:sameAs <http://dblp.rkbexplorer.com/id/journals/corr/abs-2011-06752>
rdfs:seeAlso <http://dblp.uni-trier.de/db/journals/corr/corr2011.html#abs-2011-06752>
rdfs:seeAlso <https://arxiv.org/abs/2011.06752>
dc:title Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning. (xsd:string)
dc:type <http://purl.org/dc/dcmitype/Text>
rdf:type swrc:Article
rdf:type foaf:Document
swrc:volume abs/2011.06752 (xsd:string)