[RDF data]
Home | Example Publications
PropertyValue
dcterms:bibliographicCitation <http://dblp.uni-trier.de/rec/bibtex/journals/pami/TosattoCP22>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Jan_Peters_0001>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Jo%E2%88%9A%C2%A3o_Carvalho>
dc:creator <https://dblp.l3s.de/d2r/resource/authors/Samuele_Tosatto>
foaf:homepage <http://dx.doi.org/doi.org%2F10.1109%2FTPAMI.2021.3088063>
foaf:homepage <https://doi.org/10.1109/TPAMI.2021.3088063>
dc:identifier DBLP journals/pami/TosattoCP22 (xsd:string)
dc:identifier DOI doi.org%2F10.1109%2FTPAMI.2021.3088063 (xsd:string)
dcterms:issued 2022 (xsd:gYear)
swrc:journal <https://dblp.l3s.de/d2r/resource/journals/pami>
rdfs:label Batch Reinforcement Learning With a Nonparametric Off-Policy Policy Gradient. (xsd:string)
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Jan_Peters_0001>
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Jo%E2%88%9A%C2%A3o_Carvalho>
foaf:maker <https://dblp.l3s.de/d2r/resource/authors/Samuele_Tosatto>
swrc:number 10 (xsd:string)
swrc:pages 5996-6010 (xsd:string)
owl:sameAs <http://bibsonomy.org/uri/bibtexkey/journals/pami/TosattoCP22/dblp>
owl:sameAs <http://dblp.rkbexplorer.com/id/journals/pami/TosattoCP22>
rdfs:seeAlso <http://dblp.uni-trier.de/db/journals/pami/pami44.html#TosattoCP22>
rdfs:seeAlso <https://doi.org/10.1109/TPAMI.2021.3088063>
dc:title Batch Reinforcement Learning With a Nonparametric Off-Policy Policy Gradient. (xsd:string)
dc:type <http://purl.org/dc/dcmitype/Text>
rdf:type swrc:Article
rdf:type foaf:Document
swrc:volume 44 (xsd:string)