Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback. | D2R Server publishing the DBLP Bibliography Database, hosted at L3S Research Center

Property	Value
dcterms:bibliographicCitation	<http://dblp.uni-trier.de/rec/bibtex/journals/corr/abs-2109-07054>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/David_Halpern>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Ishaan_Shah>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Kavosh_Asadi>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Michael_L._Littman>
foaf:homepage	<https://arxiv.org/abs/2109.07054>
dc:identifier	DBLP journals/corr/abs-2109-07054 (xsd:string)
dcterms:issued	2021 (xsd:gYear)
swrc:journal	<https://dblp.l3s.de/d2r/resource/journals/corr>
rdfs:label	Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback. (xsd:string)
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/David_Halpern>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Ishaan_Shah>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Kavosh_Asadi>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Michael_L._Littman>
owl:sameAs	<http://bibsonomy.org/uri/bibtexkey/journals/corr/abs-2109-07054/dblp>
owl:sameAs	<http://dblp.rkbexplorer.com/id/journals/corr/abs-2109-07054>
rdfs:seeAlso	<http://dblp.uni-trier.de/db/journals/corr/corr2109.html#abs-2109-07054>
rdfs:seeAlso	<https://arxiv.org/abs/2109.07054>
dc:title	Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback. (xsd:string)
dc:type	<http://purl.org/dc/dcmitype/Text>
rdf:type	swrc:Article
rdf:type	foaf:Document
swrc:volume	abs/2109.07054 (xsd:string)