Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization. | D2R Server publishing the DBLP Bibliography Database, hosted at L3S Research Center

Property	Value
dcterms:bibliographicCitation	<http://dblp.uni-trier.de/rec/bibtex/journals/corr/abs-2102-11055>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Jyun-Li_Lin>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Ping-Chun_Hsieh>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Shang-Hsuan_Yang>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Wei_Hung>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Xi_Liu_0011>
foaf:homepage	<https://arxiv.org/abs/2102.11055>
dc:identifier	DBLP journals/corr/abs-2102-11055 (xsd:string)
dcterms:issued	2021 (xsd:gYear)
swrc:journal	<https://dblp.l3s.de/d2r/resource/journals/corr>
rdfs:label	Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization. (xsd:string)
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Jyun-Li_Lin>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Ping-Chun_Hsieh>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Shang-Hsuan_Yang>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Wei_Hung>
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Xi_Liu_0011>
owl:sameAs	<http://bibsonomy.org/uri/bibtexkey/journals/corr/abs-2102-11055/dblp>
owl:sameAs	<http://dblp.rkbexplorer.com/id/journals/corr/abs-2102-11055>
rdfs:seeAlso	<http://dblp.uni-trier.de/db/journals/corr/corr2102.html#abs-2102-11055>
rdfs:seeAlso	<https://arxiv.org/abs/2102.11055>
dc:title	Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization. (xsd:string)
dc:type	<http://purl.org/dc/dcmitype/Text>
rdf:type	swrc:Article
rdf:type	foaf:Document
swrc:volume	abs/2102.11055 (xsd:string)