Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards. | D2R Server publishing the DBLP Bibliography Database, hosted at L3S Research Center

Property	Value
dcterms:bibliographicCitation	<http://dblp.uni-trier.de/rec/bibtex/conf/nips/Moulos20>
dc:creator	<https://dblp.l3s.de/d2r/resource/authors/Vrettos_Moulos>
foaf:homepage	<https://proceedings.neurips.cc/paper/2020/hash/597c7b407a02cc0a92167e7a371eca25-Abstract.html>
dc:identifier	DBLP conf/nips/Moulos20 (xsd:string)
dcterms:issued	2020 (xsd:gYear)
rdfs:label	Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards. (xsd:string)
foaf:maker	<https://dblp.l3s.de/d2r/resource/authors/Vrettos_Moulos>
dcterms:partOf	<https://dblp.l3s.de/d2r/resource/publications/conf/nips/2020>
owl:sameAs	<http://bibsonomy.org/uri/bibtexkey/conf/nips/Moulos20/dblp>
owl:sameAs	<http://dblp.rkbexplorer.com/id/conf/nips/Moulos20>
rdfs:seeAlso	<http://dblp.uni-trier.de/db/conf/nips/neurips2020.html#Moulos20>
rdfs:seeAlso	<https://proceedings.neurips.cc/paper/2020/hash/597c7b407a02cc0a92167e7a371eca25-Abstract.html>
swrc:series	<https://dblp.l3s.de/d2r/resource/conferences/nips>
dc:title	Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards. (xsd:string)
dc:type	<http://purl.org/dc/dcmitype/Text>
rdf:type	swrc:InProceedings
rdf:type	foaf:Document