The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications of "Hengshuai Yao" ( http://dblp.L3S.de/Authors/Hengshuai_Yao )

  Author page on DBLP  Author page in RDF  Community of Hengshuai Yao in ASPL-2

Publication years (Num. hits)
2006-2018 (16) 2019 (10)
Publication types (Num. hits)
article(11) inproceedings(15)
Venues (Conferences, Journals, ...)
CoRR(11) AAAI(3) ICML(2) NIPS(2) AAMAS(1) ADPRL(1) CDC(1) IJCAI(1) IMSCCS (2)(1) ISAIM(1) ITSC(1) WWW (Companion Volume)(1)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 2 occurrences of 2 keywords

Results
Found 27 publication records. Showing 26 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Khurram Javed, Hengshuai Yao, Martha White Is Fast Adaptation All You Need? Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
1Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White Hill Climbing on Value Estimates for Search-control in Dyna. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
1Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yaoliang Yu Distributional Reinforcement Learning for Efficient Exploration. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
1Borislav Mavrin, Hengshuai Yao, Linglong Kong Deep Reinforcement Learning with Decorrelation. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
1Nazmus Sakib, Hengshuai Yao, Hong Zhang Reinforcing Classical Planning for Adversary Driving Scenarios. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
1Shangtong Zhang, Hengshuai Yao QUOTA: The Quantile Option Architecture for Reinforcement Learning. Search on Bibsonomy AAAI The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
1Shangtong Zhang, Hengshuai Yao ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search. Search on Bibsonomy AAAI The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
1Borislav Mavrin, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yaoliang Yu Distributional Reinforcement Learning for Efficient Exploration. Search on Bibsonomy ICML The full citation details ... 2019 DBLP  BibTeX  RDF
1Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White Hill Climbing on Value Estimates for Search-control in Dyna. Search on Bibsonomy IJCAI The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
1Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong Exploration in the Face of Parametric and Intrinsic Uncertainties. Search on Bibsonomy AAMAS The full citation details ... 2019 DBLP  BibTeX  RDF
1Donglai Zhu, Hengshuai Yao, Bei Jiang, Peng Yu Negative Log Likelihood Ratio Loss for Deep Neural Network Classification. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
1Shangtong Zhang, Borislav Mavrin, Linglong Kong, Bo Liu, Hengshuai Yao QUOTA: The Quantile Option Architecture for Reinforcement Learning. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
1Shangtong Zhang, Hao Chen, Hengshuai Yao ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
1Donglai Zhu, Hao Chen, Hengshuai Yao, Masoud S. Nosrati, Peyman Yadmellat, Yunfei Zhang Practical Issues of Action-conditioned Next Image Prediction. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
1Donglai Zhu, Hao Chen, Hengshuai Yao, Masoud S. Nosrati, Peyman Yadmellat, Yunfei Zhang Practical Issues of Action-Conditioned Next Image Prediction. Search on Bibsonomy ITSC The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
1Hengshuai Yao, Csaba Szepesvári, Bernardo Ávila Pires, Xinhua Zhang Pseudo-MDPs and factored linear action models. Search on Bibsonomy ADPRL The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Chi-Hoon Lee, Hengshuai Yao, Xu He, Su Han Chan, JieYang Chang, Farzin Maghoul Learning to predict trending queries: classification - based. Search on Bibsonomy WWW (Companion Volume) The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
1Hengshuai Yao, Csaba Szepesvári, Richard S. Sutton, Joseph Modayil, Shalabh Bhatnagar Universal Option Models. Search on Bibsonomy NIPS The full citation details ... 2014 DBLP  BibTeX  RDF
1Hengshuai Yao, Dale Schuurmans Reinforcement Ranking Search on Bibsonomy CoRR The full citation details ... 2013 DBLP  BibTeX  RDF
1Hengshuai Yao Discovering and Leveraging the Most Valuable Links for Ranking Search on Bibsonomy CoRR The full citation details ... 2012 DBLP  BibTeX  RDF
1Hengshuai Yao, Csaba Szepesvári Approximate Policy Iteration with Linear Action Models. Search on Bibsonomy AAAI The full citation details ... 2012 DBLP  BibTeX  RDF
1Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. Search on Bibsonomy CDC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
1Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári Multi-Step Dyna Planning for Policy Evaluation and Control. Search on Bibsonomy NIPS The full citation details ... 2009 DBLP  BibTeX  RDF
1Hengshuai Yao, Zhi-Qiang Liu Minimal Residual Approaches for Policy Evaluation in Large Sparse Markov Chains. Search on Bibsonomy ISAIM The full citation details ... 2008 DBLP  BibTeX  RDF
1Hengshuai Yao, Zhi-Qiang Liu Preconditioned temporal difference learning. Search on Bibsonomy ICML The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
1Hengshuai Yao, Diao Dongcui, Zengqi Sun Historical Temporal Difference Learning: Some Initial Results. Search on Bibsonomy IMSCCS (2) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Multi-step Prediction, Reinforcement Learning, Temporal Difference Learning
Displaying result #1 - #26 of 26 (100 per page; Change: )
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license