|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 188 occurrences of 109 keywords
|
|
|
|
|
Results
Found 229 publication records. Showing 229 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 2 | R. Manimegalai, E. Siva Soumya, V. Muralidharan, Balaraman Ravindran, V. Kamakoti, D. Bhatia |
Placement and Routing for 3D-FPGAs Using Reinforcement Learning and Support Vector Machines.  |
VLSI Design  |
2005 |
DBLP DOI BibTeX RDF |
Three-Dimensional FPGA, Reinforcement Learning (RL), Two-opt algorithm, Support Vector Machines (SVMs), Placement and Routing |
| 2 | John G. Vlachogiannis, Nikos D. Hatziargyriou |
Reinforcement Learning (RL) to Optimal Reconfiguration of Radial Distribution System (RDS).  |
SETN  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthijs Snel, Shimon Whiteson |
Multi-task evolutionary shaping without pre-specified representations.  |
GECCO  |
2010 |
DBLP DOI BibTeX RDF |
genetic algorithms, feature selection, reinforcement learning, shaping |
| 1 | Keiji Kamei, Masumi Ishikawa |
Skill Transfer of a Mobile Robot Obtained by Reinforcement Learning to a Different Mobile Robot.  |
Brain-Inspired Information Technology  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Ian Fasel, Michael Quinlan, Peter Stone |
A task specification language for bootstrap learning.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
human computer interaction, reinforcement learning |
| 1 | Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, Jan Ramon, Kurt Driessens |
Learning with whom to communicate using relational reinforcement learning.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
relational reinforcement learning, multi-agent systems, reinforcement learning |
| 1 | Verena Heidrich-Meisner, Christian Igel |
Uncertainty handling CMA-ES for reinforcement learning.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
covariance matrix adaptation evolution strategy, direct policy search, reinforcement learning, uncertainty handling |
| 1 | Jae-Yoon Jung, James A. Reggia |
Evolving an autonomous agent for non-Markovian reinforcement learning.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
descriptive encoding, genetic programming, reinforcement learning, evolution strategy |
| 1 | Huaqing Min, Jiaan Zeng, Ronghua Luo |
Fuzzy CMAC with automatic state partition for reinforcementlearning.  |
GEC Summit  |
2009 |
DBLP DOI BibTeX RDF |
automatic state partition, fuzzy CMAC, reinforcement learning |
| 1 | Jia Rao, Xiangping Bu, Cheng-Zhong Xu, Le Yi Wang, Gang George Yin |
VCONF: a reinforcement learning approach to virtual machines auto-configuration.  |
ICAC  |
2009 |
DBLP DOI BibTeX RDF |
cloud computing, virtual machines, reinforcement learning, autonomic computing |
| 1 | J. Zico Kolter, Andrew Y. Ng |
Near-Bayesian exploration in polynomial time.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Gerhard Neumann, Wolfgang Maass, Jan Peters |
Learning complex motions by sequencing simpler motion templates.  |
ICML  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Roy Chaoming Hsu, Cheng-Ting Liu, Kuan-Chieh Wang, Wei-Ming Lee |
QoS-Aware Power Management for Energy Harvesting Wireless Sensor Network Utilizing Reinforcement Learning.  |
CSE  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawara |
Constructing action set from basis functions for reinforcement learning of robot control.  |
ICRA  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Daniel Kudenko, Marek Grzes |
Knowledge-Based Reinforcement Learning for Data Mining.  |
ADMI  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Alexander Hans, Steffen Udluft |
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data.  |
ICANN  |
2009 |
DBLP DOI BibTeX RDF |
Reinforcement learning, uncertainty, model-based, Bayesian modeling |
| 1 | Reinaldo A. C. Bianchi, Raquel Ros, Ramon López de Mántaras |
Improving Reinforcement Learning by Using Case Based Heuristics.  |
ICCBR  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | S. Mostapha Kalami Heris, Mohammad-Bagher Naghibi Sistani, Naser Pariz |
Using Control Theory for Analysis of Reinforcement Learning and Optimal Policy Properties in Grid-World Problems.  |
ICIC  |
2009 |
DBLP DOI BibTeX RDF |
Discrete-Time Control Systems, Dynamic Programming, Reinforcement Learning, Markov Decision Process, Stochastic Control |
| 1 | Tobias Jung, Peter Stone |
Feature Selection for Value Function Approximation Using Bayesian Model Selection.  |
ECML/PKDD  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Ana Iglesias, Paloma Martínez, Ricardo Aler, Fernando Fernández |
Learning teaching strategies in an Adaptive and Intelligent Educational System through Reinforcement Learning.  |
Appl. Intell.  |
2009 |
DBLP DOI BibTeX RDF |
Adaptive and Intelligent Educational Systems, Learning pedagogical strategies, Reinforcement Learning, Intelligent tutoring systems, Applied artificial intelligence |
| 1 | Jiang Zhu, Jun Wang, Tao Luo, Shaoqian Li |
Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning.  |
Telecommunication Systems  |
2009 |
DBLP DOI BibTeX RDF |
Energy-efficient networks, Reinforcement learning, Cognitive radio, Markov decision process, Cross-layer design |
| 1 | Hiroshi Tsujino, Johane Takeuchi, Osamu Shouno |
Basal Ganglia Models for Autonomous Behavior Learning.  |
Creating Brain-Like Intelligence  |
2009 |
DBLP DOI BibTeX RDF |
modular learning system, input space selection, reinforcement learning, system architecture, execution timing, spiking neuron, reward, basal ganglia |
| 1 | Jae-Yoon Jung, James A. Reggia |
Nested evolution of an autonomous agent using descriptive encoding.  |
GECCO  |
2008 |
DBLP DOI BibTeX RDF |
descriptive encoding, reinforcement learning, neuroevolution |
| 1 | Jan Hendrik Metzen, Frank Kirchner, Mark Edgington, Yohannes Kassahun |
Towards efficient online reinforcement learning using neuroevolution.  |
GECCO  |
2008 |
DBLP DOI BibTeX RDF |
reinforcement learning, online-learning, neuroevolution |
| 1 | Joseph Reisinger, Peter Stone, Risto Miikkulainen |
Online kernel selection for Bayesian reinforcement learning.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Tsuyoshi Ueno, Motoaki Kawanabe, Takeshi Mori, Shin-ichi Maeda, Shin Ishii |
A semiparametric statistical approach to model-free policy evaluation.  |
ICML  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Zhengqing Hu, Chen-Khong Tham |
CCMAC: coordinated cooperative MAC for wireless LANs.  |
MSWiM  |
2008 |
DBLP DOI BibTeX RDF |
concurrent transmission, MAC, cooperative communication |
| 1 | Julien Perez, Cécile Germain-Renaud, Balázs Kégl, Charles Loomis |
Grid Differentiated Services: A Reinforcement Learning Approach.  |
CCGRID  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Eduardo Rodrigues Gomes, Ryszard Kowalczyk |
Non-symmetric Preferences in the IPA Market with Reinforcement Learning.  |
IAT  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Hua-Qing Min, Jia-An Zeng, Jian Chen, Jin-Hui Zhu |
A Study of Reinforcement Learning in a New Multiagent Domain.  |
IAT  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Teck-Hou Teng, Ah-Hwee Tan |
Cognitive Agents Integrating Rules and Reinforcement Learning for Context-Aware Decision Support.  |
IAT  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthew Frampton, Oliver Lemon |
Using dialogue acts to learn better repair strategies for spoken dialogue systems.  |
ICASSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Lasheng Yu, Alonso Marin, Fei Hong, Jian Lin |
Studies on Hierarchical Reinforcement Learning in Multi-Agent Environment.  |
ICNSC  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Byungchan Kim, Byungduk Kang, Shinsuk Park, Sungchul Kang |
Learning robot stiffness for contact tasks using the natural actor-critic.  |
ICRA  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Jun Morimoto, Sang-Ho Hyon, Christopher G. Atkeson, Gordon Cheng |
Low-dimensional feature extraction for humanoid locomotion using kernel dimension reduction.  |
ICRA  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Khan M. Iftekharuddin, Yaqin Li |
A biologically-inspired computational model for transformation invariant target recognition.  |
IJCNN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Lei Zheng, Siu-Yeung Cho, Chai Quek |
A memory-based reinforcement learning algorithm for partially observable Markovian decision processes.  |
IJCNN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Andres El-Fakdi, Marc Carreras |
Policy gradient based Reinforcement Learning for real autonomous underwater cable tracking.  |
IROS  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Mahbubur Rashid, Ioana Banicescu, Ricolindo Cariño |
Investigating a Dynamic Loop Scheduling with Reinforcement Learning Approach to Load Balancing in Scientific Applications.  |
ISPDC  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Yuan Xue, Yuewei Lin, Zhiyong Feng, Huying Cai, Cheng Chi |
Autonomic Joint Session Scheduling Strategies for Heterogeneous Wireless Networks.  |
WCNC  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Abhijit Gosavi |
On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning.  |
Winter Simulation Conference  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Carlos D. Paternina-Arboleda, Jairo R. Montoya-Torres, Aldo Fabregas-Ariza |
Simulation-optimization using a reinforcement learning approach.  |
Winter Simulation Conference  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Rajneesh Sharma, Madan Gopal |
A Markov Game-Adaptive Fuzzy Controller for Robot Manipulators.  |
IEEE T. Fuzzy Systems  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | B. Baddeley |
Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Daoyi Dong, Chunlin Chen, Han-Xiong Li, Tzyh Jong Tarn |
Quantum Reinforcement Learning.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Marco A. Wiering, Hado van Hasselt |
Ensemble Algorithms in Reinforcement Learning.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Bryan Auslander, Stephen Lee-Urban, Chad Hogg, Héctor Muñoz-Avila |
Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning.  |
ECCBR  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Francis Maes, Ludovic Denoyer, Patrick Gallinari |
Applications of Reinforcement Learning to Structured Prediction.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | José David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, José Rafael Magdalena Benedito, Juan Gómez-Sanchís |
Use of Reinforcement Learning in Two Real Applications.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Yu Hiei, Takeshi Mori, Shin Ishii |
Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments.  |
ICANN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Daan Wierstra, Tom Schaul, Jan Peters, Jürgen Schmidhuber |
Episodic Reinforcement Learning by Logistic Reward-Weighted Regression.  |
ICANN  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Jianghao Li, Zhenbo Li, Jiapin Chen |
Reinforcement Learning Based Precise Positioning Method for a Millimeters-Sized Omnidirectional Mobile Microrobot.  |
ICIRA  |
2008 |
DBLP DOI BibTeX RDF |
precise positioning, mobile microrobot, electromagnetic micromotor, reinforcement learning |
| 1 | Hiroki Utsunomiya, Katsunari Shibata |
Contextual Behaviors and Internal Representations Acquired by Reinforcement Learning with a Recurrent Neural Network in a Continuous State and Action Space Task.  |
ICONIP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Andrea Bonarini, Claudio Caccia, Alessandro Lazaric, Marcello Restelli |
Batch Reinforcement Learning for Controlling a Mobile Wheeled Pendulum Robot.  |
IFIP AI  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Yong Duan, Baoxia Cui, Huaiqing Yang |
Robot Navigation Based on Fuzzy RL Algorithm.  |
ISNN  |
2008 |
DBLP DOI BibTeX RDF |
T-S fuzzy neural network, Reinforcement learning, Q-learning, Robot navigation |
| 1 | Arturo Servin, Daniel Kudenko |
Multi-Agent Reinforcement Learning for Intrusion Detection: A Case Study and Evaluation.  |
MATES  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Weiwei Wang 0002, Yang Gao 0001, Xingguo Chen, Shen Ge |
Reinforcement Learning with Markov Logic Networks.  |
MICAI  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Toshiyuki Yasuda, Kazuhiro Ohkura |
A Reinforcement Learning Technique with an Adaptive Action Generator for a Multi-robot System.  |
SAB  |
2008 |
DBLP DOI BibTeX RDF |
Autonomous Specialization, Action Search, Reinforcement Learning, Multi-Robot System |
| 1 | Eduardo Rodrigues Gomes, Ryszard Kowalczyk |
Individual and Social Behaviour in the IPA Market with RL.  |
SBIA  |
2008 |
DBLP DOI BibTeX RDF |
Market-based Resource Allocation, Reinforcement Learning, Multiagent Systems |
| 1 | Urban Richter, Holger Prothmann, Hartmut Schmeck |
Improving XCS Performance by Distribution.  |
SEAL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Erik Berglund, Joaquin Sitte, Gordon Wyeth |
Active audition using the parameter-less self-organising map.  |
Auton. Robots  |
2008 |
DBLP DOI BibTeX RDF |
Active audition, Self-organisation |
| 1 | Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna Helena Reali Costa |
Accelerating autonomous learning by using heuristic selection of actions.  |
J. Heuristics  |
2008 |
DBLP DOI BibTeX RDF |
Reinforcement learning, Robot navigation, Action selection, Heuristic function |
| 1 | Minija Tamosiunaite, James Ainge, Tomas Kulvicius, Bernd Porr, Paul Dudchenko, Florentin Wörgötter |
Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning.  |
Journal of Computational Neuroscience  |
2008 |
DBLP DOI BibTeX RDF |
SARSA, Place field system, Weight decay, Reinforcement learning, Function approximation |
| 1 | Neville Mehta, Sriraam Natarajan, Prasad Tadepalli, Alan Fern |
Transfer in variable-reward hierarchical reinforcement learning.  |
Machine Learning  |
2008 |
DBLP DOI BibTeX RDF |
Average-reward learning, Multi-criteria learning, Transfer learning, Hierarchical reinforcement learning |
| 1 | David Vengerov |
A reinforcement learning framework for online data migration in hierarchical storage systems.  |
The Journal of Supercomputing  |
2008 |
DBLP DOI BibTeX RDF |
Self-optimizing systems, Multi-tier storage, Fuzzy rulebase, Reinforcement learning, Markov decision process, Cost functions, Data migration |
| 1 | Masoud Mahootchi, Hamid R. Tizhoosh, Kumaraswamy Ponnambalam |
Opposition Mining in Reservoir Management.  |
Oppositional Concepts in Computational Intelligence  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Maryam Shokri, Hamid R. Tizhoosh, Mohamed S. Kamel |
The Concept of Opposition and Its Use in Q-Learning and Q(lambda) Techniques.  |
Oppositional Concepts in Computational Intelligence  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Tariq Mahmood, Francesco Ricci |
Learning and adaptivity in interactive recommender systems.  |
ICEC  |
2007 |
DBLP DOI BibTeX RDF |
adaptivity, reinforcement learning, markov decision process, conversational recommender systems |
| 1 | Matthew Grounds, Daniel Kudenko |
Parallel reinforcement learning with linear function approximation.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
value function approximation, parallel algorithms, reinforcement learning |
| 1 | Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanathan |
Conditional random fields for multi-agent reinforcement learning.  |
ICML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Jin Zhou, Lu Yu, Shingo Mabu, Kotaro Hirasawa, Jinglu Hu, Sandor Markon |
Double-deck elevator systems using Genetic Network Programming with reinforcement learning.  |
IEEE Congress on Evolutionary Computation  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Karla Conn, Richard Alan Peters II |
Reinforcement Learning with a Supervisor for a Mobile Robot in a Real-world Environment.  |
CIRA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Lucian Busoniu, Damien Ernst, Bart De Schutter, Robert Babuska |
Fuzzy Approximation for Convergent Model-Based Reinforcement Learning.  |
FUZZ-IEEE  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Sudhakara P. Reddy, Raju S. Bapi, Chakravarthy Bhagvati, Bulusu Lakshmana Deekshatulu |
Concept Pre-digestion Method for Image Relevance Reinforcement Learning.  |
ICCTA  |
2007 |
DBLP DOI BibTeX RDF |
Concept Digestion Method, Reinforcement Learning, Relevance Feedback, Q-Learning |
| 1 | Ju Jiang, Mohamed S. Kamel |
Pitch Control of an Aircraft with Aggregated Reinforcement Learning Algorithms.  |
IJCNN  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Anton Maximilian Schäfer, Daniel Schneegaß, Volkmar Sterzing, Steffen Udluft |
A Neural Reinforcement Learning Approach to Gas Turbine Control.  |
IJCNN  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Kimura Kimura |
Reinforcement learning in multi-dimensional state-action space using random rectangular coarse coding and Gibbs sampling.  |
IROS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Wooyoung Kwon, Il Hong Suh, Sanghoon Lee 0002, Young-Jo Cho |
Fast reinforcement learning using stochastic shortest paths for a mobile robot.  |
IROS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort-Piat |
Hysteretic q-learning : an algorithm for decentralized reinforcement learning in cooperative multi-agent teams.  |
IROS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Jun Morimoto, Christopher G. Atkeson, Gen Endo, Gordon Cheng |
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression.  |
IROS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Ju Jiang, Mohamed S. Kamel |
Aggregation of tiling-based reinforcement learning algorithms.  |
SMC  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Xianghai Wu, Jonathan Kofman, Hamid R. Tizhoosh |
Active exploratory q-learning for large problems.  |
SMC  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Gerald Tesauro |
Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies.  |
IEEE Internet Computing  |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning, training, autonomic computing, systems management |
| 1 | Stefan Elfwing, Eiji Uchibe, Kenji Doya, Henrik I. Christensen |
Evolutionary Development of Hierarchical Learning Structures.  |
IEEE Trans. Evolutionary Computation  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Xin Xu, Dewen Hu, Xicheng Lu |
Kernel-Based Least Squares Policy Iteration for Reinforcement Learning.  |
IEEE Transactions on Neural Networks  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Wipawee Usaha, Javier A. Barria |
Reinforcement Learning for Resource Allocation in LEO Satellite Networks.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Amanda M. Whitbrook, Uwe Aickelin, Jonathan M. Garibaldi |
Idiotypic Immune Networks in Mobile-Robot Control.  |
IEEE Transactions on Systems, Man, and Cybernetics, Part B  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Lucian Busoniu, Damien Ernst, Bart De Schutter, Robert Babuska |
Continuous-State Reinforcement Learning with Fuzzy Approximation.  |
Adaptive Agents and Multi-Agents Systems  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Matthew Grounds, Daniel Kudenko |
Parallel Reinforcement Learning with Linear Function Approximation.  |
Adaptive Agents and Multi-Agents Systems  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Arturo Servin, Daniel Kudenko |
Multi-agent Reinforcement Learning for Intrusion Detection.  |
Adaptive Agents and Multi-Agents Systems  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Joost Broekens |
Emotion and Reinforcement: Affective Facial Expressions Facilitate Robot Learning.  |
Artifical Intelligence for Human Computing  |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Affect, Human-in-the-Loop |
| 1 | Andrea Bonarini, Alessandro Lazaric, Marcello Restelli |
Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions.  |
AI*IA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Terran Lane, Martin Ridens, Scott Stevens |
Reinforcement Learning in Nonstationary Environment Navigation Tasks.  |
Canadian Conference on AI  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | José David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Mónica Climente-Martí, Teresa De Diego-Santos, N. Víctor Jiménez |
Validation of a Reinforcement Learning Policy for Dosage Optimization of Erythropoietin.  |
Australian Conference on Artificial Intelligence  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Francis Maes, Ludovic Denoyer, Patrick Gallinari |
Sequence Labeling with Reinforcement Learning and Ranking Algorithms.  |
ECML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Daan Wierstra, Jürgen Schmidhuber |
Policy Gradient Critics.  |
ECML  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | José Antonio Martin H., Javier de Lope Asiaín |
A k-NN Based Perception Scheme for Reinforcement Learning.  |
EUROCAST  |
2007 |
DBLP DOI BibTeX RDF |
Collective Decision Making, Reinforcement Learning, k-Nearest-Neighbors |
| 1 | Pawel Wawrzynski |
Reinforcement Learning in Fine Time Discretization.  |
ICANNGA  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Daan Wierstra, Alexander Förster, Jan Peters, Jürgen Schmidhuber |
Solving Deep Memory POMDPs with Recurrent Policy Gradients.  |
ICANN  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Kazuyuki Hiraoka, Manabu Yoshida, Taketoshi Mishima |
Parallel Reinforcement Learning for Weighted Multi-criteria Model with Adaptive Margin.  |
ICONIP  |
2007 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 229 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ >>] |
|