|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 1684 occurrences of 669 keywords
|
|
|
|
|
Results
Found 3379 publication records. Showing 3379 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 4 | Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, Jan Ramon, Kurt Driessens |
Learning with whom to communicate using relational reinforcement learning.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
relational reinforcement learning, multi-agent systems, reinforcement learning |
| 4 | Dongbing Gu, Erfu Yang |
Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems.  |
Journal of Intelligent and Robotic Systems  |
2007 |
DBLP DOI BibTeX RDF |
flocking behavior, policy gradient reinforcement learning, cooperative control, multi-agent reinforcement learning |
| 4 | Jing Shen, Guochang Gu, Haibo Liu |
Multi-Agent Hierarchical Reinforcement Learning by Integrating Options into MAXQ.  |
IMSCCS  |
2006 |
DBLP DOI BibTeX RDF |
MAXQ, Options, hierarchical reinforcement learning, multi-agent reinforcement learning |
| 4 | Carlos Diuk, Alexander L. Strehl, Michael L. Littman |
A hierarchical approach to efficient reinforcement learning in deterministic domains.  |
AAMAS  |
2006 |
DBLP DOI BibTeX RDF |
factored representations, reinforcement learning, hierarchical reinforcement learning, sample complexity |
| 3 | Sina Meraji, Wei Zhang 0034, Carl Tropper |
Brief announcement: a reinforcement learning approach for dynamic load-balancing of parallel digital logic simulation.  |
SPAA  |
2010 |
DBLP DOI BibTeX RDF |
digital logic simulation, reinforcement learning, dynamic load-balancing, time warp, verilog |
| 3 | Jartuwat Rajruangrabin, Dan O. Popa |
Reinforcement learning of interface mapping for interactivity enhancement of robot control in assistive environments.  |
PETRA  |
2010 |
DBLP DOI BibTeX RDF |
reinforcement learning, human-robot interface |
| 3 | Pitoyo Hartono, Sachiko Kakita |
Fast reinforcement learning for simple physical robots.  |
Memetic Computing  |
2009 |
DBLP DOI BibTeX RDF |
Neural network, Reinforcement learning, Autonomous robot, Competitive learning |
| 3 | Jiang Zhu, Jun Wang, Tao Luo, Shaoqian Li |
Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning.  |
Telecommunication Systems  |
2009 |
DBLP DOI BibTeX RDF |
Energy-efficient networks, Reinforcement learning, Cognitive radio, Markov decision process, Cross-layer design |
| 3 | Ana Iglesias, Paloma Martínez, Ricardo Aler, Fernando Fernández |
Learning teaching strategies in an Adaptive and Intelligent Educational System through Reinforcement Learning.  |
Appl. Intell.  |
2009 |
DBLP DOI BibTeX RDF |
Adaptive and Intelligent Educational Systems, Learning pedagogical strategies, Reinforcement Learning, Intelligent tutoring systems, Applied artificial intelligence |
| 3 | Martin Riedmiller, Thomas Gabel, Roland Hafner, Sascha Lange |
Reinforcement learning for robot soccer.  |
Auton. Robots  |
2009 |
DBLP DOI BibTeX RDF |
Learning mobile robots, Autonomous learning robots, Batch reinforcement learning, RoboCup, Neural control |
| 3 | Mehrtash Tafazzoli Harandi, Majid Nili Ahmadabadi, Babak Nadjar Araabi |
Optimal Local Basis: A Reinforcement Learning Approach for Face Recognition.  |
International Journal of Computer Vision  |
2009 |
DBLP DOI BibTeX RDF |
Feature selection, Face recognition, Reinforcement learning |
| 3 | C. van Reeuwijk |
Maestro: a self-organizing peer-to-peer dataflow framework using reinforcement learning.  |
HPDC  |
2009 |
DBLP DOI BibTeX RDF |
peer to peer, reinforcement learning, self organizing |
| 3 | Jae-Yoon Jung, James A. Reggia |
Evolving an autonomous agent for non-Markovian reinforcement learning.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
descriptive encoding, genetic programming, reinforcement learning, evolution strategy |
| 3 | Hisashi Handa |
EDA-RL: estimation of distribution algorithms for reinforcement learning problems.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning problems, estimation of distribution algorithms, conditional random fields |
| 3 | Rogier Koppejan, Shimon Whiteson |
Neuroevolutionary reinforcement learning for generalized helicopter control.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
neural networks, evolutionary computation, reinforcement learning, robot control |
| 3 | Verena Heidrich-Meisner, Christian Igel |
Uncertainty handling CMA-ES for reinforcement learning.  |
GECCO  |
2009 |
DBLP DOI BibTeX RDF |
covariance matrix adaptation evolution strategy, direct policy search, reinforcement learning, uncertainty handling |
| 3 | Koji Iwamura, Norihisa Mayumi, Yoshitaka Tanimizu, Nobuhiro Sugimura |
A Study on Real-Time Scheduling for Holonic Manufacturing Systems - Determination of Utility Values Based on Multi-agent Reinforcement Learning.  |
HoloMAS  |
2009 |
DBLP DOI BibTeX RDF |
Coordination, Real-time Scheduling, Holonic Manufacturing Systems, Multi-agent Reinforcement Learning |
| 3 | S. Mostapha Kalami Heris, Mohammad-Bagher Naghibi Sistani, Naser Pariz |
Using Control Theory for Analysis of Reinforcement Learning and Optimal Policy Properties in Grid-World Problems.  |
ICIC  |
2009 |
DBLP DOI BibTeX RDF |
Discrete-Time Control Systems, Dynamic Programming, Reinforcement Learning, Markov Decision Process, Stochastic Control |
| 3 | Pengcheng Zhang, Xin Xu, Chunming Liu, Qiping Yuan |
Reinforcement Learning Control of a Real Mobile Robot Using Approximate Policy Iteration.  |
ISNN  |
2009 |
DBLP DOI BibTeX RDF |
Approximate policy iteration, Approximate dynamic programming, Reinforcement learning, Mobile robots, Path following |
| 3 | Seyed Jalal Kazemitabar, Hamid Beigy |
Using Strongly Connected Components as a Basis for Autonomous Skill Acquisition in Reinforcement Learning.  |
ISNN  |
2009 |
DBLP DOI BibTeX RDF |
hierarchical reinforcement learning, strongly connected components, skill acquisition |
| 3 | Julio H. Zaragoza, Eduardo F. Morales |
A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots.  |
MICAI  |
2009 |
DBLP DOI BibTeX RDF |
Relational Reinforcement Learning, Continuous Actions, Robotics |
| 3 | Peter Vamplew, Richard Dazeley, Ewan Barker, Andrei Kelarev |
Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks.  |
Australasian Conference on Artificial Intelligence  |
2009 |
DBLP DOI BibTeX RDF |
scalarisation, reinforcement learning, Pareto fronts, multiobjective |
| 3 | Jun Wang, Carl Tropper |
Selecting GVT interval for time-warp-based distributed simulation using reinforcement learning technique.  |
SpringSim  |
2009 |
DBLP DOI BibTeX RDF |
GVT, distributed VLSI simulation, n-armed bandit, reinforcement learning, time warp, parallel and distributed simulation |
| 3 | Roy Chaoming Hsu, Cheng-Ting Liu, Wei-Ming Lee |
Reinforcement Learning-Based Dynamic Power Management for Energy Harvesting Wireless Sensor Network.  |
IEA/AIE  |
2009 |
DBLP DOI BibTeX RDF |
Energy Neutrality, Wireless Sensor Network, Reinforcement Learning, Energy Harvesting, Dynamic Power Management |
| 3 | Takeshi Mori, Shin Ishii |
An Additive Reinforcement Learning.  |
ICANN  |
2009 |
DBLP DOI BibTeX RDF |
approximation of value function, Reinforcement learning |
| 3 | Alexander Hans, Steffen Udluft |
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data.  |
ICANN  |
2009 |
DBLP DOI BibTeX RDF |
Reinforcement learning, uncertainty, model-based, Bayesian modeling |
| 3 | Masumi Ishikawa, Kosuke Ueno |
Hierarchical Architecture with Modular Network SOM and Modular Reinforcement Learning.  |
ICANN  |
2009 |
DBLP DOI BibTeX RDF |
Modular network SOM, modular reinforcement learning, pursuit-evasion game, hierarchical architecture |
| 3 | Jia Rao, Xiangping Bu, Cheng-Zhong Xu, Le Yi Wang, Gang George Yin |
VCONF: a reinforcement learning approach to virtual machines auto-configuration.  |
ICAC  |
2009 |
DBLP DOI BibTeX RDF |
cloud computing, virtual machines, reinforcement learning, autonomic computing |
| 3 | Olga Yugay, Lee Tae Kyung, Franz I. S. Ko |
Reinforcement learning coordination with combined heuristics in multi-agent environment for university timetabling.  |
Int. Conf. Interaction Sciences  |
2009 |
DBLP DOI BibTeX RDF |
multi-agent systems, reinforcement learning, timetabling |
| 3 | José Antonio Martin H., Javier de Lope Asiaín |
Learning Autonomous Helicopter Flight with Evolutionary Reinforcement Learning.  |
EUROCAST  |
2009 |
DBLP DOI BibTeX RDF |
Autonomous Helicopter, Evolutionary Computation, Reinforcement Learning |
| 3 | Natalia Akchurina |
Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
computation of equilibria, multiagent reinforcement learning, algorithmic game theory, stochastic games |
| 3 | Ioannis Partalas, Grigorios Tsoumakas, Konstantinos Tzevanidis, Ioannis P. Vlahavas |
Transferring experience in reinforcement learning through task decomposition.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning, transfer learning |
| 3 | Shivaram Kalyanakrishnan, Peter Stone |
An empirical analysis of value function-based and policy search reinforcement learning.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
policy search, reinforcement learning, function approximation, temporal difference learning |
| 3 | Todd Hester, Peter Stone |
Generalized model learning for reinforcement learning in factored domains.  |
AAMAS  |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning, supervised learning |
| 3 | M. Gómez, L. Gayarre, Tomás Martínez-Marín, S. Sánchez, Daniel Meziat |
Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning*.  |
IWANN  |
2009 |
DBLP DOI BibTeX RDF |
Cell-Mapping, Dynamic Programming, Reinforcement Learning, Optimal Control, Q-Learning |
| 3 | Dusko Katic, Aleksandar D. Rodic, Miomir Vukobratovic |
Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning.  |
Journal of Intelligent and Robotic Systems  |
2008 |
DBLP DOI BibTeX RDF |
Biped locomotion, Integrated dynamic control, Actor-critic method, Reinforcement learning, Humanoid robots |
| 3 | Frédéric Goualard, Christophe Jermann |
A Reinforcement Learning Approach to Interval Constraint Propagation.  |
Constraints  |
2008 |
DBLP DOI BibTeX RDF |
Interval propagation, Reinforcement learning, Numerical constraints |
| 3 | Neville Mehta, Sriraam Natarajan, Prasad Tadepalli, Alan Fern |
Transfer in variable-reward hierarchical reinforcement learning.  |
Machine Learning  |
2008 |
DBLP DOI BibTeX RDF |
Average-reward learning, Multi-criteria learning, Transfer learning, Hierarchical reinforcement learning |
| 3 | David Vengerov |
A reinforcement learning framework for online data migration in hierarchical storage systems.  |
The Journal of Supercomputing  |
2008 |
DBLP DOI BibTeX RDF |
Self-optimizing systems, Multi-tier storage, Fuzzy rulebase, Reinforcement learning, Markov decision process, Cost functions, Data migration |
| 3 | Daniel Machado, Miguel Rocha |
getALife - An Artificial Life Environment for the Evaluation of Agent-Based Systems and Evolutionary Algorithms for Reinforcement Learning.  |
New Challenges in Applied Intelligence Technologies  |
2008 |
DBLP DOI BibTeX RDF |
Artificial Life simulators, Prey-predator systems, Evolutionary Algorithms for Reinforcement learning |
| 3 | Lior Kuyer, Shimon Whiteson, Bram Bakker, Nikos A. Vlassis |
Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs.  |
ECML/PKDD  |
2008 |
DBLP DOI BibTeX RDF |
coordination graphs, max-plus, reinforcement learning, multiagent systems, traffic control |
| 3 | Matthias Rungger, Hao Ding, Olaf Stursberg |
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning.  |
ABiALS ![In: Anticipatory Behavior in Adaptive Learning Systems, From Psychological Theories to Artificial Cognitive Systems [4th Workshop on Anticipatory Behavior in Adaptive Learning Systems, ABiALS 2008, Munich, Germany, June 26-27, 2008], pp. 301-320, 2008, Springer, 978-3-642-02564-8. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
hybrid automaton, behavioral programming, artificial intelligence, Reinforcement learning, planning, hierarchical model |
| 3 | Jan Hendrik Metzen, Frank Kirchner, Mark Edgington, Yohannes Kassahun |
Towards efficient online reinforcement learning using neuroevolution.  |
GECCO  |
2008 |
DBLP DOI BibTeX RDF |
reinforcement learning, online-learning, neuroevolution |
| 3 | Erik J. Dries, Gilbert L. Peterson |
Scaling ant colony optimization with hierarchical reinforcement learning partitioning.  |
GECCO  |
2008 |
DBLP DOI BibTeX RDF |
swarm intelligence, ant colony optimization, hierarchical reinforcement learning |
| 3 | Jian Fan, Minrui Fei, LiKang Shao, Feng Huang |
A Novel Multi-robot Coordination Method Based on Reinforcement Learning.  |
ICIC  |
2008 |
DBLP DOI BibTeX RDF |
behavior weight, role transformation, reinforcement learning, multi-robot |
| 3 | Mohammad Kashki, Youssef Lotfy Abdel-Magid, Mohammad Ali Abido |
A Reinforcement Learning Automata Optimization Approach for Optimum Tuning of PID Controller in AVR System.  |
ICIC  |
2008 |
DBLP DOI BibTeX RDF |
reinforcement learning automata, CARLA, evolutionary computations, PID |
| 3 | Erik Kuefler, Tzu-Yi Chen |
On Using Reinforcement Learning to Solve Sparse Linear Systems.  |
ICCS  |
2008 |
DBLP DOI BibTeX RDF |
reinforcement learning, iterative methods, preconditioners |
| 3 | Robby Goetschalckx, Scott Sanner, Kurt Driessens |
Reinforcement Learning with the Use of Costly Features.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | José David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, José Rafael Magdalena Benedito, Juan Gómez-Sanchís |
Use of Reinforcement Learning in Two Real Applications.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Verena Heidrich-Meisner, Christian Igel |
Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Thomas Gabel, Martin Riedmiller |
Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Kirill Dyagilev, Shie Mannor, Nahum Shimkin |
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin |
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Francis Maes, Ludovic Denoyer, Patrick Gallinari |
Applications of Reinforcement Learning to Structured Prediction.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
|
| 3 | Peter Vamplew, John Yearwood, Richard Dazeley, Adam Berry |
On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts.  |
Australasian Conference on Artificial Intelligence  |
2008 |
DBLP DOI BibTeX RDF |
scalarisation, reinforcement learning, Pareto fronts, multiobjective |
| 3 | Nikolay Borissov, Arun Anandasivam, Niklas Wirström, Dirk Neumann |
Rational Bidding Using Reinforcement Learning.  |
GECON  |
2008 |
DBLP DOI BibTeX RDF |
Bid Generation, Service Provisioning and Usage, Grid Computing, Reinforcement learning |
| 3 | Hamid Boubertakh, Mohamed Tadjine, Pierre-Yves Glorennec |
A Simple Goal Seeking Navigation Method for a Mobile Robot Using Human Sense, Fuzzy Logic and Reinforcement Learning.  |
KES  |
2008 |
DBLP DOI BibTeX RDF |
Fuzzy logic, Reinforcement learning, Obstacle avoidance, Mobile robot navigation |
| 3 | François Klein, Christine Bourjot, Vincent Chevrier |
Contribution to the Control of a MAS's Global Behaviour: Reinforcement Learning Tools.  |
ESAW  |
2008 |
DBLP DOI BibTeX RDF |
experimental approach, global behaviour, MAS, reinforcement learning, Control, emergence |
| 3 | Cheng-Ting Liu, Roy Chaoming Hsu |
Adaptive Power Management Based on Reinforcement Learning for Embedded System.  |
IEA/AIE  |
2008 |
DBLP DOI BibTeX RDF |
adaptive power management, embedded system, reinforcement learning |
| 3 | Yolanda Sanz, Javier de Lope Asiaín, José Antonio Martin H. |
Applying Reinforcement Learning to Multi-robot Team Coordination.  |
HAIS  |
2008 |
DBLP DOI BibTeX RDF |
Coordination, Reinforcement Learning, Multi-robot Systems, Cooperative Behaviors |
| 3 | Xiong Li, Wei Chen, Zhenkun Zhai, Jie Wang |
The Application of Hybrid Distributed Reinforcement Learning Algorithm in RoboCup 2D Soccer Simulation System.  |
ICIRA  |
2008 |
DBLP DOI BibTeX RDF |
RoboCup 2D Soccer Simulation Systemm, Hybrid Distributed Reinforcement Learning Algorithm, Multiagent |
| 3 | Jianghao Li, Zhenbo Li, Jiapin Chen |
Reinforcement Learning Based Precise Positioning Method for a Millimeters-Sized Omnidirectional Mobile Microrobot.  |
ICIRA  |
2008 |
DBLP DOI BibTeX RDF |
precise positioning, mobile microrobot, electromagnetic micromotor, reinforcement learning |
| 3 | Toshiyuki Yasuda, Kazuhiro Ohkura |
A Reinforcement Learning Technique with an Adaptive Action Generator for a Multi-robot System.  |
SAB  |
2008 |
DBLP DOI BibTeX RDF |
Autonomous Specialization, Action Search, Reinforcement Learning, Multi-Robot System |
| 3 | Matthieu Geist, Olivier Pietquin, Gabriel Fricout |
Bayesian Reward Filtering.  |
EWRL  |
2008 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Function Approximation, Bayesian Filtering |
| 3 | M. Mainegra Hing, Aart van Harten, P. C. Schuur |
Reinforcement learning versus heuristics for order acceptance on a single resource.  |
J. Heuristics  |
2007 |
DBLP DOI BibTeX RDF |
Order acceptance, Artificial neural networks, Reinforcement learning, Markov decision process, Opportunity costs, Decisions under uncertainty |
| 3 | Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani |
On the use of hybrid reinforcement learning for autonomic resource allocation.  |
Cluster Computing  |
2007 |
DBLP DOI BibTeX RDF |
Policy learning, Resource allocation, Reinforcement learning, Performance management |
| 3 | Gerald Tesauro |
Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies.  |
IEEE Internet Computing  |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning, training, autonomic computing, systems management |
| 3 | Katja Verbeeck, Ann Nowé, Johan Parent, Karl Tuyls |
Exploring selfish reinforcement learning in repeated games with stochastic rewards.  |
Autonomous Agents and Multi-Agent Systems  |
2007 |
DBLP DOI BibTeX RDF |
Non-zero sum games, Learning automata, Multi-agent reinforcement learning |
| 3 | Olivier Buffet, Alain Dutech, François Charpillet |
Shaping multi-agent systems with gradient reinforcement learning.  |
Autonomous Agents and Multi-Agent Systems  |
2007 |
DBLP DOI BibTeX RDF |
Policy-gradient, Multi-agent systems, Reinforcement learning, Shaping, Partially observable Markov decision processes |
| 3 | I. S. Razo-Zapata, Julio Waissman Vilanova, Luis Enrique Ramos Velasco |
Reinforcement Learning in Continuous Systems: Wavelet Networks Approach.  |
Analysis and Design of Intelligent Systems using Soft Computing Techniques  |
2007 |
DBLP DOI BibTeX RDF |
adaptive wavelet networks, continuous systems, underactuated systems, Reinforcement learning |
| 3 | Mohammad Hossein Fazel Zarandi, Javid Jouzdani, I. Burhan Türksen |
Generalized Reinforcement Learning Fuzzy Control with Vague States.  |
Analysis and Design of Intelligent Systems using Soft Computing Techniques  |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning, fuzzy control, fuzzy systems |
| 3 | Luiz A. Celiberto, Jackson Paul Matsuura, Reinaldo A. C. Bianchi |
Heuristic Q-Learning Soccer Players: A New Reinforcement Learning Approach to RoboCup Simulation.  |
EPIA Workshops  |
2007 |
DBLP DOI BibTeX RDF |
RoboCup Simulation 2D, Reinforcement Learning, Cognitive Robotics |
| 3 | Omid Aghazadeh, Maziar Ahmad Sharbafi, Abolfazl Toroghi Haghighat |
Implementing Parametric Reinforcement Learning in Robocup Rescue Simulation.  |
RoboCup  |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Decision Making, Multi Agent Coordination |
| 3 | Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Helena Reali Costa, Reinaldo A. C. Bianchi |
Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents.  |
RoboCup  |
2007 |
DBLP DOI BibTeX RDF |
RoboCup Simulation 2D, Reinforcement Learning, Cognitive Robotics |
| 3 | Xiaobei Cheng, Jing Shen, Haibo Liu, Guochang Gu |
Multi-robot Cooperation Based on Hierarchical Reinforcement Learning.  |
International Conference on Computational Science  |
2007 |
DBLP DOI BibTeX RDF |
cooperation, multi-robot, hierarchical reinforcement learning |
| 3 | Daniel Lockery, James F. Peters |
Robotic Target Tracking with Approximation Space-Based Feedback During Reinforcement Learning.  |
RSFDGrC  |
2007 |
DBLP DOI BibTeX RDF |
rough sets, reinforcement learning, target tracking, Q-learning, Approximation space, monocular vision |
| 3 | Bernhard Hengst |
Safe State Abstraction and Reusable Continuing Subtasks in Hierarchical Reinforcement Learning.  |
Australian Conference on Artificial Intelligence  |
2007 |
DBLP DOI BibTeX RDF |
state abstraction, task hierarchies, decomposition, hierarchical reinforcement learning |
| 3 | Sudhakara P. Reddy, Raju S. Bapi, Chakravarthy Bhagvati, Bulusu Lakshmana Deekshatulu |
Concept Pre-digestion Method for Image Relevance Reinforcement Learning.  |
ICCTA  |
2007 |
DBLP DOI BibTeX RDF |
Concept Digestion Method, Reinforcement Learning, Relevance Feedback, Q-Learning |
| 3 | Kathryn Elizabeth Merrick, Mary Lou Maher |
Motivated reinforcement learning for adaptive characters in open-ended simulation games.  |
Advances in Computer Entertainment Technology  |
2007 |
DBLP DOI BibTeX RDF |
adaptive characters, motivated reinforcement learning, computer games, context-free grammar |
| 3 | Jinsong Leng, Colin Fyfe, Lakhmi C. Jain |
Reinforcement Learning of Competitive Skills with Soccer Agents.  |
KES  |
2007 |
DBLP DOI BibTeX RDF |
Agents, Reinforcement Learning, Decision Making |
| 3 | Francesco Bertoluzzo, Marco Corazza |
Making Financial Trading by Recurrent Reinforcement Learning.  |
KES  |
2007 |
DBLP DOI BibTeX RDF |
Financial trading system, recurrent reinforcement learning, no-hidden-layer perceptron model, returns weighted directional symmetry measure, gradient ascent technique, world financial market indices |
| 3 | Koichiro Morihiro, Haruhiko Nishimura, Teijiro Isokawa, Nobuyuki Matsui |
Reinforcement Learning Scheme for Grouping and Anti-predator Behavior.  |
KES  |
2007 |
DBLP DOI BibTeX RDF |
Anti-Predator, Reinforcement Learning, Grouping Behavior |
| 3 | Nima Taghipour, Ahmad Kardan, Saeed Shiry Ghidary |
Usage-based web recommendations: a reinforcement learning approach.  |
RecSys  |
2007 |
DBLP DOI BibTeX RDF |
machine learning, recommender systems, personalization, reinforcement learning, web usage mining |
| 3 | Jinsong Leng, Lakhmi C. Jain, Colin Fyfe |
Convergence Analysis on Approximate Reinforcement Learning.  |
KSEM  |
2007 |
DBLP DOI BibTeX RDF |
Approximate reinforcement learning, Agent, Convergence |
| 3 | Richardson Ribeiro, Alessandro L. Koerich, Fabrício Enembreck |
Noise Tolerance in Reinforcement Learning Algorithms.  |
IAT  |
2007 |
DBLP DOI BibTeX RDF |
Adaptive Autonomous Agents, Reinforcement Learning and Noise Tolerant Learning |
| 3 | J. Akilandeswari, N. P. Gopalan |
A Novel Design of Hidden Web Crawler Using Reinforcement Learning Based Agents.  |
APPT  |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Intelligent Agent, Web mining, Web Crawler, Hidden Web |
| 3 | José Antonio Martin H., Javier de Lope Asiaín |
A k-NN Based Perception Scheme for Reinforcement Learning.  |
EUROCAST  |
2007 |
DBLP DOI BibTeX RDF |
Collective Decision Making, Reinforcement Learning, k-Nearest-Neighbors |
| 3 | Sherief Abdallah, Victor R. Lesser |
Multiagent reinforcement learning and self-organization in a network of agents.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
reoganization, network, reinforcement learning, multiagent systems |
| 3 | Eduardo Rodrigues Gomes, Ryszard Kowalczyk |
Reinforcement learning with utility-aware agents for market-based resource allocation.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
market-based resource allocation, reinforcement learning |
| 3 | Matthew Grounds, Daniel Kudenko |
Parallel reinforcement learning with linear function approximation.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
value function approximation, parallel algorithms, reinforcement learning |
| 3 | Nicholas K. Jong, Peter Stone |
Model-based function approximation in reinforcement learning.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
models, reinforcement learning, function approximation |
| 3 | Michael Rovatsos, Alexandros Belesiotis |
Advice taking in multiagent reinforcement learning.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
multiagent reinforcement learning, communication |
| 3 | Mazda Ahmadi, Matthew E. Taylor, Peter Stone |
IFSA: incremental feature-set augmentation for reinforcement learning tasks.  |
AAMAS  |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning |
| 3 | Tom Croonenborghs, Kurt Driessens, Maurice Bruynooghe |
Learning Relational Options for Inductive Transfer in Relational Reinforcement Learning.  |
ILP  |
2007 |
DBLP DOI BibTeX RDF |
Relational Reinforcement Learning, Transfer Learning, Options |
| 3 | Quan Liu, Yang Gao 0001, Zhiming Cui, WangShu Yao, ZhongWen Chen |
An Tableau Automated Theorem Proving Method Using Logical Reinforcement Learning.  |
ISICA  |
2007 |
DBLP DOI BibTeX RDF |
logical reinforcement learning, tableau automated theorem proving, LOMDP |
| 3 | Toshiyuki Yasuda, Kazuhiro Ohkura |
Improving Search Efficiency in the Action Space of an Instance-Based Reinforcement Learning Technique for Multi-robot Systems.  |
ECAL  |
2007 |
DBLP DOI BibTeX RDF |
Autonomous Specialisation, Action Search, Reinforcement Learning, Multi-robot System |
| 3 | Güray Erus, Faruk Polat |
A layered approach to learning coordination knowledge in multiagent environments.  |
Appl. Intell.  |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement learning, Multiagent learning, Hierarchical reinforcement learning |
| 3 | Prasad Kulkarni, Dip Goswami, Prithwijit Guha, Ashish Dutta |
Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning.  |
Journal of Intelligent and Robotic Systems  |
2006 |
DBLP DOI BibTeX RDF |
PRM, statically stable biped robot, reinforcement learning, potential function |
| 3 | Yu Fei, Vincent W. S. Wong, Victor C. M. Leung |
Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning.  |
MONET  |
2006 |
DBLP DOI BibTeX RDF |
adaptive multimedia, QoS, reinforcement learning, mobile communication networks |
| 3 | Kurt Driessens, Jan Ramon, Thomas Gärtner |
Graph kernels and Gaussian processes for relational reinforcement learning.  |
Machine Learning  |
2006 |
DBLP DOI BibTeX RDF |
Reinforcement learning, Gaussian processes, Relational learning, Graph kernels |
Displaying result #1 - #100 of 3379 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|