The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for reward with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1967-1985 (17) 1986-1990 (17) 1991-1993 (24) 1994-1995 (22) 1996 (16) 1997-1998 (31) 1999 (27) 2000 (31) 2001 (49) 2002 (63) 2003 (54) 2004 (91) 2005 (98) 2006 (149) 2007 (170) 2008 (164) 2009 (123) 2010 (93) 2011 (81) 2012 (89) 2013 (107) 2014 (86) 2015 (105) 2016 (125) 2017 (159) 2018 (195) 2019 (268) 2020 (321) 2021 (403) 2022 (471) 2023 (585) 2024 (185)
Publication types (Num. hits)
article(2178) incollection(26) inproceedings(2189) mastersthesis(1) phdthesis(25)
Venues (Conferences, Journals, ...)
CoRR(908) NeuroImage(175) AAMAS(88) ICML(74) NeurIPS(73) AAAI(71) J. Cogn. Neurosci.(52) IJCNN(46) IJCAI(40) IEEE Access(38) ICRA(36) ICLR(34) HICSS(29) IROS(28) CogSci(24) AISTATS(22) More (+10 of total 1273)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 862 occurrences of 567 keywords

Results
Found 4420 publication records. Showing 4419 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
32Wei-lun Kao, Ravishankar K. Iyer, Dong Tang FINE: A Fault Injection and Monitoring Environment for Tracing the UNIX System Behavior under Faults. Search on Bibsonomy IEEE Trans. Software Eng. The full citation details ... 1993 DBLP  DOI  BibTeX  RDF FINE, fault injection and monitoring environment, UNIX system behavior, hardware-induced software errors, fault injector, analysis utilities, SunOS 4.1.2, transient Markov reward analysis, bus faults, CPU faults, pointer faults, software tools, Unix, program testing, system monitoring, software faults, software monitor, workload generator
32Jing Lei 0005, Roy D. Yates, Larry J. Greenstein A generic model for optimizing single-hop transmission policy of replenishable sensors. Search on Bibsonomy IEEE Trans. Wirel. Commun. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
32Kuang-Yuan Chen, Peter A. Lindsay Feedback of Delayed Rewards in XCS for Environments with Aliasing States. Search on Bibsonomy ACAL The full citation details ... 2009 DBLP  DOI  BibTeX  RDF aliasing states problem, credit assignment, maze problems, Learning Classifier Systems, XCS
32Adam J. Mersereau, Paat Rusmevichientong, John N. Tsitsiklis A structured multiarmed bandit problem and the greedy policy. Search on Bibsonomy CDC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
32Thach Huy Nguyen, Pornthep Rojanavasu, Ouen Pinngern Cost-Xensitive XCS Classifier System Addressing Imbalance Problems. Search on Bibsonomy FSKD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
32Andrea Soltoggio, Peter Dürr, Claudio Mattiussi, Dario Floreano Evolving neuromodulatory topologies for reinforcement learning-like problems. Search on Bibsonomy IEEE Congress on Evolutionary Computation The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
32Esteban Arcaute, Adam Kirsch, Ravi Kumar 0001, David Liben-Nowell, Sergei Vassilvitskii On threshold behavior in query incentive networks. Search on Bibsonomy EC The full citation details ... 2007 DBLP  DOI  BibTeX  RDF query incentive networks, threshold phenomena, branching processes
32Jian (Denny) Lin, Albert M. K. Cheng Maximizing Guaranteed QoS in (m, k)-firm Real-time Systems. Search on Bibsonomy RTCSA The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
32Karel Sladký Risk-Sensitive Optimality Criteria in Markov Decision Processes. Search on Bibsonomy OR The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
32Dirk Thierens An adaptive pursuit strategy for allocating operator probabilities. Search on Bibsonomy GECCO The full citation details ... 2005 DBLP  DOI  BibTeX  RDF adaptive operator allocation, adaptive pursuit, non-stationary operator probabilities, multi-armed bandit, non-stationary environment
32Ayako Onzo, Ken Mogi Cognitive Process of Emotion Under Uncertainty. Search on Bibsonomy ICONIP The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
32Ann T. Tai, William H. Sanders, Leon Alkalai, Savio N. Chau, Kam S. Tso Performability Analysis of Guarded-Operation Duration: A Successive Model-Translation Approach. Search on Bibsonomy DSN The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
32Boudewijn R. Haverkort, Lucia Cloth, Holger Hermanns, Joost-Pieter Katoen, Christel Baier Model Checking Performability Properties. Search on Bibsonomy DSN The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
32Martin Ulrich, Alexander Rüger, Verena Durner, Georg Grön, Heiko Graf Reward is not reward: Differential impacts of primary and secondary rewards on expectation, outcome, and prediction error in the human brain's reward processing regions. Search on Bibsonomy NeuroImage The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
27Pooia Lalbakhsh, Bahram Zaeri, Ali Lalbakhsh, Mehdi N. Fesharaki AntNet with Reward-Penalty Reinforcement Learning. Search on Bibsonomy CICSyN The full citation details ... 2010 DBLP  DOI  BibTeX  RDF AntNet, Reward-penalty reinforcement Learning, Swarm intelligence, Ant Colony Optimization
27Marek Grzes, Daniel Kudenko Theoretical and Empirical Analysis of Reward Shaping in Reinforcement Learning. Search on Bibsonomy ICMLA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF reward shaping, heuristics, reinforcement learning
27Okan Yilmaz, Ing-Ray Chen Comparative Performance Analysis of CAC Reward Optimization Algorithms in Wireless Networks. Search on Bibsonomy AINA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF reward optimization, performance analysis, Admission control, mobile networks, QoS guarantees
27Manuel Lopes 0001, Francisco S. Melo, Luis Montesano Active Learning for Reward Estimation in Inverse Reinforcement Learning. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
27Guiqing Zhang, Yinfeng Xu A Risk-Reward Competitive Analysis for the Newsboy Problem with Range Information. Search on Bibsonomy COCOA The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
27Simon Andrew Williamson, Enrico H. Gerding, Nicholas R. Jennings Reward shaping for valuing communications during multi-agent coordination. Search on Bibsonomy AAMAS (1) The full citation details ... 2009 DBLP  BibTeX  RDF decentralised POMDPs, communication, agents
27Qiang Shen 0001, Ruiqing Zhao, Wansheng Tang Modeling Random Fuzzy Renewal Reward Processes. Search on Bibsonomy IEEE Trans. Fuzzy Syst. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Clemens Moser, Jian-Jia Chen, Lothar Thiele Reward Maximization for Embedded Systems with Renewable Energies. Search on Bibsonomy RTCSA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Jiayu Gong, Xiliang Zhong, Cheng-Zhong Xu 0001 Energy and Timing Constrained System Reward Maximization on Wireless Networks. Search on Bibsonomy ICDCS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Andreas Huemer, David A. Elizondo, Mario Gongora 0001 A Reward-Value Based Constructive Method for the Autonomous Creation of Machine Controllers. Search on Bibsonomy ICANN (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Constructive Neural Network, Growing Machine Controllers, Reinforcement Learning, Spiking Neural Network
27Daan Wierstra, Tom Schaul, Jan Peters 0001, Jürgen Schmidhuber Episodic Reinforcement Learning by Logistic Reward-Weighted Regression. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Marek Grzes, Daniel Kudenko Multigrid Reinforcement Learning with Reward Shaping. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Akiyo Asahina, Junichiro Hirayama, Shin Ishii Interpreting Dopamine Activities in Stochastic Reward Tasks. Search on Bibsonomy ICONIP (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
27Chyng-Yang Jang Managing Fairness: Reward Distribution in a Self-organized Online Game Player Community. Search on Bibsonomy HCI (15) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Distributive justice, Fairness, Online community, MMORPG
27Ronald Ortner Pseudometrics for State Aggregation in Average Reward Markov Decision Processes. Search on Bibsonomy ALT The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Kenji Doya Designing the Reward System: Computational and Biological Principles. Search on Bibsonomy FOCI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Fang He, Souren Paul Time Pressure and Reward Inspiration as Outcome Controls for Information Sharing in Problem-Solving Virtual Teams. Search on Bibsonomy HICSS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Ambuj Tewari, Peter L. Bartlett Bounded Parameter Markov Decision Processes with Average Reward Criterion. Search on Bibsonomy COLT The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Adam Ponzi Neural Network Model of Forward Shift of CA1 Place Fields Towards Reward Location. Search on Bibsonomy ICONIP (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27N. Sato, Kishor S. Trivedi Accurate and efficient stochastic reliability analysis of composite services using their compact Markov reward model representations. Search on Bibsonomy IEEE SCC The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
27Anne Remke, Boudewijn R. Haverkort, Lucia Cloth A versatile infinite-state Markov reward model to study bottlenecks in 2-hop ad hoc networks. Search on Bibsonomy QEST The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Jian Li, Laiwan Chan Reward Adjustment Reinforcement Learning for Risk-averse Asset Allocation. Search on Bibsonomy IJCNN The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort-Piat Reward Function and Initial Values: Better Choices for Accelerated Goal-Directed Reinforcement Learning. Search on Bibsonomy ICANN (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Colin Molter, Naoyuki Sato, Utku Salihoglu, Yoko Yamaguchi How Reward Can Induce Reverse Replay of Behavioral Sequences in the Hippocampus. Search on Bibsonomy ICONIP (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Hyungil Ahn, Rosalind W. Picard Affective-Cognitive Learning and Decision Making: A Motivational Reward Framework for Affective Agents. Search on Bibsonomy ACII The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
27Raúl Rodríguez-Colín, Jesús Ariel Carrasco-Ochoa, José Francisco Martínez Trinidad Reward-Punishment Editing for Mixed Data. Search on Bibsonomy CIARP The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
27Jie Yang 0005, Lei Shu 0001, Xiaoling Wu 0004, Jinsung Cho, Sungyoung Lee, Sangman Han ETRI-QM: Reward Oriented Query Model for Wireless Sensor Networks. Search on Bibsonomy EUC The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
27Tomás Brázdil, Antonín Kucera 0001 Computing the Expected Accumulated Reward and Gain for a Subclass of Infinite Markov Chains. Search on Bibsonomy FSTTCS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
27Vinu Vijay Kumar, Rashi Verma, John C. Lach, Joanne Bechta Dugan A Markov Reward Model for Reliable Synchronous Dataflow System Design. Search on Bibsonomy DSN The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Annalisa Franco, Davide Maltoni, Loris Nanni Reward-Punishment Editing. Search on Bibsonomy ICPR (4) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Karel Sladký, Nico M. van Dijk Total Reward Variance in Discrete and Continuous Time Markov Chains. Search on Bibsonomy OR The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
27Frédéric Kaplan, Pierre-Yves Oudeyer Maximizing Learning Progress: An Internal Reward System for Development. Search on Bibsonomy Embodied Artificial Intelligence The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
27Orit Hazzan Computer science students' conception of the relationship between reward (grade) and cooperation. Search on Bibsonomy ITiCSE The full citation details ... 2003 DBLP  DOI  BibTeX  RDF software engineering, cooperation, teamwork
27Ming Zu, Albert Mo Kim Cheng Real-Time Scheduling of Hierarchical Reward-Based Tasks. Search on Bibsonomy IEEE Real Time Technology and Applications Symposium The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
27Sándor Rácz, Béla P. Tóth, Miklós Telek MRMSolve: A Tool for Transient Analysis of Large Markov Reward Models. Search on Bibsonomy Computer Performance Evaluation / TOOLS The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
27Susann C. Allmaier, David Kreische Parallel Approaches to the Numerical Transient Analysis of Stochastic Reward Nets. Search on Bibsonomy ICATPN The full citation details ... 1999 DBLP  DOI  BibTeX  RDF
27Aad P. A. van Moorsel, Latha A. Kant, William H. Sanders Computation of the Asymptotic Bias and Variance for Simulation of Markov Reward Models. Search on Bibsonomy Annual Simulation Symposium The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
24Eleni Vasilaki, Stefano Fusi, Xiao-Jing Wang, Walter Senn Learning flexible sensori-motor mappings in a complex network. Search on Bibsonomy Biol. Cybern. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Reward-modulated, Hebbian, Visuomotor task, Learning, Multilayer
24Clemens Moser, Jian-Jia Chen, Lothar Thiele Optimal service level allocation in environmentally powered embedded systems. Search on Bibsonomy SAC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF energy harvesting systems, embedded systems, reward maximization, solar cells
24Luoyi Fu, Xinbing Wang, Qian Zhang 0001 Unified fixed point analysis of IEEE 802.11(e) WLAN under saturated and unsaturated conditions. Search on Bibsonomy IWCMC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF fixed point analysis, renewal-reward theorem, unsaturated condition
24Oscar Díaz-Alcántara U-Training. A Framework to Create Ubiquitous Training Portals for Higher Education Teachers. Search on Bibsonomy ICIW The full citation details ... 2008 DBLP  DOI  BibTeX  RDF U-Training, Metric Control Programme, Reward Management Programme, e-Learning
24Xiaoxian He, Yunlong Zhu, Kunyuan Hu, Ben Niu 0002 A Swarm-Based Learning Method Inspired by Social Insects. Search on Bibsonomy ICIC (2) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Neighbor-Information-Reference (NIR) learning, i-interval neighbor, discounted reward, swarm intelligence, Q-learning
24Hiroaki Wagatsuma, Yoko Yamaguchi Context-Dependent Adaptive Behavior Generated in the Theta Phase Coding Network. Search on Bibsonomy ICONIP (2) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF amygdala, prefrontal cortex, theta phase precession, reward-evaluation, Khepera-robot, cognitive map, hippocampus, action-selection, place cells
24Levente Bodrog, Gábor Horváth 0002, Sándor Rácz, Miklós Telek A tool support for automatic analysis based on the tagged customer approach. Search on Bibsonomy QEST The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Tagged customer approach, Numerical analysis, Markov reward models
24Mourad Rabah, Karama Kanoun Performability Evaluation of Multipurpose Multiprocessor Systems: The "Separation of Concerns" Approach. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 2003 DBLP  DOI  BibTeX  RDF Dependability and performability evaluation, multipurpose multiprocessors systems, distributed shared memory, clustered systems, stochastic reward nets, modular modeling
24Cosmin Rusu, Rami G. Melhem, Daniel Mossé Maximizing rewards for real-time applications with energy constraints. Search on Bibsonomy ACM Trans. Embed. Comput. Syst. The full citation details ... 2003 DBLP  DOI  BibTeX  RDF reward-based, scheduling, real-time, operating systems, Power management
24Aaron Wilson, Margaret M. Burnett, Laura Beckwith, Orion Granatir, Ledah Casburn, Curtis R. Cook, Mike Durham, Gregg Rothermel Harnessing curiosity to increase correctness in end-user programming. Search on Bibsonomy CHI The full citation details ... 2003 DBLP  DOI  BibTeX  RDF forms/3, surprise-explain-reward strategy, assertions, end-user software engineering, curiosity
24Wing Ho A. Yuen, Roy D. Yates, Chi Wan Sung Effect of node mobility on highway mobile infostation networks. Search on Bibsonomy MSWiM The full citation details ... 2003 DBLP  DOI  BibTeX  RDF highway network, mobile infostation, renewal reward theory, ad hoc network, mobility, renewal processes, infostation
24Dong Chen, Selvamuthu Dharmaraja, Dongyan Chen, Lei Li, Kishor S. Trivedi, Raphael R. Some, Allen P. Nikora Reliability and Availability Analysis for the JPL Remote Exploration and Experimentation System. Search on Bibsonomy DSN The full citation details ... 2002 DBLP  DOI  BibTeX  RDF Fault-tolerance, Distributed systems, Markov chains, Transient faults, Hierarchical modeling, Fault trees, Dependability modeling, Stochastic reward nets
24Mira Park 0001, Jesse S. Jin, Laurence S. Wilson Fast Content-Based Image Retrieval Using Quasi-Gabor Filter and Reduction of Image Feature Dimension. Search on Bibsonomy SSIAI The full citation details ... 2002 DBLP  DOI  BibTeX  RDF Quasi-Gabor Filter, 2D FFT, Reward-Punishment algorithm, Feature Dimension Reduction, Content-Based Image Retrieval
24Anne M. P. Canuto, Gareth Howells 0001, Michael C. Fairhurst An Investigation of the Effects of Variable Vigilance within the RePART Neuro-Fuzzy Network. Search on Bibsonomy J. Intell. Robotic Syst. The full citation details ... 2000 DBLP  DOI  BibTeX  RDF reward/punishment parameter, RePART, fuzzy multi-layer perceptron, radial RAM, variable vigilance parameter, fuzzy ARTMAP, handwritten numeral recognition
24Osman Abul, Faruk Polat, Reda Alhajj Function approximation based multi-agent reinforcement learning. Search on Bibsonomy ICTAI The full citation details ... 2000 DBLP  DOI  BibTeX  RDF multi-agent based domain independent coordination mechanisms, coordination information, reward distribution, region-wide joint rewards, Adversarial Food-Collecting World, multi-agent environments, multi-agent systems, learning (artificial intelligence), function approximation, function approximation, state transitions, multi-agent reinforcement learning
24Mario Dal Cin, Gábor Huszerl, Konstantinos Kosmidis Quantitative Evaluation of Dependability Critical Systems Based on Guarded Statechart Models. Search on Bibsonomy HASE The full citation details ... 1999 DBLP  DOI  BibTeX  RDF stochasic reward nets, Embedded systems, statecharts, dependability analysis
24Paulo J. L. Adeodato, John G. Taylor Stability analysis of pRAM reinforcement learning. Search on Bibsonomy SBRN The full citation details ... 1997 DBLP  DOI  BibTeX  RDF pRAM networks, RAM-based neural networks, penalty/reward ratio, neural net chip, pattern recognition, stability, reinforcement learning, noise, generalisation, neural chips, time domain, basins of attraction
24Hsing Mei Scheduling dependent real-time multimedia tasks on distributed systems. Search on Bibsonomy COMPSAC The full citation details ... 1995 DBLP  DOI  BibTeX  RDF real-time multimedia task scheduling, periodic dependent real-time multimedia tasks, Multimedia Task Graph model, system reward value, scheduling, Quality of Services, distributed systems, real-time systems, resource allocation, distributed processing, software quality, synchronisation, processor scheduling, deadlines, multimedia computing, execution time, average response time, synchronization methods
24Lorrie A. Tomek, Jogesh K. Muppala, Kishor S. Trivedi Modeling Correlation in Software Recovery Blocks. Search on Bibsonomy IEEE Trans. Software Eng. The full citation details ... 1993 DBLP  DOI  BibTeX  RDF software recovery blocks, software fault-tolerance technique, successive acceptance tests, correct module outputs, pairwise correlation, beta-binomial density, Stochastic Reward Network, Stochastic Petri Net Package, SPNP, Petri nets, fault tolerant computing, software reliability, software reliability, statistical analysis, correlation, Markov models, stochastic modeling, system recovery, stochastic Petri nets, recovery blocks, functional specification
24Anja Austermann, Seiji Yamada, Kotaro Funakoshi, Mikio Nakano How do users interact with a pet-robot and a humanoid. Search on Bibsonomy CHI Extended Abstracts The full citation details ... 2010 DBLP  DOI  BibTeX  RDF asimo, robots, user studies, human-robot interaction, aibo
24Minija Tamosiunaite, James Ainge, Tomas Kulvicius, Bernd Porr, Paul Dudchenko, Florentin Wörgötter Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning. Search on Bibsonomy J. Comput. Neurosci. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF SARSA, Place field system, Weight decay, Reinforcement learning, Function approximation
24Akshat Verma, Rohit Jain, Sugata Ghosal A utility-based unified disk scheduling framework for shared mixed-media services. Search on Bibsonomy ACM Trans. Storage The full citation details ... 2008 DBLP  DOI  BibTeX  RDF GSP, shortest path, disk scheduling, Profit maximization
24Patrick D. Roberts, Roberto A. Santiago, Gerardo Lafferriere An implementation of reinforcement learning based on spike timing dependent plasticity. Search on Bibsonomy Biol. Cybern. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Learning, Computational neuroscience, Synaptic plasticity, Spiking neuron model
24Wenlong Ni, Wei Wayne Li, Mansoor Alam Optimal Call Admission Control Policy for the RCS Schemes in Wireless Networks. Search on Bibsonomy ICC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Guomin Zhang, Jianping Yin, En Zhu, Ling Mao On the Selection of Multi Optimal Imaging Frames in Single Time Slot for Earth Observation Satellite. Search on Bibsonomy ICYCS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Haibo Wang, Hans-Peter Schwefel, Thomas Skjødeberg Toftegaard History-Based Adaptive Modulation for a Downlink Multicast Channel in OFDMA Systems. Search on Bibsonomy WCNC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Eduardo Rodrigues Gomes, Ryszard Kowalczyk Non-symmetric Preferences in the IPA Market with Reinforcement Learning. Search on Bibsonomy IAT The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Paul Chorley, Anil K. Seth Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning. Search on Bibsonomy SAB The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Nguyen Hoang Viet, Ngo Anh Vien, TaeChoong Chung Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks. Search on Bibsonomy ICNSC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Tuan Zea Tan, Gary Kee Khoon Lee, Shie-Yui Liong, Tian Kuay Lim, Jiawei Chu, Terence Hung Rainfall intensity prediction by a spatial-temporal ensemble. Search on Bibsonomy IJCNN The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24N. Boris Margolin, Brian Neil Levine Informant: Detecting Sybils Using Incentives. Search on Bibsonomy Financial Cryptography The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Sirinart Tangruamsub, Proadpran Punyabukkana, Atiwong Suchato Thai Speech Keyword Spotting using Heterogeneous Acoustic Modeling. Search on Bibsonomy RIVF The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Sudipto Guha, Kamesh Munagala Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards. Search on Bibsonomy FOCS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richard Maclin Building Relational World Models for Reinforcement Learning. Search on Bibsonomy ILP The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Eiji Uchibe, Kenji Doya Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents. Search on Bibsonomy ICONIP (2) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Parosh Aziz Abdulla, Noomene Ben Henda, Richard Mayr, Sven Sandberg Eager Markov Chains. Search on Bibsonomy ATVA The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
24Paolo Arena, Luigi Fortuna, Mattia Frasca, Luca Patané, Marco Pavone Towards autonomous adaptive behavior in a bio-inspired CNN-controlled robot. Search on Bibsonomy ISCAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
24Christos Dimitrakakis Nearly Optimal Exploration-Exploitation Decision Thresholds. Search on Bibsonomy ICANN (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
24Naoki Abe, Alan W. Biermann, Philip M. Long Reinforcement Learning with Immediate Rewards and Linear Hypotheses. Search on Bibsonomy Algorithmica The full citation details ... 2003 DBLP  DOI  BibTeX  RDF Immediate rewards, Reinforcement learning, Online algorithms, Online learning, Decision theory, Dialogue systems, Computational learning theory
24Shie Mannor, Nahum Shimkin On-Line Learning with Imperfect Monitoring. Search on Bibsonomy COLT The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
24Christopher H. Brooks, Edmund H. Durfee Using Landscape Theory to Measure Learning Difficulty for Adaptive Agents. Search on Bibsonomy Adaptive Agents and Multi-Agents Systems The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Andrei Z. Broder, Michael Mitzenmacher Optmial plans for aggregation. Search on Bibsonomy PODC The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Natwar Modani, Parul A. Mittal, Amit Anil Nanavati, Biplav Srivastava Series of Dynamic Targeted Recommendations. Search on Bibsonomy EC-Web The full citation details ... 2002 DBLP  DOI  BibTeX  RDF recommender systems, E-commerce, targeting
24Marcus Hutter Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures. Search on Bibsonomy COLT The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Christel Baier, Boudewijn R. Haverkort, Holger Hermanns, Joost-Pieter Katoen Automated Performance and Dependability Evaluation Using Model Checking. Search on Bibsonomy Performance The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Weidong Zhou, Richard J. Coggins Computational Models of the Amygdala and the Orbitofrontal Cortex: A Hierarchical Reinforcement Learning System for Robotic Control. Search on Bibsonomy Australian Joint Conference on Artificial Intelligence The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Hiroyuki Okada, Hiroshi Yamakawa, Takashi Omori Two Dimensional Evaluation Reinforcement Learning. Search on Bibsonomy IWANN (1) The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
24Charles Lee Isbell Jr., Christian R. Shelton, Michael J. Kearns, Satinder Singh 0001, Peter Stone A social reinforcement learning agent. Search on Bibsonomy Agents The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
24Sheng-Tzong Cheng, Chi-Ming Chen, Ing-Ray Chen Dynamic Quota-Based Admission Control with Sub-Rating in Multimedia Servers. Search on Bibsonomy Multim. Syst. The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
Displaying result #101 - #200 of 4419 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license