The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for reward with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1967-1985 (17) 1986-1990 (17) 1991-1993 (24) 1994-1995 (22) 1996 (16) 1997-1998 (31) 1999 (27) 2000 (31) 2001 (49) 2002 (63) 2003 (54) 2004 (91) 2005 (98) 2006 (149) 2007 (170) 2008 (164) 2009 (123) 2010 (93) 2011 (81) 2012 (89) 2013 (107) 2014 (86) 2015 (105) 2016 (125) 2017 (159) 2018 (195) 2019 (268) 2020 (321) 2021 (403) 2022 (471) 2023 (585) 2024 (185)
Publication types (Num. hits)
article(2178) incollection(26) inproceedings(2189) mastersthesis(1) phdthesis(25)
Venues (Conferences, Journals, ...)
CoRR(908) NeuroImage(175) AAMAS(88) ICML(74) NeurIPS(73) AAAI(71) J. Cogn. Neurosci.(52) IJCNN(46) IJCAI(40) IEEE Access(38) ICRA(36) ICLR(34) HICSS(29) IROS(28) CogSci(24) AISTATS(22) More (+10 of total 1273)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 862 occurrences of 567 keywords

Results
Found 4420 publication records. Showing 4419 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
19Özgür Simsek, Andrew G. Barto An intrinsic reward mechanism for efficient exploration. Search on Bibsonomy ICML The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
19Radu Jurca, Boi Faltings Using CHI-scores to reward honest feedback from repeated interactions. Search on Bibsonomy AAMAS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF honest reporting, reputation mechanisms
19Koen Vanhoof, Pieter Pauwels, József Dombi 0001, Tom Brijs, Geert Wets Penalty-Reward Analysis with Uninorms: A Study of Customer (Dis)Satisfaction. Search on Bibsonomy Intelligent Data Mining The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
19Huaxiang Zhang 0001, Xiyu Liu 0001, Peide Liu Fuzzy Reward Modeling for Run-Time Peer Selection in Peer-to-Peer Networks. Search on Bibsonomy FSKD (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
19Lili Ding, Chunlin Xin, Jian Chen A Risk-Reward Competitive Analysis of the Bahncard Problem. Search on Bibsonomy AAIM The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
19Gustavo Deco The Computational Neuroscience ofVisual Cognition: Attention, Memory and Reward. Search on Bibsonomy WAPCV The full citation details ... 2004 DBLP  DOI  BibTeX  RDF biased competition, visual attention, computational neuroscience, theoretical model
19Hock-Hai Teo, W. Wan, L. Li Volunteering Personal Information on the Internet: Effects of Reputation, Privacy Initiatives, and Reward on Online Consumer Behavior. Search on Bibsonomy HICSS The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
19Shie Mannor Reinforcement Learning for Average Reward Zero-Sum Games. Search on Bibsonomy COLT The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
19Enrique Campos-Náñez, Stephen D. Patek On improving the performance of simulation-based algorithms for average reward processes with application to network pricing. Search on Bibsonomy WSC The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
19Hakan Aydin, Rami G. Melhem, Daniel Mossé Tolerating faults while maximizing reward. Search on Bibsonomy ECRTS The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
19Haïscam Abdallah, Moulaye Hamza Sensitivity Analysis of the Expected Accumulated Reward Using Uniformization and IRK3 Methods. Search on Bibsonomy NAA The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
19Srinivasan Ramani, Kishor S. Trivedi, Balakrishnan Dasarathy Performance Analysis of the CORBA Event Service using Stochastic Reward Nets. Search on Bibsonomy SRDS The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
19Tomoaki Sakaguchi, Arun K. Somani Hierarchical Stochastic Reward Net Solver Package. Search on Bibsonomy Computer Performance Evaluation (Tools) The full citation details ... 1998 DBLP  DOI  BibTeX  RDF
19S. Das Gupta An Algorithm to Estimate Sub-Optimal Present Values for Unichain Markov Processes with Alternative Reward Structures. Search on Bibsonomy Optimization Techniques The full citation details ... 1973 DBLP  DOI  BibTeX  RDF
16Jiayu Gong, Xiliang Zhong, Cheng-Zhong Xu 0001 Maximizing Rewards in Wireless Networks with Energy and Timing Constraints for Periodic Data Streams. Search on Bibsonomy IEEE Trans. Mob. Comput. The full citation details ... 2010 DBLP  DOI  BibTeX  RDF power-aware packet scheduling, embedded systems, wireless networks, Reward maximization
16Menghui Yang, Zhituo Li, Weikang Yang, Tonghong Li Analysis of Software Rejuvenation in Clustered Computing System with Dependency Relation between Nodes. Search on Bibsonomy CIT The full citation details ... 2010 DBLP  DOI  BibTeX  RDF Clustered computing System, Software rejuvenation, Software aging, Stochastic Reward Net
16Yoshinori Takata, Ryo Hashimoto, Ryoichi Shinkuma, Tatsuro Takahashi, Naoki Yoshinaga 0002, Satoko Itaya, Shinichi Doi, Keiji Yamada Incentive Rewarding Method for Information Propagation in Social Networks. Search on Bibsonomy SAINT The full citation details ... 2010 DBLP  DOI  BibTeX  RDF incentive reward, social networks, Information propagation
16Steffen Reidt, Mudhakar Srivatsa, Shane Balfe The fable of the bees: incentivizing robust revocation decision making in ad hoc networks. Search on Bibsonomy CCS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF bees, partially available, suicide, trust authority, game, incentive, revocation, reward
16Clemens Moser, Jian-Jia Chen, Lothar Thiele Power management in energy harvesting embedded systems with discrete service levels. Search on Bibsonomy ISLPED The full citation details ... 2009 DBLP  DOI  BibTeX  RDF energy harvesting systems, embedded systems, power management, reward maximization, solar cells
16Masumi Ishikawa, Takao Hagiwara, Naoyuki Yamamoto, Fumiko Kiriake Brain-Inspired Emergence of Behaviors in Mobile Robots by Reinforcement Learning with Internal Rewards. Search on Bibsonomy HIS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF brain-inspired, internal reward, reinforcement learning, mobile robot
16Kenneth Treharne, Darius Pfitzner, Richard Leibbrandt, David M. W. Powers A lean online approach to human factors research. Search on Bibsonomy PETRA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF participation reward, web delivery
16Laura Beckwith, Cory Kissinger, Margaret M. Burnett, Susan Wiedenbeck, Joseph Lawrance, Alan F. Blackwell, Curtis R. Cook Tinkering and gender in end-user programmers' debugging. Search on Bibsonomy CHI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF surprise-explain-reward, tinkering, debugging, gender, end-user programming, end-user software engineering, self-efficacy
16Laura Beckwith, Margaret M. Burnett, Susan Wiedenbeck, Curtis R. Cook, Shraddha Sorte, Michelle Hastings Effectiveness of end-user debugging software features: are there gender issues? Search on Bibsonomy CHI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF surprise-explain-reward, debugging, gender, end-user programming, end-user software engineering
16Ilias Maglogiannis, Elias P. Zafiropoulos, Agapios N. Platis, George A. Gravvanis Computing the Success Factors in Consistent Acquisition and Recognition of Objects in Color Digital Images by Explicit Preconditioning. Search on Bibsonomy J. Supercomput. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF digital image acquisition, color measurement, approximate inverses, computer vision, parallel computations, Bayesian networks, Markov modeling, camera calibration, preconditioning, reproducibility, Markov Reward Models
16T. J. Robertson, Shrinu Prabhakararao, Margaret M. Burnett, Curtis R. Cook, Joseph R. Ruthruff, Laura Beckwith, Amit Phalgune Impact of interruption style on end-user debugging. Search on Bibsonomy CHI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF surprise-explain-reward, debugging, interruptions, end-user programming, end-user software engineering
16Robert L. Axtell Non-cooperative dynamics of multi-agent teams. Search on Bibsonomy AAMAS The full citation details ... 2002 DBLP  DOI  BibTeX  RDF equal division, increasing returns, multi-agent coalitions, proportional reward, stationary distribution of group sizes, transient groups, unstable Nash equilibria, team formation
16Marco Gribaudo, Matteo Sereno, András Horváth, Andrea Bobbio Fluid Stochastic Petri Nets Augmented with Flush-out Arcs: Modelling and Analysis. Search on Bibsonomy Discret. Event Dyn. Syst. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF stochastic reward models, fluid stochastic Petri nets, performance analysis, Petri nets
16Hairong Sun, Xinyu Zang, Kishor S. Trivedi Performance of broadcast and unknown server (BUS) in ATM LAN emulation. Search on Bibsonomy IEEE/ACM Trans. Netw. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF LAN emulation, broadcast and unknown server, stochastic petri net package, ATM, stochastic reward nets
16E. Vance Wilson, James R. Connolly Effects of group task pressure on perceptions of email and face-to-face communication effectiveness. Search on Bibsonomy GROUP The full citation details ... 2001 DBLP  DOI  BibTeX  RDF group task pressure, punishment, task importance, time pressure, computer-mediated communication, email, reward
16William Oliver, John Yu, Eric Metois The Singing Tree: Design of an Interactive Musical Interface. Search on Bibsonomy Symposium on Designing Interactive Systems The full citation details ... 1997 DBLP  DOI  BibTeX  RDF aural/visual feedback, music synthesis, musical interface design, reward-oriented feedback systems, voice analysis
16Oren Etzioni, Steve Hanks, Tao Jiang 0001, Richard M. Karp, Omid Madani, Orli Waarts Efficient Information Gathering on the Internet (extended abstract). Search on Bibsonomy FOCS The full citation details ... 1996 DBLP  DOI  BibTeX  RDF information providers, information access problem, reward model, Internet, Internet, approximation algorithm, cost model, information gathering, information sources
16Vincenzo Catania, Antonio Puliafito, Salvatore Riccobene, Lorenzo Vita Design and Performance Analysis of a Disk Array System. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 1995 DBLP  DOI  BibTeX  RDF disk array systems, declustering organization, response time distribution, Parallel I/O, stochastic reward nets
16Oliver C. Ibe, Hoon Choi, Kishor S. Trivedi Performance Evaluation of Client-Server Systems. Search on Bibsonomy IEEE Trans. Parallel Distributed Syst. The full citation details ... 1993 DBLP  DOI  BibTeX  RDF local areanetwork, parametric sensitivities, message interdependencies, request-replysystems, CSMA/CD network, distributedprocessing, performance evaluation, performance evaluation, distributed system, distributed systems, Petri nets, Markov chain, throughput, client-server systems, stochastic Petri nets, CSMA, file server, file servers, stochastic reward nets, token ring network, mean response time
16Mei-Chen Hsueh, Ravishankar K. Iyer, Kishor S. Trivedi Performability Modeling Based on Real Data: A Case Study. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 1988 DBLP  DOI  BibTeX  RDF semiMarkov process, measurement-based performability model, reward function, service rate, performance evaluation, multiprocessing systems, multiprocessor system, error rate
16David G. Furchtgott, John F. Meyer A Performability Solution Method for Degradable Nonrepairable Systems. Search on Bibsonomy IEEE Trans. Computers The full citation details ... 1984 DBLP  DOI  BibTeX  RDF reward models, Degradable performance, performability evaluation, performance evaluation, fault tolerance, reliability evaluation
16Álvaro Fialho, Marc Schoenauer, Michèle Sebag Toward comparison-based adaptive operator selection. Search on Bibsonomy GECCO The full citation details ... 2010 DBLP  DOI  BibTeX  RDF ROC area under curve, adaptive operator selection, parameter control, multi-armed bandits
16David Bartle, Samuel Rossoff, David Whittaker, Bruce Gooch, Kim Kerns, Jenny MacSween Cognitive games as therapy for children with FAS. Search on Bibsonomy SIGGRAPH Posters The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
16Jartuwat Rajruangrabin, Dan O. Popa Reinforcement learning of interface mapping for interactivity enhancement of robot control in assistive environments. Search on Bibsonomy PETRA The full citation details ... 2010 DBLP  DOI  BibTeX  RDF reinforcement learning, human-robot interface
16Wenlong Ni, Wei Wayne Li, Mansoor Alam Determination of optimal call admission control policy in wireless networks. Search on Bibsonomy IEEE Trans. Wirel. Commun. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Yih-Farn Chen, Yennun Huang, Rittwik Jana, Hongbo Jiang 0001, Michael Rabinovich, Jeremy Rahe, Bin Wei, Zhen Xiao Towards capacity and profit optimization of video-on-demand services in a peer-assisted IPTV platform. Search on Bibsonomy Multim. Syst. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF FTTN, Video-on-demand, IPTV, Content distribution network, P2P streaming
16Vadeerat Rinsurongkawong, Christoph F. Eick Change analysis in spatial datasets by interestingness comparison. Search on Bibsonomy ACM SIGSPATIAL Special The full citation details ... 2009 DBLP  DOI  BibTeX  RDF cluster models, region discovery, using polygon to discover changing patterns, spatial data mining, change analysis
16Kemal Altinkemer, Yasin Ozcelik Cash-back rewards versus equity-based electronic loyalty programs in e-commerce. Search on Bibsonomy Inf. Syst. E Bus. Manag. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Electronic loyalty programs, e-commerce models, Switching costs
16Dmitry G. Korzun, Andrei V. Gurtov A local equilibrium model for P2P resource ranking. Search on Bibsonomy SIGMETRICS Perform. Evaluation Rev. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Bill Lin 0001, Jun (Jim) Xu, Nan Hua, Hao Wang 0006, Haiquan (Chuck) Zhao A randomized interleaved DRAM architecture for the maintenance of exact statistics counters. Search on Bibsonomy SIGMETRICS Perform. Evaluation Rev. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Sipat Triukose, Zhihua Wen, Michael Rabinovich Content delivery networks: how big is big enough? Search on Bibsonomy SIGMETRICS Perform. Evaluation Rev. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Alma Riska, Erik Riedel Evaluation of disk-level workloads at different time scales. Search on Bibsonomy SIGMETRICS Perform. Evaluation Rev. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Alessandro Agnetis, Paolo Detti, Marco Pranzo, Manbir Singh Sodhi Sequencing unreliable jobs on parallel machines. Search on Bibsonomy J. Sched. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Indexable problems, Unsupervised manufacturing systems, Approximation algorithms, NP-hardness, Polymatroids
16Iván López-Bueno, Javier García 0001, Fernando Fernández 0001 Two Steps Reinforcement Learning in Continuous Reinforcement Learning Tasks. Search on Bibsonomy IWANN (1) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Jia Yuan Yu, Shie Mannor Piecewise-stationary bandit problems with side observations. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Gavin Taylor, Ronald Parr Kernelized value function approximation for reinforcement learning. Search on Bibsonomy ICML The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
16Steven de Jong, Karl Tuyls Learning to cooperate in a continuous tragedy of the commons. Search on Bibsonomy AAMAS (2) The full citation details ... 2009 DBLP  BibTeX  RDF punishment, tragedy of the commons, learning, complex networks
16Qing Zhao 0001, Bhaskar Krishnamachari, Keqin Liu On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance. Search on Bibsonomy IEEE Trans. Wirel. Commun. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Yanjie Li, Baoqun Yin, Hongsheng Xi Partially Observable Markov Decision Processes and Performance Sensitivity Analysis. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part B The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto, Kenji Doya A New Natural Policy Gradient by Stationary Distribution Metric. Search on Bibsonomy ECML/PKDD (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF policy gradient reinforcement learning, Riemannian metric matrix, Markov decision process, natural gradient
16Andreas Karwath, Kristian Kersting, Niels Landwehr Boosting Relational Sequence Alignments. Search on Bibsonomy ICDM The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Golriz Rezaei, Michael Kirley Heterogeneous Payoffs and Social Diversity in the Spatial Prisoner's Dilemma game. Search on Bibsonomy SEAL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Ole-Christoffer Granmo A Bayesian Learning Automaton for Solving Two-Armed Bernoulli Bandit Problems. Search on Bibsonomy ICMLA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Tara Javidi, Bhaskar Krishnamachari, Qing Zhao 0001, Mingyan Liu Optimality of Myopic Sensing in Multi-Channel Opportunistic Access. Search on Bibsonomy ICC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Kousha Etessami, Dominik Wojtczak, Mihalis Yannakakis Recursive Stochastic Games with Positive Rewards. Search on Bibsonomy ICALP (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Kazuteru Miyazaki, Shigenobu Kobayashi Proposal of Exploitation-Oriented Learning PS-r#. Search on Bibsonomy IDEAL The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Keqin Liu, Qing Zhao 0001 Link throughput of multi-channel opportunistic access with limited sensing. Search on Bibsonomy ICASSP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Anja Austermann, Seiji Yamada Teaching a Pet Robot through Virtual Games. Search on Bibsonomy IVA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Linh Vu, Kristi A. Morgansen Modeling and analysis of dynamic decision making in sequential two-choice tasks. Search on Bibsonomy CDC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Andrea Nedic, Damon Tomlin, Philip Holmes, Deborah A. Prentice, Jonathan D. Cohen 0003 A simple decision task in a social context: Experiments, a model, and preliminary analyses of behavioral data. Search on Bibsonomy CDC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Wenlong Ni, Wei Wayne Li, Mansoor Alam Optimal Call Admission Control Policy In Wireless Networks. Search on Bibsonomy WCNC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Marek Grzes, Daniel Kudenko An Empirical Analysis of the Impact of Prioritised Sweeping on the DynaQ's Performance. Search on Bibsonomy ICAISC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Masakazu Takahashi, Toshiyuki Nakao, Kazuhiko Tsuda, Takao Terano Generating Dual-Directed Recommendation Information from Point-of-Sales Data of a Supermarket. Search on Bibsonomy KES (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Dual-Directed Recommendation, Collaborative Filtering System, Customer Preference, Recommendation Systems
16Fang He, Souren Paul An Empirical Investigation of the Roles of Outcome Controls and Psychological Factors in Collaboration Technology Supported Virtual Teams. Search on Bibsonomy HICSS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Adedoyin Maria Thompson, Bernd Porr, Christoph Kolodziejski, Florentin Wörgötter Second Order Conditioning in the Sub-cortical Nuclei of the Limbic System. Search on Bibsonomy SAB The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Dopamine, three factor ISO Learning, Conditioning, Hebbian learning
16Makoto Otsuka, Junichiro Yoshimoto, Kenji Doya Robust Population Coding in Free-Energy-Based Reinforcement Learning. Search on Bibsonomy ICANN (1) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
16Vishal A. Varma, Reha Uzsoy, Joseph F. Pekny, Gary E. Blau Lagrangian heuristics for scheduling new product development projects in the pharmaceutical industry. Search on Bibsonomy J. Heuristics The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Resource constrained multi-project scheduling, Dual block angular matrix, Lagrangian dual, Integer programming, Decomposition
16Lucia Cloth, Marijn R. Jongerden, Boudewijn R. Haverkort Computing Battery Lifetime Distributions. Search on Bibsonomy DSN The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Jan Peters 0001, Stefan Schaal Reinforcement Learning for Operational Space Control. Search on Bibsonomy ICRA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Lu Fan, Hamish Taylor, Philip W. Trinder Mediator: a design framework for P2P MMOGs. Search on Bibsonomy NETGAMES The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Reng Yin, Hao Hu 0001, JiDong Ge, Jian Lu 0001 Quantitative Analysis of Value-Based Software Processes Using Decision-Based Stochastic Object Petri-Nets. Search on Bibsonomy APSEC The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Yuehai Wang, Mengmeng Zhang Multiple Moving Prey Pursuit Algorithm Based on the Changeable Alliance. Search on Bibsonomy SNPD (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Changeable Alliance, Multiple Moving Preys Cooperative Pursuit, Pursuit and Evasion Games
16Eduardo Rodrigues Gomes, Ryszard Kowalczyk Learning the IPA Market with Individual and Social Rewards. Search on Bibsonomy IAT The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Ying Wu 0004, Colin Fyfe, Pei Ling Lai Stochastic Weights Reinforcement Learning for Exploratory Data Analysis. Search on Bibsonomy ICANN (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarwal Multi-armed bandit problems with dependent arms. Search on Bibsonomy ICML The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak Benaskeur A Q-decomposition and bounded RTDP approach to resource allocation. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Q-decomposition, marginal revenue, heuristic search
16Praveen Paruchuri, Jonathan P. Pearce, Milind Tambe, Fernando Ordóñez, Sarit Kraus An efficient heuristic approach for security against multiple adversaries. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Bayesian and Stackelberg games, security of agent systems, game theory
16Rajiv T. Maheswaran, Craig Milo Rogers, Romeo Sanchez Distributed coordination in uncertain multiagent systems. Search on Bibsonomy AAMAS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
16Luis Alejandro Cortés, Petru Eles, Zebo Peng Quasi-Static Assignment of Voltages and Optional Cycles in Imprecise-Computation Systems With Energy Considerations. Search on Bibsonomy IEEE Trans. Very Large Scale Integr. Syst. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Jeremy Sproston, Susanna Donatelli Backward Bisimulation in Markov Chain Model Checking. Search on Bibsonomy IEEE Trans. Software Eng. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF model checking, verification, temporal logic, Markov processes
16Ran Cheng, Julita Vassileva Design and evaluation of an adaptive incentive mechanism for sustained educational online communities. Search on Bibsonomy User Model. User Adapt. Interact. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF Personalized rewards, Online communities, Virtual communities, Participation, Incentive mechanisms, Ratings
16Parosh Aziz Abdulla, Noomene Ben Henda, Richard Mayr, Sven Sandberg Limiting Behavior of Markov Chains with Eager Attractors. Search on Bibsonomy QEST The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Daniil Ryabko, Marcus Hutter Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence. Search on Bibsonomy ALT The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Fei Liu 0010, Guangzhou Zeng Multi-agent Cooperative Learning Research Based on Reinforcement Learning. Search on Bibsonomy CSCWD The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Ali Hamzeh, Adel Rahmani A Pattern Based Evolutionary Approach to Prediction Computation in XCSF. Search on Bibsonomy ICNC (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Shimon Whiteson, Peter Stone On-line evolutionary computation for reinforcement learning in stochastic domains. Search on Bibsonomy GECCO The full citation details ... 2006 DBLP  DOI  BibTeX  RDF neural networks, evolutionary computation, reinforcement learning, on-line learning
16Ji Wu 0006, Chaoqun Ye, Shiyao Jin Opponent Learning for Multi-agent System Simulation. Search on Bibsonomy RSKT The full citation details ... 2006 DBLP  DOI  BibTeX  RDF reinforcement learning, Markov decision processes, multi-agent simulation, Opponent modeling
16George Dimitri Konidaris, Andrew G. Barto An Adaptive Robot Motivational System. Search on Bibsonomy SAB The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
16Andrea Lockerd Thomaz, Guy Hoffman, Cynthia Breazeal Experiments in socially guided machine learning: understanding how humans geach. Search on Bibsonomy HRI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF socially guided agents, machine learning, human-robot interaction
16Javier Esparza, Antonín Kucera 0001, Richard Mayr Quantitative Analysis of Probabilistic Pushdown Automata: Expectations and Variances. Search on Bibsonomy LICS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
16Krishnendu Chatterjee, Thomas A. Henzinger, Marcin Jurdzinski Mean-Payoff Parity Games. Search on Bibsonomy LICS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
16Xin Xu, Tao Xie 0005 A Reinforcement Learning Approach for Host-Based Intrusion Detection Using Sequences of System Calls. Search on Bibsonomy ICIC (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
16Lei Shu 0001, Sungyoung Lee, Xiaoling Wu 0004, Jie Yang 0005 Maximizing System Value Among Interested Packets While Satisfying Time and Energy Constraints. Search on Bibsonomy ICN (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
16Jan Drugowitsch, Alwyn Barry XCS with eligibility traces. Search on Bibsonomy GECCO The full citation details ... 2005 DBLP  DOI  BibTeX  RDF eligibility traces, XCS, q-learning, LCS, temporal-difference learning
16Joannès Vermorel, Mehryar Mohri Multi-armed Bandit Algorithms and Empirical Evaluation. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
16Ali Hamzeh, Adel Rahmani An Evolutionary Function Approximation Approach to Compute Prediction in XCSF. Search on Bibsonomy ECML The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
Displaying result #301 - #400 of 4419 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license