Extending Q-learning to general adaptive multi-agent systems
Gerald Tesauro
NeurIPS 2003
Reinforcement learning (RL) is a promising new approach for automatically developing effective policies for real-time self-* management. RL has the potential to achieve superior performance to traditional methods while requiring less built-in domain knowledge. Several case studies from real and simulated systems-management applications demonstrate RL's promises and challenges. These studies show that standard online RL can learn effective policies in feasible training times. Moreover, a Hybrid RL approach can profit from any knowledge contained in an existing policy by training on the policy's observable behavior without needing to interface directly to such knowledge. © 2007 IEEE.
Gerald Tesauro
NeurIPS 2003
Rajarshi Das, James E. Hanson, et al.
IJCAI 2001
Gerald Tesauro, Nicholas K. Jong, et al.
Cluster Computing
Irina Rish, Gerald Tesauro
IM 2007