School of Computer Science, carnegie Mellon
Extended Menu Contact Info Directory About SCS SCS New Buildings Info Giving to SCS SCS Dean's Advisory Board

SCS DISTINGUISHED LECTURE SERIES
4:00 PM - Wean Hall 7500
3:45 PM Distinguished Donuts - Outside the Hall

SCS Doctoral Dissertation Award Lectures

Michael Bowling
Assistant Professor
Department of Computing Science
University of Alberta

Multiagent Learning in the Presence of Limited Agents

Learning to act in a multiagent environment is a challenging problem. Optimal behavior for one agent depends upon the behavior of the other agents, which may be learning as well. Multiagent environments are therefore non-stationary, violating the traditional assumption underlying single-agent learning. In addition, agents in complex tasks may have limitations, such as unintended physical constraints or designer-imposed approximations of the task that make learning tractable. Limitations prevent agents from acting optimally, which complicates the already challenging problem. A learning agent must effectively compensate for its own limitations while exploiting the limitations of the other agents. My dissertation examines these two challenges.

In this talk I focus on just two contributions from my thesis: the WoLF principle and the GraWoLF algorithm. I show that the WoLF variable learning rate causes learning to converge to optimal responses in settings of simultaneous learning. I demonstrate this converging effect both theoretically in a subclass of single-state games and empirically in a variety of multiple-state domains. I then describe GraWoLF, a combination of policy gradient techniques and the WoLF principle. I show compelling results of applying this algorithm to a card game with an intractably large state space as well as an adversarial robot task. These results demonstrate that WoLF-based algorithms can effectively learn in the presence of other learning agents, and do so even in complex tasks with limited agents.

Speaker Bio:

Michael Bowling is an assistant professor at the University of Alberta in Edmonton, Canada. He recently completed his Ph.D. in Computer Science at Carnegie Mellon University in the area of artificial intelligence. He has been actively involved in the emerging field of multiagent learning. In addition to his thesis contributions, he has given several invited talks on multiagent learning and co-presented a tutorial at last year's International Joint Conference on Artificial Intelligence. He has also participated extensively in the RoboCup initiative, an international, autonomous, robot soccer competition. In 1998, he was the student leader of the Carnegie Mellon small-size robot team, which won the RoboCup world championship. His research interests include multiagent learning, multiagent planning, game theory, reinforcement learning, mobile robots, and computer game environments.

<< Back