Online Calibrated Forecasts.  Efficiency versus Universality for Learning in Games

 

Professor Jeff Shamma

University of California - Los Angeles

Mechanical and Aerospace Engineering Department

 

Abstract -- We provide a simple learning process that enables an agent to forecast a sequence of outcomes. The forecasting scheme, termed tracking forecast,  is based on tracking the past observations while emphasizing recent outcomes. As opposed to other forecasting schemes, we sacrifice universality in favor of a significantly reduced computational burden. We show that if the sequence of outcomes has certain properties---it has some internal (hidden) state that does not change too fast---then the tracking forecast is "weakly calibrated" so that the forecast appears to be correct most of the time. For binary outcomes, this result holds without any internal state assumptions. We consider learning in a repeated strategic game where each player attempts to compute some forecast of the opponent actions and play a best response to it. We show that if one of the players uses tracking forecast, while the other players uses a standard learning algorithm (such as exponential regret matching or smooth fictitious play), then the player using the tracking forecast obtains the best response to the acttual play of the other players. We further show that if both players use a tracking forecast, then under certain conditions on the game matrix, a convergence to a Nash equilibrium is possible with positive probability for a larger class of games than smooth fictitious play.

 

Bio: Jeff Shamma is a Professor of Mechanical and Aerospace Engineering at UCLA. He received a PhD in Systems Science and Engineering in 1988 from the Massachusetts Institute of Technology, Department of Mechanical Engineering. He previously held faculty positions at the University of Minnesota, Minneapolis, and the University of Texas, Austin, before joining UCLA in 1999. He is a past recipient of the American Automatic Control Council Eckman Award and a Fellow of The IEEE. His research interest is feedback control and systems theory.

 

 

Friday, February 17, 2006

3:30 – 4:30 p.m.

 1500 EECS