Spring 2005 Seminar

 
 
       
  Machine Learning Seminar Series
 
 
  Seminar Schedule (Seminar Organizer: Prof. Ziv Bar-Joseph)
 

 

ML/Google Seminars

Machine Learning Lunchtime Chats

 

 

Date: February 9, 2005
Time: 10:00 AM - 11:00 AM
Location: 3305 Newell-Simon Hall
Speaker: Susan Murphy H.E. Robbins Professor of Statistics & Research Professor,
Title: A Finite Sample Upper Bound on the Generalization Error for Q-Learning
Abstract: This talk presents our recent work in two areas: large-scale text categorization and adaptive filtering. The first analyzes the scaling problem in automated text categorization with very large taxonomies via hierarchical decomposition, and evaluates Support Vector Machines and k-nearest neighbor classifiers on the full domain of Yahoo! categories (132,199 categories in both the training and test sets). The second part introduces the research challenges in semi-supervised learning for classification with non-stationary topics or events, with extremely sparse training examples at the start and incremental relevance feedback on biased samples during the filtering process. Our cross-benchmark evaluation with regularized logistic regression and Rocchio-style classifiers concludes on the-state-of-the-art solutions: using relevance feedback on .04~0.6% documents yielded 54% cost (error) reduction and 21% utility increase, compared to the best system without relevance feedback.