Fall 2007 Seminar

 
 
       
  Machine Learning Seminar Series
 
 
  Seminar Schedule (Seminar Organizer: Prof. Ziv Bar-Joseph)
 

 

ML/Google Seminars

Machine Learning Lunchtime Chats

 

 

Date: October 15, 2007
Time: 11:00 AM - 12:00 PM
Location: 1507 Newell-Simon Hall
Speaker: Abraham Bernstein Professor
Title: Towards Intelligent Assistance for a Data Mining Process
Abstract: A data mining (DM) process involves multiple stages. A simple, but typical, process might include preprocessing data, applying a data-mining algorithm, and post-processing the mining results. There are many possible choices for each stage, and only some combinations are valid. Because of the large space and non-trivial interactions, both novices and data-mining specialists need assistance in composing and selecting DM processes. Extending notions developed for statistical expert systems we present a prototype Intelligent Discovery Assistant (IDA), which provides users with (i) systematic enumerations of valid DM processes, in order that important, potentially fruitful options are not overlooked, and (ii) effective rankings of these valid processes by different criteria, to facilitate the choice of DM processes to execute. We use the prototype to show that an IDA can indeed provide useful enumerations and effective rankings in the context of simple classification processes. We discuss how an IDA could be an important tool for knowledge sharing among a team of data miners. Furthermore, we illustrate the claims with a comprehensive demonstration of cost-sensitive classification using a more involved process and data from the 1998 KDDCUP competition. Finally, we discuss, how new technologies arising in the Semantic Web domain might help to build IDAs more efficiently. Specifically, we briefly discuss how semantic data and data mining operator descriptions can help to leverage off-the-shelf semantic web service functionality to build easy to use IDAs.