Following is a more detailed description of the event. Please note that not all events are open to the general public. If you need additional information relating to an event, please communicate with the contact person for the event or email www@isye.gatech.edu.
GUEST LECTURER
Professor Bin Yu
AFFILIATION
Statistics Department, UC Berkeley
ABSTRACT
Information technology advances are making data collection possible in most if not all fields of science and engineering and beyond. Often data reduction or feature selection is the first step towards solving current IT age problems. However, data reduction through model selection or l_0 constrained LS optimization leads to a combinatorial search which is computationally infeasible for massive data problems. A computationally efficient alternative is the l_1 constrained LS optimization or Lasso optimization.
In this talk, we first study the model selection property of Lasso in linear regression models. We show that an Irrepresentable Condition on the design matrix is almost necessary and sufficient for the model selection consistency of Lasso for fixed p and p >> n cases, provided that the true model is sparse. Moreover, we describe the Boosted Lasso (BLasso)algorithm which produces an approximation to the complete regularization path of Lasso. BLasso consists of both a forward step and a backward step. The forward step is similar to Boosting and Forward Stagewise Fitting, but the backward step is new and crucial for BLasso to approximate the Lasso path in all situations. For cases with finite number of base learners, when the step size goes to zero, the BLasso path is shown to converge to the Lasso path. Finally, the Blasso algorithm is extended to give an approximate path for the case of a convex loss function plus a convex penalty.
DATE & TIME
Tuesday, October 17, 2006 -- 11:00 AM
DURATION
1 hour
LOCATION
Executive classroom of Main building
CONTACT PERSON
Yajun Mei