Lab Home | Phone | Search | ||||||||
|
||||||||
This talk focuses on data integration problems where the predictor variables for some response variable partition into known subsets. This type of data is often referred to as multi-view data, and each subset of the predictors is called a data view. Accounting for data views can add practical value in terms of both interpretation and predictive performance. Many existing approaches for multi-view data rely on view-agreement principles, strong smoothness assumptions, or regularization penalties. The former approaches can be sensitive to modest noise in the response or predictor variables, while the latter approach is linear and can usually be out-performed. We develop semiparametric data integration methods to span key tradeoffs including the bias-variance tradeoff on prediction error, the possibility that the data may be fully viewed with no appreciable view relationships, and the use of sparse anchor point methods to implicitly detect and use manifolds (i.e., possibly non-elliptical structures) within views if they enhance performance. Theoretical results help justify the new technique, and its effectiveness and computational feasibility are demonstrated empirically. Host: Mike Hamada |