George Mason University
CSI/Statistics Colloquium Series
Seminar Announcement
Analysis of Superlarge Industrial Datasets
David Banks
National Institute for Standards and Technology
ABSTRACT
It is common to encounter
superlarge, highly multivariate data
in industrial applications. Such data pose four special problems:
compression
preanalysis
analysis
indexing
This talk concentrates on the second and third problems, using
examples from PPG and semiconductor manufacture. The key points
will be to review a protocol for preparing a superlarge dataset
for analysis, to describe a comparative survey of new-wave
nonparametric techniques that apply to such data, and to urge
the estimation of local dimensionality as a tool in these
analyses. Additionally, there will be a brief discussion of the
hard question of index creation, with some comments about possible
solution strategies.
Friday, March 5, 1999
George W. Johnson Center, Assembly Room D
Seminar at 10:45 a.m.
Refreshments at 10:30 a.m.
Information about the Statistics Colloquium Series, including
directions, and current and past schedules, is available at
www.science.gmu.edu/statseminars