Proceedings of the 2002 SIAM International Conference on Data Mining

Each link below is to a PDF of the paper as it was submitted. Papers are listed in program order. PDF file names represent the Proceedings (DM and year 02), followed by order in printed version (e.g. 001) and first author's last name and first initial..
Part I: Visualization and Applications
3 Visualizing Clustering Results
Ian Davidson
19 VizCluster:
An Interactive Visualization Approach to Cluster Analysis and Its Application
on Microarray Data
Li Zhang, Chun Tang, Yong Shi, Yuqing Song, Aidong Zhang, and Murali Ramanathan
41 Ensemble-Based
Adaptive Intrusion Detection
Wei Fan and Salvatore J. Stolfo
59 Instance
Selection Techniques for Memory-Based Collaborative Filtering
Kai Yu, Xiaowei Xu, Jianhua Tao, Martin Ester, and Hans-Peter Kriegel
Part II: Mining Large Data Sets
77 Shared
Memory Parallelization of Data Mining Algorithms: Techniques, Programming Interface,
and Performance
Ruoming Jin and Gagan Agrawal
95 A Data
Parallel Approach for Large-Scale Gaussian Process Modeling
Arindam Choudhury, Prasanth B. Nair, and Andy J. Keane
112 Efficient Filtering
of Large Dataset―A User-Centric Paradigm
Yi Xia, Wei Wang, Jiong Yang, Philip Yu, and Richard Muntz
128 Why the Information
Explosion Can Be Bad for Data Mining, and How Data Fusion Provides a Way Out
Peter van der Putten, Joost N. Kok, and Amar Gupta
Part III: Mining Sequential and Structured Patterns
141 On the Optimal Clustering
of Sequential Data
Cheng-Ru Lin and Ming-Syan Chen
158 Efficient Substructure
Discovery from Large Semi-structured Data
Tatsuya Asai, Kenji Abe, Shinji Kawasoe, Hiroki Arimura, Hiroshi Sakamoto,
and Setsuo Arikawa
175 Discovering Frequent
Substructures from Hierarchical Semi-structured Data
Gao Cong, Lan Yi, Bing Liu, and Ke Wang
Part IV: Time Series Analysis
195 Iterative Deepening
Dynamic Time Warping for Time Series
Selina Chu, Eamonn Keogh, David Hart, and Michael Pazzani
213 Extracting Precursor
Rules from Time Series―A Classical Statistical Viewpoint
João B. D. Cabrera and Raman K. Mehra
229 Autoregressive Tree
Models for Time-Series Analysis
C. Meek, D. M. Chickering, and D. Heckerman
Part V: Support Vector Machine and Neural Networks
247 Incremental Support
Vector Machine Classification
Glenn Fung and Olvi L. Mangasarian
261 A Pattern Search
Method for Model Selection of Support Vector Regression
Michinari Momma and Kristin P. Bennett
275 Explicit Thermodynamic
Properties Using Radial Basis Functions Neural Networks
Olivier Adam and Olivier Léonard
Part VI: Clustering
299 Cluster Selection in Divisive Clustering Algorithms
Sergio M. Savaresi, Daniel L. Boley, Sergio Bittanti, and Giovanna Gazzaniga
315 A Clustering Technique
for Mining Data from Text Tables
Hasan Davulcu, Saikat Mukherjee, and I. V. Ramakrishnan
333 On Scaling Up Balanced Clustering Algorithms
Arindam Banerjee and Joydeep Ghosh
Part VII: Classification and Decision Tables
353 Efficient Local Flexible Nearest Neighbor Classification
Carlotta Domeniconi and Dimitrios Gunopulos
370 Approximate Splitting for Ensembles of Trees Using Histograms
Chandrika Kamath, Erick Cantú-Paz, and David Littau
384 The Power of Second-Order Decision Tables
R. Hewett and J. Leuchner
Part VIII: Causality Rules and Relation Learning
403 Mining Relationship between Triggering and Consequential Events in a Short Transaction Database
Chang-Hung Lee, Philip S. Yu, and Ming-Syan Chen
420 Learning Simple Relations: Theory and Applications
Pavel Berkhin and Jonathan D. Becher
437 A Framework for Scalable Cost-Sensitive Learning Based on Combing Probabilities and Benefits
Wei Fan, Haixun Wang, Philip Yu, and Sal Stolfo
Part IX: Mining Frequent Patterns
457 CHARM: An Efficient Algorithm for Closed Itemset Mining
Mohammed J. Zaki and Ching-Jui Hsiao
474 Evaluating the Performance of Association Mining Methods in 3-D Medical Image Databases
Vasileios Megalooikonomou
494 Mining Frequent Itemsets in Evolving Databases
A.A. Veloso, W. Meira, Jr., M. B. de Carvalho, B. Pôssas, S. Parthasarathy,
and M. Javeed Zaki
511 Discovering Fully Dependent Patterns
Feng Liang, Sheng Ma, and Joseph L. Hellerstein
Part X: Applications
531 One Step Evolutionary Mining of Context Sensitive Associations and Web Navigation Patterns
O. Nasraoui and R. Krishnapuram
548 MedMeSH Summarizer: Text Mining for Gene Clusters
P. Kankar, S. Adak, A. Sarkar, K. Murari, and G. Sharma
566 Segmented Regression Estimators for Massive Data Sets
Ramesh Natarajan and Edwin Pednault
583 Collusion in the U. S. Crop Insurance Program: Applied Data Mining
Bertis B. Little, Walter L. Johnston, Jr., Ashley C. Lovell, Roderick M. Rejesus,
and Steve A. Steed
