Proceedings: Data Mining 2002

Proceedings of the 2002 SIAM International Conference on Data Mining


Each link below is to a PDF of the paper as it was submitted. Papers are listed in program order. PDF file names represent the Proceedings (DM and year 02), followed by order in printed version (e.g. 001) and first author's last name and first initial..

Preface

Part I: Visualization and Applications

3 Visualizing Clustering Results
Ian Davidson

19 VizCluster: An Interactive Visualization Approach to Cluster Analysis and Its Application on Microarray Data
Li Zhang, Chun Tang, Yong Shi, Yuqing Song, Aidong Zhang, and Murali Ramanathan

41 Ensemble-Based Adaptive Intrusion Detection
Wei Fan and Salvatore J. Stolfo

59 Instance Selection Techniques for Memory-Based Collaborative Filtering
Kai Yu, Xiaowei Xu, Jianhua Tao, Martin Ester, and Hans-Peter Kriegel

Part II: Mining Large Data Sets

77 Shared Memory Parallelization of Data Mining Algorithms: Techniques, Programming Interface, and Performance
Ruoming Jin and Gagan Agrawal

95 A Data Parallel Approach for Large-Scale Gaussian Process Modeling
Arindam Choudhury, Prasanth B. Nair, and Andy J. Keane

112 Efficient Filtering of Large Dataset―A User-Centric Paradigm
Yi Xia, Wei Wang, Jiong Yang, Philip Yu, and Richard Muntz

128 Why the Information Explosion Can Be Bad for Data Mining, and How Data Fusion Provides a Way Out
Peter van der Putten, Joost N. Kok, and Amar Gupta

Part III: Mining Sequential and Structured Patterns

141 On the Optimal Clustering of Sequential Data
Cheng-Ru Lin and Ming-Syan Chen

158 Efficient Substructure Discovery from Large Semi-structured Data
Tatsuya Asai, Kenji Abe, Shinji Kawasoe, Hiroki Arimura, Hiroshi Sakamoto, and Setsuo Arikawa

175 Discovering Frequent Substructures from Hierarchical Semi-structured Data
Gao Cong, Lan Yi, Bing Liu, and Ke Wang

Part IV: Time Series Analysis

195 Iterative Deepening Dynamic Time Warping for Time Series
Selina Chu, Eamonn Keogh, David Hart, and Michael Pazzani

213 Extracting Precursor Rules from Time Series―A Classical Statistical Viewpoint
João B. D. Cabrera and Raman K. Mehra

229 Autoregressive Tree Models for Time-Series Analysis
C. Meek, D. M. Chickering, and D. Heckerman

Part V: Support Vector Machine and Neural Networks

247 Incremental Support Vector Machine Classification
Glenn Fung and Olvi L. Mangasarian

261 A Pattern Search Method for Model Selection of Support Vector Regression
Michinari Momma and Kristin P. Bennett

275 Explicit Thermodynamic Properties Using Radial Basis Functions Neural Networks
Olivier Adam and Olivier Léonard

Part VI: Clustering

299 Cluster Selection in Divisive Clustering Algorithms
Sergio M. Savaresi, Daniel L. Boley, Sergio Bittanti, and Giovanna Gazzaniga

315 A Clustering Technique for Mining Data from Text Tables
Hasan Davulcu, Saikat Mukherjee, and I. V. Ramakrishnan

333 On Scaling Up Balanced Clustering Algorithms
Arindam Banerjee and Joydeep Ghosh

Part VII: Classification and Decision Tables

353 Efficient Local Flexible Nearest Neighbor Classification
Carlotta Domeniconi and Dimitrios Gunopulos

370 Approximate Splitting for Ensembles of Trees Using Histograms
Chandrika Kamath, Erick Cantú-Paz, and David Littau

384 The Power of Second-Order Decision Tables
R. Hewett and J. Leuchner

Part VIII: Causality Rules and Relation Learning

403 Mining Relationship between Triggering and Consequential Events in a Short Transaction Database
Chang-Hung Lee, Philip S. Yu, and Ming-Syan Chen

420 Learning Simple Relations: Theory and Applications
Pavel Berkhin and Jonathan D. Becher

437 A Framework for Scalable Cost-Sensitive Learning Based on Combing Probabilities and Benefits
Wei Fan, Haixun Wang, Philip Yu, and Sal Stolfo

Part IX: Mining Frequent Patterns

457 CHARM: An Efficient Algorithm for Closed Itemset Mining
Mohammed J. Zaki and Ching-Jui Hsiao

474 Evaluating the Performance of Association Mining Methods in 3-D Medical Image Databases
Vasileios Megalooikonomou

494 Mining Frequent Itemsets in Evolving Databases
A.A. Veloso, W. Meira, Jr., M. B. de Carvalho, B. Pôssas, S. Parthasarathy, and M. Javeed Zaki

511 Discovering Fully Dependent Patterns
Feng Liang, Sheng Ma, and Joseph L. Hellerstein

Part X: Applications

531 One Step Evolutionary Mining of Context Sensitive Associations and Web Navigation Patterns
O. Nasraoui and R. Krishnapuram

548 MedMeSH Summarizer: Text Mining for Gene Clusters
P. Kankar, S. Adak, A. Sarkar, K. Murari, and G. Sharma

566 Segmented Regression Estimators for Massive Data Sets
Ramesh Natarajan and Edwin Pednault

583 Collusion in the U. S. Crop Insurance Program: Applied Data Mining
Bertis B. Little, Walter L. Johnston, Jr., Ashley C. Lovell, Roderick M. Rejesus, and Steve A. Steed

Renew SIAM · Contact Us · Site Map · Join SIAM · My Account
Facebook Twitter Flickr Youtube