Time |
Event |
Location |
Wednesday, April 19, 2006 |
|
|
|
5:00 PM - 7:00 PM |
Registration Opens |
Ballroom Foyer |
|
|
|
Thursday, April 20, 2006 |
|
|
|
7:00 AM - 4:00 PM |
Registration |
Tiffany & Cartier
(Ballroom Level) |
7:00 AM - 4:00 PM |
Internet Cafe |
Embassy Room
(Conference Level) |
7:30 AM - 8:15 AM |
Continental Breakfast |
Chesapeake Suites
(Conference Level) |
8:15 AM - 8:30 AM |
Welcoming Remarks |
Waterford/Lalique
(Ballroom Level) |
8:30 AM - 9:45 AM |
Keynote Speaker
Andreas Buja, Wharton School, University of Pennsylvania |
Waterford/Lalique
(Ballroom Level) |
9:45 AM - 10:00 AM |
Coffee Break & Poster Session 1 |
Chesapeake Suites
(Conference Level) |
10:00 AM - 12:00 PM |
Theory
Area Under ROC Optimization Using a Ramp Function
Alan Herschtal Bhavani Raskutti Peter K. Campbell
On the Necessary and Sufficient Conditions of a Meaningful Distance Function for High Dimensional Data Space
Chih-Ming Hsu and Ming-Syan Chen
CPM: A Covariance-Preserving Projection Method
Jieping Ye, Tao Xiong, and Ravi Janardan
Transform Regression and the Kolmogorov Supposition Theorem
Edwin Pednault
|
Haverford
(Ballroom Level) |
10:00 AM - 12:00PM |
Tutorial 1
Randomized Algorithms for Matrices and Massive Data Sets
Petros Drineas and Michael W. Mahoney
|
Waterford
(Ballroom Level) |
10:00 AM - 12:00PM |
Enterprise Applications
A Latent Dirichelt Model for Unsupervised Entity Resolution
Indrajit Bhattacharya, Lise Getoor
Deriving Private Information from Randomly Perturbed Ratings
Sheng Zhang, James Ford, Fillia Makedon
Name Reference Resolution in Organizational Email Archives
Christopher P. Diehl, Lise Getoor and Galileo Namata
Automated Knowledge Discovery From Simulators
M.C. Burl, D. DeCoste, B.L. Enke, D. Mazzoni, W.J. Merline, L. Scharenbroich
|
Lalique
(Ballroom Level |
12:00 PM - 1:30 PM |
Lunch |
Attendees on their own |
1:30 PM - 2:45 PM |
Keynote Speaker
Latanya Sweeney, Carnegie Mellon University |
Waterford/Lalique
(Ballroom Level) |
2:45 PM - 3:15 PM |
Coffee Break & Poster Session 1 |
Chesapeake Suites
(Conference Level) |
3:15 PM - 4:45 PM |
Anomalies and Outliers
Mining for Outliers in Sequential Databases
Pei Sun, Sanjay Chawla and Bavani Arunasalam
Mining Control Flow Abnormality for Logic Error Isolation
Chao Liu, Xifeng Yan, Jiawei Han
Scan Detection: A Data Mining Approach
Gyorgy Simon, Hui Xiong, Eric Eilertson, Vipin Kumar
|
Haverford
(Ballroom Level) |
3:15 PM - 4:45 PM |
Network Relations
Learning Bayesian Networks from Incomplete Data: An Efficient Method for Generating Approximate Predictive Distributions
Carsten Riggelsen
Efficient Markov Network Structure Discovery from Independence Tests
Facundo Bromberg, Dimitris Margaritis, Vasant Honavar
k-Means Clustering Over a Large Dynamic Network
Souptik Datta, Chris Giannella, Hillol Kargupta
|
Lalique |
4:45 PM - 5:00 PM |
Intermission |
Attendees on their own |
5:00 PM - 6:30 PM |
Prototype Generation
Adapting K-Medians to Generate Normalized Cluster Centers
Benjamin J. Anderson, Deborah S. Gross, David R. Musicant, Anna M. Ritz, Thomas G. Smith, Leah E. Steinberg
Advanced Prototype Machines: Exploring Prototypes for Classification
Hans-Peter Kriegel and Matthias Schubert
Towards Semantic XML Clustering
Andrea Tagarelli, Sergio Greco
|
Haverford |
5:00 PM - 6:30 PM |
Applications in Biology
A Semantic Approach for Mining Hidden Links from Complementary and Non-interactive Biomedical Literature
Xiaohua Hu, Xiaodan Zhang, Illhoi Yoo, Yanqing Zhang
Representation is Everything: Towards Efficient and Adaptable Similarity Measures for Biological Data
Charu Aggarwal
Mining Frequent Agreement Subtrees in Phylogenetic Databases
Sen Zhang, Jason T.L. Wang
|
Lalique |
6:30 PM - 8:30 PM |
Welcome Reception & Poster Display |
Chesapeake Suites |
8:30 PM |
(All Poster Session 1 posters DOWN) |
|
Friday, April 21, 2006 |
7:30 AM |
(Poster Session 2 posters UP) |
|
7:00 AM - 4:00 PM |
Registration |
Tiffany & Cartier
(Ballroom Level) |
7:00 AM - 4:00 PM |
Internet Cafe |
Embassy Room
(Conference Level) |
7:30 AM - 8:15 AM |
Continental Breakfast |
Chesapeake Suites
(Conference Level) |
8:15 AM - 8:30 AM |
Remarks |
Waterford/Lalique |
8:30 AM - 9:45 AM |
Keynote Speaker
Stanley Wasserman, Indiana University |
Waterford/Lalique |
9:45 AM - 10:00 AM |
Coffee Break & Poster Session 2 |
Chesapeake Suites |
10:00 AM - 12:00 PM |
Clustering
Trend Relational Analysis and Grey-Fuzzy Clustering Method
Zhijie Chen, Weizhen Chen, Qile Chen,and Mian-Yun Chen
Joint Cluster Analysis of Attribute Data and Relationship Data: The Connected k-Center Problem
Martin Ester, Rong Ge, Byron J. Gao, Zengjian Hu, Boaz Ben-Moshe
Weighted Clustering Ensembles
Muna Al-Razgan and Carlotta Domeniconi
Clustering in the Presence of Bridge Nodes
Jerry Scripps, Pang-Ning Tan
|
Haverford |
10:00 AM - 12:00 PM |
Tutorial 2
From Unsupervised to Semi-supervised Learning: Algorithms and evaluation approaches
Dimitrios Gunopulos, Michalis Vazirgiannis, and Maria Halkidi
|
Waterford |
10:00 AM - 12:00 PM |
Pattern Mining
Mining Interesting Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach
Hongyan Liu, Jiawei Han, Dong Xin, Zheng Shao
Mining Frequent Patterns by Differential Refinement of Clustered Bitmaps
Jianwei Li, Alok Choudhary, Nan Jiang, and Wei-keng Liao
Discovery of Co-evoluting Spatial Co-located Event Sets
Jin Soung Yoo and Shashi Shekhar
Efficient Algorithms for Sequence Segmentation
Evimaria Terzi and Panayiotis Tsaparas
|
Lalique |
12:00 PM - 1:30 PM |
Lunch |
Attendees on their own |
1:30 PM - 2:45 PM |
Keynote Speaker
Steven Salzberg, University of Maryland |
Waterford/Lalique |
2:45 PM - 3:15 PM |
Coffee Break & Poster Session 2 |
Chesepeake Suites |
3:15 PM - 4:45 PM |
Temporal Data and Random Walks
Density-Based Clustering Over an Evolving Data Stream with Noise
Feng Cao, Martin Ester, Weining Qian, Aoying Zhou
A Random Walk Method for Text Classification
Yunpeng Xu, Xing Yi, Changshui Zhang
Efficient Mining of Temporally Annotated Sequences
F. Giannotti, M. Nanni, D. Pedreschi
|
Haverford |
3:15 PM - 4:45 PM |
Special Session
Mining On the Run: Self-Organizing Dynamic Data for Enhanced Search
Dr. H. Van Dyke Parunak, Altarum Org
Picking a Pooling
Richard Rohwer*, Ken Kreutz-Delgado** & Brandon Burge** *HNC Software, LLC / Fair Isaac Corp., **University of California at San Diego
|
Waterford |
3:15 PM - 4:45 PM |
Dimension Reduction and Coupling
A Framework for Local Supervised Dimensionality Reduction of High Dimensional Data
Charu Aggarwal
Segmentation and Dimensionality Reduction
Ella Bingham, Aristides Gionis, Niina Haiminen, Heli Hiisilä, Heikki Mannila, Evimaria Terzi
Probabilistic Multi-State Split-Merge Algorithm for Coupling Parameter Estimates
Juan K. Lin
|
Lalique |
4:45 PM - 5:00 PM |
Intermission |
Attendees on their own |
5:00 PM - 6:30 PM |
Item Sets
Item Sets that Compress
Arno Siebes and Jilles Vreeken and Matthijs van Leeuwen
Mining Approximate Frequent Item sets in the Presence of Noise: Algorithms and Analysis
Jinze Liu, Susan Paulsen, Xing Xu, Wei Wang, Andrew Nobel, Jan Prins
Mining Frequent Closed Item sets Out of Core
Claudio Lucchese, Salvatore Orlando, Raffaele Perego
|
Haverford |
5:00 PM - 6:30 PM |
Special Session 2: Cont.
SurveyAnalyst: a Text Mining Tool for Survey Analysis
Dan Moldovan, Christine Clark, Daniel Hodges, John Prange Language Computer Corporation
ARGUS: Novelty Detection and Profile Tracking from Massive Data
Eugene Fink, and
Jaime Carbonell,
Carnegie Mellon University
|
Waterford |
5:00 PM - 6:30 PM |
Collaborative Mining
Local L-2 Thresholding Based Data Mining in Peer-to-Peer Networks
Ran Wolff, Kanishka Bhaduri, Hillol Kargupta
Collaborative Information Extraction and Mining from Multiple Web Documents
Tak-Lam Wong, Wai Lam, Shing-Kit Chan
Collaborative Document Clustering
Khaled Hammouda and Mohamed Kamel
|
Lalique |
|
(All posters DOWN) |
|
Saturday, April 22, 2006 |
|
|
|
7:00 AM - 3:00 PM |
Registration |
Tiffany & Cartier
(Ballroom Level) |
7:00 AM - 3:00 PM |
Internet Cafe |
Embassy Room
(Conference Level) |
7:30 AM - 8:00 AM |
Continental Breakfast |
Chesapeake Suites
(Conference Level) |
8:00 AM - 10:00 AM |
Workshops |
See below for locations |
10:00 AM - 10:30 AM |
Coffee Break |
Chesapeake Suites |
10:30 AM - 12:00 PM |
Workshops Continue |
See below for locations |
12:00 PM - 1:30 PM |
Lunch |
Attendees on their own |
1:30 PM - 3:00 PM |
Workshops Continue |
See below for locations |
3:00 PM - 3:30 PM |
Coffee Break |
Chesapeake Suites |
3:30 PM - 5:00 PM |
Workshops Continue |
See below for locations |
Summarizing Data Clusters: Description Formats, Problems and Algorithms
Positive Borders or Negative Bordes: What to Make Lossless Generator-based Representations Concise
Bayesian K-Means as a Maximization-Expectation Algorithm
A framework for clustering Massive Text and Categorical Data
Streams
Cone Cluster Labeling for Support Vector Clustering
Semi-supervised Clustering with Partial Background Information
A New Privacy Preserving Distributed k-Clustering Algorithm
ODAC: Hierarchical Clustering of Time-Series Data Streams
Detecting the Change of Clustering Structure in Categorical
Data Streams
Dissimilarity Measures for Detecting Hepatoxicity in Clinical
Trial Data
Transducive De-noising, Dimensionality Reduction and Clustering
Using Bregman Regression
Robust Estimation for Mixture of Probability Tables Based on
Beta-Likelihoods
Fast Optimal Bandwidth Selection for Kernel Density Estimation
Risk Sensitive Learning via Expected Shortfall Minimization
On Approximate Solutions to Support Vector Machines
Confidence Estimation Methods for Partially Supervised
Information Extraction
Inference of Node Replacement Recursive Graph Grammars
Learning from Incomplete Ratings Using Non-negative Matrix
Factorization
Health Monitoring of a shaft transmission system via hybrid models of PCR and PLS
Modeling Evolutionary and Relational Behaviors for Community-Based Dynamic Recommendation
A Systematic Cross-Comparison of Sequence Classifiers
Data Enhanced Predictive Modeling for Sales Targeting
Graph Based Methods for Orbit Classification
Mining and Validation of WEb Transaction Data
Profiling Protein Families from Partially Aligned Sequences
Personalized Knowledge Discovery: Mining Novvel Association Rules from Text
A Novel Framework for Incorporating Labeled Examples into Anomaly Detection
Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data
Using Compression to Identify Classes of Inauthentic Text
Fast Mining of Distance-Based Outliers in High Dimensional Data Sets
Spatial Weighted Outlier Detection
Fast Adaptive Summarization of Heterogeneous Data Sets
WIP: Mining Weighted Interesting Patterns with a Strong Weight and/or Support Affinity
Discovering Frequent Tree Patterns over Data Streams
Finding Sequential Pattern from a Massive Number of Spatio-Temporal Events
Mining Minimal Contrast Subgraph Patterns