Welcome to D
SIGMOD 2003
PODS 2003
SIGMOD-RECOR
ADBIS
CIDR 2003
CIKM 2003
DASFAA 2003
Data Enginee
DEBS
DMKD 2003
DOLAP 2003
DPDJ 2003
ER
GIS 2003
Hypertext 20
ICDE 2003
ICDM 2003
ICDT 2003
JCDL 2003
KRDB 2003
MIR 2003
MIS 2003
MMDB 2003
RIDE 2003
SBBD 2003
SIGIR 2003
SIGIR-FORUM
SIGKDD 2003
<<< = SIGKDD'03 Pa>>>
SIGKDD-EXP
SSDBM 2003
TIME 2003
TODS
VLDB 2003
VLDB Journal
WIDM 2003

SIGKDD03 - Research Track


Finding recent frequent itemsets adaptively over online data streams Generating English summaries of time series data using the Gricean maxims Mining unexpected rules by pushing user dynamics Accurate decision trees for mining high-speed data streams Nantonac collaborative filtering: recommendation based on order responses Improving spatial locality of programs via data mining Distributed multivariate regression based on influential observations Graph-based anomaly detection Screening and interpreting multi-item associations based on log-linear modeling Maximizing the spread of influence through a social network Navigating massive data sets via local clustering Mining concept-drifting data streams using ensemble classifiers CLOSET+: searching for the best strategies for mining frequent closed itemsets Applications of sampling and fractional factorial designs to model-free data squashing Towards systematic design of distance functions for data mining applications Mining phenotypes and informative genes from gene expression data Cross-training: learning probabilistic mappings between topics Mining data records in Web pages Privacy-preserving k-means clustering over vertically partitioned data PaintingClass: interactive construction, visualization and exploration of decision trees Aggregation-based feature invention and relational concept classes Natural communities in large linked networks Efficient data reduction with EASE Playing hide-and-seek with correlations Generative model-based clustering of directional data PROXIMUS: a framework for analyzing very high dimensional discrete-attributed datasets Understanding captions in biomedical publications Mining high dimensional data for classifier knowledge Mining viewpoint patterns in image databases A Web page prediction model based on click-stream tree representation of user behavior Visualizing changes in the structure of data for exploratory feature selection Style mining of electronic messages for multiple authorship discrimination: first results Fragments of order Indexing multi-dimensional time-series with support for multiple distance measures Translation-invariant mixture models for curve clustering Interactive exploration of coherent patterns in time-series gene expression data CloseGraph: mining closed frequent graph patterns On computing, storing and querying frequent patterns Learning relational probability trees An iterative hypothesis-testing strategy for pattern discovery XRules: an effective structural classifier for XML data Eliminating noisy information in Web pages for data mining Information-theoretic co-clustering Efficiently handling feature redundancy in high-dimensional data
  • Lei Yu

  • Huan Liu

Efficient decision tree construction on streaming data Distributed cooperative mining for information consortia SEWeP: using site semantics and a taxonomy to enhance the Web personalization process To buy or not to buy: mining airfare data to minimize ticket purchase price Weighted Association Rule Mining using weighted support and significance framework Using randomized response techniques for privacy-preserving data mining Probabilistic discovery of time series motifs Correlating synchronous and asynchronous data streams Classifying large data sets using SVMs with hierarchical clusters Time and sample efficient discovery of Markov blankets and direct causal relations Adaptive duplicate detection using learnable string similarity measures Efficient elastic burst detection in data streams Algorithms for estimating relative importance in networks Empirical comparisons of various voting methods in bagging A bag of paths model for measuring structural similarity in Web documents Assessment and pruning of hierarchical model based clustering Fast vertical mining using diffsets Online novelty detection on temporal sequences Extracting semantics from data cubes using cube transversals and closures Experiments with random projections for machine learning Mining distance-based outliers in near linear time with randomization and a simple pruning rule On detecting differences between groups Carpenter: finding closed patterns in long biological datasets Inverted matrix: efficient discovery of frequent items in large datasets in the context of interactive mining A two-way visualization method for clustered data
Return to SIGKDD03 session listing



©2004 Association for Computing Machinery