Ebook: Knowledge Discovery in Databases: PKDD 2007: 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, September 17-21, 2007. Proceedings
- Tags: Artificial Intelligence (incl. Robotics), Database Management, Information Storage and Retrieval, Probability and Statistics in Computer Science, Document Preparation and Text Processing, Mathematical Logic and Formal Languages
- Series: Lecture Notes in Computer Science 4702
- Year: 2007
- Publisher: Springer-Verlag Berlin Heidelberg
- Edition: 1
- Language: English
- pdf
The two premier annual European conferences in the areas of machine learning and data mining have been collocated ever since the ?rst joint conference in Freiburg, 2001. The European Conference on Machine Learning (ECML) traces its origins to 1986, when the ?rst European Working Session on Learning was held in Orsay, France. The European Conference on Principles and Practice of KnowledgeDiscoveryinDatabases(PKDD) was?rstheldin1997inTrondheim, Norway. Over the years, the ECML/PKDD series has evolved into one of the largest and most selective international conferences in machine learning and data mining. In 2007, the seventh collocated ECML/PKDD took place during September 17–21 on the centralcampus of WarsawUniversityand in the nearby Staszic Palace of the Polish Academy of Sciences. The conference for the third time used a hierarchical reviewing process. We nominated 30 Area Chairs, each of them responsible for one sub-?eld or several closely related research topics. Suitable areas were selected on the basis of the submission statistics for ECML/PKDD 2006 and for last year’s International Conference on Machine Learning (ICML 2006) to ensure a proper load balance amongtheAreaChairs.AjointProgramCommittee(PC)wasnominatedforthe two conferences, consisting of some 300 renowned researchers, mostly proposed by the Area Chairs. This joint PC, the largest of the series to date, allowed us to exploit synergies and deal competently with topic overlaps between ECML and PKDD. ECML/PKDD 2007 received 592 abstract submissions. As in previous years, toassistthereviewersandtheAreaChairsintheir?nalrecommendationauthors had the opportunity to communicate their feedback after the reviewing phase.
This book constitutes the refereed proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007, held in Warsaw, Poland, in September 2007, co-located with ECML 2007, the 18th European Conference on Machine Learning.
The 28 revised full papers and 35 revised short papers presented together with abstracts of four invited talks were carefully reviewed and selected from 592 papers submitted to both ECML and PKDD. The papers present original results on leading-edge subjects of knowledge discovery from conventional and complex data and address all current issues in the area.
This book constitutes the refereed proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007, held in Warsaw, Poland, in September 2007, co-located with ECML 2007, the 18th European Conference on Machine Learning.
The 28 revised full papers and 35 revised short papers presented together with abstracts of four invited talks were carefully reviewed and selected from 592 papers submitted to both ECML and PKDD. The papers present original results on leading-edge subjects of knowledge discovery from conventional and complex data and address all current issues in the area.
Content:
Front Matter....Pages -
Learning, Information Extraction and the Web....Pages 1-1
Putting Things in Order: On the Fundamental Role of Ranking in Classification and Probability Estimation....Pages 2-3
Mining Queries....Pages 4-4
Adventures in Personalized Information Access....Pages 5-5
Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning....Pages 6-17
Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction....Pages 18-29
A Graphical Model for Content Based Image Suggestion and Feature Selection....Pages 30-41
Efficient AUC Optimization for Classification....Pages 42-53
Finding Transport Proteins in a General Protein Database....Pages 54-66
Classification of Web Documents Using a Graph-Based Model and Structural Patterns....Pages 67-78
Context-Specific Independence Mixture Modelling for Protein Families....Pages 79-90
An Algorithm to Find Overlapping Community Structure in Networks....Pages 91-102
Privacy Preserving Market Basket Data Analysis....Pages 103-114
Feature Extraction from Sensor Data Streams for Real-Time Human Behaviour Recognition....Pages 115-126
Generating Social Network Features for Link-Based Classification....Pages 127-139
An Empirical Comparison of Exact Nearest Neighbour Algorithms....Pages 140-151
Site-Independent Template-Block Detection....Pages 152-163
Statistical Model for Rough Set Approach to Multicriteria Classification....Pages 164-175
Classification of Anti-learnable Biological and Synthetic Data....Pages 176-187
Improved Algorithms for Univariate Discretization of Continuous Features....Pages 188-199
Efficient Weight Learning for Markov Logic Networks....Pages 200-211
Classification in Very High Dimensional Problems with Handfuls of Examples....Pages 212-223
Domain Adaptation of Conditional Probability Models Via Feature Subsetting....Pages 224-235
Learning to Detect Adverse Traffic Events from Noisily Labeled Data....Pages 236-247
IKNN: Informative K-Nearest Neighbor Pattern Classification....Pages 248-264
Finding Outlying Items in Sets of Partial Rankings....Pages 265-276
Speeding Up Feature Subset Selection Through Mutual Information Relevance Filtering....Pages 277-287
A Comparison of Two Approaches to Classify with Guaranteed Performance....Pages 288-299
Towards Data Mining Without Information on Knowledge Structure....Pages 300-311
Relaxation Labeling for Selecting and Exploiting Efficiently Non-local Dependencies in Sequence Labeling....Pages 312-323
Bridged Refinement for Transfer Learning....Pages 324-335
Flexible Grid-Based Clustering....Pages 336-349
Polyp Detection in Endoscopic Video Using SVMs....Pages 350-357
A Density-Biased Sampling Technique to Improve Cluster Representativeness....Pages 358-365
Expectation Propagation for Rating Players in Sports Competitions....Pages 366-373
Efficient Closed Pattern Mining in Strongly Accessible Set Systems (Extended Abstract) ....Pages 374-381
Discovering Emerging Patterns in Spatial Databases: A Multi-relational Approach....Pages 382-389
Realistic Synthetic Data for Testing Association Rule Mining Algorithms for Market Basket Databases....Pages 390-397
Learning Multi-dimensional Functions: Gas Turbine Engine Modeling....Pages 398-405
Constructing High Dimensional Feature Space for Time Series Classification....Pages 406-413
A Dynamic Clustering Algorithm for Mobile Objects....Pages 414-421
A Method for Multi-relational Classification Using Single and Multi-feature Aggregation Functions....Pages 422-429
MINI: Mining Informative Non-redundant Itemsets....Pages 430-437
Stream-Based Electricity Load Forecast....Pages 438-445
Automatic Hidden Web Database Classification....Pages 446-453
Pruning Relations for Substructure Discovery of Multi-relational Databases....Pages 454-461
The Most Reliable Subgraph Problem....Pages 462-470
Matching Partitions over Time to Reliably Capture Local Clusters in Noisy Domains....Pages 471-478
Searching for Better Randomized Response Schemes for Privacy-Preserving Data Mining....Pages 479-486
Pre-processing Large Spatial Data Sets with Bayesian Methods....Pages 487-497
Tag Recommendations in Folksonomies....Pages 498-505
Providing Na?ve Bayesian Classifier-Based Private Recommendations on Partitioned Data....Pages 506-514
Multi-party, Privacy-Preserving Distributed Data Mining Using a Game Theoretic Framework....Pages 515-522
Multilevel Conditional Fuzzy C-Means Clustering of XML Documents....Pages 523-531
Uncovering Fraud in Direct Marketing Data with a Fraud Auditing Case Builder....Pages 532-539
Real Time GPU-Based Fuzzy ART Skin Recognition....Pages 540-547
A Cooperative Game Theoretic Approach to Prototype Selection....Pages 548-555
Dynamic Bayesian Networks for Real-Time Classification of Seismic Signals....Pages 556-564
Robust Visual Mining of Data with Error Information....Pages 565-572
An Effective Approach to Enhance Centroid Classifier for Text Categorization....Pages 573-580
Automatic Categorization of Human-Coded and Evolved CoreWar Warriors....Pages 581-588
Utility-Based Regression....Pages 589-596
Multi-label Lazy Associative Classification....Pages 597-604
Visual Exploration of Genomic Data....Pages 605-612
Association Mining in Large Databases: A Re-examination of Its Measures....Pages 613-620
Semantic Text Classification of Emergent Disease Reports....Pages 621-628
Back Matter....Pages 629-637
....Pages -