11. PAKDD 2007: Nanjing, China
Zhi-Hua Zhou, Hang Li, Qiang Yang (Eds.): Advances in Knowledge Discovery and Data Mining, 11th Pacific-Asia Conference, PAKDD 2007, Nanjing, China, May 22-25, 2007, Proceedings. Springer 2007 Lecture Notes in Computer Science ISBN 978-3-540-71700-3
Keynote Speeches
Jiawei Han: Research Frontiers in Advanced Data Mining Technologies and Applications. 1-5
Geoffrey I. Webb: Finding the Real Patterns. 6
Xindong Wu: Class Noise vs Attribute Noise: Their Impacts, Detection and Cleansing. 7-8
Regular Papers
Bill Andreopoulos, Aijun An, Xiaogang Wang: Hierarchical Density-Based Clustering of Categorical Data and a Simplification. 11-22
Johannes Aßfalg, Hans-Peter Kriegel, Alexey Pryakhin, Matthias Schubert: Multi-represented Classification Based on Confidence Estimation. 23-34
Liefeng Bo, Ling Wang, Licheng Jiao: Selecting a Reduced Set for Building Sparse Support Vector Regression in the Primal. 35-46

Dihua Guo, Hui Xiong, Vijay Atluri, Nabil R. Adam: Semantic Feature Selection for Object Discovery in High-Resolution Remote Sensing Imagery. 71-83

Jian Zhou, Haruhiko Shirai, Isamu Takahashi, Jousuke Kuroiwa, Tomohiro Odaka, Hisakazu Ogura: A Hybrid Command Sequence Model for Anomaly Detection. 108-118
Kwanghoon Kim, Clarence A. Ellis: sigma - Algorithm : Structured Workflow Process Mining Through Amalgamating Temporal Workcases. 119-130
Min-Woo Lee, Dong-Chul Park, Yunsik Lee: Multiscale BiLinear Recurrent Neural Network for Prediction of MPEG Video Traffic. 131-137
Ming Leng, Songnian Yu: An Effective Multi-level Algorithm Based on Ant Colony Optimization for Bisecting Graph. 138-149
Zhi Li, Hong Ma, Yongbing Mei: A Unifying Method for Outlier and Change Detection from Data Streams Based on Local Polynomial Fitting. 150-161
Shizhong Liao, Lei Jia: Simultaneous Tuning of Hyperparameter and Parameter for Support Vector Machines. 162-172
Zhiwu Lu, Xiaoqing Lu, Zhiyuan Ye: Entropy Regularization, Automatic Model Selection, and Unsupervised Image Segmentation. 173-182
Yinglong Ma, Beihong Jin, Yuancheng Li, Kehe Wu: A Timing Analysis Model for Ontology Evolutions Based on Distributed Environments. 183-192
Weidong Mao, Shannon Kelly: An Optimum Random Forest Model for Prediction of Genetic Susceptibility to Complex Diseases. 193-204
Mohammad M. Masud, Latifur Khan, Bhavani M. Thuraisingham: Feature Based Techniques for Auto-Detection of Novel Email Worms. 205-216
Byung-Jae Min, Dong-Chul Park, Hwan-Soo Choi: Multiresolution-Based BiLinear Recurrent Neural Network. 217-223
Laurence A. F. Park, Kotagiri Ramamohanarao: Query Expansion Using a Collection Dependent Probabilistic Latent Semantic Thesaurus. 224-235
Bernhard Pfahringer, Claire Leschi, Peter Reutemann: Scaling Up Semi-supervised Learning: An Efficient and Effective LLGC Variant. 236-247
Rafael Ramirez, Montserrat Puiggros: A Machine Learning Approach to Detecting Instantaneous Cognitive States from fMRI Data. 248-259
Xingzhi Sun, Ming Chang, Xue Li, Maria E. Orlowska: Discovering Correlated Items in Data Streams. 260-271
Chih-Hua Tai, Bi-Ru Dai, Ming-Syan Chen: Incremental Clustering in Geography and Optimization Spaces. 272-283
Kazuko Takahashi, Hiroya Takamura, Manabu Okumura: Estimation of Class Membership Probabilities in the Document Classification. 284-295
Zhouxuan Teng, Wenliang Du: A Hybrid Multi-group Privacy-Preserving Approach for Building Decision Trees. 296-307
Chao Wang, Jie Lu, Guangquan Zhang: A Constrained Clustering Approach to Duplicate Detection Among Relational Data. 308-319
Jinlong Wang, Congfu Xu, Gang Li, Zhenwen Dai, Guojing Luo: Understanding Research Field Evolving and Trend with Dynamic Bayesian Networks. 320-331
Shiming Xiang, Feiping Nie, Yangqiu Song, Changshui Zhang, Chunxia Zhang: Embedding New Data Points for Manifold Learning Via Coordinate Propagation. 332-343
Wenxin Yang, Junping Zhang: Spectral Clustering Based Null Space Linear Discriminant Analysis (SNLDA). 344-354
Wei-Feng Zhang, Dao-Qing Dai, Hong Yan: On a New Class of Framelet Kernels for Support Vector Regression and Regularization Networks. 355-366
Yu-Jie Zheng, Zhibo Guo, Jian Yang, Xiaojun Wu, Jing-Yu Yang: DLDA/QR: A Robust Direct LDA Algorithm for Face Recognition and Its Theoretical Foundation. 379-387
Feida Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu: gPrune: A Constraint Pushing Framework for Graph Pattern Mining. 388-400
Short Papers
Ridzwan Aminuddin, Ridzwan Suri, Kuiyu Chang, Zaki Zainudin, Qi He, Ee-Peng Lim: Modeling Anticipatory Event Transitions. 401-408
Turgay Tugay Bilgin, A. Yilmaz Çamurcu: A Modified Relationship Based Clustering Framework for Density Based Clustering and Outlier Filtering on High Dimensional Datasets. 409-416
Sam Chao, Yiping Li, Ming-Chui Dong: Supportive Utility of Irrelevant Features in Data Preprocessing. 425-432
Yue Chen, Jiankui Guo, Yaqin Wang, Yun Xiong, Yangyong Zhu: Incremental Mining of Sequential Patterns Using Prefix Tree. 433-440
Zhenyu Chen, Jianping Li: A Multiple Kernel Support Vector Machine Scheme for Simultaneous Feature Selection and Rule-Based Classification. 441-448
Victor Cheng, Chun-hung Li: Combining Supervised and Semi-supervised Classifier for Personalized Spam Filtering. 449-456
Yusheng Cheng, Yousheng Zhang, Xuegang Hu, Xiaoyao Jiang: Qualitative Simulation and Reasoning with Feature Reduction Based on Boundary Conditional Entropy of Knowledge. 457-464
Deng-Yiv Chiu, Kong-Ling Hsieh: A Hybrid Incremental Clustering Method-Combining Support Vector Machine and Enhanced Clustering by Committee Clustering Algorithm. 465-472
Huizhong Duan, Shenghua Bao, Yong Yu: CCRM: An Effective Algorithm for Mining Commodity Information from Threaded Chinese Customer Reviews. 473-480
Qiguo Duan, Duoqian Miao, Kaimin Jin: A Rough Set Approach to Classifying Web Page Without Negative Examples. 481-488
Mengling Feng, Guozhu Dong, Jinyan Li, Yap-Peng Tan, Limsoon Wong: Evolution and Maintenance of Frequent Pattern Space When Transactions Are Removed. 489-497
Chun Che Fung, Kien-Ping Chung: Establishing Semantic Relationship in Inter-query Learning for Content-Based Image Retrieval Systems. 498-506
Maoguo Gong, Licheng Jiao, Ling Wang, Liefeng Bo: Density-Sensitive Evolutionary Clustering. 507-514
Pengfei Han, Xiuzhen Zhang, Raymond S. Norton, Zhi-Ping Feng: Reducing Overfitting in Predicting Intrinsically Unstructured Proteins. 515-522
Tu Bao Ho, Canh Hao Nguyen, Saori Kawasaki, Katsuhiko Takabayashi: Temporal Relations Extraction in Mining Hepatitis Data. 523-530
Guoping Hu, Dan Liu, Qingfeng Liu, Ren-Hua Wang: Supervised Learning Approach to Optimize Ranking Function for Chinese FAQ-Finder. 531-538
Minlie Huang, Xiaoyan Zhu: Combining Convolution Kernels Defined on Heterogeneous Sub-structures. 539-546
Huidong Jin, Jie Chen, Hongxing He, Christine M. O'Keefe: Privacy-Preserving Sequential Pattern Release. 547-554
Wei Jin, Rohini K. Srihari, Xin Wu: Mining Concept Associations for Knowledge Discovery Through Concept Chain Queries. 555-562
Doo Kie Kim, Dong Hyawn Kim, Seong Kyu Chang, Sang Kil Chang: Capability Enhancement of Probabilistic Neural Network for the Design of Breakwater Armor Blocks. 563-570
Kono Kim, Yeohoon Yoon, Harksoo Kim, Jungyun Seo: Named Entity Recognition Using Acyclic Weighted Digraphs: A Semi-supervised Statistical Method. 571-578
Petra Kralj, Nada Lavrac, Dragan Gamberger, Antonija Krstacic: Contrast Set Mining Through Subgroup Discovery Applied to Brain Ischaemina Data. 579-586
Hye-Chung Kum, Joong Hyuk Chang, Wei Wang: Intelligent Sequential Mining Via Alignment: Optimization Techniques for Very Large DB. 587-597
Yongle Lü, Rongling Lang: A Hybrid Prediction Method Combining RBF Neural Network and FAR Model. 598-605
Sang-Hyuk Lee, Jinho Kim, Se-Hwan Jang, Jong-Bae Park, Young-Hwan Jeon, Sung-Yong Sohn: An Advanced Fuzzy C-Mean Algorithm for Regional Clustering of Interconnected Systems. 606-615
Song-Jae Lee, Dong-Chul Park: Centroid Neural Network with Bhattacharyya Kernel for GPDF Data Clustering. 616-622
Yuxia Lei, Yan Wang, Baoxiang Cao, Jiguo Yu: Concept Interconnection Based on Many-Valued Context Analysis. 623-630
Verayuth Lertnattee, Thanaruk Theeramunkong: Text Classification for Thai Medicinal Web Pages. 631-638
Jiuyong Li, Xiaodi Huang, Clinton Selke, Jianming Yong: A Fast Algorithm for Finding Correlation Clusters in Noise Data. 639-647
Ming Li, De-San Yang: Application of Discrimination Degree for Attributes Reduction in Concept Lattice. 648-655
Xiaohui Li, Yan Huang: A Language and a Visual Interface to Specify Complex Spatial Patterns. 656-663
Yangyang Li, Licheng Jiao: Quantum-Inspired Immune Clonal Multiobjective Optimization Algorithm. 672-679
Zhiyong Li, Weilin Wu: Phase Space Reconstruction Based Classification of Power Disturbances Using Support Vector Machines. 680-687
Hongbo Liu, Jiaxin Wang, Yannan Zhao, Zehong Yang: Mining the Impact Factors of Threads and Participators on Usenet Using Link Analysis. 688-695
Jinfu Liu, Qinghua Hu, Daren Yu: Weighted Rough Set Learning: Towards a Subjective Approach. 696-703
Jun Liu, Kotagiri Ramamohanarao: Multiple Self-Splitting and Merging Competitive Learning Algorithm. 704-711
Xinguo Lu, Yaping Lin, Haijun Wang, Si-wang Zhou, Xiaolong Li: A Novel Relative Space Based Gene Feature Extraction and Cancer Recognition. 712-719
Ithipan Methasate, Thanaruk Theeramunkong: Experiments on Kernel Tree Support Vector Machines for Text Categorization. 720-727
Hoong Kee Ng, Kang Ning, Hon Wai Leong: A New Approach for Similarity Queries of Biological Sequences in Databases. 728-736
Tomonobu Ozaki, Takenao Ohkawa: Efficiently Mining Closed Constrained Frequent Ordered Subtrees by Using Border Information. 745-752
Nam Hun Park, Won Suk Lee: Approximate Trace of Grid-Based Clusters over High Dimensional Data Streams. 753-760
Guang Qiu, Jiajun Bu, Chun Chen, Peng Huang, Keke Cai: Syntactic Impact on Sentence Similarity Measure in Archive-Based QA System. 769-776
Issei Sato, Hiroshi Nakagawa: Semi-structure Mining Method for Text Mining with a Chunk-Based Dependency Structure. 777-784
Jinhui Tang, Xian-Sheng Hua, Yan Song, Guo-Jun Qi, Xiuqing Wu: Kernel-Based Linear Neighborhood Propagation for Semantic Video Annotation. 793-800
Fengzhan Tian, Feng Liu, Zhihai Wang, Jian Yu: Learning Bayesian Networks with Combination of MRMR Criterion and EMI Method. 801-808
Jin Tian, Minqiang Li, Fuzan Chen: A Cooperative Coevolution Algorithm of RBFNN for Classification. 809-816
Cheng-Fa Tsai, Chia-Chen Yen: ANGEL: A New Effective and Efficient Hybrid Clustering Technique for Large Databases. 817-824
Hsiao-Ping Tsai, De-Nian Yang, Wen-Chih Peng, Ming-Syan Chen: Exploring Group Moving Pattern for an Energy-Constrained Object Tracking Sensor Network. 825-832
Chi-Yao Tseng, Jen-Wei Huang, Ming-Syan Chen: ProMail: Using Progressive Email Social Network for Spam Detection. 833-840
Kuralmani Vellaisamy, Jinyan Li: Multidimensional Decision Support Indicator (mDSI) for Time Series Stock Trend Prediction. 841-848
Cuiru Wang, Hejin Yuan, Jun Liu, Tao Zhou, Huiling Lu: A Novel Support Vector Machine Ensemble Based on Subtractive Clustering Analysis. 849-856
Limin Wang, Chunhong Cao, Xiongfei Li, Haijun Li: Finding the Optimal Feature Representations for Bayesian Network Learning. 865-870
Shulin Wang, Ji Wang, Huowang Chen, Shutao Li: Feature Extraction and Classification of Tumor Based on Wavelet Package and Support Vector Machines. 871-878
Weixing Wang: Image Classification and Segmentation for Densely Packed Aggregates. 887-894
Ling-Yin Wei, Man-Kwan Shan: Mining Temporal Co-orientation Pattern from Spatio-temporal Databases. 895-903
Yimin Wen, Bao-Liang Lu: Incremental Learning of Support Vector Machines by Classifier Combining. 904-911
Daya C. Wimalasuriya, Sridhar Ramachandran, Dejing Dou: Clustering Zebrafish Genes Based on Frequent-Itemsets and Frequency Levels. 912-920
Jung-Im Won, Sang-Kyoon Hong, Jeehee Yoon, Sanghyun Park, Sang-Wook Kim: A Practical Method for Approximate Subsequence Search in DNA Databases. 921-931
Hu Wu, Yongji Wang, Xiaoyong Huai: AttributeNets: An Incremental Learning Method for Interpretable Classification. 940-947
Jing Wu, Pin Zhang, Zhang Xiong, Hao Sheng: Mining Personalization Interest and Navigation Patterns on Portal. 948-955
Peng Wu, Yuehui Chen: Grammar Guided Genetic Programming for Flexible Neural Trees Optimization. 964-971
Shu Wu, Qingshan Jiang, Joshua Zhexue Huang: A New Initialization Method for Clustering Categorical Data. 972-980

Bin Xu, Danny Z. Chen: Density-Based Data Clustering Algorithms for Lower Dimensions Using Space-Filling Curves. 997-1005
Limin Xu, Zhenmin Tang, Keke He, Bo Qian: Transformation-Based GMM with Improved Cluster Algorithm for Speaker Identification. 1006-1014
Shengliang Xu, Shenghua Bao, Yong Yu, Yunbo Cao: Using Social Annotations to Smooth the Language Model for IR. 1015-1021
Dongyi Ye, Zhaojiong Chen, Jiankun Liao: A New Algorithm for Minimum Attribute Reduction Based on Binary Particle Swarm Optimization with Vaccination. 1029-1036
Luh Yen, François Fouss, Christine Decaestecker, Pascal Francq, Marco Saerens: Graph Nodes Clustering Based on the Commute-Time Kernel. 1037-1045
Ying Yin, Yuhai Zhao, Bin Zhang: Identifying Synchronous and Asynchronous Co-regulations from Time Series Gene Expression Data. 1046-1054
Ting Yu, Simeon J. Simoff, Donald Stokes: Incorporating Prior Domain Knowledge into a Kernel Based Feature Selection Algorithm. 1064-1071
Bin Zhang, Wen Jun Yin, Ming Xie, Jin Dong: Geo-spatial Clustering with Non-spatial Attributes and Geographic Non-overlapping Constraint: A Penalized Spatial Distance Measure. 1072-1079
Chengqi Zhang, Xiaofeng Zhu, Jilian Zhang, Yongsong Qin, Shichao Zhang: GBKII: An Imputation Method for Missing Values. 1080-1087
Lijuan Zhang, Zhoujun Li, Huowang Chen: An Effective Gene Selection Method Based on Relevance Analysis and Discernibility Matrix. 1088-1095
Nan Zhang: Towards Comprehensive Privacy Protection in Data Clustering. 1096-1104
Xueping Zhang, Jiayao Wang, Mingguang Wu, Yi Cheng: A Novel Spatial Clustering with Obstacles Constraints Based on Particle Swarm Optimization and K-Medoids. 1105-1113
Qiang Zhao, Hua Chen, Zhi Geng: Structural Learning About Independence Graphs from Multiple Databases. 1122-1130
Renliang Zhao, Jiatian Li: An Effective Method For Calculating Natural Adjacency Relation in Spatial Database. 1131-1139
Weidong Zhao, Weihui Dai, Chunbin Tang: K-Centers Algorithm for Clustering Mixed Type Data. 1140-1147



