22. ICDE 2006: Atlanta, Georgia, USA
Ling Liu, Andreas Reuter, Kyu-Young Whang, Jianjun Zhang (Eds.): Proceedings of the 22nd International Conference on Data Engineering, ICDE 2006, 3-8 April 2006, Atlanta, GA, USA. IEEE Computer Society 2006
Introduction
Message From The Chairs.
General Chairs.
Program Committee Members.
External Reviewers.
Research Session 1: Views
Václav Lín, Vasilis Vassalos, Prodromos Malakasiotis: MiniCount: Efficient Rewriting of COUNT-Queries Using Views. 1

Research Session 2: Data Warehouse (1)
Dong Xin, Zheng Shao, Jiawei Han, Hongyan Liu: C-Cubing: Efficient Computation of Closed Cubes by Aggregation-Based Checking. 4
Surajit Chaudhuri, Venkatesh Ganti, Raghav Kaushik: A Primitive Operator for Similarity Joins in Data Cleaning. 5
Research Session 3: Query Processing and Uncertainty Reasoning
Anish Das Sarma, Omar Benjelloun, Alon Y. Halevy, Jennifer Widom: Working Models for Uncertain Data. 7
Sudipto Guha, Nick Koudas, Divesh Srivastava, Xiaohui Yu: Reasoning About Approximate Match Query Results. 8
Christian Böhm, Alexey Pryakhin, Matthias Schubert: The Gauss-Tree: Efficient Object Identification in Databases of Probabilistic Feature Vectors. 9
Research Session 4 : Indexing and Optimization
Evangelos Kanoulas, Yang Du, Tian Xia, Donghui Zhang: Finding Fastest Paths on A Road Network with Speed Patterns. 10
Ira Assent, Andrea Wenning, Thomas Seidl: Approximation Techniques for Indexing the Earth Mover's Distance in Multimedia Databases. 11
Research Session 5: XML and Semi-Structured Data

Christopher Re, Jérôme Siméon, Mary F. Fernández: A Complete and Efficient Algebraic Compiler for XQuery. 14
Nuwee Wiwatwattana, H. V. Jagadish, Laks V. S. Lakshmanan, Divesh Srivastava: Making Designer Schemas with Colors. 15
Research Session 6: Data Mining and Optimization

Ruoming Jin, Gagan Agrawal: Systematic Approach for Optimizing Complex Mining Tasks on Multiple Databases. 17
Ruoming Jin, Leonid Glimcher, Chris Jermaine, Gagan Agrawal: New Sampling-Based Estimators for OLAP Queries. 18
Reza Sherkat, Davood Rafiei: Efficiently Evaluating Order Preserving Similarity Queries over Historical Market-Basket Data. 19
Research Session 7: Query Processing and Query Optimization
Research Session 8: Data Privacy and Security


Fatih Emekçi, Divyakant Agrawal, Amr El Abbadi, Aziz Gulbeden: Privacy Preserving Query Processing Using Third Parties. 27
Research Session 9: Data Integration and Database Interoperability
Amit Chandel, P. C. Nagesh, Sunita Sarawagi: Efficient Batch Top-k Search for Dictionary-based Entity Recognition. 28
Periklis Andritsos, Ariel Fuxman, Renée J. Miller: Clean Answers over Dirty Databases: A Probabilistic Approach. 30
Research Session 10: Web Services and Applications
Ken Q. Pu, Vagelis Hristidis, Nick Koudas: Syntactic Rule Based Approach toWeb Service Composition. 31
Fan Yang, Jayavel Shanmugasundaram, Mirek Riedewald, Johannes Gehrke: Hilda: A High-Level Language for Data-DrivenWeb Applications. 32
Huiming Qu, Alexandros Labrinidis, Daniel Mossé: UNIT: User-centric Transaction Management in Web-Database Systems. 33
Research Session 11: Temporal and Spatial Data Management (1)
H. V. Jagadish, Beng Chin Ooi, Quang Hieu Vu, Rong Zhang, Aoying Zhou: VBI-Tree: A Peer-to-Peer Framework for Supporting Multi-Dimensional Indexing Schemes. 34
David B. Lomet, Roger S. Barga, Mohamed F. Mokbel, German Shegalov, Rui Wang, Yunyue Zhu: Transaction Time Support Inside a Database Engine. 35
Research Session 12: Query Optimization and Data Structures


Utkarsh Srivastava, Peter J. Haas, Volker Markl, Marcel Kutsch, Tam Minh Tran: ISOMER: Consistent Histogram Construction Using Query Feedback. 39
Research Session 13: Distributed and Peer to Peer Data Management
Nikos Ntarmos, Peter Triantafillou, Gerhard Weikum: Counting at Large: Efficient Cardinality Estimation in Internet-Scale Data Networks. 40
Philippe Cudré-Mauroux, Karl Aberer, Andras Feher: Probabilistic Message Passing in Peer Data Management Systems. 41
Benjamin Arai, Gautam Das, Dimitrios Gunopulos, Vana Kalogeraki: Approximating Aggregation Queries in Peer-to-Peer Networks. 42
Stratos Idreos, Christos Tryfonopoulos, Manolis Koubarakis: Distributed Evaluation of Continuous Equi-join Queries over Large Structured Overlay Networks. 43
Research Session 14: Web Queries
Wensheng Wu, AnHai Doan, Clement T. Yu: WebIQ: Learning from the Web to Match Deep-Web Query Interfaces. 44
Eduard C. Dragut, Wensheng Wu, A. Prasad Sistla, Clement T. Yu, Weiyi Meng: Merging Source Query Interfaces onWeb Databases. 46
Ping Wu, Ji-Rong Wen, Huan Liu, Wei-Ying Ma: Query Selection Techniques for Efficient Crawling of Structured Web Sources. 47
Research Session 15: Stream Processing (1)
David Chu, Amol Deshpande, Joseph M. Hellerstein, Wei Hong: Approximate Data Collection in Sensor Networks using Probabilistic Models. 48
Peter R. Pietzuch, Jonathan Ledlie, Jeffrey Shneidman, Mema Roussopoulos, Matt Welsh, Margo I. Seltzer: Network-Aware Operator Placement for Stream-Processing Systems. 49
Xin Zhou, Hetal Thakkar, Carlo Zaniolo: Unifying the Processing of XML Streams and Relational Data Streams. 50
Ying Zhang, Xuemin Lin, Jian Xu, Flip Korn, Wei Wang: Space-efficient Relative Error Order Sketch over Data Streams. 51
Research Session 16: XML and XPath
Steven Bird, Yi Chen, Susan B. Davidson, Haejoong Lee, Yifeng Zheng: Designing and Evaluating an XPath Dialect for Linguistic Queries. 52

Research Session 17: Stream Processing (2)
Yongluan Zhou, Beng Chin Ooi, Kian-Lee Tan, Feng Yu: Adaptive Reorganization of Coherency-Preserving Dissemination Tree for Streaming Data. 55
Frederick Reiss, Joseph M. Hellerstein: Declarative Network Monitoring with an Underprovisioned Query Processor. 56
Graham Cormode, S. Muthukrishnan, Wei Zhuang: What's Different: Distributed, Continuous Monitoring of Duplicate-Resilient Aggregates on Data Streams. 57
Research Session 18: Database System Internals and Performance
Jennifer L. Beckmann, Alan Halverson, Rajasekar Krishnamurthy, Jeffrey F. Naughton: Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format. 58
Marcin Zukowski, Sándor Héman, Niels Nes, Peter A. Boncz: Super-Scalar RAM-CPU Cache Compression. 59
Bianca Schroeder, Mor Harchol-Balter, Arun Iyengar, Erich M. Nahum, Adam Wierman: How to Determine a Good Multi-Programming Level for External Scheduling. 60
Research Session 19: XML Database and XML Query Optimization
Ning Zhang, M. Tamer Özsu, Ashraf Aboulnaga, Ihab F. Ilyas: XSEED: Accurate and Fast Cardinality Estimation for XPath Queries. 61
Cheng Luo, Zhewei Jiang, Wen-Chi Hou, Feng Yan, Chih-Fang Wang: Estimating XML Structural Join Size Quickly and Economically. 62
Research Session 20: Skyline Query Processing
Surajit Chaudhuri, Nilesh N. Dalvi, Raghav Kaushik: Robust Cardinality and Cost Estimation for Skyline Operator. 64
Zhiyong Huang, Christian S. Jensen, Hua Lu, Beng Chin Ooi: Skyline Queries Against Mobile Lightweight Devices in MANETs. 66
Research Session 21: Query Management
Shouke Qin, Weining Qian, Aoying Zhou: Approximately Processing Multi-granularity Aggregate Queries over Data Streams. 67
Adam Silberstein, Rebecca Braynard, Carla Schlatter Ellis, Kamesh Munagala, Jun Yang: A Sampling-Based Approach to Optimizing Top-k Queries in Sensor Networks. 68
Research Session 22: Ubiquitous Data Management
Yingqi Xu, Wang-Chien Lee, Jianliang Xu, Gail Mitchell: ProcessingWindow Queries in Wireless Sensor Networks. 70
Christian S. Jensen, Dan Lin, Beng Chin Ooi, Rui Zhang: Effective Density Queries on ContinuouslyMoving Objects. 71
Nikos Mamoulis, Kit Hung Cheng, Man Lung Yiu, David W. Cheung: Efficient Aggregation of Ranked Inputs. 72
Research Session 23: Graph Databases and Algorithms
Jianyong Wang, Zhiping Zeng, Lizhu Zhou: CLAN: An Algorithm for Mining Closed Cliques from Large Dense Graph Databases. 73
Haixun Wang, Hao He, Jun Yang, Philip S. Yu, Jeffrey Xu Yu: Dual Labeling: Answering Graph Reachability Queries in Constant Time. 75
Research Session 24: Nearest Neighbor Search in Temporal and Spatial Databases
Research Session 25: Stream Query Optimization

Praveen Rao, Bongki Moon: SketchTree: Approximate Tree Pattern Counts over Streaming Labeled Trees. 80
Feifei Li, Ching Chang, George Kollios, Azer Bestavros: Characterizing and Exploiting Reference Locality in Data Stream Applications. 81
Research Session 26: Advanced Query Processing
Floris Geerts, Anastasios Kementsietsidis, Diego Milano: MONDRIAN: Annotating and Querying Databases through Colors and Blocks. 82
Hector Gonzalez, Jiawei Han, Xiaolei Li, Diego Klabjan: Warehousing and Analyzing Massive RFID Data Sets. 83
Research Session 27: Temporal and Spatial Managementt (2)
Kien A. Hua, Ning Yu, Danzhou Liu: Query Decomposition: A Multiple Neighborhood Approach to Relevance Feedback Processing in Content-based Image Retrieval. 84
Subramanian Arumugam, Chris Jermaine: Closest-Point-of-Approach Join for Moving Object Histories. 86
Research Session 28: Scientific and Biological Databases and Bioinformatics
Sandeep Tata, Jignesh M. Patel, James S. Friedman, Anand Swaroop: Declarative Querying for Biological Sequences. 87
Xifeng Yan, Feida Zhu, Jiawei Han, Philip S. Yu: Searching Substructures with Superimposed Distance. 88
Xin Xu, Ying Lu, Anthony K. H. Tung, Wei Wang: Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles. 89
Industry Session 1: Databases, Services, and Processes
Stefan Seltzsam, Daniel Gmach, Stefan Krompass, Alfons Kemper: AutoGlobe: An Automatic Administration Concept for Service-Oriented Database Applications. 90
Asuman Dogac, Veli Bicer, Alper Okcan: Collaborative Business Process Support in IHE XDS through ebXML Business Processes. 91
Rakesh Agrawal, Christopher M. Johnson, Jerry Kiernan, Frank Leymann: Taming Compliance with Sarbanes-Oxley Internal Controls Using Database Technology. 92
Industry Session 2: RDF, Ontologies, Metadata


Eugene Inseok Chong, Souripriya Das, George Eadon, Jagannathan Srinivasan: Supporting Keyword Columns with Ontology-based Referential Constraints in DBMS. 95
Fusheng Wang, Peiya Liu, John Pearson, Fred Azar, Gerald Madlmayr: Experiment Management with Metadata-based Integration for Collaborative Scientific Research. 96
Industry Session 3: Querying and Logging
Sunil Jigyasu, Sujeet Banerjee, Vinayak R. Borkar, Michael J. Carey, Kanad Dixit, Anil Malkani, Sachin Thatte: SQL to XQuery Translation in the AquaLogic Data Services Platform. 97
A. Kumaran, Pavan K. Chowdary, Jayant R. Haritsa: On Pushing Multilingual Query Operators into Relational Engines. 98
Antti-Pekka Liedes, Antoni Wolski: SIREN: A Memory-Conserving, Snapshot-Consistent Checkpoint Algorithm for in-Memory Databases. 99
Industry Session 4: Data Warehousing and Analysis
Mohamed Y. Eltabakh, Ramy Eltarras, Walid G. Aref: Space-Partitioning Trees in PostgreSQL: Realization and Performance. 100
Ganesh Ramakrishnan, Sachindra Joshi, Sumit Negi, Raghu Krishnapuram, Sreeram Balakrishnan: Automatic Sales Lead Generation from Web Data. 101
Wen-Syan Li, Daniel C. Zilio, Vishal S. Batra, Mahadevan Subramanian, Calisto Zuzarte, Inderpal Narang: Load Balancing for Multi-tiered Database Systems through Autonomic Placement of Materialized Views. 102
Advanced Seminar 1
Paolo Atzeni: Schema and Data Translation. 103
Advanced Seminar 2
Advanced Seminar 3
Johannes Gehrke: Models and Methods for Privacy-Preserving Data Analysis and Publishing. 105
Advanced Seminar 4
Jiawei Han, Xifeng Yan, Philip S. Yu: Mining, Indexing, and Similarity Search in Graphs and Complex Structures. 106
Advanced Seminar 5
Anastassia Ailamaki, Naga K. Govindaraju, Dinesh Manocha: Query Co-Processing on Commodity Hardware. 107
Poster Session 1: Data Integration, Data Mining, Data Warehousing
Michael D. Morse, Jignesh M. Patel, William I. Grosky: Efficient Continuous Skyline Computation. 108
Heng Tao Shen, Beng Chin Ooi, Kian-Lee Tan: SaveRF: Towards Efficient Relevance Feedback Search. 110
Charu C. Aggarwal, Chen Chen, Jiawei Han: On the Inverse Classification Problem and its Applications. 111
Yiping Ke, James Cheng, Wilfred Ng: MIC Framework: An Information-Theoretic Approach to Quantitative Association Rule Mining. 112
Xiaoming Jin, Xinqiang Zuo, Kwok-Yan Lam, Jianmin Wang, Jia-Guang Sun: Efficient Discovery of Emerging Frequent Patterns in ArbitraryWindows on Data Streams. 113
Hongyan Liu, Jiawei Han, Dong Xin, Zheng Shao: Top-Down Mining of Interesting Patterns from Very High Dimensional Data. 114
Poster Session 2: Data Privacy and Security

Sergei Evdokimov, Matthias Fischmann, Oliver Günther: Provable Security for Outsourcing Database Operations. 117
Basit Shafiq, Arjmand Samuel, Elisa Bertino, Arif Ghafoor: Technique for Optimal Adaptation of Time-Dependent Workflows with Security Constraints. 119
Poster Session 3: Web Data Management and Workflow
Wei Zhang, Clement T. Yu, Neil R. Smalheiser, Vetle I. Torvik: Segmentation of Publication Records of Authors from the Web. 120
Dou Shen, Jian-Tao Sun, Qiang Yang, Hui Zhao, Zheng Chen: Text Classification Improved through Automatically Extracted Sequences. 121
Weifeng Su, Jiying Wang, Frederick H. Lochovsky: Holistic Query Interface Matching using Parallel Schema Matching. 122
Stefanie Rinderle, Andreas Wombacher, Manfred Reichert: On the Controlled Evolution of Process Choreographies. 124
Poster Session 4: XML Data Management


Matthias Brantner, Carl-Christian Kanne, Guido Moerkotte, Sven Helmer: Algebraic Optimization of Nested XPath Expressions. 128
Maged El-Sayed, Elke A. Rundensteiner, Murali Mani: Incremental Maintenance of Materialized XQuery Views. 129
Zeeshan Sardar, Bettina Kemme: Don't be a Pessimist: Use Snapshot based Concurrency Control for XML. 130
Fusheng Wang, Xin Zhou, Carlo Zaniolo: Using XML to Build Efficient Transaction-Time Temporal Database Systems on Relational Databases. 131
Poster Session 5: Peer to Peer and Mobile Data Management


Prasanna Padmanabhan, Le Gruenwald: DREAM: A Data Replication Technique for Real-Time Mobile Ad-hoc Network Databases. 134
Damdinsuren Amarmend, Masayoshi Aritsugi, Yoshinari Kanamori: An Air Index for Data Access over Multiple Wireless Broadcast Channels. 135
Poster Session 6: Stream Processing and Optimization
Michael Cammert, Jürgen Krämer, Bernhard Seeger, Sonny Vaupel: An Approach to Adaptive Memory Management in Data Stream Systems. 137
John Hershberger, Nisheeth Shrivastava, Subhash Suri: Cluster Hull: A Technique for Summarizing Spatial Data Streams. 138
DongDong Zhang, Jianzhong Li, Kimutai Kimeli, Weiping Wang: SlidingWindow based Multi-Join Algorithms over Distributed Data Streams. 139
Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong, Jennifer Widom: A Pipelined Framework for Online Cleaning of Sensor Data Streams. 140
Esther Ryvkina, Anurag Maskey, Mitch Cherniack, Stanley B. Zdonik: Revision Processing in a Stream Processing Engine: A High-Level Design. 141
Ahmed Ayad, Jeffrey F. Naughton, Stephen J. Wright, Utkarsh Srivastava: Approximating StreamingWindow Joins Under CPU Limitations. 142
Poster Session 7: Sensor Network and Sensor Queries
Minji Wu, Jianliang Xu, Xueyan Tang, Wang-Chien Lee: Monitoring Top-k Query inWireless Sensor Networks. 143
Vebjorn Ljosa, Arnab Bhattacharya, Ambuj K. Singh: LB-Index: A Multi-Resolution Index Structure for Images. 144
Adam Silberstein, Rebecca Braynard, Jun Yang: Energy-Efficient Continuous Isoline Queries in Sensor Networks. 145

Dina Q. Goldin: Faster In-Network Evaluation of Spatial Aggregationin Sensor Networks. 148
Johannes Aßfalg, Hans-Peter Kriegel, Peer Kröger, Peter Kunath, Alexey Pryakhin, Matthias Renz: Threshold Similarity Queries in Large Time Series Databases. 149
Poster Session 8: Database System Internals, Query Processing and Optimization
Lin Qiao, Balakrishna R. Iyer, Divyakant Agrawal, Amr El Abbadi: Automated Storage Management with QoS Guarantees. 150
Sourav S. Bhowmick, Sandeep Prakash: Every Click You Make, IWill Be Fetching It: Efficient XML Query Processing in RDMS Using GUI-driven Prefetching. 152
Bianca Schroeder, Mor Harchol-Balter, Arun Iyengar, Erich M. Nahum: Achieving Class-Based QoS for Transactional Workloads. 153
Flip Korn, S. Muthukrishnan, Yihua Wu: Fractal Modeling of IP Network Traffic at Streaming Speeds. 155
Demo Session 1: New Approaches in Data Engineering
Mohammed K. Jaber, Andrei Voronkov: UNIDOOR: a Deductive Object-Oriented Database Management System. 157
Luca Cabibbo, Ivan Panella, Riccardo Torlone: DaWaII: a Tool for the Integration of Autonomous Data Marts. 158
Francesco Bonchi, Fosca Giannotti, Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Roberto Trasarti: ConQueSt: a Constraint-based Querying System for Exploratory Pattern Discovery. 159
Wei-Shinn Ku, Roger Zimmermann, Chi-Ngai Wan, Haojun Wang: MAPLE: A Mobile Scalable P2P Nearest Neighbor Query System for Location-based Services. 160
Shuigeng Zhou, Zheng Zhang, Weining Qian, Aoying Zhou: SIPPER: Selecting Informative Peers in Structured P2P Environment for Content-Based Retrieval. 161
Kun Gao, Stavros Harizopoulos, Ippokratis Pandis, Vladislav Shkapenyuk, Anastassia Ailamaki: Simultaneous Pipelining in QPipe: Exploiting Work Sharing Opportunities Across Queries. 162
Jagan Sankaranarayanan, Houman Alborzi, Hanan Samet: Enabling Query Processing on Spatial Networks. 163
Ying Chen, Andrew Rau-Chaplin, Frank K. H. A. Dehne, Todd Eavis, D. Green, E. Sithirasenan: cgmOLAP: Efficient Parallel Generation and Querying of Terabyte Size ROLAP Data Cubes. 164
Demo Session 2: New Data Engineering Applications
Anastasios Gounaris, Norman W. Paton, Rizos Sakellariou, Alvaro A. A. Fernandes, Jim Smith, Paul Watson: Practical Adaptation to Changing Resources in Grid Query Processing. 165
Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle, Matthias Renz: ViEWNet: Visual Exploration of Region-Wide Traffic Networks. 166
Chang-Tien Lu, Arnold P. Boedihardjo, Jinping Zheng: AITVS: Advanced Interactive Traffic Visualization System. 167
Michael Cammert, Christoph Heinz, Jürgen Krämer, Tobias Riemenschneider, Maxim Schwarzkopf, Bernhard Seeger, Alexander Zeiss: Stream Processing in Production-to-Business Software. 168
Jialie Shen, John Shepherd, Bin Cui, Kian-Lee Tan: HSI: A Novel Framework for Efficient Automated Singer Identification in Large Music Database. 169
Sriram Mohan, Jonathan Klinginsmith, Arijit Sengupta, Yuqing Wu: ACXESS - Access Control for XML with Enhanced Security Specifications. 171
Le Gruenwald, Percy Bernedo, Prasanna Padmanabhan: PETRANET: a Power Efficient Transaction Management Technique for Real-Time Mobile Ad-hoc Network Databases. 172



