17. CIKM 2008: Napa Valley, California, USA
James G. Shanahan, Sihem Amer-Yahia, Ioana Manolescu, Yi Zhang, David A. Evans, Aleksander Kolcz, Key-Sun Choi, Abdur Chowdhury (Eds.): Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, Napa Valley, California, USA, October 26-30, 2008. ACM 2008 ISBN 978-1-59593-991-3
Rakesh Agrawal: Humane data mining. 1-2
DB: faceted search, web query results presentation
Debabrata Dash, Jun Rao, Nimrod Megiddo, Anastasia Ailamaki, Guy M. Lohman: Dynamic faceted search for discovery-driven analysis. 3-12
Senjuti Basu Roy, Haidong Wang, Gautam Das, Ullas Nambiar, Mukesh K. Mohania: Minimum-effort driven dynamic faceted search in structured databases. 13-22
Gloria Bordogna, Alessandro Campi, Giuseppe Psaila, Stefania Ronchi: A language for manipulating clustered web documents results. 23-32
Shui-Lung Chuang, Kevin Chen-Chuan Chang: Integrating web query results: holistic schema matching. 33-42
IR: web search 1
Filip Radlinski, Madhu Kurup, Thorsten Joachims: How does clickthrough data reflect retrieval quality? 43-52
Marc Najork, Nick Craswell: Efficient and effective link analysis with precomputed salsa maps. 53-62
Lian'en Huang, Lei Wang, Xiaoming Li: Achieving both high precision and high recall in near-duplicate detection. 63-72
Zhicheng Dou, Ruihua Song, Xiaojie Yuan, Ji-Rong Wen: Are click-through data adequate for learning web search rankings? 73-82
KM: classification
Jian Huang, Omid Madani, C. Lee Giles: Error-driven generalist+experts (edge): a multi-stage ensemble framework for text categorization. 83-92
Yang Song, Lu Zhang, C. Lee Giles: A sparse gaussian processes classification framework for fast tag suggestions. 93-102
Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He: Transfer learning from multiple source domains via consensus regularization. 103-112
Industry research track
Casey Whitelaw, Alexander Kehlenbeck, Nemanja Petrovic, Lyle H. Ungar: Web-scale named entity recognition. 123-132
Roy J. Byrd, Mary S. Neff, Wilfried Teiken, Youngja Park, Keh-Shin F. Cheng, Stephen C. Gates, Karthik Visweswariah: Semi-automated logging of contact center telephone calls. 133-142
Gang Luo, Chunqiang Tang, Hao Yang, Xing Wei: MedSearch: a specialized search engine for medical information retrieval. 143-152
Roger B. Bradford: An empirical study of required dimensionality for large-scale latent semantic indexing applications. 153-162
DB: efficient maintenance and query optimization
Gang Luo, Philip S. Yu: Content-based filtering for efficient online materialized view maintenance. 163-172
Gang Qian, Yisheng Dong: A step towards incremental maintenance of the composed schema mapping. 173-182
Mumtaz Ahmad, Ashraf Aboulnaga, Shivnath Babu, Kamesh Munagala: Modeling and exploiting query interactions in database systems. 183-192
Humberto Luiz Razente, Maria Camila Nardini Barioni, Agma J. M. Traina, Christos Faloutsos, Caetano Traina Jr.: A novel optimization approach to efficiently process aggregate similarity queries in metric access methods. 193-202
IR: social search
Kerstin Bischoff, Claudiu S. Firan, Wolfgang Nejdl, Raluca Paiu: Can all tags be used for search? 193-202
Anna Ritchie, Stephen Robertson, Simone Teufel: Comparing citation contexts for information retrieval. 213-222
Fabian M. Suchanek, Milan Vojnovic, Dinan Gunawardena: Social tags: meaning and suggestions. 223-232
Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King: Mining social networks using heat diffusion processes for marketing candidates selection. 233-242
IR/KM: machine learning
Leonardo C. da Rocha, Fernando Mourão, Adriano M. Pereira, Marcos André Gonçalves, Wagner Meira Jr.: Exploiting temporal contexts in text classification. 243-252
Alessandro Moschitti: Kernel methods, syntax and semantics for relational text categorization. 253-262
George Forman: BNS feature scaling: an improved representation over tf-idf for svm text classification. 263-270
KM: link and graph mining

Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar, Ying Xu: Link privacy in social networks. 289-298
Chen Chen, Cindy Xide Lin, Xifeng Yan, Jiawei Han: On effective presentation of graph patterns: a structural representative approach. 299-308
Qiankun Zhao, Sourav S. Bhowmick, Xin Zheng, Kai Yi: Characterizing and predicting community members from evolutionary and heterogeneous networks. 309-318
KM: information filtering

Dongmei Jia, Wai Gen Yee, Ophir Frieder: Spam characterization and detection in peer-to-peer file-sharing systems. 329-338
Nish Parikh, Neel Sundaresan: Inferring semantic query relations from collective user behavior. 349-358
DB: stream processing
George A. Mihaila, Ioana Stanoi, Christian A. Lang: Anomaly-free incremental output in stream processing. 359-368
Abhishek Mukherji, Elke A. Rundensteiner, David C. Brown, Venkatesh Raghavan: SNIF TOOL: sniffing for patterns in continuous streams. 369-378
Giorgio Ghelli, Dario Colazzo, Carlo Sartiani: Linear time membership in a class of regular expressions with interleaving and counting. 389-398
IR: theory
Donald Metzler: Generalized inverse document frequency. 399-408
Derrick Coetzee: TinyLex: static n-gram index pruning with perfect recall. 409-418
David E. Losada, Leif Azzopardi, Mark Baillie: Revisiting the relationship between document length and relevance. 419-428
Lixin Shi, Jian-Yun Nie, Guihong Cao: Relating dependent indexes using dempster-shafer theory. 429-438
IR: query analysis
Claudia Hauff, Vanessa Murdock, Ricardo A. Baeza-Yates: Improved query difficulty prediction for the web. 439-448
Doug Downey, Susan T. Dumais, Daniel J. Liebling, Eric Horvitz: Understanding the relationship between searchers' queries and information goals. 449-458

KM: web mining
Xuanhui Wang, ChengXiang Zhai: Mining term association patterns from search logs for effective query reformulation. 479-488
Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan: Discovering leaders from community actions. 499-508
Pedro Domingos: Markov logic: a unifying language for knowledge and information management. 519
DB/industry: XML data integration and XML query optimization
Alex Thomo, Srinivasan Venkatesh: Rewriting of visibly pushdown languages for xml data integration. 521-530
Guangjun Xie, Qi Cheng, Jarek Gryz, Calisto Zuzarte: Some rewrite optimizations of DB2 XQuery navigation. 531-540
Bilel Gueni, Talel Abdessalem, Bogdan Cautis, Emmanuel Waller: Pruning nested XQuery queries. 541-550
Pawel Placek, Dimitri Theodoratos, Stefanos Souldatos, Theodore Dalamagas, Timos K. Sellis: A heuristic approach for checking containment of generalized tree-pattern queries. 551-560
IR: evaluation
Leif Azzopardi, Vishwa Vinay: Retrievability: an evaluation measure for higher order information access tasks. 561-570
William Webber, Alistair Moffat, Justin Zobel: Statistical power in retrieval experimentation. 571-580
Tetsuya Sakai: Comparing metrics across TREC and NTCIR: the robustness to system bias. 581-590
Kenneth A. Kinney, Scott B. Huffman, Juting Zhai: How evaluator domain expertise affects search result relevance judgments. 591-598
KM: statistical techniques
Christos Boutsidis, Jimeng Sun, Nikos Anerousis: Clustered subset selection and its applications on it service metrics. 599-608
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, Sebastiano Vigna: The query-flow graph: model and applications. 609-618
Arnold P. Boedihardjo, Chang-Tien Lu, Feng Chen: A framework for estimating complex probability density structures in data streams. 619-628
Pinar Donmez, Jaime G. Carbonell: Proactive learning: cost-sensitive active learning with multiple imperfect oracles. 619-628
Panel discussion
DB: indexing and physical query optimization
Josep Aguilar-Saborit, Mohammad Jalali, Dave Sharpe, Victor Muntés-Mulero: Exploiting pipeline interruptions for efficient memory allocation. 639-648
Marina Barsky, Ulrike Stege, Alex Thomo, Chris Upton: A new method for indexing genomes using on-disk suffix trees. 649-658
Vuk Ercegovac, Vanja Josifovski, Ning Li, Maurício R. Mediano, Eugene J. Shekita: Supporting sub-document updates and queries in an inverted index. 659-668
Wei Dong, Zhe Wang, William Josephson, Moses Charikar, Kai Li: Modeling LSH for performance tuning. 669-678
IR: web search 2
Mingjie Zhu, Shuming Shi, Nenghai Yu, Ji-Rong Wen: Can phrase indexing help to process non-phrase queries? 679-688
Julia Luxenburger, Shady Elbassuoni, Gerhard Weikum: Matching task profiles and user needs in personalized web search. 689-698
Rosie Jones, Kristina Lisa Klinkner: Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. 699-708
Hao Ma, Haixuan Yang, Irwin King, Michael R. Lyu: Learning latent semantic relations from clickthrough data for query suggestion. 709-718
IR: multilingual & multimedia
Kristen Parton, Kathleen McKeown, James Allan, Enrique Henestroza: Simultaneous multilingual search for translingual information retrieval. 719-728
Eduardo Valle, Matthieu Cord, Sylvie Philipp-Foliguet: High-dimensional descriptor indexing for large multimedia databases. 739-748
Yu-En Lu, Pietro Liò, Steven Hand: On low dimensional random projections and similarity search. 749-758
KM: data mining
Hanghang Tong, Yasushi Sakurai, Tina Eliassi-Rad, Christos Faloutsos: Fast mining of complex time-stamped events. 759-768
Darcy A. Davis, Nitesh V. Chawla, Nicholas Blumm, Nicholas A. Christakis, Albert-László Barabási: Predicting individual disease risk based on medical history. 769-778
Malika Mahoui, William John Teahan, Arvind Kumar Thirumalaiswamy Sekhar, Satyasaibabu Chilukuri: Identification of gene function using prediction by partial matching (PPM) language models. 779-786
KM: semantic techniques
Rodolfo Stecher, Claudia Niederée, Wolfgang Nejdl: Wildcards for lightweight information integration in virtual desktops. 797-806
Simona Colucci, Eugenio Di Sciascio, Francesco M. Donini, Eufemia Tinelli: Finding informative commonalities in concept collections. 807-817
Masahiro Ito, Kotaro Nakayama, Takahiro Hara, Shojiro Nishio: Association thesaurus construction methods based on link co-occurrence analysis for wikipedia. 817-826
Christian Hütter, Conny Kühne, Klemens Böhm: Peer production of structured knowledge -: an empirical study of ratings and incentive mechanisms. 827-842
DB: security and privacy
Venkatesan T. Chakaravarthy, Himanshu Gupta, Prasan Roy, Mukesh K. Mohania: Efficient techniques for document sanitization. 843-852
Haixun Wang, Jian Yin, Chang-Shing Perng, Philip S. Yu: Dual encryption for query integrity assurance. 863-872
Ahmed A. Ataullah, Ashraf Aboulnaga, Frank Wm. Tompa: Records retention in relational database systems. 873-882
IR: medley



Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai: Modeling hidden topics on document manifold. 911-920
IR: recommender systems
Jinwen Guo, Shengliang Xu, Shenghua Bao, Yong Yu: Tapping on the potential of q&a community by recommending answer providers. 921-930
Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King: SoRec: social recommendation using probabilistic matrix factorization. 931-940
Yun Chi, Shenghuo Zhu, Yihong Gong, Yi Zhang: Probabilistic polyadic factorization and its application to personalized recommendation. 941-950
Derry Tanti Wijaya, Stéphane Bressan: A random walk on the red carpet: rating movies with user reviews and pagerank. 951-960
KM: feature selection
Xiang Zhang, Feng Pan, Wei Wang: REDUS: finding reducible subspaces in high dimensional data. 961-970
Elsa Loekito, James Bailey: Mining influential attributes that capture class and group contrast behaviour. 971-980
Ying Liu, Lucian Vlad Lita, Radu Stefan Niculescu, Kun Bai, Prasenjit Mitra, C. Lee Giles: Real-time data pre-processing technique for efficient feature extraction in large scale datasets. 981-990
Panel discussion 2
David A. Evans, Susan Feldman, Ed H. Chi, Natasa Milic-Frayling, Igor Perisic: The social (open) workspace. 1529
W. Bruce Croft: Unsolved problems in search: (and how we approach them). 1001
IR: advertising & filtering
Andrei Z. Broder, Massimiliano Ciaramita, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Donald Metzler, Vanessa Murdock, Vassilis Plachouras: To swing or not to swing: learning when (not) to advertise. 1003-1012
Andrei Z. Broder, Peter Ciccolo, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Lance Riedel: Search advertising using web relevance feedback. 1013-1022
Yuefeng Li, Xujuan Zhou, Peter Bruza, Yue Xu, Raymond Y. K. Lau: A two-stage text mining model for information filtering. 1023-1032
Canhui Wang, Min Zhang, Liyun Ru, Shaoping Ma: Automatic online news topic ranking using media focus and user attention based on aging theory. 1033-1042
IR: blog


Ben He, Craig Macdonald, Jiyin He, Iadh Ounis: An effective statistical approach to blog post opinion retrieval. 1063-1072
KM: clustering
Chuan Duan, Jane Cleland-Huang, Bamshad Mobasher: A consensus based approach to constrained clustering of software requirements. 1073-1082
Ron Bekkerman, Martin Scholz: Data weaving: scaling up the state-of-the-art in data clustering. 1083-1092
Ira Assent, Ralph Krieger, Emmanuel Müller, Thomas Seidl: EDSC: efficient density-based subspace clustering. 1093-1102
Faris Alqadah, Raj Bhatnagar: An effective algorithm for mining 3-clusters in vertically partitioned data. 1103-1112
IR: enterprise search
Maryam Karimzadehgan, ChengXiang Zhai, Geneva G. Belford: Multi-aspect expertise matching for review assignment. 1113-1122
Barbara Poblete, Carlos Castillo, Aristides Gionis: Dr. Searcher and Mr. Browser: a unified hyperlink-click graph. 1123-1132
Pavel Serdyukov, Henning Rode, Djoerd Hiemstra: Modeling multi-step relevance propagation for expert finding. 1133-1142
Keke Chen, Rongqing Lu, C. K. Wong, Gordon Sun, Larry P. Heck, Belle L. Tseng: Trada: tree based ranking function adaptation. 1143-1152
IR: structured documents
Mir Sadek Ali, Mariano P. Consens, Gabriella Kazai, Mounia Lalmas: Structural relevance: a common basis for the evaluation of structured document retrieval. 1153-1162
Christian Kohlschütter, Wolfgang Nejdl: A densitometric approach to web page segmentation. 1173-1182
KM: text mining
Anup Chalamalla, Sumit Negi, L. Venkata Subramaniam, Ganesh Ramakrishnan: Identification of class specific discourse patterns. 1193-1202
Huajing Li, Zaiqing Nie, Wang-Chien Lee, C. Lee Giles, Ji-Rong Wen: Scalable community discovery on textual data with relations. 1203-1212
George Forman, Evan Kirshenbaum: Extremely fast text feature extraction for classification and indexing. 1221-1230
DB: mobile and distributed data management
Ken C. K. Lee, Josh Schiffman, Baihua Zheng, Wang-Chien Lee: Valid scope computation for location-dependent spatial query in mobile broadcast environments. 1231-1240
Linh Thai Nguyen, Wai Gen Yee, Ophir Frieder: Adaptive distributed indexing for structured peer-to-peer networks. 1241-1250
Jon Olav Hauglid, Kjetil Nørvåg: PROQID: partial restarts of queries in distributed databases. 1251-1260
IR: QA
Andrew Hickl: Answering questions with authority. 1261-1270
David Dominguez-Sal, Mihai Surdeanu, Josep Aguilar-Saborit, Josep-Lluis Larriba-Pey: Cache-aware load balancing for question answering. 1271-1280
Wei Zhou, Clement T. Yu, Weiyi Meng: A system for finding biological entities that satisfy certain conditions from texts. 1281-1290
KM: information extraction
Andrew Arnold, William W. Cohen: Intra-document structural frequency features for semi-supervised domain adaptation. 1291-1300
Ying Liu, Prasenjit Mitra, C. Lee Giles: Identifying table boundaries in digital documents via sparse line detection. 1311-1320
Poster session 1 database
Pawel Jurczyk, Li Xiong: Privacy-preserving data publishing for horizontally partitioned databases. 1321-1322
Haofen Wang, Thanh Tran, Chang Liu: CE2: towards a large scale hybrid search engine with integrated ranking support. 1323-1324
Ken C. K. Lee, Wang-Chien Lee, Baihua Zheng: ROAD: an efficient framework for location dependentspatial queries on road networks. 1327-1328
Maxim Kormilitsin, Rada Chirkova, Yahya Fathi, Matthias F. Stallmann: View and index selection for query-performance improvement: quality-centered algorithms and heuristics. 1329-1330
Shaoyi Yin, Philippe Pucheral, Xiaofeng Meng: PBFilter: indexing flash-resident data through partitioned summaries. 1333-1334
Poster session 1/industry
Gang Luo, Jeffrey F. Naughton, Curt J. Ellmann, Michael Watzke: Transaction reordering with application to synchronized scans. 1335-1336
Jason J. Soo, Rebecca Cathey, Ophir Frieder, Michlean J. Amir, Gideon Frieder: Yizkor books: a voice for the silent past. 1337-1338
Poster session 1/information retrieval
Mihai Stroe, Radu Berinde, Cosmin Negruseri, Dan Popovici: An approximate string matching approach for handling incorrectly typed urls. 1339-1340
Qiang Wang, Rui Li, Lei Chen, Jie Lian, M. Tamer Özsu: Speed up semantic search in p2p networks. 1341-1342
Xuerui Wang, Andrei Z. Broder, Marcus Fontoura, Vanja Josifovski: A note on search based forecasting of ad volume in contextual advertising. 1343-1344
Young-Min Kim, Jean-François Pessiot, Massih-Reza Amini, Patrick Gallinari: An extension of PLSA for document clustering. 1345-1346
Katsuya Masuda, Jun'ichi Tsujii: Nested region algebra extended with variables for tag-annotated text search. 1349-1350
Antti Ukkonen, Carlos Castillo, Debora Donato, Aristides Gionis: Searching the wikipedia with contextual information. 1351-1352
Binyamin Rosenfeld, Ronen Feldman, Lyle H. Ungar: Using sequence classification for filtering web pages. 1355-1356
Elif Aktolga, Marc-Allen Cartright, James Allan: Cross-document cross-lingual coreference retrieval. 1359-1360
Karane Vieira, Luciano Barbosa, Juliana Freire, Altigran Soares da Silva: Siphon++: a hidden-webcrawler for keyword-based interfaces. 1361-1362
Tonya Custis, Khalid Al-Kofahi: Investigating external corpus and clickthrough statistics for query expansion in the legal domain. 1363-1364
Monica Rogati, Yiming Yang, Jaime G. Carbonell: Corpus microsurgery: criteria optimization for medical cross-language ir. 1365-1366
Qingzhao Tan, Prasenjit Mitra, C. Lee Giles: Metadata extraction and indexing for map search in web documents. 1367-1368
Poster session 1/knowledge management
Qingliang Miao, Qiudan Li, Ruwei Dai: An integration strategy for mining product features and opinions. 1369-1370



Yu-Ru Lin, Hari Sundaram, Aisling Kelliher: Summarization of social activity over time: people, actions and concepts in dynamic networks. 1379-1380
Lizhen Qu, Christof Müller, Iryna Gurevych: Using tag semantic network for keyphrase extraction in blogs. 1381-1382
Nuno Cardoso, Mário J. Silva, Diana Santos: Handling implicit geographic evidence for geographic ir. 1383-1384
Richard Bache, Fabio Crestani: Estimating real-valued characteristics of criminals from their recorded crimes. 1385-1386
Jinfeng Zhuang, Steven C. H. Hoi, Aixin Sun, Rong Jin: Representative entry selection for profiling blogs. 1387-1388

Chi-Yao Tseng, Pin-Chieh Sung, Ming-Syan Chen: A novel email abstraction scheme for spam detection. 1393-1394
Pavan Kumar Vatturi, Werner Geyer, Casey Dugan, Michael J. Muller, Beth Brownholtz: Tag-based filtering for personalized bookmark recommendations. 1395-1396
Chunyu Yang, Yong Cao, Zaiqing Nie, Jie Zhou, Ji-Rong Wen: Closing the loop in webpage understanding. 1397-1398
Poster session 2 database
Bruce S. E. Chung, Wang-Chien Lee, Arbee L. P. Chen: Efficient processing of probabilistic spatio-temporal range queries over moving objects. 1399-1400
Nicolas Anciaux, Luc Bouganim, Harold van Heerde, Philippe Pucheral, Peter M. G. Apers: Data degradation: making private data less sensitive over time. 1401-1402
Dongfeng Chen, Rada Chirkova, Maxim Kormilitsin, Fereidoon Sadri, Timo J. Salo: Query optimization in xml-based information integration. 1405-1406
Marcel Karnstedt, Kai-Uwe Sattler, Michael Haß, Manfred Hauswirth, Brahmananda Sapkota, Roman Schmidt: Estimating the number of answers with guarantees for structured queries in p2p databases. 1407-1408
Pranav Vaidya, Jaehwan John Lee: Characterization of TPC-H queries for a column-oriented database on a dual-core amd athlon processor. 1411-1412
Poster session 2/information retrieval
Petteri Nurmi, Eemil Lagerspetz, Wray L. Buntine, Patrik Floréen, Joonas Kukkonen, Peter Peltonen: Natural language retrieval of grocery products. 1413-1414
Wei Zhang, Lifeng Jia, Clement T. Yu, Weiyi Meng: Improve the effectiveness of the opinion retrieval and opinion polarity classification. 1415-1416
Qiang Huang, Dawei Song: A latent variable model for query expansion using the hidden markov model. 1417-1418
Claudia Hauff, Djoerd Hiemstra, Franciska de Jong: A survey of pre-retrieval query performance predictors. 1419-1420
Jianhan Zhu, Dawei Song, Stefan M. Rüger, Xiangji Huang: Modeling document features for expert finding. 1421-1422
Raghavendra Udupa, K. Saravanan, A. Kumaran, Jagadeesh Jagarlamudi: Mining named entity transliteration equivalents from comparable corpora. 1423-1424
Vishwa Vinay, Natasa Milic-Frayling, Ingemar J. Cox: Estimating retrieval effectiveness using rank distributions. 1425-1426
Shouchun Chen, Fei Wang, Yangqiu Song, Changshui Zhang: Semi-supervised ranking aggregation. 1427-1428
Fabian Abel, Nicola Henze, Daniel Krause: Ranking in folksonomy systems: can context help? 1429-1430

Dingding Wang, Shenghuo Zhu, Tao Li, Yun Chi, Yihong Gong: Integrating clustering and multi-document summarization to improve document understanding. 1435-1436
Wisam Dakka, Luis Gravano, Panagiotis G. Ipeirotis: Answering general time sensitive queries. 1437-1438
Jiang-Ming Yang, Rui Cai, Feng Jing, Shuo Wang, Lei Zhang, Wei-Ying Ma: Search-based query suggestion. 1439-1440
Poster session 2/knowledge management

Fred S. Annexstein, Svetlana Strunjas: Collaborative partitioning with maximum user satisfaction. 1445-1446
Syed Khairuzzaman Tanbeer, Chowdhury Farhan Ahmed, Byeong-Soo Jeong, Young-Koo Lee: Efficient frequent pattern mining over data streams. 1447-1448
Xiaoming Fan, Jianyong Wang, Bing Lv, Lizhu Zhou, Wei Hu: GHOST: an effective graph-based framework for name distinction. 1449-1450
Gavin Shaw, Yue Xu, Shlomo Geva: Deriving non-redundant approximate association rules from hierarchical datasets. 1451-1452
Shuming Shi, Xiaokang Liu, Ji-Rong Wen: Pattern-based semantic class discovery with multi-membership support. 1453-1454
Faris Alqadah, Raj Bhatnagar: Detecting significant distinguishing sets among bi-clusters. 1455-1456
Fei Wang, Shouchun Chen, Changshui Zhang, Tao Li: Semi-supervised metric learning by maximizing constraint margin. 1457-1458
Rohan Choudhary, Sameep Mehta, Amitabha Bagchi: On quantifying changes in temporally evolving dataset. 1459-1460
Fidelia Ibekwe-Sanjuan, Eric SanJuan, Michael S. E. Vogeley: Decomposition of terminology graphs for domain knowledge acquisition. 1463-1464
Francisco M. Carrero, José Carlos Cortizo, José María Gómez, Manuel de Buenaga Rodríguez: In the development of a spanish metamap. 1465-1466
Leila Kaghazian, Dennis McLeod, Reza Sadri: Scalable complex pattern search in sequential data. 1467-1468
Chaitanya Chemudugunta, Padhraic Smyth, Mark Steyvers: Combining concept hierarchies and statistical topic models. 1469-1470
Poster session 3: database
Weifa Liang, Baichen Chen, Jeffrey Xu Yu: Energy-efficient skyline query processing and maintenance in sensor networks. 1471-1472
K. Selçuk Candan, Huiping Cao, Yan Qi, Maria Luisa Sapino: Table summarization with the help of domain lattices. 1473-1474
Xiao Pan, Jianliang Xu, Xiaofeng Meng: Protecting location privacy against location-dependent attack in mobile services. 1475-1476
Philon Nguyen, Nematollaah Shiri: Polyhedral transformation for indexed rank order correlation queries. 1477-1478
Matthias Böhm, Uwe Wloka, Dirk Habich, Wolfgang Lehner: Workload-based optimization of integration processes. 1479-1480
Poster session 3/information retrieval
Vitor R. Carvalho, Jonathan L. Elsas, William W. Cohen, Jaime G. Carbonell: Suppressing outliers in pairwise preference ranking. 1487-1488
Adele E. Howe, Ryan D. Forbes: Re-considering neighborhood-based collaborative filtering parameters in the context of new data. 1481-1482
Jianguo Lu: Efficient estimation of the size of text deep web data source. 1485-1486
Álvaro Zubizarreta, Pablo de la Fuente, José Manuel Cantera, Mario Arias, Jorge Cabrero Alonso, Guido García Bernardo, César Llamas, Jesús Vegas: A georeferencing multistage method for locating geographic context in web search. 1485-1486
Hiroyuki Toda, Norihito Yasuda, Yumiko Matsuura, Ryoji Kataoka: Incorporating place name extents into geo-ir ranking. 1489-1490
Paavo Arvola, Jaana Kekäläinen, Marko Junkkari: The effect of contextualization at different granularity levels in content-oriented xml retrieval. 1491-1492
Mandar Rahurkar, Silviu Cucerzan: Using the current browsing context to improve search relevance. 1493-1494
Mariam Daoud, Lynda Tamine-Lechani, Mohand Boughanem: Using a graph-based ontological user profile for personalizing search. 1495-1496
Yang Sun, Huajing Li, Isaac G. Councill, Wang-Chien Lee, C. Lee Giles: Measuring user preference changes in digital libraries. 1497-1498
Rifat Ozcan, Ismail Sengör Altingövde, Özgür Ulusoy: Utilization of navigational queries for result presentation and caching in search engines. 1499-1500
Alexander Yates, James Joseph, Ana-Maria Popescu, Alexander D. Cohn, Nick Sillick: SHOPSMART: product recommendations through technical specifications and user reviews. 1501-1502
Gabriella Kazai, Natasa Milic-Frayling: Trust, authority and popularity in social information retrieval. 1503-1504
Sreangsu Acharyya, Joydeep Ghosh: A spam resistant family of concavo-convex ranks for link analysis. 1505-1506
Poster session 3/knowledge management
Shenghua Bao, Bohai Yang, Ben Fei, Shengliang Xu, Zhong Su, Yong Yu: Boosting social annotations using propagation. 1507-1508
Yuefeng Li, Sheng-Tang Wu, Xiaohui Tao: Effective pattern taxonomy mining in text documents. 1509-1510
Kyung Soon Lee: Incorporating topical support documents into a small training set in text categorization. 1511-1512
Tanveer A. Faruquie, Sumit Negi, Anup Chalamalla, L. Venkata Subramaniam: Exploiting context to detect sensitive information in call center conversations. 1513-1514
Munmun De Choudhury, Hari Sundaram, Ajita John, Dorée D. Seligmann: Multi-scale characterization of social network dynamics in the blogosphere. 1515-1516
Zenglin Xu, Rong Jin, Kaizhu Huang, Michael R. Lyu, Irwin King: Semi-supervised text categorization by active search. 1517-1518
Jae Woo Lee, Won Suk Lee: A coarse-grain grid-based subspace clustering method for online multi-dimensional data streams. 1521-1522
Yanhua Chen, Lijun Wang, Ming Dong: A matrix-based approach for semi-supervised document co-clustering. 1523-1524
Jiahui Liu, Larry Birnbaum, Bryan Pardo: Categorizing blogger's interests based on short snippets of blog posts. 1525-1526



