default search action
27th SIGIR 2004: Sheffield, UK
- Mark Sanderson, Kalervo Järvelin, James Allan, Peter Bruza:
SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, UK, July 25-29, 2004. ACM 2004, ISBN 1-58113-881-4 - Gordon Bell, Jim Gemmell, Roger Lueder:
Challenges in using lifetime personal information stores. 1
Opening session
- Chirag Shah, W. Bruce Croft:
Evaluating high accuracy retrieval techniques. 2-9 - Einat Amitay, David Carmel, Ronny Lempel, Aya Soffer:
Scaling IR-system evaluation using term relevance sets. 10-17 - Fernando Diaz, Rosie Jones:
Using temporal profiles of queries for precision prediction. 18-24
Test collections
- Chris Buckley, Ellen M. Voorhees:
Retrieval evaluation with incomplete information. 25-32 - Mark Sanderson, Hideo Joho:
Forming test collections with no system pooling. 33-40 - Douglas W. Oard, Dagobert Soergel, David S. Doermann, Xiaoli Huang, G. Craig Murray, Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, Samuel Gustman, James Mayfield, Liliya Kharevych, Stephanie M. Strassel:
Building an information retrieval test collection for spontaneous conversational speech. 41-48
Formal models-1
- Hui Fang, Tao Tao, ChengXiang Zhai:
A formal study of information retrieval heuristics. 49-56 - Ji-Rong Wen, Ni Lao, Wei-Ying Ma:
Probabilistic model for contextual retrieval. 57-63 - Ramesh Nallapati:
Discriminative models for information retrieval. 64-71
XML retrieval
- Gabriella Kazai, Mounia Lalmas, Arjen P. de Vries:
The overlap problem in content-oriented XML retrieval evaluation. 72-79 - Jaap Kamps, Maarten de Rijke, Börkur Sigurbjörnsson:
Length normalization in XML retrieval. 80-87 - Shaorong Liu, Qinghua Zou, Wesley W. Chu:
Configurable indexing and ranking for XML information retrieval. 88-95
Dimensionality reduction
- Xiaofei He, Deng Cai, Haifeng Liu, Wei-Ying Ma:
Locality preserving indexing for document representation. 96-103 - Effrosini Kokiopoulou, Yousef Saad:
Polynomial filtering in latent semantic indexing for information retrieval. 104-111 - Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu:
On scaling latent semantic indexing for large peer-to-peer systems. 112-121
Formal models-2
- John F. Canny:
GaP: a factor model for discrete data. 122-129 - Raymond Y. K. Lau, Peter Bruza, Dawei Song:
Belief revision for adaptive information retrieval. 130-137 - Weiguo Fan, Ming Luo, Li Wang, Wensi Xi, Edward A. Fox:
Tuning before feedback: combining ranking discovery and blind feedback for robust retrieval. 138-145
Cross-language information retrieval
- Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, Lee-Feng Chien:
Translating unknown queries with web corpora for cross-language information retrieval. 146-153 - Monica Rogati, Yiming Yang:
Resource selection for domain-specific cross-lingual IR. 154-161 - Ying Zhang, Phil Vines:
Using the web for automated translation extraction in cross-language information retrieval. 162-169
Language models
- Jianfeng Gao, Jian-Yun Nie, Guangyuan Wu, Guihong Cao:
Dependence language model for information retrieval. 170-177 - Djoerd Hiemstra, Stephen E. Robertson, Hugo Zaragoza:
Parsimonious language models for information retrieval. 178-185 - Xiaoyong Liu, W. Bruce Croft:
Cluster-based retrieval using language models. 186-193 - Oren Kurland, Lillian Lee:
Corpus structure, language models, and ad hoc information retrieval. 194-201
Clustering
- Wei Xu, Yihong Gong:
Document clustering by concept factorization. 202-209 - Hua-Jun Zeng, Qi-Cai He, Zheng Chen, Wei-Ying Ma, Jinwen Ma:
Learning to cluster web search results. 210-217 - Tao Li, Sheng Ma, Mitsunori Ogihara:
Document clustering via adaptive subspace iteration. 218-225 - Stefan Siersdorfer, Sergej Sizov:
Restrictive clustering and metaclustering for self-organizing document collections. 226-233
Text classification
- Dunja Mladenic, Janez Brank, Marko Grobelnik, Natasa Milic-Frayling:
Feature selection using linear classifier weights: interaction with classification models. 234-241 - Dou Shen, Zheng Chen, Qiang Yang, Hua-Jun Zeng, Benyu Zhang, Yuchang Lu, Wei-Ying Ma:
Web-page classification through summarization. 242-249 - Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovitch:
Parameterized generation of labeled datasets for text categorization based on a hierarchical directory. 250-257
Disambiguation
- Sang-Bum Kim, Hee-Cheol Seo, Hae-Chang Rim:
Information retrieval using word senses: root sense tagging approach. 258-265 - Shuang Liu, Fang Liu, Clement T. Yu, Weiyi Meng:
An effective approach to document retrieval via utilizing WordNet and recognizing phrases. 266-272 - Einat Amitay, Nadav Har'El, Ron Sivan, Aya Soffer:
Web-a-where: geotagging web content. 273-280
Recognising and using named entities
- Li Zhang, Yue Pan, Tong Zhang:
Focused named entity recognition using machine learning. 281-288 - Wai Lam, Ruizhang Huang, Pik-Shan Cheung:
Learning phonetic similarity for matching named entity translations and mining new translations. 289-296 - Giridhar Kumaran, James Allan:
Text classification and named entities for new event detection. 297-304
Efficiency and scaling
- Fabrizio Silvestri, Salvatore Orlando, Raffaele Perego:
Assigning identifiers to documents to enhance the clustering property of fulltext indexes. 305-312 - Christos Tryfonopoulos, Manolis Koubarakis, Yannis Drougas:
Filtering algorithms for information retrieval models with named attributes and proximity operators. 313-320 - Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David A. Grossman, Ophir Frieder:
Hourly analysis of a very large topically categorized web query log. 321-328
Content-based filtering & collaborative filtering
- Matthew R. McLaughlin, Jonathan L. Herlocker:
A collaborative filtering algorithm and evaluation metric that accurately model the user experience. 329-336 - Rong Jin, Joyce Y. Chai, Luo Si:
An automatic weighting scheme for collaborative filtering. 337-344 - Yi Zhang:
Using bayesian priors to combine classifiers for adaptive filtering. 345-352 - Kai Yu, Volker Tresp, Shipeng Yu:
A nonparametric hierarchical bayesian framework for information filtering. 353-360
Image retrieval, users and usability
- Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu:
Automatic image annotation by using concept-sensitive salient objects for image content representation. 361-368 - Toni M. Rath, R. Manmatha, Victor Lavrenko:
A search engine for historical manuscript images. 369-376 - Diane Kelly, Nicholas J. Belkin:
Display time as implicit feedback: understanding task effects. 377-384 - Mingfang Wu, Gheorghe Muresan, Alistair McLean, Muh-Chyun (Morris) Tang, Ross Wilkinson, Yuelin Li, Hyuk-Jin Lee, Nicholas J. Belkin:
Human versus machine in the topic distillation task. 385-392 - Peter Willett:
Chemoinformatics: an application domain for information retrieval techniques. 393
Machine learning for IR
- Wensi Xi, Jesper Lind, Eric Brill:
Learning effective ranking functions for newsgroup search. 394-401 - Leah S. Larkey, Fangfang Feng, Margaret E. Connell, Victor Lavrenko:
Language-specific models in multilingual topic tracking. 402-409 - Dell Zhang, Wee Sun Lee:
Web taxonomy integration through co-bootstrapping. 410-417
Natural language processing
- Jinxi Xu, Ralph M. Weischedel, Ana Licuanan:
Evaluation of an extraction-based approach to answering definitional questions. 418-424 - Hai Leong Chieu, Yoong Keok Lee:
Query based event extraction along a timeline. 425-432 - Korinna Grabski, Tobias Scheffer:
Sentence completion. 433-439
Web structure
- Deng Cai, Xiaofei He, Ji-Rong Wen, Wei-Ying Ma:
Block-level link analysis. 440-447 - Vassilis Plachouras, Iadh Ounis:
Usefulness of hyperlink structure for query-biased topic distillation. 448-455 - Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma:
Block-based web search. 456-463
Posters
- William P. Doran, Nicola Stokes, Eamonn Newman, John Dunnion, Joe Carthy:
A hybrid statistical/linguistic model for generating news story gists. 464-465 - Mark Sanderson, Robert C. Pasley:
Image based gisting in CLIR. 466-467 - Edel Greevy, Alan F. Smeaton:
Classifying racist texts using a support vector machine. 468-469 - Azreen Azman, Iadh Ounis:
Discovery of aggregate usage profiles based on clustering information needs. 470-471 - Jie Lu, Jamie Callan:
Merging retrieval results in hierarchical peer-to-peer networks. 472-473 - Tetsuya Sakai, Yoshimi Saito, Yumi Ichimura, Tomoharu Kokubu, Makoto Koyama:
The effect of back-formulating questions in question answering evaluation. 474-475 - Jesse Montgomery, Luo Si, Jamie Callan, David A. Evans:
Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop "bf_numdocs" experiment suite. 476-477 - Laura A. Granka, Thorsten Joachims, Geri Gay:
Eye-tracking analysis of user behavior in WWW search. 478-479 - Raman Chandrasekar, Harr Chen, Simon Corston-Oliver, Eric Brill:
Subwebs for specialized search. 480-481 - Zhenmei Gu, Ming Luo:
Comparison of using passages and documents for blind relevance feedback in information retrieval. 482-483 - Paul D. Clough, Mark Sanderson:
Measuring pseudo relevance feedback & CLIR. 484-485 - Tao Tao, ChengXiang Zhai:
A two-stage mixture model for pseudo feedback. 486-487 - Eric Crestan, Claude de Loupy:
Natural language processing for browse help. 488-489 - James Mayfield, Paul McNamee:
Triangulation without translation. 490-491 - Smitha Sriram, Xuehua Shen, ChengXiang Zhai:
A session-based search engine. 492-493 - Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David A. Grossman, Ophir Frieder:
Evaluation of filtering current news search results. 494-495 - Eduard Hoenkamp, Dawei Song:
The document as an ergodic markov chain. 496-497 - Raymond J. D'Amore:
Expertise community detection. 498-499 - Dmitri Roussinov, Jose Antonio Robles-Flores:
Learning patterns to answer open domain questions on the web. 500-501 - Anton Leuski:
Email is a stage: discovering people roles from email archives. 502-503 - Gauri Shah, Tanveer Fathima Syeda-Mahmood:
Searching databases for sematically-related schemas. 504-505 - Chris Buckley:
Topic prediction based on comparative retrieval rankings. 506-507 - Elizabeth D. Liddy, Anne Diekema, Özgür Yilmazel:
Context-based question-answering evaluation. 508-509 - Yixing Sun, David J. Harper, Stuart N. K. Watt:
Design of an e-book user interface and visualizations to support reading for comprehension. 510-511 - David Hawking, Trystan Upstill, Nick Craswell:
Toward better weighting of anchors. 512-513 - Jiamin Ye, Alan F. Smeaton:
Aggregated feature retrieval for MPEG-7 via clustering. 514-515 - Andrés Corrada-Emmanuel, W. Bruce Croft:
Answer models for question answering passage retrieval. 516-517 - Harris Wu, Michael D. Gordon:
Collaborative filing in a document repository. 518-519 - Ryen W. White, Joemon M. Jose:
A study of topic similarity measures. 520-521 - Hui Yang, Tat-Seng Chua:
Effectiveness of web page classification on finding list answers. 522-523 - Ying Zhang, Phil Vines:
Detection and translation of OOV terms prior to query time. 524-525 - Yael Nemeth, Bracha Shapira, Meirav Taieb-Maimon:
Evaluation of the real and perceived value of automatic and interactive query expansion. 526-527 - Donna Harman, Chris Buckley:
The NRRC reliable information access (RIA) workshop. 528-529 - Ian Soboroff:
On evaluating web search with very few relevant documents. 530-531 - Qing Li, Byeong Man Kim, Donghai Guan, Duk whan Oh:
A music recommender based on audio features. 532-533 - Liping Ma, John Shepherd:
Information extraction using two-phase pattern discovery. 534-535 - Yue Lu, Li Zhang, Chew Lim Tan:
A search engine for imaged documents in PDF files. 536-537 - Yan Liu, Jaime G. Carbonell, Judith Klein-Seetharaman, Vanathi Gopalakrishnan:
Context sensitive vocabulary and its application in protein secondary structure prediction. 538-539 - Donald Metzler, Victor Lavrenko, W. Bruce Croft:
Formal multiple-bernoulli models for language modeling. 540-541 - Leif Azzopardi, Mark A. Girolami, Cornelis Joost van Rijsbergen:
User biased document language modelling. 542-543 - Kevyn Collins-Thompson, Jamie Callan:
Information retrieval for language tutoring: an overview of the REAP project. 544-545 - Yinghui Xu, Kyoji Umemura:
A unified model of literal mining and link analysis for ranking web resources. 546-547 - Xiaoyong Liu, W. Bruce Croft, Paul Oh, David M. Hart:
Automatic recognition of reading levels from user queries. 548-549 - Justin Basilico, Thomas Hofmann:
A joint framework for collaborative and content filtering. 550-551 - Hee-Soo Kim, Ikkyu Choi, Minkoo Kim:
Refining term weights of documents using term dependencies. 552-553 - Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke:
Multiple sources of evidence for XML retrieval. 554-555 - Yuen-Hsien Tseng, William John Teahan:
Verifying a Chinese collection for text categorization. 556-557 - Yih-Ling Hedley, Muhammad Younas, Anne E. James, Mark Sanderson:
Query-related data extraction of hidden web documents. 558-559 - Atsushi Fujii, Makoto Iwayama, Noriko Kando:
The patent retrieval task in the fourth NTCIR workshop. 560-561 - Ellen M. Voorhees:
Measuring ineffectiveness. 562-563 - Philip J. Cowans:
Information retrieval using hierarchical dirichlet processes. 564-565 - Abduelbaset Goweder, Massimo Poesio, Anne N. De Roeck:
Broken plural detection for arabic information retrieval. 566-567 - Rong Jin, Luo Si:
A study of methods for normalizing user ratings in collaborative filtering. 568-569 - Robert H. Warren, Ting Liu:
A review of relevance feedback experiments at the 2003 reliable information access (RIA) workshop. 570-571 - Bicheng Liu, David J. Harper, Stuart N. K. Watt:
Supporting federated information sharing communities. 572-573 - Kevyn Collins-Thompson, Jamie Callan, Egidio L. Terra, Charles L. A. Clarke:
The effect of document retrieval quality on factoid question answering performance. 574-575 - Trystan Upstill, Stephen E. Robertson:
Exploiting hyperlink recommendation evidence in navigational web search. 576-577 - D. S. Hunnisett, W. J. Teahan:
Context-based methods for text categorisation. 578-579 - Manu Aery, Sharma Chakravarthy:
eMailSift: mining-based approaches to email classification. 580-581 - Jack G. Conrad, Cindy P. Schriber:
Constructing a text corpus for inexact duplicate detection. 582-583 - Chris Buckley:
Why current IR engines fail. 584-585 - Manuel Zahariev:
Automatic sense disambiguation for acronyms. 586-587 - Gabriel Somlo, Adele E. Howe:
Filtering for personal web information agents. 588-589 - Michael G. Christel, Neema Moraveji, Chang Huang:
Evaluating content-based filters for image and video retrieval. 590-591