10. DRR 2003: Santa Clara, California, USA
Tapas Kanungo, Elisa H. Barney Smith, Jianying Hu, Paul B. Kantor (Eds.): Document Recognition and Retrieval X, 22-23 January 2003, Santa Clara, California, USA, Proceedings. SPIE 2003 SPIE Proceedings ISBN 0-8194-4810-9
Invited Paper
Saharon Rosset, Ji Zhu, Trevor Hastie: Boosting and support vector machines as optimal separators. 1-7
Optical Character Recognition I
Jian Fan: Text extraction via an edge-bounded averaging and a parametric character model. 8-19
Tuan D. Pham: Applications of geostatistics and Markov models for logo recognition. 20-27
Bin Zhang, Sargur N. Srihari: Binary vector dissimilarity measures for handwriting identification. 28-38
Optical Character Recognition II
Hailong Liu, Xiaoqing Ding, Chi Fang: AdaBoost-based handwritten/printed discrimination on a single character. 39-46
Jongwoo Kim, Daniel X. Le, George R. Thoma: Automated labeling of bibliographic data extracted from biomedical online journals. 47-56
Hrishikesh Aradhye, James A. Herson, Gregory K. Myers: Syntax-directed content analysis of videotext: application to a map detection recognition system. 57-66
Modeling and Error Analysis
Gerd Maderlechner, Peter Suda: Extraction of valid data sets in registers using recognition of invalidation lines. 67-72
Kristen Maria Summers: Document image improvment for OCR as a classification problem. 73-83
Susan E. Hauser, Jonathan Schlaifer, Tehseen F. Sabir, Dina Demner-Fushman, Scott Straughan, George R. Thoma: Correcting OCR text by association with historical datasets. 84-93
Roger D. Clements, Elisa H. Barney Smith: Speed-up of optical scanner characterization subsystem. 94-102
Thomas A. Nartker, Kazem Taghva, Ron Young, Julie Borsack, Allen Condit: OCR correction based on document level knowledge. 103-110
Information Retrieval
Kazem Taghva, Jeffrey S. Coombs: Do Thesauri enhance rule-based categorization for OCR text? 111-119
Ahmad Fuad Rezaur Rahman, Yuliya Tarnikova, Hassan Alam: Exploring a hybrid of support vector machines (SVMs) and a heuristic-based system in classifying web pages. 120-127
Rong Jin, ChengXiang Zhai, Alexander G. Hauptmann: Information retrieval for OCR documents: a content-based probabilistic correction model. 128-135
Yunnan Wu, Daniel P. Lopresti: Resource-optimized delivery of web images to small-screen devices. 144-155
Layout Analysis
Xiaofan Lin: Header and footer extraction by page association. 164-171
Matthew Hurst, Dave Barney: Unconstrained invoice processing in the health insurance domain. 172-178

Song Mao, Azriel Rosenfeld, Tapas Kanungo: Document structure analysis algorithms: a literature survey. 197-207
Hisao Ogata, Shigeru Watanabe, Atsuhiro Imaizumi, Tsukasa Yasue, Naohiro Furukawa, Hiroshi Sako, Hiromichi Fujisawa: Form-type identification for banking applications and its implementation issues. 208-218
Multilingual OCR

Berrin A. Yanikoglu, Alisher Kholmatov: Turkish handwritten text recognition: a case of agglutinative languages. 227-233
Poster Session
Luigi Cinque, Stefano Levialdi, Alessio Malizia, Fabio De Rosa: Fermat theorem and elliptic color histogram features. 234-240
Guangshun Shi, Wumo Pan, Jianming Jin: Automatic information retrieval of Chinese business card. 241-248
Dahai Luan, Changsong Liu, Xiaoqing Ding: General Chinese document capture system with an improved error-rejecting module. 257-265
Hsin-Chang Yang, Chung-Hong Lee: Semantics-based image retrieval by text mining on environmental texts. 266-277
Misako Suwa, Satoshi Naoi: Separation algorithm of superimposed pattern using directional decomposition of an image. 278-285
Jacques Facon, Eduardo Akira Yonekura: Morphological postal envelope segmentation by co-occurrence matrix. 294-304
Hui Chao: Graphics extraction in a PDF document. 317-325



