49. ACL Short Papers 2011: Portland, Oregon, USA
The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA - Short Papers. The Association for Computer Linguistics 2011 ISBN 978-1-932432-88-6
Front Matter.
Brian Roark, Richard Sproat, Izhak Shafran: Lexicographic Semirings for Exact Automata Encoding of Sequence Models. 1-5
Dmitriy Dligach, Martha Palmer: Good Seed Makes a Good Crop: Accelerating Active Learning Using Language Modeling. 6-10
Benjamin Van Durme, Ashwin Lall: Efficient Online Locality Sensitive Hashing via Reservoir Counting. 18-23
Greg Durrett, Dan Klein: An Empirical Investigation of Discounting in Cross-Domain Language Models. 24-29
Tetsuo Kiso, Masashi Shimbo, Mamoru Komachi, Yuji Matsumoto: HITS-based Seed Selection and Stop List Construction for Bootstrapping. 30-36
Omar Zaidan, Chris Callison-Burch: The Arabic Online Commentary Dataset: an Annotated Dataset of Informal Arabic with High Dialectal Content. 37-41
Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, Noah A. Smith: Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments. 42-47
Anders Søgaard: Semi-supervised condensed nearest neighbor for part-of-speech tagging. 48-52
Masato Hagiwara, Satoshi Sekine: Latent Class Transliteration based on Source Language Origin. 53-57
Jeffrey Heinz, Chetan Rawal, Herbert G. Tanner: Tier-based Strictly Local Constraints for Phonology. 58-64
Steffen Hedegaard, Jakob Grue Simonsen: Lost in Translation: Authorship Attribution using Frame Semantics. 65-70
Fei Liu, Fuliang Weng, Bingqing Wang, Yang Liu: Insertion, Deletion, or Substitution? Normalizing Text Messages without Pre-categorization nor Supervision. 71-76
Manoj Harpalani, Michael Hart, Sandesh Signh, Rob Johnson, Yejin Choi: Language of Vandalism: Improving Wikipedia Vandalism Detection via Stylometric Analysis. 83-88
Fabrizio Morbini, Kenji Sagae: Joint Identification and Segmentation of Domain-Specific Dialogue Acts for Conversational Dialogue Systems. 95-100
Richard Johansson, Alessandro Moschitti: Extracting Opinion Expressions and Their Polarities - Exploration of Pipelines and Joint Models. 101-106
Cecilia Ovesdotter Alm: Subjective Natural Language Problems: Motivations, Applications, Characterizations, and Implications. 107-112
Rivka Levitan, Agustín Gravano, Julia Hirschberg: Entrainment in Speech Preceding Backchannels. 113-117
Anna Margolis, Mari Ostendorf: Question Detection in Spoken Conversations Using Textual Conversations. 118-124
André Bittar, Pascal Amsili, Pascal Denis, Laurence Danlos: French TimeBank: An ISO-TimeML Annotated Reference Corpus. 130-134
Bo Pang, Ravi Kumar: Search in the Lost Sense of ``Query'': Question Formulation in Web Search Queries and its Temporal Changes. 135-140
Mehdi Manshadi, James F. Allen, Mary D. Swift: A Corpus of Scope-disambiguated English Text. 141-146
Jagadeesh Jagarlamudi, Hal Daumé III, Raghavendra Udupa: From Bilingual Dictionaries to Interlingual Document Representations. 147-152
Rafael E. Banchs, Haizhou Li: AM-FM: A Semantic Framework for Translation Quality Assessment. 153-158
Maoxi Li, Chengqing Zong, Hwee Tou Ng: Automatic Evaluation of Chinese Translation Output: Word-Level or Character-Level? 159-164

Jonathan H. Clark, Chris Dyer, Alon Lavie, Noah A. Smith: Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability. 176-181

Daniël de Kok, Barbara Plank, Gertjan van Noord: Reversible Stochastic Attribute-Value Grammars. 194-199
Colin Cherry, Shane Bergsma: Joint Training of Dependency Parsing Filters through Latent Support Vector Machines. 200-205
Hiroyuki Shindo, Akinori Fujino, Masaaki Nagata: Insertion Operator for Bayesian Tree Substitution Grammars. 206-211
Matt Post: Judging Grammaticality with Tree Substitution Grammar Derivations. 217-222
Hajime Morita, Tetsuya Sakai, Manabu Okumura: Query Snowball: A Co-occurrence-based Approach to Multi-document Summarization for Question Answering. 223-229
Margaret Mitchell, Aaron Dunlop, Brian Roark: Semi-Supervised Modeling for Prenominal Modifier Ordering. 236-241
Amjad Abu-Jbara, Barbara Rosario, Kent Lyons: Towards Style Transformation from Written-Style to Audio-Style. 248-253
Kapil Thadani, Kathleen McKeown: Optimal and Syntactically-Informed Decoding for Monolingual Phrase-Based Alignment. 254-259
Shasha Liao, Ralph Grishman: Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction. 260-265
Tara McIntosh, Lars Yencken, James R. Curran, Timothy Baldwin: Relation Guided Bootstrapping of Semantic Lexicons. 266-270
Oleksandr Kolomiyets, Steven Bethard, Marie-Francine Moens: Model-Portability Experiments for Textual Temporal Analysis. 271-276
Truc-Vien T. Nguyen, Alessandro Moschitti: End-to-End Relation Extraction Using Distant Supervision from External Semantic Repositories. 277-282
Emilia Apostolova, Noriko Tomuro, Dina Demner-Fushman: Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes. 283-287
Ryan Gabbard, Marjorie Freedman, Ralph M. Weischedel: Coreference for Learning to Extract Relations: Yes Virginia, Coreference Matters. 288-293
Qin Gao, Stephan Vogel: Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules. 294-298
Lonneke van der Plas, Paola Merlo, James Henderson: Scaling up Automatic Cross-Lingual Semantic Role Annotation. 299-304
Christian Rohrdantz, Annette Hautli, Thomas Mayer, Miriam Butt, Daniel A. Keim, Frans Plank: Towards Tracking Semantic Change by Visual Analytics. 305-310
Youngjun Kim, Ellen Riloff, Stéphane M. Meystre: Improving Classification of Medical Assertions in Clinical Notes. 311-316
Els Lefever, Véronique Hoste, Martine De Cock: ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation. 317-322
Dirk Hovy, Ashish Vaswani, Stephen Tratz, David Chiang, Eduard H. Hovy: Models and Training for Unsupervised Preposition Sense Disambiguation. 323-328
Peter LoBue, Alexander Yates: Types of Common-Sense Knowledge Needed for Recognizing Textual Entailment. 329-334
Derya Ozkan, Louis-Philippe Morency: Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts. 335-340
Marjorie Freedman, Alex Baron, Vasin Punyakanok, Ralph M. Weischedel: Language Use: What can it tell us? 341-345
Alexander Volokh, Günter Neumann: Automatic Detection and Correction of Errors in Dependency Treebanks. 346-350
Sarah Alkuhlani, Nizar Habash: A Corpus for Modeling Morpho-Syntactic Agreement in Arabic: Gender, Number and Rationality. 357-362
Saif Mohammad: Even the Abstract have Color: Consensus in Word-Colour Associations. 368-373
Wen Wang, Sibel Yaman, Kristin Precoda, Colleen Richey, Geoffrey Raymond: Detection of Agreement and Disagreement in Broadcast Conversations. 374-378
Shujian Huang, Stephan Vogel, Jiajun Chen: Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment. 379-383
Vicent Alabau, Alberto Sanchís, Francisco Casacuberta: Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation. 389-394
Houda Bouamor, Aurélien Max, Anne Vilnat: Monolingual Alignment by Edit Rate Computation on Sentential Paraphrase Pairs. 395-400
Hal Daumé III, Jagadeesh Jagarlamudi: Domain Adaptation for Machine Translation by Mining Unseen Words. 407-412
Tagyoung Chung, Licheng Fang, Daniel Gildea: Issues Concerning Decoding with Synchronous Context-free Grammar. 413-417
Bing Xiang, Abraham Ittycheriah: Discriminative Feature-Tied Mixture Modeling for Statistical Machine Translation. 424-428
Kevin Duh, Akinori Fujino, Masaaki Nagata: Is Machine Translation Ripe for Cross-Lingual Sentiment Classification? 429-433
Takashi Onishi, Masao Utiyama, Eiichiro Sumita: Reordering Constraint Based on Document-Level Context. 434-438
Viet Ha-Thuc, Nicola Cancedda: Confidence-Weighted Learning of Factored Discriminative Language Models. 439-444
Sankaranarayanan Ananthakrishnan, Rohit Prasad, Prem Natarajan: On-line Language Model Biasing for Statistical Machine Translation. 445-449
Wang Ling, Tiago Luís, João Graça, Isabel Trancoso, Luísa Coheur: Reordering Modeling using Weighted Alignment Matrices. 450-454
Kristina Toutanova, Michel Galley: Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity. 461-466
Manaal Faruqui, Sebastian Padó: "I Thou Thee, Thou Traitor": Predicting Formal vs. Informal Address in English Literature. 467-472
Bo Li, Éric Gaussier, Akiko N. Aizawa: Clustering Comparable Corpora For Bilingual Lexicon Extraction. 473-478
Ivan Vulic, Wim De Smet, Marie-Francine Moens: Identifying Word Translations from Comparable Corpora Using Latent Topic Models. 479-484
Yabin Zheng, Lixing Xie, Zhiyuan Liu, Maosong Sun, Yang Zhang, Liyun Ru: Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method. 485-490
Samuel Brody, Paul B. Kantor: Automatic Assessment of Coverage Quality in Intelligence Reports. 491-495
Or Biran, Samuel Brody, Noemie Elhadad: Putting it Simply: a Context-Aware Approach to Lexical Simplification. 496-501
Nitin Madnani, Martin Chodorow, Joel R. Tetreault, Alla Rozovskaya: They Can Help: Using Crowdsourcing to Improve the Evaluation of Grammatical Error Detection Systems. 508-513

Thomas Mueller, Hinrich Schuetze: Improved Modeling of Out-Of-Vocabulary Words Using Morphological Classes. 524-528
Graham Neubig, Yosuke Nakata, Shinsuke Mori: Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. 529-533
Yun Huang, Min Zhang, Chew Lim Tan: Nonparametric Bayesian Machine Transliteration with Synchronous Adaptor Grammars. 534-539
Donald Metzler, Eduard H. Hovy, Chunliang Zhang: An Empirical Evaluation of Data-Driven Paraphrase Generation Techniques. 546-551
Fumiyo Fukumoto, Yoshimi Suzuki: Identification of Domain-Specific Senses in a Machine-Readable Dictionary. 552-557
Eyal Shnarch, Jacob Goldberger, Ido Dagan: A Probabilistic Modeling Framework for Lexical Entailment. 558-563
Paula Carvalho, Luís Sarmento, Jorge Teixeira, Mário J. Silva: Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates. 564-568
Oscar Täckström, Ryan T. McDonald: Semi-supervised latent variable models for sentence-level sentiment analysis. 569-574
Roberto González-Ibáñez, Smaranda Muresan, Nina Wacholder: Identifying Sarcasm in Twitter: A Closer Look. 581-586
Muhammad Abdul-Mageed, Mona T. Diab, Mohammed Korayem: Subjectivity and Sentiment Analysis of Modern Standard Arabic. 587-591
Ahmed Hassan, Amjad Abu-Jbara, Rahul Jha, Dragomir R. Radev: Identifying the Semantic Orientation of Foreign Words. 592-597
Xipeng Qiu, Xuanjing Huang, Zhao Liu, Jinlong Zhou: Hierarchical Text Classification with Latent Concepts. 598-602
Wei-Bin Liang, Chung-Hsien Wu, Chia-Ping Chen: Semantic Information and Derivation Rules for Robust Dialogue Act Detection in a Spoken Dialogue System. 603-608
Siwei Wang, Gina-Anne Levow: Contrasting Multi-Lingual Prosodic Cues to Predict Verbal Feedback for Rapport. 614-619
Joel Lang: A Scalable Probabilistic Classifier for Language Modeling. 625-630
Jun Suzuki, Hideki Isozaki, Masaaki Nagata: Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning. 636-641
William M. Darling, Fei Song: Probabilistic Document Modeling for Syntax Removal in Text Summarization. 642-647
Xiaojiang Huang, Xiaojun Wan, Jianguo Xiao: Comparative News Summarization Using Linear Programming. 648-653
Nina Dethlefs, Heriberto Cuayáhuitl: Hierarchical Reinforcement Learning and Hidden Markov Models for Task-Oriented Natural Language Generation. 654-659
Mariët Theune, Ruud Koolen, Emiel Krahmer, Sander Wubben: Does Size Matter - How Much Data is Required to Train a REG Algorithm? 660-664

Nathan Bodenstab, Kristy Hollingshead, Brian Roark: Unary Constraints for Efficient Context-Free Parsing. 676-681
Anders Søgaard: Data point selection for cross-language adaptation of dependency parsers. 682-686

Eneko Agirre, Kepa Bengoetxea, Koldo Gojenola, Joakim Nivre: Improving Dependency Parsing with Semantic Classes. 699-703
Yoav Goldberg, Michael Elhadad: Joint Hebrew Segmentation and Parsing using a PCFGLA Lattice Parser. 704-709
Gholamreza Haffari, Marzieh Razavi, Anoop Sarkar: An Ensemble Model that Combines Syntactic and Semantic Clustering for Discriminative Dependency Parsing. 710-714
Muhua Zhu, Jingbo Zhu, Minghan Hu: Better Automatic Treebank Conversion Using A Feature-Based Approach. 715-719
Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura, Gen-ichiro Kikui: Entity Set Expansion using Topic information. 726-731



