default search action
Monojit Choudhury
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j15]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI. Lang. Resour. Evaluation 58(2): 427-458 (2024) - [c106]Navreet Kaur, Monojit Choudhury, Danish Pruthi:
Evaluating Large Language Models for Health-related Queries with Presuppositions. ACL (Findings) 2024: 14308-14331 - [c105]Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury:
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language We Prompt Them in. LREC/COLING 2024: 6330-6340 - [c104]Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury, Kalika Bali:
INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation. LREC/COLING 2024: 9097-9109 - [c103]Abhinav Rao, Atharva Naik, Sachin Vashistha, Somak Aditya, Monojit Choudhury:
Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks. LREC/COLING 2024: 16802-16830 - [c102]Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram:
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? EACL (Findings) 2024: 1051-1070 - [c101]Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury:
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test. EACL (1) 2024: 2882-2894 - [i50]Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury:
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test. CoRR abs/2402.02135 (2024) - [i49]Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury:
Towards Measuring and Modeling "Culture" in LLMs: A Survey. CoRR abs/2403.15412 (2024) - [i48]Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury:
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in. CoRR abs/2404.18460 (2024) - [i47]Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra:
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations. CoRR abs/2405.05378 (2024) - [i46]Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru:
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences. CoRR abs/2405.05572 (2024) - [i45]Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gaël de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam:
Benchmark Underestimates the Readiness of Multi-lingual Dialogue Agents. CoRR abs/2405.17840 (2024) - [i44]Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury:
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting. CoRR abs/2406.11661 (2024) - [i43]Abhinav Rao, Monojit Choudhury, Somak Aditya:
[WIP] Jailbreak Paradox: The Achilles' Heel of LLMs. CoRR abs/2406.12702 (2024) - 2023
- [c100]Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam:
X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents. ACL (Findings) 2023: 2773-2794 - [c99]Sunayana Sitaram, Monojit Choudhury, Barun Patra, Vishrav Chaudhary, Kabir Ahuja, Kalika Bali:
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world. ACL (tutorial) 2023: 21-26 - [c98]Dipto Das, Parboti Roy, Carlos Toxtli, Kagonya Awori, Morgan Vigil-Hayes, Monojit Choudhury, Neha Kumar, Syed Ishtiaque Ahmed, Bryan C. Semaan:
Conceptualizing Indigeneity in Social Computing. CSCW Companion 2023: 501-505 - [c97]Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer. EACL 2023: 385-406 - [c96]Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury:
Fairness in Language Models Beyond English: Gaps and Challenges. EACL (Findings) 2023: 2061-2074 - [c95]Aniket Vashishtha, S. Sai Prasad, Payal Bajaj, Vishrav Chaudhary, Kate Cook, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Performance and Risk Trade-offs for Multi-word Text Prediction at Scale. EACL (Findings) 2023: 2181-2197 - [c94]Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji:
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance. EMNLP 2023: 671-686 - [c93]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN: Visual Document Understanding By Language-Image Network. EMNLP (Industry Track) 2023: 693-706 - [c92]Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury:
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. EMNLP (Findings) 2023: 13370-13388 - [c91]Deepanway Ghosal, Somak Aditya, Monojit Choudhury:
Prover: Generating Intermediate Steps for NLI with Commonsense Knowledge Retrieval and Next-Step Prediction. IJCNLP (1) 2023: 872-884 - [e2]Melissa Densmore, Monojit Choudhury, Josiah Chavula:
Proceedings of the 6th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, COMPASS 2023, Cape Town, South Africa, August 16-19, 2023. ACM 2023 [contents] - [i42]Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury:
Fairness in Language Models Beyond English: Gaps and Challenges. CoRR abs/2302.12578 (2023) - [i41]Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer. CoRR abs/2303.02357 (2023) - [i40]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN - Document Understanding By Language-Image Network. CoRR abs/2305.14218 (2023) - [i39]Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji:
LLM-powered Data Augmentation for Enhanced Crosslingual Performance. CoRR abs/2305.14288 (2023) - [i38]Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury:
Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks. CoRR abs/2305.14965 (2023) - [i37]Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam:
X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents. CoRR abs/2306.17674 (2023) - [i36]Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram:
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? CoRR abs/2309.07462 (2023) - [i35]Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, Monojit Choudhury:
Probing the Moral Development of Large Language Models through Defining Issues Test. CoRR abs/2309.13356 (2023) - [i34]Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury:
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. CoRR abs/2310.07251 (2023) - [i33]Navreet Kaur, Monojit Choudhury, Danish Pruthi:
Evaluating Large Language Models for Health-related Queries with Presuppositions. CoRR abs/2312.08800 (2023) - 2022
- [c90]Anirudh Srinivasan, Gauri Kholkar, Rahul Kejriwal, Tanuja Ganu, Sandipan Dandapat, Sunayana Sitaram, Balakrishnan Santhanam, Somak Aditya, Kalika Bali, Monojit Choudhury:
LITMUS Predictor: An AI Assistant for Building Reliable, High-Performing and Fair Multilingual NLP Systems. AAAI 2022: 13227-13229 - [c89]Prashant Kodali, Anmol Goel, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru:
SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-Mixing. ACL (Findings) 2022: 472-480 - [c88]Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models. ACL (1) 2022: 5454-5467 - [c87]Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O'Neill, Kalika Bali, Monojit Choudhury:
Global Readiness of Language Technology for Healthcare: What Would It Take to Combat the Next Pandemic? COLING 2022: 4320-4335 - [c86]Harshita Diddee, Kalika Bali, Monojit Choudhury, Namrata Mukhija:
The Six Conundrums of Building and Deploying Language Technologies for Social Good. COMPASS 2022: 12-19 - [c85]Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury:
On the Calibration of Massively Multilingual Language Models. EMNLP 2022: 4310-4323 - [c84]Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Multilingual CheckList: Generation and Evaluation. AACL/IJCNLP (Findings) 2022: 282-295 - [c83]Deepanway Ghosal, Somak Aditya, Sandipan Dandapat, Monojit Choudhury:
Vector Space Interpolation for Query Expansion. AACL/IJCNLP (2) 2022: 405-410 - [c82]Ishani Mondal, Kalika Bali, Mohit Jain, Monojit Choudhury, Jacki O'Neill, Millicent Ochieng, Kagonya Awori, Keshet Ronen:
Language Patterns and Behaviour of the Peer Supporters in Multilingual Healthcare Conversational Forums. LREC 2022: 963-975 - [c81]Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer. NAACL-HLT (Findings) 2022: 1042-1055 - [c80]Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat:
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. NAACL-HLT 2022: 1369-1384 - [c79]Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali:
Too Brittle to Touch: Comparing the Stability of Quantization and Distillation towards Developing Low-Resource MT Models. WMT 2022: 870-885 - [i32]Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Abdullahi Salahudeen, Aremu Anuoluwapo, Alípio Jeorge, Pavel Brazdil:
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis. CoRR abs/2201.08277 (2022) - [i31]Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Multilingual CheckList: Generation and Evaluation. CoRR abs/2203.12865 (2022) - [i30]Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O. Neil, Kalika Bali, Monojit Choudhury:
Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic? CoRR abs/2204.02790 (2022) - [i29]Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models. CoRR abs/2205.06130 (2022) - [i28]Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat:
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. CoRR abs/2205.06350 (2022) - [i27]Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages. CoRR abs/2205.06356 (2022) - [i26]Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer. CoRR abs/2206.15010 (2022) - [i25]Deepanway Ghosal, Somak Aditya, Monojit Choudhury:
Generating Intermediate Steps for NLI with Next-Step Supervision. CoRR abs/2208.14641 (2022) - [i24]Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury:
On the Calibration of Massively Multilingual Language Models. CoRR abs/2210.12265 (2022) - [i23]Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali:
Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models. CoRR abs/2210.15184 (2022) - 2021
- [c78]Monojit Choudhury, Amit Deshpande:
How Linguistically Fair Are Multilingual Pre-Trained Language Models? AAAI 2021: 12710-12718 - [c77]Sebastin Santy, Anku Rani, Monojit Choudhury:
Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices. ACL/IJCNLP (Findings) 2021: 4704-4710 - [c76]Adithya Pratapa, Monojit Choudhury:
Comparing Grammatical Theories of Code-Mixing. W-NUT 2021: 158-167 - [c75]Sebastin Santy, Kalika Bali, Monojit Choudhury, Sandipan Dandapat, Tanuja Ganu, Anurag Shukla, Jahanvi Shah, Vivek Seshadri:
Language Translation as a Socio-Technical System: Case-Studies of Mixed-Initiative Interactions. COMPASS 2021: 156-172 - [c74]Mohd Sanad Zaki Rizvi, Anirudh Srinivasan, Tanuja Ganu, Monojit Choudhury, Sunayana Sitaram:
GCM: A Toolkit for Generating Synthetic Code-mixed Text. EACL (System Demonstrations) 2021: 205-211 - [c73]Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram:
On the Universality of Deep Contextual Language Models. ICON 2021: 106-119 - [c72]Saujas Vaduguru, Partho Sarthi, Monojit Choudhury, Dipti Sharma:
Stress Rules from Surface Forms: Experiments with Program Synthesis. ICON 2021: 619-628 - [c71]Amar Budhiraja, Ankur Sharma, Rahul Agrawal, Monojit Choudhury, Joyojeet Pal:
American Politicians Diverge Systematically, Indian Politicians do so Chaotically: Text Embeddings as a Window into Party Polarization. ICWSM 2021: 1054-1058 - [i22]Sebastin Santy, Anku Rani, Monojit Choudhury:
Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices. CoRR abs/2106.01105 (2021) - [i21]Saujas Vaduguru, Aalok Sathe, Monojit Choudhury, Dipti Misra Sharma:
Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems. CoRR abs/2106.06566 (2021) - [i20]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task. CoRR abs/2107.07229 (2021) - [i19]Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram:
On the Universality of Deep COntextual Language Models. CoRR abs/2109.07140 (2021) - [i18]Karthikeyan K, Aalok Sathe, Somak Aditya, Monojit Choudhury:
Analyzing the Effects of Reasoning Types on Cross-Lingual Transfer Performance. CoRR abs/2110.02386 (2021) - [i17]Namrata Mukhija, Monojit Choudhury, Kalika Bali:
Designing Language Technologies for Social Good: The Road not Taken. CoRR abs/2110.07444 (2021) - [i16]Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury:
Predicting the Performance of Multilingual NLP Models. CoRR abs/2110.08875 (2021) - [i15]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI. CoRR abs/2112.02333 (2021) - 2020
- [j14]Anshul Bawa, Pranav Khadpe, Pratik Joshi, Kalika Bali, Monojit Choudhury:
Do Multilingual Users Prefer Chat-bots that Code-mix? Let's Nudge and Find Out! Proc. ACM Hum. Comput. Interact. 4(CSCW): 041:1-041:23 (2020) - [j13]Anmol Panda, Ramaravind Kommiya Mothilal, Monojit Choudhury, Kalika Bali, Joyojeet Pal:
Topical Focus of Political Campaigns and its Impact: Findings from Politicians' Hashtag Use during the 2019 Indian Elections. Proc. ACM Hum. Comput. Interact. 4(CSCW): 053:1-053:14 (2020) - [j12]Somnath Banerjee, Monojit Choudhury, Kunal Chakma, Sudip Kumar Naskar, Amitava Das, Sivaji Bandyopadhyay, Paolo Rosso:
MSIR@FIRE: A Comprehensive Report from 2013 to 2016. SN Comput. Sci. 1(1): 55 (2020) - [c70]Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury:
GLUECoS: An Evaluation Benchmark for Code-Switched NLP. ACL 2020: 3575-3585 - [c69]Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali, Monojit Choudhury:
The State and Fate of Linguistic Diversity and Inclusion in the NLP World. ACL 2020: 6282-6293 - [c68]Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
A New Dataset for Natural Language Inference from Code-mixed Conversations. CodeSwitch@LREC 2020: 9-16 - [c67]Abhishek Srivastava, Kalika Bali, Monojit Choudhury:
Understanding Script-Mixing: A Case Study of Hindi-English Bilingual Twitter Users. CodeSwitch@LREC 2020: 36-44 - [c66]Anirudh Srinivasan, Sandipan Dandapat, Monojit Choudhury:
Code-mixed parse trees and how to find them. CodeSwitch@LREC 2020: 57-64 - [c65]Pratik Joshi, Somak Aditya, Aalok Sathe, Monojit Choudhury:
TaxiNLI: Taking a Ride up the NLU Hill. CoNLL 2020: 41-55 - [c64]Ashish Sharma, Monojit Choudhury, Tim Althoff, Amit Sharma:
Engagement Patterns of Peer-to-Peer Interactions on Mental Health Platforms. ICWSM 2020: 614-625 - [c63]Basil Abraham, Danish Goel, Divya Siddarth, Kalika Bali, Manu Chopra, Monojit Choudhury, Pratik Joshi, Preethi Jyothi, Sunayana Sitaram, Vivek Seshadri:
Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers. LREC 2020: 2819-2826 - [e1]Thamar Solorio, Monojit Choudhury, Kalika Bali, Sunayana Sitaram, Amitava Das, Mona T. Diab:
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, CodeSwitch@LREC 2020, Marseille, France, May, 2020. European Language Resources Association 2020, ISBN 979-10-95546-66-5 [contents] - [i14]Ashish Sharma, Monojit Choudhury, Tim Althoff, Amit Sharma:
Engagement Patterns of Peer-to-Peer Interactions on Mental Health Platforms. CoRR abs/2004.04999 (2020) - [i13]Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
A New Dataset for Natural Language Inference from Code-mixed Conversations. CoRR abs/2004.05051 (2020) - [i12]Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali, Monojit Choudhury:
The State and Fate of Linguistic Diversity and Inclusion in the NLP World. CoRR abs/2004.09095 (2020) - [i11]Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury:
GLUECoS : An Evaluation Benchmark for Code-Switched NLP. CoRR abs/2004.12376 (2020) - [i10]Pratik Joshi, Somak Aditya, Aalok Sathe, Monojit Choudhury:
TaxiNLI: Taking a Ride up the NLU Hill. CoRR abs/2009.14505 (2020)
2010 – 2019
- 2019
- [j11]Koustav Rudra, Ashish Sharma, Kalika Bali, Monojit Choudhury, Niloy Ganguly:
Identifying and Analyzing Different Aspects of English-Hindi Code-Switching in Twitter. ACM Trans. Asian Low Resour. Lang. Inf. Process. 18(3): 29:1-29:28 (2019) - [c62]Monojit Choudhury, Anirudh Srinivasan, Sandipan Dandapat:
Processing and Understanding Mixed Language Data. EMNLP/IJCNLP (2) 2019 - [c61]Sebastin Santy, Sandipan Dandapat, Monojit Choudhury, Kalika Bali:
INMT: Interactive Neural Machine Translation Prediction. EMNLP/IJCNLP (3) 2019: 103-108 - [c60]Jasabanta Patro, Sabyasachee Baruah, Vivek Gupta, Monojit Choudhury, Pawan Goyal, Animesh Mukherjee:
Characterizing the Spread of Exaggerated Health News Content over Social Media. HT 2019: 279-280 - [i9]Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali:
Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities. CoRR abs/1912.03457 (2019) - 2018
- [c59]Adithya Pratapa, Gayatri Bhat, Monojit Choudhury, Sunayana Sitaram, Sandipan Dandapat, Kalika Bali:
Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data. ACL (1) 2018: 1543-1553 - [c58]Sunit Sivasankaran, Brij Mohan Lal Srivastava, Sunayana Sitaram, Kalika Bali, Monojit Choudhury:
Phone Merging For Code-Switched Speech Recognition. CodeSwitch@ACL 2018: 11-19 - [c57]Anshul Bawa, Monojit Choudhury, Kalika Bali:
Accommodation of Conversational Code-Choice. CodeSwitch@ACL 2018: 82-91 - [c56]Adithya Pratapa, Monojit Choudhury, Sunayana Sitaram:
Word Embeddings for Code-Mixed Language Processing. EMNLP 2018: 3067-3072 - [c55]Anshul Bawa, Monojit Choudhury, Kalika Bali:
User Perception of Code-Switching Dialog Systems. ICON 2018: 166-174 - [c54]Silvana Hartmann, Monojit Choudhury, Kalika Bali:
An Integrated Representation of Linguistic and Social Functions of Code-Switching. LREC 2018 - [c53]Sunayana Sitaram, Varun Manjunath, Varun Bharadwaj, Monojit Choudhury, Kalika Bali, Michael Tjalve:
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach. LREC 2018 - [i8]Jasabanta Patro, Sabyasachee Baruah, Vivek Gupta, Monojit Choudhury, Pawan Goyal, Animesh Mukherjee:
Characterizing the spread of exaggerated news content over social media. CoRR abs/1811.07853 (2018) - 2017
- [c52]Shruti Rijhwani, Royal Sequiera, Monojit Choudhury, Kalika Bali, Chandra Shekhar Maddila:
Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique. ACL (1) 2017: 1971-1982 - [c51]Prabhat Agarwal, Ashish Sharma, Jeenu Grover, Mayank Sikka, Koustav Rudra, Monojit Choudhury:
I may talk in English but gaali toh Hindi mein hi denge : A study of English-Hindi code-switching and swearing pattern on social networks. COMSNETS 2017: 554-557 - [c50]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
All that is English may be Hindi: Enhancing language identification through automatic ranking of the likeliness of word borrowing in social media. EMNLP 2017: 2264-2274 - [c49]Moumita Basu, Saptarshi Ghosh, Kripabandhu Ghosh, Monojit Choudhury:
Overview of the FIRE 2017 track: Information Retrieval from Microblogs during Disasters (IRMiDis). FIRE (Working Notes) 2017: 28-33 - [c48]Monojit Choudhury, Kalika Bali, Sunayana Sitaram, Ashutosh Baheti:
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks. ICON 2017: 65-74 - [c47]Adithya Pratapa, Monojit Choudhury:
Quantitative Characterization of Code Switching Patterns in Complex Multi-Party Conversations: A Case Study on Hindi Movie Scripts. ICON 2017: 75-84 - [i7]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media. CoRR abs/1703.05122 (2017) - [i6]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media. CoRR abs/1707.08446 (2017) - 2016
- [j10]