


Остановите войну!
for scientists:


default search action
Vishrav Chaudhary
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c35]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. ACL (1) 2023: 14590-14604 - [c34]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. ACL (1) 2023: 15354-15373 - [c33]Sunayana Sitaram, Monojit Choudhury, Barun Patra, Vishrav Chaudhary, Kabir Ahuja, Kalika Bali:
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world. ACL (tutorial) 2023: 21-26 - [c32]Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kiciman, Boi Faltings:
Language Model Decoding as Likelihood-Utility Alignment. EACL (Findings) 2023: 1425-1440 - [c31]Aniket Vashishtha, S. Sai Prasad, Payal Bajaj, Vishrav Chaudhary, Kate Cook, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Performance and Risk Trade-offs for Multi-word Text Prediction at Scale. EACL (Findings) 2023: 2181-2197 - [c30]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Magneto: A Foundation Transformer. ICML 2023: 36077-36092 - [i32]Jessica Huynh, Cathy Jiao, Prakhar Gupta, Shikib Mehri, Payal Bajaj, Vishrav Chaudhary, Maxine Eskénazi:
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation. CoRR abs/2301.12004 (2023) - [i31]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. CoRR abs/2302.14045 (2023) - [i30]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN - Document Understanding By Language-Image Network. CoRR abs/2305.14218 (2023) - 2022
- [j4]Katharina Kann, Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, John E. Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Alberto Giménez Lugo, Ricardo Ramos, Iván Vladimir Meza Ruíz
, Elisabeth Mager, Vishrav Chaudhary, Graham Neubig, Alexis Palmer
, Rolando A. Coto Solano
, Ngoc Thang Vu:
AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas. Frontiers Artif. Intell. 5 (2022) - [j3]Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzmán, Angela Fan:
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation. Trans. Assoc. Comput. Linguistics 10: 522-538 (2022) - [c29]Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán:
OCR Improves Machine Translation for Low-Resource Languages. ACL (Findings) 2022: 1164-1174 - [c28]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. ACL (1) 2022: 5291-5305 - [c27]Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John E. Ortega, Ricardo Ramos, Annette Rios, Iván Vladimir Meza Ruíz
, Gustavo Giménez Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando A. Coto Solano
, Ngoc Thang Vu, Katharina Kann:
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages. ACL (1) 2022: 6279-6299 - [c26]Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán:
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? AMTA 2022: 97-116 - [c25]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. EMNLP (Findings) 2022: 1569-1582 - [c24]Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona T. Diab, Veselin Stoyanov, Xian Li:
Few-shot Learning with Multilingual Generative Language Models. EMNLP 2022: 9019-9052 - [c23]Marina Fomicheva, Shuo Sun, Erick R. Fonseca, Chrysoula Zerva, Frédéric Blain, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins:
MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset. LREC 2022: 4963-4974 - [i29]Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán:
OCR Improves Machine Translation for Low-Resource Languages. CoRR abs/2202.13274 (2022) - [i28]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. CoRR abs/2203.13867 (2022) - [i27]Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán:
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? CoRR abs/2204.14268 (2022) - [i26]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Foundation Transformers. CoRR abs/2210.06423 (2022) - [i25]Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kiciman, Boi Faltings, Robert West:
Language Model Decoding as Likelihood-Utility Alignment. CoRR abs/2210.07228 (2022) - [i24]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. CoRR abs/2210.14867 (2022) - [i23]Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel J. Orr, Lucia Zheng, Mert Yüksekgönül
, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri S. Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang
, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda:
Holistic Evaluation of Language Models. CoRR abs/2211.09110 (2022) - [i22]Shuming Ma, Hongyu Wang, Shaohan Huang, Wenhui Wang, Zewen Chi, Li Dong, Alon Benhaim, Barun Patra, Vishrav Chaudhary, Xia Song, Furu Wei:
TorchScale: Transformers at Scale. CoRR abs/2211.13184 (2022) - [i21]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. CoRR abs/2212.10554 (2022) - 2021
- [j2]Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Michael Auli, Armand Joulin:
Beyond English-Centric Multilingual Machine Translation. J. Mach. Learn. Res. 22: 107:1-107:48 (2021) - [c22]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. ACL/IJCNLP (1) 2021: 802-812 - [c21]Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan:
Multilingual Translation from Denoising Pre-Training. ACL/IJCNLP (Findings) 2021: 3450-3466 - [c20]Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia:
Quality Estimation without Human-labeled Data. EACL 2021: 619-625 - [c19]Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán:
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia. EACL 2021: 1351-1361 - [c18]Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Lucia Specia, Francisco Guzmán:
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications. EMNLP (1) 2021: 5865-5875 - [c17]Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Veselin Stoyanov, Alexis Conneau:
Self-training Improves Pre-training for Natural Language Understanding. NAACL-HLT 2021: 5408-5418 - [c16]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska
, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [c15]Guillaume Wenzek, Vishrav Chaudhary, Angela Fan, Sahir Gomez, Naman Goyal, Somya Jain, Douwe Kiela, Tristan Thrush, Francisco Guzmán:
Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation. WMT@EMNLP 2021: 89-99 - [c14]Lucia Specia, Frédéric Blain, Marina Fomicheva, Chrysoula Zerva, Zhenhao Li, Vishrav Chaudhary, André F. T. Martins:
Findings of the WMT 2021 Shared Task on Quality Estimation. WMT@EMNLP 2021: 684-725 - [i20]Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia:
Quality Estimation without Human-labeled Data. CoRR abs/2102.04020 (2021) - [i19]Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John E. Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir, Gustavo Alberto Giménez Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando A. Coto Solano, Ngoc Thang Vu, Katharina Kann:
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages. CoRR abs/2104.08726 (2021) - [i18]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. CoRR abs/2105.15071 (2021) - [i17]Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzmán, Angela Fan:
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation. CoRR abs/2106.03193 (2021) - [i16]Hongyu Gong, Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán:
LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models. CoRR abs/2106.03379 (2021) - [i15]Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia:
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications. CoRR abs/2109.08627 (2021) - [i14]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. CoRR abs/2110.07804 (2021) - [i13]Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona T. Diab, Veselin Stoyanov, Xian Li:
Few-shot Learning with Multilingual Language Models. CoRR abs/2112.10668 (2021) - 2020
- [j1]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia:
Unsupervised Quality Estimation for Neural Machine Translation. Trans. Assoc. Comput. Linguistics 8: 539-555 (2020) - [c13]Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov:
Unsupervised Cross-lingual Representation Learning at Scale. ACL 2020: 8440-8451 - [c12]Denise Díaz, James Cross, Vishrav Chaudhary, Ahmed El-Kishky, Philipp Koehn:
A Survey of Qualitative Error Analysis for Neural Machine Translation Systems. AMTA (2) 2020: 48-77 - [c11]Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn:
CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs. EMNLP (1) 2020: 5960-5969 - [c10]Shuo Sun, Marina Fomicheva, Frédéric Blain, Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia:
An Exploratory Study on Multilingual Quality Estimation. AACL/IJCNLP 2020: 366-377 - [c9]Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave:
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data. LREC 2020: 4003-4012 - [c8]Lucia Specia, Zhenhao Li, Juan Miguel Pino, Vishrav Chaudhary, Francisco Guzmán, Graham Neubig, Nadir Durrani, Yonatan Belinkov, Philipp Koehn, Hassan Sajjad, Paul Michel, Xian Li:
Findings of the WMT 2020 Shared Task on Machine Translation Robustness. WMT@EMNLP 2020: 76-91 - [c7]Philipp Koehn, Vishrav Chaudhary, Ahmed El-Kishky, Naman Goyal, Peng-Jen Chen, Francisco Guzmán:
Findings of the WMT 2020 Shared Task on Parallel Corpus Filtering and Alignment. WMT@EMNLP 2020: 726-742 - [c6]Lucia Specia, Frédéric Blain, Marina Fomicheva, Erick Rocha Fonseca, Vishrav Chaudhary, Francisco Guzmán, André F. T. Martins:
Findings of the WMT 2020 Shared Task on Quality Estimation. WMT@EMNLP 2020: 743-764 - [c5]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Vishrav Chaudhary, Mark Fishel, Francisco Guzmán, Lucia Specia:
BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task. WMT@EMNLP 2020: 1010-1017 - [i12]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain
, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia:
Unsupervised Quality Estimation for Neural Machine Translation. CoRR abs/2005.10608 (2020) - [i11]Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan:
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning. CoRR abs/2008.00401 (2020) - [i10]Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau:
Self-training Improves Pre-training for Natural Language Understanding. CoRR abs/2010.02194 (2020) - [i9]Marina Fomicheva, Shuo Sun, Erick R. Fonseca, Frédéric Blain
, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins:
MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset. CoRR abs/2010.04480 (2020) - [i8]Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin:
Beyond English-Centric Multilingual Machine Translation. CoRR abs/2010.11125 (2020)
2010 – 2019
- 2019
- [c4]Peng-Jen Chen, Jiajun Shen
, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato:
Facebook AI's WAT19 Myanmar-English Translation Task Submission. WAT@EMNLP-IJCNLP 2019: 112-122 - [c3]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. EMNLP/IJCNLP (1) 2019: 6097-6110 - [c2]Philipp Koehn, Francisco Guzmán, Vishrav Chaudhary, Juan Miguel Pino:
Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions. WMT (3) 2019: 54-72 - [c1]Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn:
Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings. WMT (3) 2019: 261-266 - [i7]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. CoRR abs/1902.01382 (2019) - [i6]Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn:
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings. CoRR abs/1906.08885 (2019) - [i5]Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán:
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia. CoRR abs/1907.05791 (2019) - [i4]Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato:
Facebook AI's WAT19 Myanmar-English Translation Task Submission. CoRR abs/1910.06848 (2019) - [i3]Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave:
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data. CoRR abs/1911.00359 (2019) - [i2]Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov:
Unsupervised Cross-lingual Representation Learning at Scale. CoRR abs/1911.02116 (2019) - [i1]Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn:
A Massive Collection of Cross-Lingual Web-Document Pairs. CoRR abs/1911.06154 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2023-10-02 01:01 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint