default search action
Juan Pino 0001
Person information
- affiliation: Meta AI
- affiliation (former): University of Cambridge, Department of Engineering, UK
- affiliation (foremr): Carnegie Mellon University, Language Technologies Institute, Pittsburgh, PA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2012
- [j1]Juan Pino, Aurelien Waite, William Byrne:
Simple and Efficient Model Filtering in Statistical Machine Translation. Prague Bull. Math. Linguistics 98: 5-24 (2012)
Conference and Workshop Papers
- 2024
- [c56]HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang:
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception. ACL (1) 2024: 12896-12911 - 2023
- [c55]Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polak, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe:
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. ACL (demo) 2023: 400-411 - [c54]Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation for a Real-world Unwritten Language. ACL (Findings) 2023: 4969-4983 - [c53]Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. ACL (1) 2023: 10771-10784 - [c52]Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. ACL (1) 2023: 12441-12455 - [c51]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. ACL (1) 2023: 15655-15680 - [c50]Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswami, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk:
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations. ACL (1) 2023: 16251-16269 - [c49]Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-To-Speech Translation with Multiple TTS Targets. ICASSP 2023: 1-5 - [c48]Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab:
Pre-training for Speech Translation: CTC Meets Optimal Transport. ICML 2023: 18667-18685 - [c47]Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolution. INTERSPEECH 2023: 3287-3291 - [c46]Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang:
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation. INTERSPEECH 2023: 4064-4068 - [c45]Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos:
Findings of the IWSLT 2023 Evaluation Campaign. IWSLT@ACL 2023: 1-61 - 2022
- [c44]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. ACL (1) 2022: 1488-1499 - [c43]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu:
Direct Speech-to-Speech Translation With Discrete Units. ACL (1) 2022: 3327-3339 - [c42]Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Miguel Pino:
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation. INTERSPEECH 2022: 1771-1775 - [c41]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. INTERSPEECH 2022: 2278-2282 - [c40]Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. INTERSPEECH 2022: 5195-5199 - [c39]Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondrej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vera Kloudová, Surafel Melaku Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Miguel Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe:
Findings of the IWSLT 2022 Evaluation Campaign. IWSLT@ACL 2022: 98-157 - [c38]Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. NAACL-HLT 2022: 860-872 - 2021
- [c37]Hang Le, Juan Miguel Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier:
Lightweight Adapter Tuning for Multilingual Speech Translation. ACL/IJCNLP (2) 2021: 817-824 - [c36]Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Miguel Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models. ACL/IJCNLP (1) 2021: 827-838 - [c35]Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Miguel Pino, Emmanuel Dupoux:
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation. ACL/IJCNLP (1) 2021: 993-1003 - [c34]Yun Tang, Juan Miguel Pino, Xian Li, Changhan Wang, Dmitriy Genzel:
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task. ACL/IJCNLP (1) 2021: 4252-4261 - [c33]Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Pino:
fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit. EMNLP (Demos) 2021: 143-152 - [c32]Yun Tang, Juan Miguel Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel:
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks. ICASSP 2021: 6209-6213 - [c31]Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Miguel Pino:
Streaming Simultaneous Speech Translation with Augmented Memory Transformer. ICASSP 2021: 7523-7527 - [c30]Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau:
Large-Scale Self- and Semi-Supervised Learning for Speech Translation. Interspeech 2021: 2242-2246 - [c29]Changhan Wang, Anne Wu, Jiatao Gu, Juan Pino:
CoVoST 2 and Massively Multilingual Speech Translation. Interspeech 2021: 2247-2251 - [c28]Antonios Anastasopoulos, Ondrej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alex Waibel, Changhan Wang, Matthew Wiesner:
Findings of the IWSLT 2021 Evaluation Campaign. IWSLT 2021: 1-29 - [c27]Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Miguel Pino, Holger Schwenk, Naman Goyal:
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task. IWSLT 2021: 131-137 - [c26]Hongyu Gong, Yun Tang, Juan Miguel Pino, Xian Li:
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling. NeurIPS 2021: 2668-2681 - 2020
- [c25]Hang Le, Juan Miguel Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier:
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation. COLING 2020: 3520-3533 - [c24]Xutai Ma, Mohammad Javad Dousti, Changhan Wang, Jiatao Gu, Juan Miguel Pino:
SIMULEVAL: An Evaluation Toolkit for Simultaneous Translation. EMNLP (Demos) 2020: 144-150 - [c23]Arya D. McCarthy, Liezl Puzon, Juan Miguel Pino:
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation. ICASSP 2020: 7924-7928 - [c22]Xutai Ma, Juan Miguel Pino, James Cross, Liezl Puzon, Jiatao Gu:
Monotonic Multihead Attention. ICLR 2020 - [c21]Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Miguel Pino:
Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq. AACL/IJCNLP (System Demonstrations) 2020: 33-39 - [c20]Xutai Ma, Juan Miguel Pino, Philipp Koehn:
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation. AACL/IJCNLP 2020: 582-587 - [c19]Juan Miguel Pino, Qiantong Xu, Xutai Ma, Mohammad Javad Dousti, Yun Tang:
Self-Training for End-to-End Speech Translation. INTERSPEECH 2020: 1476-1480 - [c18]Anne Wu, Changhan Wang, Juan Miguel Pino, Jiatao Gu:
Self-Supervised Representations Improve End-to-End Speech Translation. INTERSPEECH 2020: 1491-1495 - [c17]Changhan Wang, Juan Miguel Pino, Jiatao Gu:
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation. INTERSPEECH 2020: 4731-4735 - [c16]Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondrej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Miguel Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alexander Waibel, Changhan Wang:
FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN. IWSLT 2020: 1-34 - [c15]Changhan Wang, Juan Miguel Pino, Anne Wu, Jiatao Gu:
CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus. LREC 2020: 4197-4203 - [c14]Lucia Specia, Zhenhao Li, Juan Miguel Pino, Vishrav Chaudhary, Francisco Guzmán, Graham Neubig, Nadir Durrani, Yonatan Belinkov, Philipp Koehn, Hassan Sajjad, Paul Michel, Xian Li:
Findings of the WMT 2020 Shared Task on Machine Translation Robustness. WMT@EMNLP 2020: 76-91 - 2019
- [c13]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. EMNLP/IJCNLP (1) 2019: 6097-6110 - [c12]Juan Pino, Liezl Puzon, Jiatao Gu, Xutai Ma, Arya D. McCarthy, Deepak Gopinath:
Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade. IWSLT 2019 - [c11]Paul Michel, Xian Li, Graham Neubig, Juan Miguel Pino:
On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models. NAACL-HLT (1) 2019: 3103-3114 - [c10]Philipp Koehn, Francisco Guzmán, Vishrav Chaudhary, Juan Miguel Pino:
Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions. WMT (3) 2019: 54-72 - [c9]Xian Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Miguel Pino, Hassan Sajjad:
Findings of the First Shared Task on Machine Translation Robustness. WMT (2) 2019: 91-102 - 2013
- [c8]Juan Pino, Aurelien Waite, Tong Xiao, Adrià de Gispert, Federico Flego, William Byrne:
The University of Cambridge Russian-English System at WMT13. WMT@ACL 2013: 200-205 - 2010
- [c7]Adrià de Gispert, Juan Pino, William J. Byrne:
Hierarchical Phrase-Based Translation Grammars Extracted from Alignment Posterior Probabilities. EMNLP 2010: 545-554 - [c6]Juan Pino, Gonzalo Iglesias, Adrià de Gispert, Graeme W. Blackwood, Jamie Brunning, William Byrne:
The CUED HiFST System for the WMT10 Translation Shared Task. WMT@ACL 2010: 155-160 - 2009
- [c5]Juan Pino, Maxine Eskénazi:
An Application of Latent Semantic Analysis to Word Sense Discrimination for Words with Related and Unrelated Meanings. BEA@NAACL 2009: 43-46 - [c4]Juan Pino, Maxine Eskénazi:
Measuring Hint Level in Open Cloze Questions. FLAIRS 2009 - [c3]Juan Pino, Maxine Eskénazi:
Semi-automatic generation of cloze question distractors effect of students' L1. SLaTE 2009: 65-68 - [c2]Luís Marujo, José Lopes, Nuno J. Mamede, Isabel Trancoso, Juan Pino, Maxine Eskénazi, Jorge Baptista, Céu Viana:
Porting REAP to European Portuguese. SLaTE 2009: 69-72 - [c1]Luís Marujo, José Lopes, Nuno J. Mamede, Isabel Trancoso, Juan Pino, Maxine Eskénazi, Jorge Baptista, Céu Viana:
REAP.PT, a tutoring system for teaching Portuguese. SLaTE 2009
Editorship
- 2023
- [e3]Houda Bouamor, Juan Pino, Kalika Bali:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-060-8 [contents] - [e2]Houda Bouamor, Juan Pino, Kalika Bali:
Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-061-5 [contents] - 2022
- [e1]Tong Xiao, Juan Pino:
Machine Translation - 18th China Conference, CCMT 2022, Lhasa, China, August 6-10, 2022, Revised Selected Papers. Communications in Computer and Information Science 1671, Springer 2022, ISBN 978-981-19-7959-0 [contents]
Informal and Other Publications
- 2024
- [i48]Tu Anh Nguyen, Benjamin Muller, Bokai Yu, Marta R. Costa-jussà, Maha Elbayad, Sravya Popuri, Paul-Ambroise Duquenne, Robin Algayres, Ruslan Mavlyutov, Itai Gat, Gabriel Synnaeve, Juan Pino, Benoît Sagot, Emmanuel Dupoux:
SpiRit-LM: Interleaved Spoken and Written Language Model. CoRR abs/2402.05755 (2024) - [i47]HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang:
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception. CoRR abs/2403.14402 (2024) - [i46]Yejin Lee, Anna Y. Sun, Basil Hosmer, Bilge Acun, Can Balioglu, Changhan Wang, Charles David Hernandez, Christian Puhrsch, Daniel Haziza, Driss Guessous, Francisco Massa, Jacob Kahn, Jeffrey Wan, Jeremy Reizenstein, Jiaqi Zhai, Joe Isaacson, Joel Schlosser, Juan Pino, Kaushik Ram Sadagopan, Leonid Shamis, Linjian Ma, Min-Jae Hwang, Mingda Chen, Mostafa Elhoushi, Pedro Rodriguez, Ram Pasunuru, Scott Yih, Sravya Popuri, Xing Liu, Carole-Jean Wu:
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference. CoRR abs/2410.00215 (2024) - 2023
- [i45]Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab:
Pre-training for Speech Translation: CTC Meets Optimal Transport. CoRR abs/2301.11716 (2023) - [i44]Mohamed Anwar, Bowen Shi, Vedanuj Goswami, Wei-Ning Hsu, Juan Pino, Changhan Wang:
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation. CoRR abs/2303.00628 (2023) - [i43]Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-to-Speech Translation with Multiple TTS Targets. CoRR abs/2304.04618 (2023) - [i42]Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden D. Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. CoRR abs/2305.03101 (2023) - [i41]Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolutions. CoRR abs/2306.01084 (2023) - [i40]Hongyu Gong, Ning Dong, Sravya Popuri, Vedanuj Goswami, Ann Lee, Juan Pino:
Multilingual Speech-to-Speech Translation into Multiple Target Languages. CoRR abs/2307.08655 (2023) - [i39]Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023) - [i38]Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin N. Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Y. Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson:
Seamless: Multilingual Expressive and Streaming Speech Translation. CoRR abs/2312.05187 (2023) - 2022
- [i37]Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. CoRR abs/2204.02967 (2022) - [i36]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. CoRR abs/2204.05409 (2022) - [i35]Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. CoRR abs/2210.10191 (2022) - [i34]Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswami, Changhan Wang, Juan Miguel Pino, Benoît Sagot, Holger Schwenk:
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations. CoRR abs/2211.04508 (2022) - [i33]Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Miguel Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation For A Real-world Unwritten Language. CoRR abs/2211.06474 (2022) - [i32]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. CoRR abs/2212.08055 (2022) - 2021
- [i31]Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Miguel Pino, Emmanuel Dupoux:
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation. CoRR abs/2101.00390 (2021) - [i30]Changhan Wang, Anne Wu, Juan Miguel Pino, Alexei Baevski, Michael Auli, Alexis Conneau:
Large-Scale Self- and Semi-Supervised Learning for Speech Translation. CoRR abs/2104.06678 (2021) - [i29]Hang Le, Juan Miguel Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier:
Lightweight Adapter Tuning for Multilingual Speech Translation. CoRR abs/2106.01463 (2021) - [i28]Hongyu Gong, Yun Tang, Juan Miguel Pino, Xian Li:
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling. CoRR abs/2106.10840 (2021) - [i27]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Miguel Pino, Wei-Ning Hsu:
Direct speech-to-speech translation with discrete units. CoRR abs/2107.05604 (2021) - [i26]Yun Tang, Juan Miguel Pino, Xian Li, Changhan Wang, Dmitriy Genzel:
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task. CoRR abs/2107.05782 (2021) - [i25]Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Miguel Pino, Holger Schwenk, Naman Goyal:
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task. CoRR abs/2107.06959 (2021) - [i24]Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Miguel Pino:
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit. CoRR abs/2109.06912 (2021) - [i23]Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Miguel Pino:
Incremental Speech Synthesis For Speech-To-Speech Translation. CoRR abs/2110.08214 (2021) - [i22]Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Kenneth Heafield, Phillip Koehn, Juan Miguel Pino:
Direct simultaneous speech to speech translation. CoRR abs/2110.08250 (2021) - [i21]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. CoRR abs/2111.09296 (2021) - [i20]Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. CoRR abs/2112.08352 (2021) - 2020
- [i19]Changhan Wang, Juan Miguel Pino, Anne Wu, Jiatao Gu:
CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus. CoRR abs/2002.01320 (2020) - [i18]Arya D. McCarthy, Liezl Puzon, Juan Miguel Pino:
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation. CoRR abs/2002.12231 (2020) - [i17]Juan Miguel Pino, Qiantong Xu, Xutai Ma, Mohammad Javad Dousti, Yun Tang:
Self-Training for End-to-End Speech Translation. CoRR abs/2006.02490 (2020) - [i16]Changhan Wang, Juan Miguel Pino, Jiatao Gu:
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation. CoRR abs/2006.05474 (2020) - [i15]Anne Wu, Changhan Wang, Juan Miguel Pino, Jiatao Gu:
Self-Supervised Representations Improve End-to-End Speech Translation. CoRR abs/2006.12124 (2020) - [i14]Changhan Wang, Anne Wu, Juan Miguel Pino:
CoVoST 2: A Massively Multilingual Speech-to-Text Translation Corpus. CoRR abs/2007.10310 (2020) - [i13]Xutai Ma, Mohammad Javad Dousti, Changhan Wang, Jiatao Gu, Juan Miguel Pino:
SimulEval: An Evaluation Toolkit for Simultaneous Translation. CoRR abs/2007.16193 (2020) - [i12]Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Miguel Pino:
fairseq S2T: Fast Speech-to-Text Modeling with fairseq. CoRR abs/2010.05171 (2020) - [i11]Yun Tang, Juan Miguel Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel:
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks. CoRR abs/2010.11338 (2020) - [i10]Chau Tran, Changhan Wang, Yuqing Tang, Yun Tang, Juan Miguel Pino, Xian Li:
Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation. CoRR abs/2010.12829 (2020) - [i9]Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Miguel Pino:
Streaming Simultaneous Speech Translation with Augmented Memory Transformer. CoRR abs/2011.00033 (2020) - [i8]Hang Le, Juan Miguel Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier:
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation. CoRR abs/2011.00747 (2020) - [i7]Xutai Ma, Juan Miguel Pino, Philipp Koehn:
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation. CoRR abs/2011.02048 (2020) - 2019
- [i6]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. CoRR abs/1902.01382 (2019) - [i5]Paul Michel, Xian Li, Graham Neubig, Juan Miguel Pino:
On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models. CoRR abs/1903.06620 (2019) - [i4]Xian Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Miguel Pino, Hassan Sajjad:
Findings of the First Shared Task on Machine Translation Robustness. CoRR abs/1906.11943 (2019) - [i3]Juan Miguel Pino, Liezl Puzon, Jiatao Gu, Xutai Ma, Arya D. McCarthy, Deepak Gopinath:
Leveraging Out-of-Task Data for End-to-End Automatic Speech Translation. CoRR abs/1909.06515 (2019) - [i2]Xutai Ma, Juan Miguel Pino, James Cross, Liezl Puzon, Jiatao Gu:
Monotonic Multihead Attention. CoRR abs/1909.12406 (2019) - 2018
- [i1]Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Shanker Khudia, James Law, Parth Malani, Andrey Malevich, Nadathur Satish, Juan Miguel Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim M. Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao, Nadav Rotem, Sungjoo Yoo, Mikhail Smelyanskiy:
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications. CoRR abs/1811.09886 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 21:29 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint