default search action
Marta R. Costa-jussà
Marta Ruiz Costa-jussà
Person information
- affiliation: Universitat Politècnica de Catalunya, Madrid, Spain
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c133]Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood:
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector. ACL (Findings) 2024: 5725-5734 - [c132]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Pushing the Limits of Zero-shot End-to-End Speech Translation. ACL (Findings) 2024: 14245-14267 - [c131]Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà:
SpeechAlign: A Framework for Speech Translation Alignment Evaluation. LREC/COLING 2024: 15137-15146 - [c130]Javier García Gilabert, Carlos Escolano, Marta R. Costa-jussà:
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation. EAMT (1) 2024: 37-58 - [c129]Marta R. Costa-jussà, David Dale, Maha Elbayad, Bokai Yu:
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation. EAMT (1) 2024: 360-372 - [c128]Javier Ferrando, Marta R. Costa-jussà:
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task. EMNLP (Findings) 2024: 10115-10125 - [c127]Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà:
Unveiling the Role of Pretraining in Direct Speech Translation. EMNLP 2024: 11259-11265 - [c126]David Dale, Marta R. Costa-jussà:
BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation. EMNLP (Findings) 2024: 16075-16085 - [i64]Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood:
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector. CoRR abs/2401.05060 (2024) - [i63]Christophe Ropers, David Dale, Prangthip Hansanti, Gabriel Mejia Gonzalez, Ivan Evtimov, Corinne Wong, Christophe Touret, Kristina Pereyra, Seohyun Sonia Kim, Cristian Canton-Ferrer, Pierre Andrews, Marta R. Costa-jussà:
Towards Red Teaming in Multimodal and Multilingual Translation. CoRR abs/2401.16247 (2024) - [i62]Tu Anh Nguyen, Benjamin Muller, Bokai Yu, Marta R. Costa-jussà, Maha Elbayad, Sravya Popuri, Paul-Ambroise Duquenne, Robin Algayres, Ruslan Mavlyutov, Itai Gat, Gabriel Synnaeve, Juan Pino, Benoît Sagot, Emmanuel Dupoux:
SpiRit-LM: Interleaved Spoken and Written Language Model. CoRR abs/2402.05755 (2024) - [i61]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Pushing the Limits of Zero-shot End-to-End Speech Translation. CoRR abs/2402.10422 (2024) - [i60]Javier Ferrando, Gabriele Sarti, Arianna Bisazza, Marta R. Costa-jussà:
A Primer on the Inner Workings of Transformer-based Language Models. CoRR abs/2405.00208 (2024) - [i59]Xiaoqing Ellen Tan, Prangthip Hansanti, Carleigh Wood, Bokai Yu, Christophe Ropers, Marta R. Costa-jussà:
Towards Massive Multilingual Holistic Bias. CoRR abs/2407.00486 (2024) - [i58]Eduardo Sánchez, Belen Alastruey, Christophe Ropers, Pontus Stenetorp, Mikel Artetxe, Marta R. Costa-jussà:
Linguini: A benchmark for language-agnostic linguistic reasoning. CoRR abs/2409.12126 (2024) - [i57]Javier Ferrando, Marta R. Costa-jussà:
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task. CoRR abs/2410.06496 (2024) - 2023
- [j57]Meysam Shamsi, Anthony Larcher, Loïc Barrault, Sylvain Meignier, Yevhenii Prokopalo, Marie Tahon, Ambuj Mehrish, Simon Petitrenaud, Olivier Galibert, Samuel Gaist, André Anjos, Sébastien Marcel, Marta R. Costa-jussà:
Towards lifelong human assisted speaker diarization. Comput. Speech Lang. 77: 101437 (2023) - [c125]David Dale, Elena Voita, Loïc Barrault, Marta R. Costa-jussà:
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better. ACL (1) 2023: 36-50 - [c124]Javier Ferrando, Gerard I. Gállego, Ioannis Tsiamas, Marta R. Costa-jussà:
Explaining How Transformers Use Context to Build Predictions. ACL (1) 2023: 5486-5513 - [c123]Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà:
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric. ACL (1) 2023: 9064-9079 - [c122]David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loïc Barrault, Marta R. Costa-jussà:
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation. EMNLP 2023: 638-653 - [c121]Ioannis Tsiamas, José A. R. Fonollosa, Marta R. Costa-jussà:
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations. EMNLP (Findings) 2023: 8569-8588 - [c120]Marta R. Costa-jussà, Eric Michael Smith, Christophe Ropers, Daniel Licht, Jean Maillard, Javier Ferrando, Carlos Escolano:
Toxicity in Multilingual Machine Translation at Scale. EMNLP (Findings) 2023: 9570-9586 - [c119]Marta R. Costa-jussà, Pierre Andrews, Eric Michael Smith, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Daniel Licht, Carleigh Wood:
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale. EMNLP 2023: 14141-14156 - [c118]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Efficient Speech Translation with Dynamic Latent Perceivers. ICASSP 2023: 1-5 - [c117]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23. IWSLT@ACL 2023: 397-410 - [c116]Benjamin Muller, Belen Alastruey, Prangthip Hansanti, Elahe Kalbassi, Christophe Ropers, Eric Michael Smith, Adina Williams, Luke Zettlemoyer, Pierre Andrews, Marta R. Costa-jussà:
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages. WMT 2023: 536-550 - [i56]David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loïc Barrault, Marta R. Costa-jussà:
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation. CoRR abs/2305.11746 (2023) - [i55]Javier García Gilabert, Carlos Escolano, Marta R. Costa-jussà:
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation. CoRR abs/2305.11761 (2023) - [i54]Javier Ferrando, Gerard I. Gállego, Ioannis Tsiamas, Marta R. Costa-jussà:
Explaining How Transformers Use Context to Build Predictions. CoRR abs/2305.12535 (2023) - [i53]Marta R. Costa-jussà, Pierre Andrews, Eric Michael Smith, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Daniel Licht, Carleigh Wood:
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale. CoRR abs/2305.13198 (2023) - [i52]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23. CoRR abs/2306.01327 (2023) - [i51]Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023) - [i50]Benjamin Muller, Belen Alastruey, Prangthip Hansanti, Elahe Kalbassi, Christophe Ropers, Eric Michael Smith, Adina Williams, Luke Zettlemoyer, Pierre Andrews, Marta R. Costa-jussà:
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages. CoRR abs/2308.16871 (2023) - [i49]Eduardo Sánchez, Pierre Andrews, Pontus Stenetorp, Mikel Artetxe, Marta R. Costa-jussà:
Gender-specific Machine Translation with Large Language Models. CoRR abs/2309.03175 (2023) - [i48]Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà:
SpeechAlign: a Framework for Speech Translation Alignment Evaluation. CoRR abs/2309.11585 (2023) - [i47]Marta R. Costa-jussà, David Dale, Maha Elbayad, Bokai Yu:
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation. CoRR abs/2311.06532 (2023) - [i46]Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin N. Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Y. Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson:
Seamless: Multilingual Expressive and Streaming Speech Translation. CoRR abs/2312.05187 (2023) - 2022
- [j56]Carlos Escolano, Marta Ruiz Costa-jussà, José A. R. Fonollosa:
Multilingual Machine Translation: Deep Analysis of Language-Specific Encoder-Decoders. J. Artif. Intell. Res. 73: 1535-1552 (2022) - [c115]Marta R. Costa-jussà, Carlos Escolano, Christine Basta, Javier Ferrando, Roser Batlle, Ksenia Kharitonova:
Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters. AAAI 2022: 11855-11863 - [c114]Belen Alastruey, Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà:
On the Locality of Attention in Direct Speech Translation. ACL (student) 2022: 402-412 - [c113]Shailashree K. Sheshadri, Deepa Gupta, Marta Ruiz Costa-jussà:
Neural Machine Translation for Kashmiri to English and Hindi using Pre-trained Embeddings. OCIT 2022: 238-243 - [c112]Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà:
Measuring the Mixing of Contextual Information in the Transformer. EMNLP 2022: 8698-8714 - [c111]Javier Ferrando, Gerard I. Gállego, Belen Alastruey, Carlos Escolano, Marta R. Costa-jussà:
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer. EMNLP 2022: 8756-8769 - [c110]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation. INTERSPEECH 2022: 106-110 - [c109]Ioannis Tsiamas, Gerard I. Gállego, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà:
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022. IWSLT@ACL 2022: 265-276 - [c108]Marta R. Costa-jussà, Christine Basta, Gerard I. Gállego:
Evaluating Gender Bias in Speech Translation. LREC 2022: 2141-2147 - [c107]Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta Ruiz Costa-jussà:
Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation. NAACL-HLT (Student Research Workshop) 2022: 277-284 - [c106]Marta R. Costa-jussà, Christine Basta, Oriol Domingo, André Rubungo:
OccGen: Selection of Real-world Multilingual Parallel Data Balanced in Gender within Occupations. NeurIPS 2022 - [c105]David Ifeoluwa Adelani, Md Mahfuz Ibn Alam, Antonios Anastasopoulos, Akshita Bhagia, Marta R. Costa-jussà, Jesse Dodge, Fahim Faisal, Christian Federmann, Natalia Fedorova, Francisco Guzmán, Sergey Koshelev, Jean Maillard, Vukosi Marivate, Jonathan Mbuya, Alexandre Mourachko, Safiyyah Saleem, Holger Schwenk, Guillaume Wenzek:
Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages. WMT 2022: 773-800 - [e13]Helena Moniz, Lieve Macken, Andrew Rufener, Loïc Barrault, Marta R. Costa-jussà, Christophe Declercq, Maarit Koponen, Ellie Kemp, Spyridon Pilos, Mikel L. Forcada, Carolina Scarton, Joachim Van den Bogaert, Joke Daems, Arda Tezcan, Bram Vanroy, Margot Fonteyne:
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, EAMT 2022, Ghent, Belgium, June 1-3, 2022. European Association for Machine Translation 2022, ISBN 9789464597622 [contents] - [e12]Elizabeth Salesky, Marcello Federico, Marta Ruiz Costa-jussà:
Proceedings of the 19th International Conference on Spoken Language Translation, IWSLT@ACL 2022, Dublin, Ireland (in-person and online), May 26-27, 2022. Association for Computational Linguistics 2022, ISBN 978-1-955917-41-4 [contents] - [e11]Philipp Koehn, Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri:
Proceedings of the Seventh Conference on Machine Translation, WMT 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 7-8, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-29-6 [contents] - [i45]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà:
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation. CoRR abs/2202.04774 (2022) - [i44]Oriol Domingo, Marta R. Costa-jussà, Carlos Escolano:
A multi-task semi-supervised framework for Text2Graph & Graph2Text. CoRR abs/2202.06041 (2022) - [i43]Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà:
Measuring the Mixing of Contextual Information in the Transformer. CoRR abs/2203.04212 (2022) - [i42]Belen Alastruey, Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà:
On the Locality of Attention in Direct Speech Translation. CoRR abs/2204.09028 (2022) - [i41]Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta R. Costa-jussà:
Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation. CoRR abs/2205.07100 (2022) - [i40]Javier Ferrando, Gerard I. Gállego, Belen Alastruey, Carlos Escolano, Marta R. Costa-jussà:
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer. CoRR abs/2205.11631 (2022) - [i39]Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Y. Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loïc Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang:
No Language Left Behind: Scaling Human-Centered Machine Translation. CoRR abs/2207.04672 (2022) - [i38]Marta R. Costa-jussà, Eric Michael Smith, Christophe Ropers, Daniel Licht, Javier Ferrando, Carlos Escolano:
Toxicity in Multilingual Machine Translation at Scale. CoRR abs/2210.03070 (2022) - [i37]Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta Ruiz Costa-jussà:
Efficient Speech Translation with Dynamic Latent Perceivers. CoRR abs/2210.16264 (2022) - [i36]Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà:
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric. CoRR abs/2212.08486 (2022) - [i35]David Dale, Elena Voita, Loïc Barrault, Marta R. Costa-jussà:
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better. CoRR abs/2212.08597 (2022) - [i34]Ioannis Tsiamas, José A. R. Fonollosa, Marta R. Costa-jussà:
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations. CoRR abs/2212.09699 (2022) - 2021
- [j55]Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa:
From bilingual to multilingual neural-based machine translation by incremental training. J. Assoc. Inf. Sci. Technol. 72(2): 190-203 (2021) - [j54]Jordi Armengol-Estapé, Marta R. Costa-jussà:
Semantic and syntactic information for neural machine translation. Mach. Transl. 35(1): 3-17 (2021) - [j53]Anna Jobin, Kingson Man, Antonio Damasio, Georgios Kaissis, Rickmer Braren, Julia Stoyanovich, Jay J. Van Bavel, Tessa V. West, Brent D. Mittelstadt, Jason Eshraghian, Marta R. Costa-jussà, Asaf Tzachor, Aimun A. B. Jamjoom, Mariarosaria Taddeo, Edoardo Sinibaldi, Yipeng Hu, Miguel A. Luengo-Oroz:
AI reflections in 2020. Nat. Mach. Intell. 3(1): 2-8 (2021) - [j52]Marta R. Costa-jussà:
Towards universal translation. Nat. Mach. Intell. 3(5): 376-377 (2021) - [j51]Christine Basta, Marta R. Costa-jussà, Noe Casas:
Extensive study on the underlying gender bias in contextualized word embeddings. Neural Comput. Appl. 33(8): 3371-3384 (2021) - [j50]Noe Casas, Marta R. Costa-jussà, José A. R. Fonollosa, Juan A. Alonso, Ramón Fanlo:
Linguistic knowledge-based vocabularies for Neural Machine Translation. Nat. Lang. Eng. 27(4): 485-506 (2021) - [c104]Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa, Carlos Segura:
Enabling Zero-Shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders. ASRU 2021: 694-701 - [c103]Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa, Mikel Artetxe:
Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders. EACL 2021: 944-948 - [c102]Javier Ferrando, Marta R. Costa-jussà:
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions. EMNLP (Findings) 2021: 434-443 - [c101]Carlos Escolano, Graciela Ojeda, Christine Basta, Marta R. Costa-jussà:
Multi-Task Learning for Improving Gender Accuracy in Neural Machine Translation. ICON 2021: 12-17 - [c100]Gerard I. Gállego, Ioannis Tsiamas, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà:
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021. IWSLT 2021: 110-119 - [c99]Christine Basta, Marta R. Costa-jussà:
Impact of COVID-19 in Natural Language Processing Publications: a Disaggregated Study in Gender, Contribution and Experience. LT-EDI@EACL 2021: 1-6 - [c98]Jordi Armengol-Estapé, Marta R. Costa-jussà, Carlos Escolano:
Enriching the Transformer with Linguistic Factors for Low-Resource Machine Translation. RANLP 2021: 73-78 - [c97]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [c96]Carlos Escolano, Ioannis Tsiamas, Christine Basta, Javier Ferrando, Marta R. Costa-jussà, José A. R. Fonollosa:
The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach. WMT@EMNLP 2021: 117-122 - [c95]Bardia Rafieian, Marta Ruiz Costa-jussà:
High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task. WMT@EMNLP 2021: 863-867 - [e10]Marcello Federico, Alex Waibel, Marta R. Costa-jussà, Jan Niehues, Sebastian Stüker, Elizabeth Salesky:
Proceedings of the 18th International Conference on Spoken Language Translation, IWSLT 2021, Bangkok, Thailand (online), August 5-6, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-74-9 [contents] - [e9]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz:
Proceedings of the Sixth Conference on Machine Translation, WMT@EMNLP 2021, Online Event, November 10-11, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-94-7 [contents] - [i33]Noe Casas, José A. R. Fonollosa, Marta R. Costa-jussà:
Sparsely Factored Neural Machine Translation. CoRR abs/2102.08934 (2021) - [i32]Christian Hardmeier, Marta R. Costa-jussà, Kellie Webster, Will Radford, Su Lin Blodgett:
How to Write a Bias Statement: Recommendations for Submissions to the Workshop on Gender Bias in NLP. CoRR abs/2104.03026 (2021) - [i31]Christine Basta, Marta R. Costa-jussà:
Impact of Gender Debiased Word Embeddings in Language Modeling. CoRR abs/2105.00908 (2021) - [i30]Gerard I. Gállego, Ioannis Tsiamas, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà:
UPC's Speech Translation System for IWSLT 2021. CoRR abs/2105.04512 (2021) - [i29]Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà:
Efficient Transformer for Direct Speech Translation. CoRR abs/2107.03069 (2021) - [i28]Javier Ferrando, Marta R. Costa-jussà:
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions. CoRR abs/2109.05853 (2021) - 2020
- [j49]Marta R. Costa-jussà, Cristina España-Bonet, Pascale Fung, Noah A. Smith:
Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction. Comput. Linguistics 46(2): 249-255 (2020) - [j48]Marta R. Costa-jussà:
AMALEU: Una Representación Universal del Lenguaje basada en Aprendizaje Automático. Proces. del Leng. Natural 65: 105-108 (2020) - [c94]Noe Casas, Marta R. Costa-jussà, José A. R. Fonollosa:
Combining Subword Representations into Word-level Representations in the Transformer Architecture. ACL (student) 2020: 66-71 - [c93]Magdalena Biesialska, Bardia Rafieian, Marta R. Costa-jussà:
Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources. ACL (student) 2020: 271-278 - [c92]Noe Casas, José A. R. Fonollosa, Marta R. Costa-jussà:
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation. SPNLP@EMNLP 2020: 1-10 - [c91]Magdalena Biesialska, Katarzyna Biesialska, Marta R. Costa-jussà:
Continual Lifelong Learning in Natural Language Processing: A Survey. COLING 2020: 6523-6541 - [c90]Magdalena Biesialska, Marta R. Costa-jussà:
Refinement of Unsupervised Cross-Lingual Word Embeddings. ECAI 2020: 1978-1981 - [c89]Marta R. Costa-jussà, Esther González, Asunción Moreno, Eudald Cumalat:
Abusive language in Spanish children and young teenager's conversations: data preparation and short text classification with contextual word embeddings. LREC 2020: 1533-1537 - [c88]Marta R. Costa-jussà, Pau Li Lin, Cristina España-Bonet:
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies. LREC 2020: 4081-4088 - [c87]Casimiro Pio Carrino, Marta R. Costa-jussà, José A. R. Fonollosa:
Automatic Spanish Translation of SQuAD Dataset for Multi-lingual Question Answering. LREC 2020: 5515-5523 - [c86]Loïc Barrault, Magdalena Biesialska, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubesic, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2020 Conference on Machine Translation (WMT20). WMT@EMNLP 2020: 1-55 - [c85]Loïc Barrault, Magdalena Biesialska, Marta R. Costa-jussà, Fethi Bougares, Olivier Galibert:
Findings of the First Shared Task on Lifelong Learning Machine Translation. WMT@EMNLP 2020: 56-64 - [c84]Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa:
The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT. WMT@EMNLP 2020: 134-138 - [c83]Luis A Menéndez-Salazar, Grigori Sidorov, Marta R. Costa-jussà:
The IPN-CIC team system submission for the WMT 2020 similar language task. WMT@EMNLP 2020: 409-413 - [c82]Pere Vergés Boncompte, Marta R. Costa-jussà:
Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages. WMT@EMNLP 2020: 447-450 - [e8]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri:
Proceedings of the Fifth Conference on Machine Translation, WMT@EMNLP 2020, Online, November 19-20, 2020. Association for Computational Linguistics 2020, ISBN 978-1-948087-81-0 [contents] - [d2]Marta R. Costa-jussà:
Catalan United Nations v1.0 test set. Zenodo, 2020 - [d1]Marta R. Costa-jussà:
WinoST: Evaluating Gender Bias in Speech Translation. Zenodo, 2020 - [i27]