default search action
Yossi Adi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j8]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. J. Mach. Learn. Res. 25: 97:1-97:52 (2024) - [c56]Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi:
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. AAAI 2024: 6639-6647 - [c55]Guy Lorberbom, Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
Layer Collaboration in the Forward-Forward Algorithm. AAAI 2024: 14141-14148 - [c54]Alon Ziv, Itai Gat, Gaël Le Lan, Tal Remez, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Yossi Adi:
Masked Audio Generation using a Single Non-Autoregressive Transformer. ICLR 2024 - [c53]Jean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez:
An Independence-promoting Loss for Music Generation with Language Models. ICML 2024 - [i78]Alon Ziv, Itai Gat, Gaël Le Lan, Tal Remez, Felix Kreuk, Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
Masked Audio Generation using a Single Non-Autoregressive Transformer. CoRR abs/2401.04577 (2024) - [i77]Matanel Oren, Michael Hassid, Yossi Adi, Roy Schwartz:
Transformers are Multi-State RNNs. CoRR abs/2401.06104 (2024) - [i76]Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi:
The Larger the Better? Improved LLM Code-Generation via Budget Reallocation. CoRR abs/2404.00725 (2024) - [i75]Jean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez:
An Independence-promoting Loss for Music Generation with Language Models. CoRR abs/2406.02315 (2024) - [i74]Xuankai Chang, Jiatong Shi, Jinchuan Tian, Yuning Wu, Yuxun Tang, Yihan Wu, Shinji Watanabe, Yossi Adi, Xie Chen, Qin Jin:
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units. CoRR abs/2406.07725 (2024) - [i73]Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi:
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation. CoRR abs/2406.10970 (2024) - [i72]Shoval Messica, Yossi Adi:
NAST: Noise Aware Speech Tokenization for Speech Language Models. CoRR abs/2406.11037 (2024) - [i71]Guy Yariv, Idan Schwartz, Yossi Adi, Sagie Benaim:
Improving Visual Commonsense in Language Models via Multiple Image Generation. CoRR abs/2406.13621 (2024) - [i70]Arnon Turetzky, Or Tal, Yael Segal-Feldman, Yehoshua Dissen, Ella Zeldes, Amit Roth, Eyal Cohen, Yosi Shrem, Bronya Roni Chernyak, Olga Seleznova, Joseph Keshet, Yossi Adi:
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing. CoRR abs/2407.07566 (2024) - [i69]Amit Roth, Arnon Turetzky, Yossi Adi:
A Language Modeling Approach to Diacritic-Free Hebrew TTS. CoRR abs/2407.12206 (2024) - [i68]Simon Rouard, Yossi Adi, Jade Copet, Axel Roebel, Alexandre Défossez:
Audio Conditioning for Music Generation via Discrete Bottleneck Features. CoRR abs/2407.12563 (2024) - [i67]Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman:
Discrete Flow Matching. CoRR abs/2407.15595 (2024) - [i66]Shiran Aziz, Yossi Adi, Shmuel Peleg:
Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline. CoRR abs/2408.17434 (2024) - [i65]Robin San Roman, Pierre Fernandez, Antoine Deleforge, Yossi Adi, Romain Serizel:
Latent Watermarking of Audio Generative Models. CoRR abs/2409.02915 (2024) - [i64]Arnon Turetzky, Yossi Adi:
LAST: Language Model Aware Speech Tokenization. CoRR abs/2409.03701 (2024) - [i63]Gallil Maimon, Amit Roth, Yossi Adi:
A Suite for Acoustic Language Model Evaluation. CoRR abs/2409.07437 (2024) - 2023
- [j7]Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. Trans. Assoc. Comput. Linguistics 11: 250-266 (2023) - [j6]Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
High Fidelity Neural Audio Compression. Trans. Mach. Learn. Res. 2023 (2023) - [c52]Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration. CVPR 2023: 18796-18806 - [c51]Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux:
Generative Spoken Language Model based on continuous word-sized audio tokens. EMNLP 2023: 3008-3028 - [c50]Gallil Maimon, Yossi Adi:
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units. EMNLP (Findings) 2023: 8048-8061 - [c49]Ali Elkahky, Wei-Ning Hsu, Paden Tomasello, Tu Anh Nguyen, Robin Algayres, Yossi Adi, Jade Copet, Emmanuel Dupoux, Abdelrahman Mohamed:
Do Coarser Units Benefit Cluster Prediction-Based Speech Pre-Training? ICASSP 2023: 1-5 - [c48]Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. ICASSP 2023: 1-5 - [c47]Moshe Mandel, Or Tal, Yossi Adi:
AERO: Audio Super Resolution in the Spectral Domain. ICASSP 2023: 1-5 - [c46]Roy Sheffer, Yossi Adi:
I Hear Your True Colors: Image Guided Audio Generation. ICASSP 2023: 1-5 - [c45]Amitay Sicherman, Yossi Adi:
Analysing Discrete Self Supervised Speech Representation For Spoken Language Modeling. ICASSP 2023: 1-5 - [c44]Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. ICLR 2023 - [c43]Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. INTERSPEECH 2023: 4823-4827 - [c42]Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz:
Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. INTERSPEECH 2023: 5446-5450 - [c41]Itai Gat, Felix Kreuk, Tu Anh Nguyen, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling. IWSLT@ACL 2023: 465-477 - [c40]Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez:
Simple and Controllable Music Generation. NeurIPS 2023 - [c39]Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi:
Textually Pretrained Speech Language Models. NeurIPS 2023 - [c38]Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. NeurIPS 2023 - [c37]Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez:
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion. NeurIPS 2023 - [i62]Amitay Sicherman, Yossi Adi:
Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling. CoRR abs/2301.00591 (2023) - [i61]Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. CoRR abs/2301.10606 (2023) - [i60]Guy Lorberbom, Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
Layer Collaboration in the Forward-Forward Algorithm. CoRR abs/2305.12393 (2023) - [i59]Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi:
Textually Pretrained Speech Language Models. CoRR abs/2305.13009 (2023) - [i58]Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz:
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. CoRR abs/2305.13050 (2023) - [i57]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. CoRR abs/2305.13516 (2023) - [i56]Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez:
Simple and Controllable Music Generation. CoRR abs/2306.05284 (2023) - [i55]Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. CoRR abs/2306.15687 (2023) - [i54]Robin San-Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez:
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion. CoRR abs/2308.02560 (2023) - [i53]Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. CoRR abs/2308.05725 (2023) - [i52]Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton-Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom, Gabriel Synnaeve:
Code Llama: Open Foundation Models for Code. CoRR abs/2308.12950 (2023) - [i51]Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi:
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. CoRR abs/2309.16429 (2023) - [i50]Po-Chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed:
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS. CoRR abs/2309.17020 (2023) - [i49]Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux:
Generative Spoken Language Model based on continuous word-sized audio tokens. CoRR abs/2310.05224 (2023) - 2022
- [j5]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing. IEEE J. Sel. Top. Signal Process. 16(6): 1329-1341 (2022) - [j4]Alexandre Défossez, Yossi Adi, Gabriel Synnaeve:
Differentiable Model Compression via Pseudo Quantization Noise. Trans. Mach. Learn. Res. 2022 (2022) - [c36]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu:
Direct Speech-to-Speech Translation With Discrete Units. ACL (1) 2022: 3327-3339 - [c35]Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. ACL (1) 2022: 8666-8681 - [c34]Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Discrete & Decomposed Representations. EMNLP 2022: 11200-11214 - [c33]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual Self-Training With Bootstrapped Remixing For Speech Enhancement. ICASSP 2022: 6947-6951 - [c32]Alon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan:
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies. ICLR 2022 - [c31]Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi:
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement. INTERSPEECH 2022: 1193-1197 - [c30]Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski:
Probing phoneme, language and speaker information in unsupervised speech representations. INTERSPEECH 2022: 1402-1406 - [c29]Shahaf Bassan, Yossi Adi, Jeffrey S. Rosenschein:
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors. INTERSPEECH 2022: 2423-2427 - [c28]Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg:
Deep Audio Waveform Prior. INTERSPEECH 2022: 2938-2942 - [c27]Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. INTERSPEECH 2022: 5195-5199 - [c26]Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. NAACL-HLT 2022: 860-872 - [c25]Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
On the Importance of Gradient Norm in PAC-Bayesian Bounds. NeurIPS 2022 - [c24]Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossi Adi, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
Stop: A Dataset for Spoken Task Oriented Semantic Parsing. SLT 2022: 991-998 - [i48]Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
textless-lib: a Library for Textless Spoken Language Processing. CoRR abs/2202.07359 (2022) - [i47]Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing. CoRR abs/2202.08862 (2022) - [i46]Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski:
Probing phoneme, language and speaker information in unsupervised speech representations. CoRR abs/2203.16193 (2022) - [i45]Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. CoRR abs/2203.16502 (2022) - [i44]Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. CoRR abs/2204.02967 (2022) - [i43]Alon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan:
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies. CoRR abs/2205.01324 (2022) - [i42]Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi:
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement. CoRR abs/2206.11000 (2022) - [i41]Shahaf Bassan, Yossi Adi, Jeffrey S. Rosenschein:
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors. CoRR abs/2207.00760 (2022) - [i40]Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg:
Deep Audio Waveform Prior. CoRR abs/2207.10441 (2022) - [i39]Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. CoRR abs/2209.15352 (2022) - [i38]Itai Gat, Felix Kreuk, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
On The Robustness of Self-Supervised Representations for Spoken Language Modeling. CoRR abs/2209.15483 (2022) - [i37]Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
On the Importance of Gradient Norm in PAC-Bayesian Bounds. CoRR abs/2210.06143 (2022) - [i36]Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
High Fidelity Neural Audio Compression. CoRR abs/2210.13438 (2022) - [i35]Felix Kreuk, Yaniv Taigman, Adam Polyak, Jade Copet, Gabriel Synnaeve, Alexandre Défossez, Yossi Adi:
Audio Language Modeling using Perceptually-Guided Discrete Representations. CoRR abs/2211.01223 (2022) - [i34]Roy Sheffer, Yossi Adi:
I Hear Your True Colors: Image Guided Audio Generation. CoRR abs/2211.03089 (2022) - [i33]Moshe Mandel, Or Tal, Yossi Adi:
AERO: Audio Super Resolution in the Spectral Domain. CoRR abs/2211.12232 (2022) - [i32]Gallil Maimon, Yossi Adi:
Speaking Style Conversion With Discrete Self-Supervised Units. CoRR abs/2212.09730 (2022) - [i31]Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement. CoRR abs/2212.11377 (2022) - 2021
- [j3]Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi:
SAGRNN: Self-Attentive Gated RNN For Binaural Speaker Separation With Interaural Cue Preservation. IEEE Signal Process. Lett. 28: 26-30 (2021) - [c23]Shahar Segal, Yossi Adi, Benny Pinkas, Carsten Baum, Chaya Ganesh, Joseph Keshet:
Fairness in the Eyes of the Data: Certifying Machine-Learning Models. AIES 2021: 926-935 - [c22]Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Pino:
fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit. EMNLP (Demos) 2021: 143-152 - [c21]Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi:
Single Channel Voice Separation for Unknown Number of Speakers Under Reverberant and Noisy Settings. ICASSP 2021: 3730-3734 - [c20]Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman:
High Fidelity Speech Regeneration with Application to Speech Enhancement. ICASSP 2021: 7143-7147 - [c19]Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. Interspeech 2021: 3615-3619 - [i30]Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman:
High Fidelity Speech Regeneration with Application to Speech Enhancement. CoRR abs/2102.00429 (2021) - [i29]Kushal Lakhotia, Evgeny Kharitonov, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Benjamin Bolte, Tu Anh Nguyen, Jade Copet, Alexei Baevski, Adelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Language Modeling from Raw Audio. CoRR abs/2102.01192 (2021) - [i28]Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. CoRR abs/2104.00355 (2021) - [i27]Alexandre Défossez, Yossi Adi, Gabriel Synnaeve:
Differentiable Model Compression via Pseudo Quantization Noise. CoRR abs/2104.09987 (2021) - [i26]Ori Kabeli, Yossi Adi, Zhenyu Tang, Buye Xu, Anurag Kumar:
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation. CoRR abs/2106.13493 (2021) - [i25]Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Miguel Pino, Wei-Ning Hsu:
Direct speech-to-speech translation with discrete units. CoRR abs/2107.05604 (2021) - [i24]Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. CoRR abs/2109.03264 (2021) - [i23]Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Miguel Pino:
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit. CoRR abs/2109.06912 (2021) - [i22]Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual self-training with bootstrapped remixing for speech enhancement. CoRR abs/2110.10103 (2021) - [i21]Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Decomposed and Discrete Representations. CoRR abs/2111.07402 (2021) - 2020
- [c18]Felix Kreuk, Yaniv Sheena, Joseph Keshet, Yossi Adi:
Phoneme Boundary Detection Using Learnable Segmental Features. ICASSP 2020: 8089-8093 - [c17]Eliya Nachmani, Yossi Adi, Lior Wolf:
Voice Separation with an Unknown Number of Multiple Speakers. ICML 2020: 7164-7175 - [c16]Adam Polyak, Lior Wolf, Yossi Adi, Yaniv Taigman:
Unsupervised Cross-Domain Singing Voice Conversion. INTERSPEECH 2020: 801-805 - [c15]Alexandre Défossez, Gabriel Synnaeve, Yossi Adi:
Real Time Speech Enhancement in the Waveform Domain. INTERSPEECH 2020: 3291-3295 - [c14]Felix Kreuk, Joseph Keshet, Yossi Adi:
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation. INTERSPEECH 2020: 3700-3704 - [c13]Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Towards Deep Neural Networks for Speech Steganography. INTERSPEECH 2020: 4656-4660 - [c12]Ben Goldberger, Guy Katz, Yossi Adi, Joseph Keshet:
Minimal Modifications of Deep Neural Networks using Verification. LPAR 2020: 260-278 - [i20]Felix Kreuk, Yaniv Sheena, Joseph Keshet, Yossi Adi:
Phoneme Boundary Detection using Learnable Segmental Features. CoRR abs/2002.04992 (2020) - [i19]Yossi Adi, Yaniv Nemcovsky, Alexander G. Schwing, Tamir Hazan:
On the generalization of bayesian deep nets for multi-class classification. CoRR abs/2002.09866 (2020) - [i18]