default search action
Najim Dehak
Person information
- affiliation: MIT, Cambridge, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j28]Deming Li, Ankur A. Butala, Laureano Moro-Velázquez, Trevor Meyer, Esther S. Oh, Chelsey Motley, Jesús Villalba, Najim Dehak:
Automating the analysis of eye movement for different neurodegenerative disorders. Comput. Biol. Medicine 170: 107951 (2024) - [j27]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Zelasko, Najim Dehak:
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1736-1749 (2024) - [j26]Magdalena Rybicka, Jesús Villalba, Thomas Thebaud, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3960-3973 (2024) - [j25]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Slowness Regularized Contrastive Predictive Coding for Acoustic Unit Discovery. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4277-4287 (2024) - [c137]Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro-Velázquez, Najim Dehak:
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline. LREC/COLING 2024: 7296-7306 - [c136]Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. ICASSP 2024: 1196-1200 - [c135]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. Odyssey 2024: 165-171 - [c134]Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. Odyssey 2024: 201-208 - [c133]Lucas Goncalves, Ali N. Salman, Abinay Reddy Naini, Laureano Moro-Velázquez, Thomas Thebaud, Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results. Odyssey 2024: 247-254 - [e1]Najim Dehak, Patrick Cardinal:
Odyssey 2024: The Speaker and Language Recognition Workshop, Quebec City, Canada, June 18-21, 2024. ISCA 2024 [contents] - [i55]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. CoRR abs/2402.19355 (2024) - [i54]Helin Wang, Meng Yu, Jiarui Hai, Chen Chen, Yuchen Hu, Rilin Chen, Najim Dehak, Dong Yu:
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis. CoRR abs/2409.07556 (2024) - [i53]Thomas Thebaud, Anna Favaro, Casey Chen, Gabriel Chávez, Laureano Moro-Velázquez, Ankur A. Butala, Najim Dehak:
Explainable Metrics for the Assessment of Neurodegenerative Diseases through Handwriting Analysis. CoRR abs/2409.08303 (2024) - [i52]Helin Wang, Jiarui Hai, Yen-Ju Lu, Karan Thakkar, Mounya Elhilali, Najim Dehak:
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer. CoRR abs/2409.08425 (2024) - [i51]Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks against SLU Systems. CoRR abs/2409.08985 (2024) - 2023
- [j24]Anna Favaro, Yi-Ting Tsai, Ankur A. Butala, Thomas Thebaud, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput. Biol. Medicine 166: 107559 (2023) - [c132]Maliha Jahan, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak, Jesús Villalba:
Model-Based Fairness Metric for Speaker Verification. ASRU 2023: 1-7 - [c131]Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. ASRU 2023: 1-8 - [c130]Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. ASRU 2023: 1-8 - [c129]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning. INTERSPEECH 2023: 431-435 - [c128]Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525 - [c127]Helin Wang, Thomas Thebaud, Jesús Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velázquez:
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model. INTERSPEECH 2023: 1548-1552 - [c126]Anna Favaro, Tianyu Cao, Thomas Thebaud, Jesús Villalba, Ankur A. Butala, Najim Dehak, Laureano Moro-Velázquez:
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora? INTERSPEECH 2023: 2388-2392 - [c125]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition. INTERSPEECH 2023: 4688-4692 - [i50]Martin Sustek, Samik Sadhu, Lukás Burget, Hynek Hermansky, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Stabilized training of joint energy-based models and their practical applications. CoRR abs/2303.04187 (2023) - [i49]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning. CoRR abs/2309.04628 (2023) - [i48]Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. CoRR abs/2310.04567 (2023) - [i47]Trevor Meyer, Camden Shultz, Najim Dehak, Laureano Moro-Velázquez, Pedro P. Irazoqui:
Time Scale Network: A Shallow Neural Network For Time Series Data. CoRR abs/2311.06170 (2023) - 2022
- [j23]Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering phonetic inventories with crosslingual automatic speech recognition. Comput. Speech Lang. 74: 101358 (2022) - [j22]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech. IEEE J. Sel. Top. Signal Process. 16(6): 1284-1295 (2022) - [j21]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2002-2014 (2022) - [c124]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. INTERSPEECH 2022: 615-619 - [c123]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-contrastive self-supervised learning of utterance-level speech representations. INTERSPEECH 2022: 4028-4032 - [c122]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. INTERSPEECH 2022: 5035-5039 - [c121]Yiwen Shao, Jesús Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak:
Chunking Defense for Adversarial Attacks on ASR. INTERSPEECH 2022: 5045-5049 - [c120]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. INTERSPEECH 2022: 5060-5064 - [c119]Magdalena Rybicka, Jesús Villalba, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. INTERSPEECH 2022: 5090-5094 - [c118]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21. Odyssey 2022: 213-220 - [c117]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge. Odyssey 2022: 338-345 - [c116]Tianyu Cao, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics. SLT 2022: 517-524 - [c115]Anna Favaro, Chelsie Motley, Tianyu Cao, Miguel Iglesias, Ankur A. Butala, Esther S. Oh, Robert D. Stevens, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders. SLT 2022: 532-539 - [c114]Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur:
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition. SLT 2022: 777-784 - [i46]Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali:
Code-Switching Text Augmentation for Multilingual Speech Processing. CoRR abs/2201.02550 (2022) - [i45]Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition. CoRR abs/2201.11207 (2022) - [i44]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. CoRR abs/2203.16614 (2022) - [i43]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. CoRR abs/2204.03848 (2022) - [i42]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser. CoRR abs/2204.03851 (2022) - [i41]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations. CoRR abs/2208.05413 (2022) - [i40]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech. CoRR abs/2208.05445 (2022) - 2021
- [j20]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Julián D. Arias-Londoño, Najim Dehak, Juan Ignacio Godino-Llorente:
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. Biomed. Signal Process. Control. 66: 102418 (2021) - [j19]Nanxin Chen, Shinji Watanabe, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Non-Autoregressive Transformer for Speech Recognition. IEEE Signal Process. Lett. 28: 121-125 (2021) - [j18]Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. Trans. Assoc. Comput. Linguistics 9: 1163-1179 (2021) - [j17]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems. IEEE Trans. Inf. Forensics Secur. 16: 4811-4826 (2021) - [c113]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. ASRU 2021: 39-46 - [c112]Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint Prediction of Truecasing and Punctuation for Conversational Speech in Low-Resource Scenarios. ASRU 2021: 1185-1191 - [c111]Laureano Moro-Velázquez, Jorge Gómez-García, Najim Dehak, Juan Ignacio Godino-Llorente:
New tools for the differential evaluation of Parkinson's disease using voice and speech processing. IberSPEECH 2021 - [c110]Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer. ICASSP 2021: 5994-5998 - [c109]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. ICASSP 2021: 6324-6328 - [c108]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios. ICASSP 2021: 6733-6737 - [c107]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models. ICASSP 2021: 7118-7122 - [c106]Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-Shot ASR Performance. ICASSP 2021: 7238-7242 - [c105]Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval. ICASSP 2021: 7603-7607 - [c104]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. Interspeech 2021: 366-370 - [c103]Magdalena Rybicka, Jesús Villalba, Piotr Zelasko, Najim Dehak, Konrad Kowalczyk:
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition. Interspeech 2021: 496-500 - [c102]Saurabh Kataria, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification. Interspeech 2021: 1079-1083 - [c101]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. Interspeech 2021: 3765-3769 - [c100]Nanxin Chen, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition. Interspeech 2021: 3770-3774 - [c99]Raghavendra Pappagari, Jaejin Cho, Sonal Joshi, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios. Interspeech 2021: 3825-3829 - [c98]Jesús Villalba, Sonal Joshi, Piotr Zelasko, Najim Dehak:
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems. Interspeech 2021: 4304-4308 - [c97]Aviad Shtrosberg, Jesús Villalba, Najim Dehak, Azaria Cohen, Bar Ben-Yair:
Invariant Representation Learning for Robust Far-Field Speaker Recognition. SLSP 2021: 97-110 - [i39]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Adversarial Attacks and Defenses for Speaker Identification Systems. CoRR abs/2101.08909 (2021) - [i38]Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021) - [i37]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. CoRR abs/2106.02170 (2021) - [i36]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. CoRR abs/2106.09660 (2021) - [i35]Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. CoRR abs/2107.02294 (2021) - [i34]Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios. CoRR abs/2109.06103 (2021) - [i33]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. CoRR abs/2109.06112 (2021) - [i32]Jaejin Cho, Jesús Villalba, Najim Dehak:
The JHU submission to VoxSRC-21: Track 3. CoRR abs/2109.13425 (2021) - [i31]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding. CoRR abs/2110.02345 (2021) - 2020
- [j16]Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020) - [j15]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Leibny Paola García-Perera, Fred Richardson, Réda Dehak, Pedro A. Torres-Carrasquillo, Najim Dehak:
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations. Comput. Speech Lang. 60 (2020) - [j14]Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. IEEE J. Sel. Top. Signal Process. 14(2): 234-239 (2020) - [j13]Laureano Moro-Velázquez, Estefanía Hernández-García, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Najim Dehak:
Analysis of the Effects of Supraglottal Tract Surgical Procedures in Automatic Speaker Recognition Performance. IEEE ACM Trans. Audio Speech Lang. Process. 28: 798-812 (2020) - [c96]Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Using X-Vectors to Automatically Detect Parkinson's Disease from Speech. ICASSP 2020: 1155-1159 - [c95]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition. ICASSP 2020: 7169-7173 - [c94]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, L. Paola García-Perera, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. ICASSP 2020: 7584-7588 - [c93]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for Speaker Verification. ICASSP 2020: 7599-7603 - [c92]Raghavendra Pappagari, Jaejin Cho, Laureano Moro-Velázquez, Najim Dehak:
Using State of the Art Speaker Recognition and Natural Language Processing Technologies to Detect Alzheimer's Disease and Assess its Severity. INTERSPEECH 2020: 2177-2181 - [c91]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. INTERSPEECH 2020: 3256-3260 - [c90]Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: An Analysis of Phonetic Representations Transfer Across Languages. INTERSPEECH 2020: 3705-3709 - [c89]Jesús Villalba, Yuekai Zhang, Najim Dehak:
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. INTERSPEECH 2020: 4233-4237 - [c88]Yuekai Zhang, Ziyan Jiang, Jesús Villalba, Najim Dehak:
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. INTERSPEECH 2020: 4238-4242 - [c87]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. INTERSPEECH 2020: 4876-4880 - [c86]Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? INTERSPEECH 2020: 4906-4910 - [c85]Jesús Antonio Villalba López, Daniel Garcia-Romero, Nanxin Chen, Gregory Sell, Jonas Borgstrom, Alan McCree, Leibny Paola García-Perera, Saurabh Kataria, Phani Sankar Nidadavolu, Pedro Torres-Carrasquiilo, Najim Dehak:
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19. Odyssey 2020: 273-280 - [c84]Leibny Paola García-Perera, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422 - [c83]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss Based Enhancement for Speaker Verification. Odyssey 2020: 459-466 - [i30]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss based Enhancement for Speaker Verification. CoRR abs/2002.00139 (2020) - [i29]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition. CoRR abs/2002.05039 (2020) - [i28]Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? CoRR abs/2004.05985 (2020) - [i27]Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages. CoRR abs/2005.08118 (2020) - [i26]Phani Sankar Nidadavolu, Saurabh Kataria, L. Paola García-Perera, Jesús Villalba, Najim Dehak:
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild. CoRR abs/2005.08331 (2020) - [i25]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. CoRR abs/2007.13033 (2020) - [i24]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. CoRR abs/2010.11221 (2020) - [i23]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models. CoRR abs/2010.11860 (2020) - [i22]Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-shot ASR Performance. CoRR abs/2010.12104 (2020) - [i21]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. CoRR abs/2010.14602 (2020) - [i20]