default search action

combined dblp search
author search
venue search
publication search

ask others

Najim Dehak

> Home > Persons

Person information

affiliation: MIT, Cambridge, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/cbm/MeyerFOBMIDM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cbm/MeyerFOBMIDM25
Trevor Meyer, Anna Favaro, Esther S. Oh, Ankur A. Butala, Chelsie Motley, Pedro P. Irazoqui, Najim Dehak, Laureano Moro-Velázquez:
Deep Stroop: Integrating eye tracking and speech processing to characterize people with neurodegenerative disorders while performing neuropsychological tests. Comput. Biol. Medicine 184: 109398 (2025)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/MeyerSDMI25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/MeyerSDMI25
Trevor Meyer, Camden Shultz, Najim Dehak, Laureano Moro-Velázquez, Pedro P. Irazoqui:
Time Scale Network: An Efficient Shallow Neural Network for Time Series Data in Biomedical Applications. IEEE J. Sel. Top. Signal Process. 19(1): 129-139 (2025)
[j29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/RybickaKTDV25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/RybickaKTDV25
Magdalena Rybicka, Konrad Kowalczyk, Thomas Thebaud, Najim Dehak, Jesús Villalba:
Joint Diarization and Separation Using SepFormer With Non-Autoregressive Attractors. IEEE Signal Process. Lett. 32: 2913-2917 (2025)
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/ciss/YangTD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ciss/YangTD25
Yuchen Yang, Thomas Thebaud, Najim Dehak:
Demographic Attributes Prediction from Speech Using WavLM Embeddings. CISS 2025: 1-6
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JahanMTH0DM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JahanMTH0DM25
Maliha Jahan, Priyam Mazumdar, Thomas Thebaud, Mark Hasegawa-Johnson, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Unveiling Performance Bias in ASR Systems: A Study on Gender, Age, Accent, and More. ICASSP 2025: 1-5
[c152]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaouedjW0TMD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaouedjW0TMD25
Sarah Laouedj, Yuzhe Wang, Jesús Villalba, Thomas Thebaud, Laureano Moro-Velázquez, Najim Dehak:
Detecting Neurodegenerative Diseases using Frame-Level Handwriting Embeddings. ICASSP 2025: 1-5
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangHLTED25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangHLTED25
Helin Wang, Jiarui Hai, Yen-Ju Lu, Karan Thakkar, Mounya Elhilali, Najim Dehak:
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer. ICASSP 2025: 1-5
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYH0HCDY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYH0HCDY25
Helin Wang, Meng Yu, Jiarui Hai, Chen Chen, Yuchen Hu, Rilin Chen, Najim Dehak, Dong Yu:
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis. ICASSP 2025: 1-5
[c149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoFDM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoFDM25
Kunxiao Gao, Anna Favaro, Najim Dehak, Laureano Moro-Velázquez:
ADCeleb: A Longitudinal Speech Dataset from Public Figures for Early Detection of Alzheimer's Disease. INTERSPEECH 2025
[c148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JahanSMFT0HDM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JahanSMFT0HDM25
Maliha Jahan, Yinglun Sun, Priyam Mazumdar, Zsuzsanna Fagyal, Thomas Thebaud, Jesús Villalba, Mark Hasegawa-Johnson, Najim Dehak, Laureano Moro-Velázquez:
FaiST: A Benchmark Dataset for Fairness in Speech Technology. INTERSPEECH 2025
[c147]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NainiGSMUTMGDSB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NainiGSMUTMGDSB25
Abinay Reddy Naini, Lucas Goncalves, Ali N. Salman, Pravin Mote, Ismail Rasim Ulgen, Thomas Thebaud, Laureano Moro-Velázquez, Leibny Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
The Interspeech 2025 Challenge on Speech Emotion Recognition in Naturalistic Conditions. INTERSPEECH 2025
[c146]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Singh0D25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Singh0D25
Prabhav Singh, Jesús Villalba, Najim Dehak:
Count Your Speakers! Multitask Learning for Multimodal Speaker Diarization. INTERSPEECH 2025
[c145]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamirT0DK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamirT0DK25
Ziv Tamir, Thomas Thebaud, Jesús Villalba, Najim Dehak, Oren Kurland:
Multimodal Emotion Diarization: Frame-Wise Integration of Text and Audio Representations. INTERSPEECH 2025
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-07025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-07025
Sarah Laouedj, Yuzhe Wang, Jesús Villalba, Thomas Thebaud, Laureano Moro-Velázquez, Najim Dehak:
Detecting Neurodegenerative Diseases using Frame-Level Handwriting Embeddings. CoRR abs/2502.07025 (2025)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12007
Yuchen Yang, Thomas Thebaud, Najim Dehak:
Demographic Attributes Prediction from Speech Using WavLM Embeddings. CoRR abs/2502.12007 (2025)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-14648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-14648
Tiantian Feng, Jihwan Lee, Anfeng Xu, Yoonjeong Lee, Thanathai Lertpetchpun, Xuan Shi, Helin Wang, Thomas Thebaud, Laureano Moro-Velázquez, Dani Byrd, Najim Dehak, Shrikanth Narayanan:
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits. CoRR abs/2505.14648 (2025)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19314
Helin Wang, Jiarui Hai, Dongchao Yang, Chen Chen, Kai Li, Junyi Peng, Thomas Thebaud, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline. CoRR abs/2505.19314 (2025)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-02863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-02863
Helin Wang, Jiarui Hai, Dading Chong, Karan Thakkar, Tiantian Feng, Dongchao Yang, Junhyeok Lee, Laureano Moro-Velázquez, Jesús Villalba, Zengyi Qin, Shrikanth Narayanan, Mounya Elhilali, Najim Dehak:
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech. CoRR abs/2506.02863 (2025)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-04795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-04795
Thomas Thebaud, Yen-Ju Lu, Matthew Wiesner, Peter Viechnicki, Najim Dehak:
Enhancing Dialogue Annotation with Speaker Characteristics Leveraging a Frozen LLM. CoRR abs/2508.04795 (2025)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-08559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-08559
Alexandrine Fortier, Sonal Joshi, Thomas Thebaud, Jesús Antonio Villalba López, Najim Dehak, Patrick Cardinal:
Multi-Target Backdoor Attacks Against Speaker Recognition. CoRR abs/2508.08559 (2025)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-16474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-16474
Gabriel Chávez, Laureano Moro-Velázquez, Ankur A. Butala, Najim Dehak, Thomas Thebaud:
Cross-Corpus and Cross-domain Handwriting Assessment of NeuroDegenerative Diseases via Time-Series-to-Image Conversion. CoRR abs/2509.16474 (2025)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-17143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-17143
Junhyeok Lee, Helin Wang, Yaohan Guan, Thomas Thebaud, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances. CoRR abs/2509.17143 (2025)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-25144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-25144
Yen-Ju Lu, Thomas Thebaud, Laureano Moro-Velázquez, Najim Dehak, Jesús Villalba:
Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation. CoRR abs/2509.25144 (2025)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-01157
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-01157
Alexandrine Fortier, Thomas Thebaud, Jesús Villalba, Najim Dehak, Patrick Cardinal:
Backdoor Attacks Against Speech Language Models. CoRR abs/2510.01157 (2025)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-06195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-06195
Yen-Ju Lu, Yashesh Gaur, Wei Zhou, Benjamin Muller, Jesús Villalba, Najim Dehak, Luke Zettlemoyer, Gargi Ghosh, Mike Lewis, Srinivasan Iyer, Duc Le:
Latent Speech-Text Transformer. CoRR abs/2510.06195 (2025)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-21014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-21014
Ari Frummer, Helin Wang, Tianyu Cao, Adi Arbel, Yuval Sieradzki, Oren Gal, Jesús Villalba, Thomas Thebaud, Najim Dehak:
ReFESS-QI: Reference-Free Evaluation For Speech Separation With Joint Quality And Intelligibility Scoring. CoRR abs/2510.21014 (2025)
2024
[j28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/cbm/LiBMMOMVD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cbm/LiBMMOMVD24
Deming Li, Ankur A. Butala, Laureano Moro-Velázquez, Trevor Meyer, Esther S. Oh, Chelsey Motley, Jesús Villalba, Najim Dehak:
Automating the analysis of eye movement for different neurodegenerative disorders. Comput. Biol. Medicine 170: 107951 (2024)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KatariaVMZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KatariaVMZD24
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Zelasko, Najim Dehak:
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1736-1749 (2024)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/RybickaVTDK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/RybickaVTDK24
Magdalena Rybicka, Jesús Villalba, Thomas Thebaud, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3960-3973 (2024)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BhatiVZMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BhatiVZMD24
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Slowness Regularized Contrastive Predictive Coding for Acoustic Unit Discovery. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4277-4287 (2024)
[c144]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/JahanWTSLFSHMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/JahanWTSLFSHMD24
Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro-Velázquez, Najim Dehak:
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline. LREC/COLING 2024: 7296-7306
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/WatkinsonACEGMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/WatkinsonACEGMD24
Sophia A. Watkinson, Anthony J. Anderson, Michael Caiola, David Eguren, Michael Gonzalez, Laureano Moro-Velázquez, Najim Dehak, Chelsey Motley, Emile Moukheiber, Kelly Mills, Brittney C. Muir, Ankur A. Butala, Kimberly Kontson:
Concurrent validity of instrumented insoles measuring gait and balance metrics in Parkinson's disease. EMBC 2024: 1-7
[c142]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HaiWYTDE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HaiWYTDE24
Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. ICASSP 2024: 1196-1200
[c141]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/ThebaudFGYS0MD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/ThebaudFGYS0MD24
Thomas Thebaud, Anna Favaro, Yaohan Guan, Yuchen Yang, Prabhav Singh, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Multimodal Emotion Recognition Harnessing the Complementarity of Speech, Language, and Vision. ICMI 2024: 684-689
[c140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Favaro0DM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Favaro0DM24
Anna Favaro, Tianyu Cao, Najim Dehak, Laureano Moro-Velázquez:
Leveraging Universal Speech Representations for Detecting and Assessing the Severity of Mild Cognitive Impairment Across Languages. INTERSPEECH 2024
[c139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang0MHTD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang0MHTD24
Helin Wang, Jesús Villalba, Laureano Moro-Velázquez, Jiarui Hai, Thomas Thebaud, Najim Dehak:
Noise-robust Speech Separation with Fast Generative Correction. INTERSPEECH 2024
[c138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangFT0DM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangFT0DM24
Yuzhe Wang, Anna Favaro, Thomas Thebaud, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Exploring the Complementary Nature of Speech and Eye Movements for Profiling Neurological Disorders. INTERSPEECH 2024
[c137]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuLTMRD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuLTMRD024
Yen-Ju Lu, Jing Liu, Thomas Thebaud, Laureano Moro-Velázquez, Ariya Rastrow, Najim Dehak, Jesús Villalba:
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing. NeurIPS 2024
[c136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/JoshiT0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/JoshiT0D24
Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. Odyssey 2024: 165-171
[c135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/FavaroDT0OM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/FavaroDT0OM24
Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. Odyssey 2024: 201-208
[c134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GoncalvesSNMT0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GoncalvesSNMT0D24
Lucas Goncalves, Ali N. Salman, Abinay Reddy Naini, Laureano Moro-Velázquez, Thomas Thebaud, Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results. Odyssey 2024: 247-254
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/XinyuanJTVDK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/XinyuanJTVDK24
Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks Against SLU Systems. SLT 2024: 1107-1114
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/odyssey/2024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/2024
Najim Dehak, Patrick Cardinal:
Odyssey 2024: The Speaker and Language Recognition Workshop, Quebec City, Canada, June 18-21, 2024. ISCA 2024 [contents]
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-19355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-19355
Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. CoRR abs/2402.19355 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-07556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-07556
Helin Wang, Meng Yu, Jiarui Hai, Chen Chen, Yuchen Hu, Rilin Chen, Najim Dehak, Dong Yu:
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis. CoRR abs/2409.07556 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08303
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08303
Thomas Thebaud, Anna Favaro, Casey Chen, Gabriel Chávez, Laureano Moro-Velázquez, Ankur A. Butala, Najim Dehak:
Explainable Metrics for the Assessment of Neurodegenerative Diseases through Handwriting Analysis. CoRR abs/2409.08303 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08425
Helin Wang, Jiarui Hai, Yen-Ju Lu, Karan Thakkar, Mounya Elhilali, Najim Dehak:
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer. CoRR abs/2409.08425 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08985
Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks against SLU Systems. CoRR abs/2409.08985 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-04425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-04425
Yen-Ju Lu, Jing Liu, Thomas Thebaud, Laureano Moro-Velázquez, Ariya Rastrow, Najim Dehak, Jesús Villalba:
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing. CoRR abs/2412.04425 (2024)
2023
[j24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/cbm/FavaroTBTVDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cbm/FavaroTBTVDM23
Anna Favaro, Yi-Ting Tsai, Ankur A. Butala, Thomas Thebaud, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput. Biol. Medicine 166: 107559 (2023)
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/JahanMTDV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/JahanMTDV23
Maliha Jahan, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak, Jesús Villalba:
Model-Based Fairness Metric for Speaker Verification. ASRU 2023: 1-7
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SustekJLTVKD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SustekJLTVKD23
Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. ASRU 2023: 1-8
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ThebaudJLSVKD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ThebaudJLSVKD23
Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. ASRU 2023: 1-8
[c129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bhati0MTD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bhati0MTD23
Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning. INTERSPEECH 2023: 431-435
[c128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001BJKGTD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001BJKGTD23
Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangT0SLDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangT0SLDM23
Helin Wang, Thomas Thebaud, Jesús Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velázquez:
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model. INTERSPEECH 2023: 1548-1552
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Favaro0T0BDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Favaro0T0BDM23
Anna Favaro, Tianyu Cao, Thomas Thebaud, Jesús Villalba, Ankur A. Butala, Najim Dehak, Laureano Moro-Velázquez:
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora? INTERSPEECH 2023: 2388-2392
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kataria0MTD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kataria0MTD23
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition. INTERSPEECH 2023: 4688-4692
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04187
Martin Sustek, Samik Sadhu, Lukás Burget, Hynek Hermansky, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Stabilized training of joint energy-based models and their practical applications. CoRR abs/2303.04187 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-04628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-04628
Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning. CoRR abs/2309.04628 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04567
Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. CoRR abs/2310.04567 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06170
Trevor Meyer, Camden Shultz, Najim Dehak, Laureano Moro-Velázquez, Pedro P. Irazoqui:
Time Scale Network: A Shallow Neural Network For Time Series Data. CoRR abs/2311.06170 (2023)
2022
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/ZelaskoFMABSHD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/ZelaskoFMABSHD22
Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering phonetic inventories with crosslingual automatic speech recognition. Comput. Speech Lang. 74: 101358 (2022)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChoVMD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChoVMD22
Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech. IEEE J. Sel. Top. Signal Process. 16(6): 1284-1295 (2022)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BhatiVZMD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BhatiVZMD22
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2002-2014 (2022)
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatariaVMD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatariaVMD22
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. INTERSPEECH 2022: 615-619
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoPZMVD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoPZMVD22
Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-contrastive self-supervised learning of utterance-level speech representations. INTERSPEECH 2022: 4028-4032
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiKSZVKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiKSZVKD22
Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. INTERSPEECH 2022: 5035-5039
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaoVJKKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaoVJKKD22
Yiwen Shao, Jesús Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak:
Chunking Defense for Adversarial Attacks on ASR. INTERSPEECH 2022: 5045-5049
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiKVD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiKVD22
Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. INTERSPEECH 2022: 5060-5064
[c119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RybickaVDK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RybickaVDK22
Magdalena Rybicka, Jesús Villalba, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. INTERSPEECH 2022: 5090-5094
[c118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/VillalbaBKRCCGT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/VillalbaBKRCCGT22
Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21. Odyssey 2022: 213-220
[c117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/VillalbaBKCTD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/VillalbaBKCTD22
Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge. Odyssey 2022: 338-345
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/CaoMZVD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/CaoMZVD22
Tianyu Cao, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics. SLT 2022: 517-524
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/FavaroMCIBOSVDM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/FavaroMCIBOSVDM22
Anna Favaro, Chelsie Motley, Tianyu Cao, Miguel Iglesias, Ankur A. Butala, Esther S. Oh, Robert D. Stevens, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders. SLT 2022: 532-539
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HusseinCADAK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HusseinCADAK22
Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur:
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition. SLT 2022: 777-784
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-02550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-02550
Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali:
Code-Switching Text Augmentation for Multilingual Speech Processing. CoRR abs/2201.02550 (2022)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-11207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-11207
Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition. CoRR abs/2201.11207 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16614
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. CoRR abs/2203.16614 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03848
Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. CoRR abs/2204.03848 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03851
Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser. CoRR abs/2204.03851 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05413
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05413
Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations. CoRR abs/2208.05413 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05445
Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech. CoRR abs/2208.05445 (2022)
2021
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/Moro-VelazquezG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/Moro-VelazquezG21
Laureano Moro-Velázquez, Jorge Andrés Gómez García, Julián D. Arias-Londoño, Najim Dehak, Juan Ignacio Godino-Llorente:
Advances in Parkinson's Disease detection and assessment using voice and speech: A review of the articulatory and phonatory aspects. Biomed. Signal Process. Control. 66: 102418 (2021)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChenWVZD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChenWVZD21
Nanxin Chen, Shinji Watanabe, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Non-Autoregressive Transformer for Speech Recognition. IEEE Signal Process. Lett. 28: 121-125 (2021)
[j18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/ZelaskoPD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/ZelaskoPD21
Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. Trans. Assoc. Comput. Linguistics 9: 1163-1179 (2021)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/JoshiVZMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/JoshiVZMD21
Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems. IEEE Trans. Inf. Forensics Secur. 16: 4811-4826 (2021)
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PappagariZVMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PappagariZVMD21
Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. ASRU 2021: 39-46
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PappagariZMPD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PappagariZMPD21
Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint Prediction of Truecasing and Punctuation for Conversational Speech in Low-Resource Scenarios. ASRU 2021: 1185-1191
[c111]
- view
- export record
  dblp key:
  - conf/iberspeech/Moro-VelazquezG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iberspeech/Moro-VelazquezG21
Laureano Moro-Velázquez, Jorge Gómez-García, Najim Dehak, Juan Ignacio Godino-Llorente:
New tools for the differential evaluation of Parkinson's disease using voice and speech processing. IberSPEECH 2021
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenZVD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenZVD21
Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer. ICASSP 2021: 5994-5998
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PappagariVZMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PappagariVZMD21
Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. ICASSP 2021: 6324-6328
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoZVD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoZVD21
Jaejin Cho, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios. ICASSP 2021: 6733-6737
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KatariaVD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KatariaVD21
Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models. ICASSP 2021: 7118-7122
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FengZMAHSD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FengZMAHSD21
Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-Shot ASR Performance. ICASSP 2021: 7238-7242
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWHSD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWHSD21
Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval. ICASSP 2021: 7603-7607
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BhatiVZMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BhatiVZMD21
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. Interspeech 2021: 366-370
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RybickaVZDK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RybickaVZDK21
Magdalena Rybicka, Jesús Villalba, Piotr Zelasko, Najim Dehak, Konrad Kowalczyk:
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition. Interspeech 2021: 496-500
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatariaVZMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatariaVZMD21
Saurabh Kataria, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification. Interspeech 2021: 1079-1083
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZZW0DC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZZW0DC21
Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. Interspeech 2021: 3765-3769
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZMVD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZMVD21
Nanxin Chen, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition. Interspeech 2021: 3770-3774
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PappagariCJMZVD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PappagariCJMZVD21
Raghavendra Pappagari, Jaejin Cho, Sonal Joshi, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios. Interspeech 2021: 3825-3829
[c98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillalbaJZD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillalbaJZD21
Jesús Villalba, Sonal Joshi, Piotr Zelasko, Najim Dehak:
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems. Interspeech 2021: 4304-4308
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/slsp/ShtrosbergVDCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slsp/ShtrosbergVDCB21
Aviad Shtrosberg, Jesús Villalba, Najim Dehak, Azaria Cohen, Bar Ben-Yair:
Invariant Representation Learning for Robust Far-Field Speaker Recognition. SLSP 2021: 97-110
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-08909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-08909
Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Adversarial Attacks and Defenses for Speaker Identification Systems. CoRR abs/2101.08909 (2021)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-17122
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-17122
Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02170
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. CoRR abs/2106.02170 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09660
Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. CoRR abs/2106.09660 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02294
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02294
Piotr Zelasko, Raghavendra Pappagari, Najim Dehak:
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition. CoRR abs/2107.02294 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06103
Raghavendra Pappagari, Piotr Zelasko, Agnieszka Mikolajczyk, Piotr Pezik, Najim Dehak:
Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios. CoRR abs/2109.06103 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06112
Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. CoRR abs/2109.06112 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13425
Jaejin Cho, Jesús Villalba, Najim Dehak:
The JHU submission to VoxSRC-21: Track 3. CoRR abs/2109.13425 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02345
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding. CoRR abs/2110.02345 (2021)
2020
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/TanSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/TanSD20
Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59: 1-21 (2020)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/VillalbaCSGMSBG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/VillalbaCSGMSBG20
Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Leibny Paola García-Perera, Fred Richardson, Réda Dehak, Pedro A. Torres-Carrasquillo, Najim Dehak:
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations. Comput. Speech Lang. 60 (2020)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/Godino-Llorente20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/Godino-Llorente20
Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. IEEE J. Sel. Top. Signal Process. 14(2): 234-239 (2020)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Moro-VelazquezH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Moro-VelazquezH20
Laureano Moro-Velázquez, Estefanía Hernández-García, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Najim Dehak:
Analysis of the Effects of Supraglottal Tract Surgical Procedures in Automatic Speaker Recognition Performance. IEEE ACM Trans. Audio Speech Lang. Process. 28: 798-812 (2020)
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Moro-VelazquezV20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Moro-VelazquezV20
Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Using X-Vectors to Automatically Detect Parkinson's Disease from Speech. ICASSP 2020: 1155-1159
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PappagariWVCD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PappagariWVCD20
Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition. ICASSP 2020: 7169-7173
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KatariaNVCGD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KatariaNVCGD20
Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, L. Paola García-Perera, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. ICASSP 2020: 7584-7588
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NidadavoluKVGD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NidadavoluKVGD20
Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for Speaker Verification. ICASSP 2020: 7599-7603
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PappagariCMD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PappagariCMD20
Raghavendra Pappagari, Jaejin Cho, Laureano Moro-Velázquez, Najim Dehak:
Using State of the Art Speaker Recognition and Natural Language Processing Technologies to Detect Alzheimer's Disease and Assess its Severity. INTERSPEECH 2020: 2177-2181
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoZV0D20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoZV0D20
Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. INTERSPEECH 2020: 3256-3260
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelaskoMHSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelaskoMHSD20
Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: An Analysis of Phonetic Representations Transfer Across Languages. INTERSPEECH 2020: 3705-3709
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillalbaZD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillalbaZD20
Jesús Villalba, Yuekai Zhang, Najim Dehak:
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. INTERSPEECH 2020: 4233-4237
[c88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangJVD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangJVD20
Yuekai Zhang, Ziyan Jiang, Jesús Villalba, Najim Dehak:
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. INTERSPEECH 2020: 4238-4242
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BhatiVZD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BhatiVZD20
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. INTERSPEECH 2020: 4876-4880
[c86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AugustyniakSMZS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AugustyniakSMZS20
Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? INTERSPEECH 2020: 4906-4910
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LopezGCSBMGKNTD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LopezGCSBMGKNTD20
Jesús Antonio Villalba López, Daniel Garcia-Romero, Nanxin Chen, Gregory Sell, Jonas Borgstrom, Alan McCree, Leibny Paola García-Perera, Saurabh Kataria, Phani Sankar Nidadavolu, Pedro Torres-Carrasquiilo, Najim Dehak:
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19. Odyssey 2020: 273-280
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Garcia-PereraVB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Garcia-PereraVB20
Leibny Paola García-Perera, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422
[c83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/KatariaNVD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/KatariaNVD20
Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss Based Enhancement for Speaker Verification. Odyssey 2020: 459-466
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00139
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00139
Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss based Enhancement for Speaker Verification. CoRR abs/2002.00139 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05039
Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition. CoRR abs/2002.05039 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-05985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-05985
Lukasz Augustyniak, Piotr Szymanski, Mikolaj Morzy, Piotr Zelasko, Adrian Szymczak, Jan Mizgajski, Yishay Carmiel, Najim Dehak:
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings? CoRR abs/2004.05985 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08118
Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages. CoRR abs/2005.08118 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08331
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08331
Phani Sankar Nidadavolu, Saurabh Kataria, L. Paola García-Perera, Jesús Villalba, Najim Dehak:
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild. CoRR abs/2005.08331 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13033
Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. CoRR abs/2007.13033 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11221
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11221
Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. CoRR abs/2010.11221 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11860
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11860
Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models. CoRR abs/2010.11860 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12104
Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-shot ASR Performance. CoRR abs/2010.12104 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14602
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14602
Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. CoRR abs/2010.14602 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01210
Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the present: a regularization method for the ASR source-target attention layer. CoRR abs/2011.01210 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/Moro-VelazquezG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/Moro-VelazquezG19
Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Jan Rusz, Stefanie Shattuck-Hufnagel, Najim Dehak:
A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing. Biomed. Signal Process. Control. 48: 205-220 (2019)
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NidadavoluKVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NidadavoluKVD19
Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-Gans. ASRU 2019: 710-717
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PappagariZVCD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PappagariZVCD19
Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. ASRU 2019: 838-844
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/BhatiLVTKD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/BhatiLVTKD19
Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak:
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP 2019: 1-5
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/BhatiMVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/BhatiMVD19
Saurabhchand Bhati, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
LSTM Siamese Network for Parkinson's Disease Detection from Speech. GlobalSIP 2019: 1-5
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NidadavoluIVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NidadavoluIVD19
Phani Sankar Nidadavolu, Vicente Iglesias, Jesús Villalba, Najim Dehak:
Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition. ICASSP 2019: 6111-6115
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoWHBIVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoWHBIVD19
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. ICASSP 2019: 6191-6195
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NidadavoluVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NidadavoluVD19
Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Cycle-GANs for Domain Adaptation of Acoustic Features for Speaker Recognition. ICASSP 2019: 6206-6210
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaiARYDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaiARYDK19
Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. ICASSP 2019: 6316-6320
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShonDRG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShonDRG19
Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. INTERSPEECH 2019: 356-360
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiCVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiCVD19
Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks. INTERSPEECH 2019: 1013-1017
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillalbaCSGMSBR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillalbaCSGMSBR19
Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SnyderVCPSDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SnyderVCPSDK19
David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BhatiNMD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BhatiNMD19
Saurabhchand Bhati, Shekhar Nayak, K. Sri Rama Murty, Najim Dehak:
Unsupervised Acoustic Segmentation and Clustering Using Siamese Network Embeddings. INTERSPEECH 2019: 2668-2672
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenVD19
Nanxin Chen, Jesús Villalba, Najim Dehak:
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings. INTERSPEECH 2019: 2948-2952
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moro-VelazquezC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moro-VelazquezC19
Laureano Moro-Velázquez, Jaejin Cho, Shinji Watanabe, Mark A. Hasegawa-Johnson, Odette Scharenborg, Heejin Kim, Najim Dehak:
Study of the Performance of Automatic Speech Recognition Systems in Speakers with Parkinson's Disease. INTERSPEECH 2019: 3875-3879
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarmaGPGSD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarmaGPGSD19
Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak:
Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN. INTERSPEECH 2019: 3925-3929
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WiesnerRWLDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WiesnerRWLDK19
Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings. INTERSPEECH 2019: 4375-4379
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-01120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-01120
Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks. CoRR abs/1904.01120 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04240
Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation. CoRR abs/1904.04240 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-11641
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-11641
Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich:
Speaker Sincerity Detection based on Covariance Feature Vectors and Ensemble Methods. CoRR abs/1904.11641 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03588
Zheng-Hua Tan, Achintya Kumar Sarkar, Najim Dehak:
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method. CoRR abs/1906.03588 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10781
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10781
Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. CoRR abs/1910.10781 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11905
Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, Paola García, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. CoRR abs/1910.11905 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11909
Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs. CoRR abs/1910.11909 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-11915
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-11915
Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for speaker verification. CoRR abs/1910.11915 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04908
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04908
Nanxin Chen, Shinji Watanabe, Jesús Villalba, Najim Dehak:
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition. CoRR abs/1911.04908 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-00938
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-00938
Paola García, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker detection in the wild: Lessons learned from JSALT 2019. CoRR abs/1912.00938 (2019)
2018
[j11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/Zazo-CandilNCGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/Zazo-CandilNCGD18
Rubén Zazo-Candil, Phani Sankar Nidadavolu, Nanxin Chen, Joaquin Gonzalez-Rodriguez, Najim Dehak:
Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks. IEEE Access 6: 22524-22530 (2018)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/asc/Moro-VelazquezG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/asc/Moro-VelazquezG18
Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Juan Rafael Orozco-Arroyave, Najim Dehak:
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease. Appl. Soft Comput. 62: 649-666 (2018)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/Orozco-Arroyave18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/Orozco-Arroyave18
Juan Rafael Orozco-Arroyave, Juan Camilo Vásquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani S. Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Hamid R. Chinaei, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink, Elmar Nöth:
NeuroSpeech: An open-source software for Parkinson's speech analysis. Digit. Signal Process. 77: 207-221 (2018)
[j8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/softx/Orozco-Arroyave18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/softx/Orozco-Arroyave18
Juan Rafael Orozco-Arroyave, Juan Camilo Vásquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani S. Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Hamid R. Chinaei, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink, Elmar Nöth:
NeuroSpeech. SoftwareX 8: 69-70 (2018)
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/Moro-VelazquezG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/Moro-VelazquezG18
Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jan Rusz, Sabine Skodda, Francisco Grandas, José-Miguel Velazquez, Juan Rafael Orozco-Arroyave, Elmar Nöth, Najim Dehak:
Study of the Automatic Detection of Parkison's Disease Based on Speaker Recognition Technologies and Allophonic Distillation. EMBC 2018: 1404-1407
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/iberspeech/HuangGVPD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iberspeech/HuangGVPD18
Zili Huang, L. Paola García-Perera, Jesús Villalba, Daniel Povey, Najim Dehak:
JHU Diarization System Description. IberSPEECH 2018: 236-239
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenVCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenVCD18
Nanxin Chen, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech. ICASSP 2018: 4939-4943
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaciejewskiSMDK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaciejewskiSMDK18
Matthew Maciejewski, David Snyder, Vimal Manohar, Najim Dehak, Sanjeev Khudanpur:
Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods. ICASSP 2018: 5244-5248
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PappagariVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PappagariVD18
Raghavendra Pappagari, Jesús Villalba, Najim Dehak:
Joint Verification-Identification in end-to-end Multi-Scale CNN Framework for Topic Identification. ICASSP 2018: 6199-6203
[c60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenVD18
Nanxin Chen, Jesús Villalba, Najim Dehak:
An Investigation of Non-linear i-vectors for Speaker Verification. INTERSPEECH 2018: 87-91
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoPKVCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoPKVCD18
Jaejin Cho, Raghavendra Pappagari, Purva Kulkarni, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts. INTERSPEECH 2018: 247-251
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhahremaniNCVPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhahremaniNCVPK18
Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak:
End-to-end Deep Neural Network Age Estimation. INTERSPEECH 2018: 277-281
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NidadavoluLVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NidadavoluLVD18
Phani Sankar Nidadavolu, Cheng-I Lai, Jesús Villalba, Najim Dehak:
Investigation on Bandwidth Extension for Speaker Recognition. INTERSPEECH 2018: 1111-1115
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgTHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgTHD18
Odette Scharenborg, Sebastian Tiesmeyer, Mark Hasegawa-Johnson, Najim Dehak:
Visualizing Phoneme Category Adaptation in Deep Neural Networks. INTERSPEECH 2018: 1482-1486
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrederiksenVWTD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrederiksenVWTD18
Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. INTERSPEECH 2018: 1823-1827
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WiesnerLOHMTHDK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WiesnerLOHMTHDK18
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelaskoSMSCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelaskoSMSCD18
Piotr Zelasko, Piotr Szymanski, Jan Mizgajski, Adrian Szymczak, Yishay Carmiel, Najim Dehak:
Punctuation Prediction Model for Conversational Speech. INTERSPEECH 2018: 2633-2637
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SellSMGVMMDPWK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SellSMGVMMDPWK18
Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarmaGPGSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarmaGPGSD18
Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak:
Emotion Identification from Raw Speech Signals Using DNNs. INTERSPEECH 2018: 3097-3101
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/RichardsonTBSGV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/RichardsonTBSGV18
Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. Odyssey 2018: 54-59
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LopezBD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LopezBD18
Jesús Antonio Villalba López, Niko Brummer, Najim Dehak:
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions. Odyssey 2018: 112-119
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiuWWHTDK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiuWWHTDK18
Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sltu/ScharenborgEHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/ScharenborgEHD18
Odette Scharenborg, Patrick Ebel, Mark Hasegawa-Johnson, Najim Dehak:
Building an ASR System for Mboshi Using A Cross-Language Definition of Acoustic Units Approach. SLTU 2018: 167-171
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-08731
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-08731
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-00543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-00543
Piotr Zelasko, Piotr Szymanski, Jan Mizgajski, Adrian Szymczak, Yishay Carmiel, Najim Dehak:
Punctuation Prediction Model for Conversational Speech. CoRR abs/1807.00543 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-06204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-06204
Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-06663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-06663
Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System. CoRR abs/1807.06663 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-13048
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-13048
Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. CoRR abs/1810.13048 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02162
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language model integration based on memory control for sequence to sequence speech recognition. CoRR abs/1811.02162 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-03919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-03919
Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Low Resource Multi-modal Data Augmentation for End-to-end ASR. CoRR abs/1812.03919 (2018)
2017
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Vasquez-CorreaO17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Vasquez-CorreaO17
Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Raman Arora, Elmar Nöth, Najim Dehak, Heidi Christensen, Frank Rudzicz, Tobias Bocklet, Milos Cernak, Hamid R. Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Maria Yancheva, Alyssa Vann, Nikolai Vogler:
Multi-view representation learning via gcca for multimodal analysis of Parkinson's disease. ICASSP 2017: 2966-2970
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuYSKROGDBK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuYSKROGDBK17
Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An empirical evaluation of zero resource acoustic unit discovery. ICASSP 2017: 5305-5309
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KesirajuPOBDKCG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KesirajuPOBDKCG17
Santosh Kesiraju, Raghavendra Pappagari, Lucas Ondel, Lukás Burget, Najim Dehak, Sanjeev Khudanpur, Jan Cernocký, Suryakanth V. Gangashetty:
Topic identification of spoken documents using unsupervised acoustic unit discovery. ICASSP 2017: 5745-5749
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarciaODDN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarciaODDN17
Nicanor García, Juan Rafael Orozco-Arroyave, Luis Fernando D'Haro, Najim Dehak, Elmar Nöth:
Evaluation of the Neurological State of People with Parkinson's Disease Using i-Vectors. INTERSPEECH 2017: 299-303
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VillalbaBD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VillalbaBD17
Jesús Villalba, Niko Brümmer, Najim Dehak:
Tied Variational Autoencoder Backends for i-Vector Speaker Recognition. INTERSPEECH 2017: 1004-1008
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Torres-Carrasquillo17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Torres-Carrasquillo17
Pedro A. Torres-Carrasquillo, Fred Richardson, Shahan C. Nercessian, Douglas E. Sturim, William M. Campbell, Youngjune Gwon, Swaroop Vattam, Najim Dehak, Sri Harish Reddy Mallidi, Phani Sankar Nidadavolu, Ruizhi Li, Réda Dehak:
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System. INTERSPEECH 2017: 1333-1337
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/GarciaVODN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/GarciaVODN17
Nicanor García, Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Najim Dehak, Elmar Nöth:
Language Independent Assessment of Motor Impairments of Patients with Parkinson's Disease Using i-Vectors. TSD 2017: 147-155
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/LiuYSKROGDBK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LiuYSKROGDBK17
Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukás Burget, Sanjeev Khudanpur:
An Empirical Evaluation of Zero Resource Acoustic Unit Discovery. CoRR abs/1702.01360 (2017)
2016
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ShumHDG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ShumHDG16
Stephen H. Shum, David F. Harwath, Najim Dehak, James R. Glass:
On the Use of Acoustic Unit Discovery for Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(9): 1665-1676 (2016)
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SenoussaouiCDK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SenoussaouiCDK16
Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro L. Koerich:
Native Language Detection Using the I-Vector Framework. INTERSPEECH 2016: 2398-2402
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AliDCKYG0R16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AliDCKYG0R16
Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James R. Glass, Peter Bell, Steve Renals:
Automatic Dialect Detection in Arabic Broadcast Speech. INTERSPEECH 2016: 2934-2938
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMBPD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMBPD16
Ruizhi Li, Sri Harish Reddy Mallidi, Lukás Burget, Oldrich Plchot, Najim Dehak:
Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. INTERSPEECH 2016: 3265-3269
[c36]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Dehak15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Dehak15
Najim Dehak:
I-Vector Representation Based on GMM and DNN for Audio Classification. Odyssey 2016
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Torres-Carrasquillo16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Torres-Carrasquillo16
Pedro A. Torres-Carrasquillo, Najim Dehak, Elizabeth Godoy, Douglas A. Reynolds, Fred Richardson, Stephen Shum, Elliot Singer, Douglas E. Sturim:
The MITLL NIST LRE 2015 Language Recognition System. Odyssey 2016: 196-203
2015
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/RichardsonRD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/RichardsonRD15
Fred Richardson, Douglas A. Reynolds, Najim Dehak:
Deep Neural Network Approaches to Speaker and Language Recognition. IEEE Signal Process. Lett. 22(10): 1671-1675 (2015)
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RichardsonRD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RichardsonRD15
Fred Richardson, Douglas A. Reynolds, Najim Dehak:
A unified deep neural network for speaker and language recognition. INTERSPEECH 2015: 1146-1150
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CardinalDZG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CardinalDZG15
Patrick Cardinal, Najim Dehak, Yu Zhang, James R. Glass:
Speaker adaptation using the i-vector technique for bottleneck features. INTERSPEECH 2015: 2867-2871
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/CardinalDKAB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/CardinalDKAB15
Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich, Jahangir Alam, Patrice Boucher:
ETS System for AV+EC 2015 Challenge. AVEC@ACM Multimedia 2015: 17-23
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RichardsonRD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RichardsonRD15
Fred Richardson, Douglas A. Reynolds, Najim Dehak:
A Unified Deep Neural Network for Speaker and Language Recognition. CoRR abs/1504.00923 (2015)
2014
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/BahariDhBAG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/BahariDhBAG14
Mohamad Hasan Bahari, Najim Dehak, Hugo Van hamme, Lukás Burget, Ahmed Ali, Jim Glass:
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(7): 1117-1129 (2014)
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShumDG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShumDG14
Stephen H. Shum, Najim Dehak, James R. Glass:
Limited labels for unlimited data: active learning for speaker recognition. INTERSPEECH 2014: 383-387
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CardinalADZHZGV14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CardinalADZHZGV14
Patrick Cardinal, Ahmed Ali, Najim Dehak, Yu Zhang, Tuka Al Hanai, Yifan Zhang, James R. Glass, Stephan Vogel:
Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera. INTERSPEECH 2014: 2088-2092
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DehakPBBhD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DehakPBBhD14
Najim Dehak, Oldrich Plchot, Mohamad Hasan Bahari, Lukás Burget, Hugo Van hamme, Réda Dehak:
GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification. Odyssey 2014: 48-53
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/AliZCDVG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/AliZCDVG14
Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James R. Glass:
A complete KALDI recipe for building Arabic speech recognition systems. SLT 2014: 525-529
2013
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ShumDDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ShumDDG13
Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach. IEEE Trans. Speech Audio Process. 21(10): 2015-2028 (2013)
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PlchotMMDMCGHMMSSTTZZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PlchotMMDMCGHMMSSTTZZ13
Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FangDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FangDG13
Xiao Fang, Najim Dehak, James R. Glass:
Bayesian distance metric learning on i-vector for speaker verification. INTERSPEECH 2013: 2514-2518
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SenoussaouiKDD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SenoussaouiKDD13
Mohammed Senoussaoui, Patrick Kenny, Pierre Dumouchel, Najim Dehak:
New cosine similarity scorings to implement gender-independent speaker verification. INTERSPEECH 2013: 2773-2777
2012
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatejkaPSGDVGMMD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatejkaPSGDVGMMD12
Pavel Matejka, Oldrich Plchot, Mehdi Soufifar, Ondrej Glembek, Luis Fernando D'Haro, Karel Veselý, Frantisek Grézl, Jeff Z. Ma, Spyros Matsoukas, Najim Dehak:
Patrol Team Language Identification System for DARPA RATS P1 Evaluation. INTERSPEECH 2012: 50-53
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShumDG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShumDG12
Stephen Shum, Najim Dehak, Jim Glass:
On the Use of Spectral and Iterative Methods for Speaker Diarization. INTERSPEECH 2012: 482-485
[c22]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/SenoussaouiDKDD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SenoussaouiDKDD12
Mohammed Senoussaoui, Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel:
First attempt of boltzmann machines for speaker verification. Odyssey 2012: 117-121
[c21]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/SingerTRMRDS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SingerTRMRDS12
Elliot Singer, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Alan McCree, Fred Richardson, Najim Dehak, Douglas E. Sturim:
The MITLL NIST LRE 2011 language recognition system. Odyssey 2012: 209-215
2011
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DehakKDDO11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DehakKDDO11
Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel, Pierre Ouellet:
Front-End Factor Analysis for Speaker Verification. IEEE Trans. Speech Audio Process. 19(4): 788-798 (2011)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KaramCD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KaramCD11
Zahi N. Karam, William M. Campbell, Najim Dehak:
Towards reduced false-alarms using cohorts. ICASSP 2011: 4512-4515
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DehakKRDCG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DehakKRDCG11
Najim Dehak, Zahi N. Karam, Douglas A. Reynolds, Réda Dehak, William M. Campbell, James R. Glass:
A channel-blind system for speaker verification. ICASSP 2011: 4536-4539
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SturimCDKMRRTS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SturimCDKMRRTS11
Douglas E. Sturim, William M. Campbell, Najim Dehak, Zahi N. Karam, Alan McCree, Douglas A. Reynolds, Fred Richardson, Pedro A. Torres-Carrasquillo, Stephen Shum:
The MIT LL 2010 speaker recognition evaluation system: Scalable language-independent speaker recognition. ICASSP 2011: 5272-5275
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehakTRD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehakTRD11
Najim Dehak, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Réda Dehak:
Language Recognition via i-vectors and Dimensionality Reduction. INTERSPEECH 2011: 857-860
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShumDCRG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShumDCRG11
Stephen Shum, Najim Dehak, Ekapol Chuangsuwanich, Douglas A. Reynolds, James R. Glass:
Exploiting Intra-Conversation Variability for Speaker Diarization. INTERSPEECH 2011: 945-948
2010
[c15]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/SenoussaouiKDD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SenoussaouiKDD10
Mohammed Senoussaoui, Patrick Kenny, Najim Dehak, Pierre Dumouchel:
An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech. Odyssey 2010: 6
[c14]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/DehakDGRK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DehakDGRK10
Najim Dehak, Réda Dehak, James R. Glass, Douglas A. Reynolds, Patrick Kenny:
Cosine Similarity Scoring without Score Normalization Techniques. Odyssey 2010: 15
[c13]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/ShumDDG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ShumDDG10
Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass:
Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification. Odyssey 2010: 16

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GlembekBDBK09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GlembekBDBK09
Ondrej Glembek, Lukás Burget, Najim Dehak, Niko Brümmer, Patrick Kenny:
Comparison of scoring methods used in speaker recognition with Joint Factor Analysis. ICASSP 2009: 4057-4060
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DehakKDGDBHC09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DehakKDGDBHC09
Najim Dehak, Patrick Kenny, Réda Dehak, Ondrej Glembek, Pierre Dumouchel, Lukás Burget, Valiantsina Hubeika, Fabio Castaldo:
Support vector machines and Joint Factor Analysis for speaker verification. ICASSP 2009: 4237-4240
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DumouchelDADB09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DumouchelDADB09
Pierre Dumouchel, Najim Dehak, Yazid Attabi, Réda Dehak, Narjès Boufaden:
Cepstral and long-term features for emotion recognition. INTERSPEECH 2009: 344-347
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehakDKBOD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehakDKBOD09
Najim Dehak, Réda Dehak, Patrick Kenny, Niko Brümmer, Pierre Ouellet, Pierre Dumouchel:
Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification. INTERSPEECH 2009: 1559-1562
2008
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KennyODGD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KennyODGD08
Patrick Kenny, Pierre Ouellet, Najim Dehak, Vishwa Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification. IEEE Trans. Speech Audio Process. 16(5): 980-988 (2008)
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KennyDOGD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KennyDOGD08
Patrick Kenny, Najim Dehak, Pierre Ouellet, Vishwa Gupta, Pierre Dumouchel:
Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation. INTERSPEECH 2008: 1401-1404
[c7]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/DehakDKD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DehakDKD08
Najim Dehak, Réda Dehak, Patrick Kenny, Pierre Dumouchel:
Comparison between factor analysis and GMM support vector machines for speaker verification. Odyssey 2008: 9
[c6]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/KennyDDGD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/KennyDDGD08
Patrick Kenny, Najim Dehak, Réda Dehak, Vishwa Gupta, Pierre Dumouchel:
The role of speaker factors in the NIST extended data task. Odyssey 2008: 11
[c5]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/DehakDKD08a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DehakDKD08a
Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Kernel combination for SVM speaker verification. Odyssey 2008: 21
2007
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DehakDK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DehakDK07
Najim Dehak, Pierre Dumouchel, Patrick Kenny:
Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification. IEEE Trans. Speech Audio Process. 15(7): 2095-2103 (2007)
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehakDKD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehakDKD07
Réda Dehak, Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Linear and non linear kernel GMM supervector machines for speaker verification. INTERSPEECH 2007: 302-305
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DehakKD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DehakKD07
Najim Dehak, Patrick Kenny, Pierre Dumouchel:
Continuous prosodic features and formant modeling with joint factor analysis for speaker verification. INTERSPEECH 2007: 1234-1237
2006
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/BredinDC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/BredinDC06
Hervé Bredin, Najim Dehak, Gérard Chollet:
GMM-based SVM for face recognition. ICPR (3) 2006: 1111-1114
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/odyssey/DehakC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DehakC06
Najim Dehak, Gérard Chollet:
Support Vector Gmms for Speaker Verification. Odyssey 2006: 1-4

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.