default search action

combined dblp search
author search
venue search
publication search

ask others

Rohan Kumar Das

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2023
[j17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/TaoLDHL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TaoLDHL23
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023)
2022
[j16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/spl/LiuDLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LiuDLL22
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022)
2021
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XuHZRD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XuHZRD21
Longting Xu, Daiyu Huang, Syed Faham Ali Zaidi, Abdul Rauf, Rohan Kumar Das:
Graph Fourier Transform Based Audio Zero-Watermarking. IEEE Signal Process. Lett. 28: 1943-1947 (2021)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangWDQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangWDQ21
Jichen Yang, Hongji Wang, Rohan Kumar Das, Yanmin Qian:
Modified Magnitude-Phase Spectrum Information for Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1065-1078 (2021)
2020
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/YangD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/YangD20
Jichen Yang, Rohan Kumar Das:
Long-term high frequency features for synthetic speech detection. Digit. Signal Process. 97 (2020)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/YangDL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/YangDL20
Jichen Yang, Rohan Kumar Das, Haizhou Li:
Significance of Subband Features for Synthetic Speech Detection. IEEE Trans. Inf. Forensics Secur. 15: 2160-2170 (2020)
2019
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/cssp/DasJP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cssp/DasJP19
Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification. Circuits Syst. Signal Process. 38(4): 1775-1792 (2019)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/cssp/DasP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cssp/DasP19
Rohan Kumar Das, S. R. Mahadeva Prasanna:
Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions. Circuits Syst. Signal Process. 38(8): 3778-3801 (2019)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/YangD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/YangD19
Jichen Yang, Rohan Kumar Das:
Low frequency frame-wise normalization over constant-Q transform for playback speech detection. Digit. Signal Process. 89: 30-39 (2019)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangDZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangDZ19
Jichen Yang, Rohan Kumar Das, Nina Zhou:
Extraction of Octave Spectra Information for Spoofing Attack Detection. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2373-2384 (2019)
2018
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/DasSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/DasSP18
Rohan Kumar Das, Bidisha Sharma, S. R. Mahadeva Prasanna:
Significance of duration modification for speaker verification under mismatch speech tempo condition. Int. J. Speech Technol. 21(3): 401-408 (2018)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/DasJP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/DasJP18
Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Multi-style speaker recognition database in practical conditions. Int. J. Speech Technol. 21(3): 409-419 (2018)
2017
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/DasMP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/DasMP17
Rohan Kumar Das, Akhil Babu Manam, S. R. Mahadeva Prasanna:
Exploring kernel discriminant analysis for speaker verification with limited test data. Pattern Recognit. Lett. 98: 26-31 (2017)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SharmaPBD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SharmaPBD17
Rajib Sharma, S. R. M. Prasanna, Ramesh K. Bhukya, Rohan Kumar Das:
Analysis of the Intrinsic Mode Functions for Speaker Information. Speech Commun. 91: 1-16 (2017)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/DasJP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/DasJP17
Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Development of Multi-Level Speech based Person Authentication System. J. Signal Process. Syst. 88(3): 259-271 (2017)
2013
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/ChakrabartyPD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/ChakrabartyPD13
Debmalya Chakrabarty, S. R. Mahadeva Prasanna, Rohan Kumar Das:
Development and evaluation of online text-independent speaker verification system for remote person authentication. Int. J. Speech Technol. 16(1): 75-88 (2013)
2012
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/CPMPD012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/CPMPD012
Haris B. C., Gayadhar Pradhan, A. Misra, S. R. M. Prasanna, Rohan Kumar Das, Rohit Sinha:
Multivariability speaker recognition database in Indian scenario. Int. J. Speech Technol. 15(4): 441-453 (2012)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoD24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoD24a
Yang Xiao, Rohan Kumar Das:
Dual Knowledge Distillation for Efficient Sound Event Detection. ICASSP Workshops 2024: 690-694
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangCDZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangCDZZ24
Jichen Yang, Fangfan Chen, Rohan Kumar Das, Zhengyu Zhu, Shunsi Zhang:
Adaptive-Avg-Pooling Based Attention Vision Transformer for Face Anti-Spoofing. ICASSP 2024: 3875-3879
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SaeedNMDTZLKNYS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SaeedNMDTZLKNYS24
Muhammad Saad Saeed, Shah Nawaz, Marta Moscati, Rohan Kumar Das, Muhammad Salman Tahir, Muhammad Zaigham Zaheer, Muhammad Irzam Liaqat, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf, Markus Schedl:
A Synopsis of FAME 2024 Challenge: Associating Faces with Voices in Multilingual Environments. ACM Multimedia 2024: 11333-11334
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/HeXWZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HeXWZD24
Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. Odyssey 2024: 137-144
2023
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhandelwalD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhandelwalD23
Tanmay Khandelwal, Rohan Kumar Das:
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds. INTERSPEECH 2023: 1214-1218
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/KhandelwalDKC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/KhandelwalDKC23
Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng:
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions. SSP 2023: 329-333
2022
[c59]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/MarsD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MarsD22
Rohith Mars, Rohan Kumar Das:
A Device Classification-Aided Multi-Task Framework for Low-Complexity Acoustic Scene Classification. DCASE 2022
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TaoLDHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TaoLDHL22
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuDLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuDLL22
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/KhandelwalD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/KhandelwalD22
Tanmay Khandelwal, Rohan Kumar Das:
Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection. ISCSLP 2022: 428-432
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/MarsD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/MarsD22
Rohith Mars, Rohan Kumar Das:
On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement. ISCSLP 2022: 458-462
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/XuTGSJPYD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/XuTGSJPYD22
Longting Xu, Mianxin Tian, Xing Guo, Zhiyong Shan, Jie Jia, Yiyuan Peng, Jichen Yang, Rohan Kumar Das:
A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks. Odyssey 2022: 107-111
2021
[c53]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/SudroDSP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SudroDSP21
Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition. APSIPA ASC 2021: 484-490
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasY021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasY021
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Data Augmentation with Signal Companding for Detection of Logical Access Attacks. ICASSP 2021: 6349-6353
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasM021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasM021
Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Diagnosis of COVID-19 Using Auditory Acoustic Cues. Interspeech 2021: 921-925
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/OuyangDY021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/OuyangDY021
Meidan Ouyang, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Capsule Network based End-to-end System for Detection of Replay Attacks. ISCSLP 2021: 1-5
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/TaoPDQS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/TaoPDQS021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. ACM Multimedia 2021: 3927-3935
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SudroD0P21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SudroD0P21
Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks. SLT 2021: 720-727
2020
[c47]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/DasTYRY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasTYRY020
Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation. APSIPA 2020: 605-609
[c46]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/SarmaD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SarmaD20
Biswajit Dev Sarma, Rohan Kumar Das:
Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech. APSIPA 2020: 610-615
[c45]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/Das020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Das020
Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. APSIPA 2020: 747-752
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0006HTYDKLT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0006HTYDKLT20
Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --. Blizzard Challenge / Voice Conversion Challenge 2020
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/DasKHLY0TT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/DasKHLY0TT20
Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. Blizzard Challenge / Voice Conversion Challenge 2020
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/LinMD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/LinMD020
Wanqiu Lin, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
Transformer-based Arabic Dialect Identification. IALP 2020: 192-196
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasY020
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Assessing the Scope of Generalized Countermeasures for Anti-Spoofing. ICASSP 2020: 6589-6593
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Das020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Das020
Rohan Kumar Das, Haizhou Li:
On the Importance of Vocal Tract Constriction for Speaker Characterization: The Whispered Speech Study. ICASSP 2020: 7119-7123
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouTLD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouTLD020
Xuehao Zhou, Xiaohai Tian, Grandee Lee, Rohan Kumar Das, Haizhou Li:
End-to-End Code-Switching TTS with Cross-Lingual Language Model. ICASSP 2020: 7614-7618
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuDY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuDY020
Zhenzong Wu, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Light Convolutional Neural Network with Feature Genuinization for Detection of Synthetic Speech Attacks. INTERSPEECH 2020: 1101-1105
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaoD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaoD020
Ruijie Tao, Rohan Kumar Das, Haizhou Li:
Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network. INTERSPEECH 2020: 2242-2246
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinLBRDN020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinLBRDN020
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3456-3460
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasTK020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasTK020
Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. INTERSPEECH 2020: 4213-4217
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004DMS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004DMS020
Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. INTERSPEECH 2020: 4293-4297
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/TianD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/TianD020
Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion. Odyssey 2020: 159-164
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GaoTZD020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GaoTZD020
Xiaoxue Gao, Xiaohai Tian, Yi Zhou, Rohan Kumar Das, Haizhou Li:
Personalized Singing Voice Generation Using WaveRNN. Odyssey 2020: 252-258
2019
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GaoTDZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GaoTDZ019
Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. APSIPA 2019: 159-164
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiuD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiuD019
Yitong Liu, Rohan Kumar Das, Haizhou Li:
Multi-band Spectral Entropy Information for Detection of Replay Attacks. APSIPA 2019: 838-843
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhouTD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhouTD019
Yi Zhou, Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. APSIPA 2019: 1282-1287
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DasY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasY019
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech. APSIPA 2019: 1630-1635
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhouTYDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhouTYDL19
Yi Zhou, Xiaohai Tian, Emre Yilmaz, Rohan Kumar Das, Haizhou Li:
A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion. ASRU 2019: 160-167
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DasYL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DasYL19
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic and Deep Features Perspective on ASVspoof 2019. ASRU 2019: 1018-1025
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouTXD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouTXD019
Yi Zhou, Xiaohai Tian, Haihua Xu, Rohan Kumar Das, Haizhou Li:
Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling. ICASSP 2019: 6790-6794
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasY019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasY019
Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic Features for Spoofed Speech Detection. INTERSPEECH 2019: 1058-1062
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHKYOV0DSLD0R19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHKYOV0DSLD0R19
Kong Aik Lee, Ville Hautamäki, Tomi H. Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Massimiliano Todisco:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019: 1497-1501
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaD019
Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
Multi-Level Adaptive Speech Activity Detector for Speech in Naturalistic Environments. INTERSPEECH 2019: 2015-2019
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaD019a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaD019a
Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
On the Importance of Audio-Source Separation for Singer Identification in Polyphonic Music. INTERSPEECH 2019: 2020-2024
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Das019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Das019
Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Long-Term Acoustic Cues for Orca Activity Detection. INTERSPEECH 2019: 2418-2422
[c19]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/JelilSDP019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JelilSDP019
Sarfaraz Jelil, Abhishek Shrivastava, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
SpeechMarker: A Voice Based Multi-Level Attendance Application. INTERSPEECH 2019: 3665-3666
[c18]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/WuPZDC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuPZDC019
Jibin Wu, Zihan Pan, Malu Zhang, Rohan Kumar Das, Yansong Chua, Haizhou Li:
Robust Sound Recognition: A Neuromorphic Approach. INTERSPEECH 2019: 3667-3668
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuMD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuMD019
Tianchi Liu, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
A Unified Framework for Speaker and Utterance Verification. INTERSPEECH 2019: 4320-4324
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/SheelvantSMDP019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/SheelvantSMDP019
Rohan Sheelvant, Bidisha Sharma, Maulik C. Madhavi, Rohan Kumar Das, S. R. M. Prasanna, Haizhou Li:
RSL2019: A Realistic Speech Localization Corpus. O-COCOSDA 2019: 1-6
2018
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YangD018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YangD018
Jichen Yang, Rohan Kumar Das, Haizhou Li:
Extended Constant-Q Cepstral Coefficients for Detection of Spoofing Attacks. APSIPA 2018: 1024-1029
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/Das018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/Das018
Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Excitation Source Features for Detection of Replay Attacks. APSIPA 2018: 1030-1037
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DasP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasP18
Rohan Kumar Das, S. R. Mahadeva Prasanna:
Investigating Text-independent Speaker Verification from Practically Realizable System Perspective. APSIPA 2018: 1483-1487
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DasM018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DasM018
Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Compensating Utterance Information in Fixed Phrase Speaker Verification. APSIPA 2018: 1708-1712
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/KanthetiDP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/KanthetiDP18
Srinivas Kantheti, Rohan Kumar Das, Hemant A. Patil:
Combining Phase-based Features for Replay Spoof Detection System. ISCSLP 2018: 151-155
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/XuDYY018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/XuDYY018
Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative X-Vectors for Text-Independent Speaker Verification. SLT 2018: 1014-1020
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/smm/SarmaDDH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smm/SarmaDDH18
Biswajit Dev Sarma, Rohan Kumar Das, Abhishek Dey, Risto Haukioja:
Analysis of Speech Emotions in Realistic Environments. SMM 2018
2017
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JelilDP017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JelilDP017
Sarfaraz Jelil, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features. INTERSPEECH 2017: 22-26
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarDJKKMGSP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarDJKKMGSP17
Nagendra Kumar, Rohan Kumar Das, Sarfaraz Jelil, Dhanush B. K, H. Kashyap, K. Sri Rama Murty, Sriram Ganapathy, Rohit Sinha, S. R. Mahadeva Prasanna:
IITG-Indigo System for NIST 2016 SRE Challenge. INTERSPEECH 2017: 2859-2863
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ncc/JelilDP017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ncc/JelilDP017
Sarfaraz Jelil, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
Role of voice activity detection methods for the speakers in the wild challenge. NCC 2017: 1-6
2016
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasJP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasJP16
Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances. INTERSPEECH 2016: 445-449
2015
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JelilD0P15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JelilD0P15
Sarfaraz Jelil, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances. INTERSPEECH 2015: 1042-1046
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ncc/DasPP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ncc/DasPP15
Rohan Kumar Das, Debadatta Pati, S. R. Mahadeva Prasanna:
Different aspects of source information for limited data speaker verification. NCC 2015: 1-6
2014
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasAPR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasAPR14
Rohan Kumar Das, S. Abhiram, S. R. M. Prasanna, A. G. Ramakrishnan:
Combining source and system information for limited data speaker verification. INTERSPEECH 2014: 1836-1840
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ncc/DeyBBDCPS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ncc/DeyBBDCPS14
Subhadeep Dey, Sujit Barman, Ramesh Kumar Bhukya, Rohan Kumar Das, Haris B. C., S. R. M. Prasanna, Rohit Sinha:
Speech biometric based attendance system. NCC 2014: 1-6

Editorship

see FAQ

What is the meaning of the colors in the publication lists?

2020
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/blizzard/2020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/2020
Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents]

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02781
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02781
Yang Xiao, Rohan Kumar Das:
Dual Knowledge Distillation for Efficient Sound Event Detection. CoRR abs/2402.02781 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09342
Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf:
Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan. CoRR abs/2404.09342 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17280
Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. CoRR abs/2404.17280 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02483
Tianchi Liu, Lin Zhang, Rohan Kumar Das, Yi Ma, Ruijie Tao, Haizhou Li:
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? CoRR abs/2406.02483 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-00291
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-00291
Yang Xiao, Han Yin, Jisheng Bai, Rohan Kumar Das:
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels. CoRR abs/2407.00291 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03656
Yang Xiao, Rohan Kumar Das:
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System. CoRR abs/2407.03656 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03657
Yang Xiao, Rohan Kumar Das:
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection. CoRR abs/2407.03657 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03661
Yang Xiao, Rohan Kumar Das:
Configurable DOA Estimation using Incremental Learning. CoRR abs/2407.03661 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05034
Yang Xiao, Rohan Kumar Das:
TF-Mamba: A Time-Frequency Network for Sound Source Localization. CoRR abs/2409.05034 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13292
Han Yin, Jisheng Bai, Yang Xiao, Hui Wang, Siqi Zheng, Yafeng Chen, Rohan Kumar Das, Chong Deng, Jianfeng Chen:
Exploring Text-Queried Sound Event Detection with Audio Source Separation. CoRR abs/2409.13292 (2024)
2022
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-01624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-01624
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15385
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01091
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022)
2021
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-06592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-06592
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. CoRR abs/2107.06592 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-08007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-08007
Longting Xu, Daiyu Huang, Syed Faham Ali Zaidi, Abdul Rauf, Rohan Kumar Das:
Graph Fourier Transform based Audio Zero-watermarking. CoRR abs/2109.08007 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-00797
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-00797
Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition. CoRR abs/2110.00797 (2021)
2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00387
Xiaoyi Qin, Ming Li, Hui Bu, Rohan Kumar Das, Wei Rao, Shrikanth Narayanan, Haizhou Li:
The FFSVC 2020 Evaluation Plan. CoRR abs/2002.00387 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-08849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-08849
Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. CoRR abs/2004.08849 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08046
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08046
Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2005.08046 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-08901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-08901
Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. CoRR abs/2008.08901 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-12527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-12527
Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion. CoRR abs/2008.12527 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-03554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-03554
Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. CoRR abs/2009.03554 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03905
Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation. CoRR abs/2010.03905 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03907
Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. CoRR abs/2010.03907 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03909
Biswajit Dev Sarma, Rohan Kumar Das:
Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech. CoRR abs/2010.03909 (2020)
2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-07386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-07386
Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-06798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-06798
Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative x-vectors for text-independent speaker verification. CoRR abs/1809.06798 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.