default search action
Rohan Kumar Das
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2023
- [j17]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1706-1719 (2023) - 2022
- [j16]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask. IEEE Signal Process. Lett. 29: 782-786 (2022) - 2021
- [j15]Longting Xu, Daiyu Huang, Syed Faham Ali Zaidi, Abdul Rauf, Rohan Kumar Das:
Graph Fourier Transform Based Audio Zero-Watermarking. IEEE Signal Process. Lett. 28: 1943-1947 (2021) - [j14]Jichen Yang, Hongji Wang, Rohan Kumar Das, Yanmin Qian:
Modified Magnitude-Phase Spectrum Information for Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1065-1078 (2021) - 2020
- [j13]Jichen Yang, Rohan Kumar Das:
Long-term high frequency features for synthetic speech detection. Digit. Signal Process. 97 (2020) - [j12]Jichen Yang, Rohan Kumar Das, Haizhou Li:
Significance of Subband Features for Synthetic Speech Detection. IEEE Trans. Inf. Forensics Secur. 15: 2160-2170 (2020) - 2019
- [j11]Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification. Circuits Syst. Signal Process. 38(4): 1775-1792 (2019) - [j10]Rohan Kumar Das, S. R. Mahadeva Prasanna:
Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions. Circuits Syst. Signal Process. 38(8): 3778-3801 (2019) - [j9]Jichen Yang, Rohan Kumar Das:
Low frequency frame-wise normalization over constant-Q transform for playback speech detection. Digit. Signal Process. 89: 30-39 (2019) - [j8]Jichen Yang, Rohan Kumar Das, Nina Zhou:
Extraction of Octave Spectra Information for Spoofing Attack Detection. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2373-2384 (2019) - 2018
- [j7]Rohan Kumar Das, Bidisha Sharma, S. R. Mahadeva Prasanna:
Significance of duration modification for speaker verification under mismatch speech tempo condition. Int. J. Speech Technol. 21(3): 401-408 (2018) - [j6]Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Multi-style speaker recognition database in practical conditions. Int. J. Speech Technol. 21(3): 409-419 (2018) - 2017
- [j5]Rohan Kumar Das, Akhil Babu Manam, S. R. Mahadeva Prasanna:
Exploring kernel discriminant analysis for speaker verification with limited test data. Pattern Recognit. Lett. 98: 26-31 (2017) - [j4]Rajib Sharma, S. R. M. Prasanna, Ramesh K. Bhukya, Rohan Kumar Das:
Analysis of the Intrinsic Mode Functions for Speaker Information. Speech Commun. 91: 1-16 (2017) - [j3]Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Development of Multi-Level Speech based Person Authentication System. J. Signal Process. Syst. 88(3): 259-271 (2017) - 2013
- [j2]Debmalya Chakrabarty, S. R. Mahadeva Prasanna, Rohan Kumar Das:
Development and evaluation of online text-independent speaker verification system for remote person authentication. Int. J. Speech Technol. 16(1): 75-88 (2013) - 2012
- [j1]Haris B. C., Gayadhar Pradhan, A. Misra, S. R. M. Prasanna, Rohan Kumar Das, Rohit Sinha:
Multivariability speaker recognition database in Indian scenario. Int. J. Speech Technol. 15(4): 441-453 (2012)
Conference and Workshop Papers
- 2024
- [c65]Yang Xiao, Rohan Kumar Das:
Dual Knowledge Distillation for Efficient Sound Event Detection. ICASSP Workshops 2024: 690-694 - [c64]Jichen Yang, Fangfan Chen, Rohan Kumar Das, Zhengyu Zhu, Shunsi Zhang:
Adaptive-Avg-Pooling Based Attention Vision Transformer for Face Anti-Spoofing. ICASSP 2024: 3875-3879 - [c63]Muhammad Saad Saeed, Shah Nawaz, Marta Moscati, Rohan Kumar Das, Muhammad Salman Tahir, Muhammad Zaigham Zaheer, Muhammad Irzam Liaqat, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf, Markus Schedl:
A Synopsis of FAME 2024 Challenge: Associating Faces with Voices in Multilingual Environments. ACM Multimedia 2024: 11333-11334 - [c62]Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. Odyssey 2024: 137-144 - 2023
- [c61]Tanmay Khandelwal, Rohan Kumar Das:
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds. INTERSPEECH 2023: 1214-1218 - [c60]Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng:
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions. SSP 2023: 329-333 - 2022
- [c59]Rohith Mars, Rohan Kumar Das:
A Device Classification-Aided Multi-Task Framework for Low-Complexity Acoustic Scene Classification. DCASE 2022 - [c58]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Speaker Recognition with Loss-Gated Learning. ICASSP 2022: 6142-6146 - [c57]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances. ICASSP 2022: 7517-7521 - [c56]Tanmay Khandelwal, Rohan Kumar Das:
Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection. ISCSLP 2022: 428-432 - [c55]Rohith Mars, Rohan Kumar Das:
On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement. ISCSLP 2022: 458-462 - [c54]Longting Xu, Mianxin Tian, Xing Guo, Zhiyong Shan, Jie Jia, Yiyuan Peng, Jichen Yang, Rohan Kumar Das:
A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks. Odyssey 2022: 107-111 - 2021
- [c53]Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition. APSIPA ASC 2021: 484-490 - [c52]Rohan Kumar Das, Jichen Yang, Haizhou Li:
Data Augmentation with Signal Companding for Detection of Logical Access Attacks. ICASSP 2021: 6349-6353 - [c51]Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Diagnosis of COVID-19 Using Auditory Acoustic Cues. Interspeech 2021: 921-925 - [c50]Meidan Ouyang, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Capsule Network based End-to-end System for Detection of Replay Attacks. ISCSLP 2021: 1-5 - [c49]Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. ACM Multimedia 2021: 3927-3935 - [c48]Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks. SLT 2021: 720-727 - 2020
- [c47]Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation. APSIPA 2020: 605-609 - [c46]Biswajit Dev Sarma, Rohan Kumar Das:
Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech. APSIPA 2020: 610-615 - [c45]Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. APSIPA 2020: 747-752 - [c44]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --. Blizzard Challenge / Voice Conversion Challenge 2020 - [c43]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. Blizzard Challenge / Voice Conversion Challenge 2020 - [c42]Wanqiu Lin, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
Transformer-based Arabic Dialect Identification. IALP 2020: 192-196 - [c41]Rohan Kumar Das, Jichen Yang, Haizhou Li:
Assessing the Scope of Generalized Countermeasures for Anti-Spoofing. ICASSP 2020: 6589-6593 - [c40]Rohan Kumar Das, Haizhou Li:
On the Importance of Vocal Tract Constriction for Speaker Characterization: The Whispered Speech Study. ICASSP 2020: 7119-7123 - [c39]Xuehao Zhou, Xiaohai Tian, Grandee Lee, Rohan Kumar Das, Haizhou Li:
End-to-End Code-Switching TTS with Cross-Lingual Language Model. ICASSP 2020: 7614-7618 - [c38]Zhenzong Wu, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Light Convolutional Neural Network with Feature Genuinization for Detection of Synthetic Speech Attacks. INTERSPEECH 2020: 1101-1105 - [c37]Ruijie Tao, Rohan Kumar Das, Haizhou Li:
Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network. INTERSPEECH 2020: 2242-2246 - [c36]Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3456-3460 - [c35]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. INTERSPEECH 2020: 4213-4217 - [c34]Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. INTERSPEECH 2020: 4293-4297 - [c33]Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion. Odyssey 2020: 159-164 - [c32]Xiaoxue Gao, Xiaohai Tian, Yi Zhou, Rohan Kumar Das, Haizhou Li:
Personalized Singing Voice Generation Using WaveRNN. Odyssey 2020: 252-258 - 2019
- [c31]Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. APSIPA 2019: 159-164 - [c30]Yitong Liu, Rohan Kumar Das, Haizhou Li:
Multi-band Spectral Entropy Information for Detection of Replay Attacks. APSIPA 2019: 838-843 - [c29]Yi Zhou, Xiaohai Tian, Rohan Kumar Das, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. APSIPA 2019: 1282-1287 - [c28]Rohan Kumar Das, Jichen Yang, Haizhou Li:
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech. APSIPA 2019: 1630-1635 - [c27]Yi Zhou, Xiaohai Tian, Emre Yilmaz, Rohan Kumar Das, Haizhou Li:
A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion. ASRU 2019: 160-167 - [c26]Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic and Deep Features Perspective on ASVspoof 2019. ASRU 2019: 1018-1025 - [c25]Yi Zhou, Xiaohai Tian, Haihua Xu, Rohan Kumar Das, Haizhou Li:
Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling. ICASSP 2019: 6790-6794 - [c24]Rohan Kumar Das, Jichen Yang, Haizhou Li:
Long Range Acoustic Features for Spoofed Speech Detection. INTERSPEECH 2019: 1058-1062 - [c23]Kong Aik Lee, Ville Hautamäki, Tomi H. Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Massimiliano Todisco:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019: 1497-1501 - [c22]Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
Multi-Level Adaptive Speech Activity Detector for Speech in Naturalistic Environments. INTERSPEECH 2019: 2015-2019 - [c21]Bidisha Sharma, Rohan Kumar Das, Haizhou Li:
On the Importance of Audio-Source Separation for Singer Identification in Polyphonic Music. INTERSPEECH 2019: 2020-2024 - [c20]Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Long-Term Acoustic Cues for Orca Activity Detection. INTERSPEECH 2019: 2418-2422 - [c19]Sarfaraz Jelil, Abhishek Shrivastava, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
SpeechMarker: A Voice Based Multi-Level Attendance Application. INTERSPEECH 2019: 3665-3666 - [c18]Jibin Wu, Zihan Pan, Malu Zhang, Rohan Kumar Das, Yansong Chua, Haizhou Li:
Robust Sound Recognition: A Neuromorphic Approach. INTERSPEECH 2019: 3667-3668 - [c17]Tianchi Liu, Maulik C. Madhavi, Rohan Kumar Das, Haizhou Li:
A Unified Framework for Speaker and Utterance Verification. INTERSPEECH 2019: 4320-4324 - [c16]Rohan Sheelvant, Bidisha Sharma, Maulik C. Madhavi, Rohan Kumar Das, S. R. M. Prasanna, Haizhou Li:
RSL2019: A Realistic Speech Localization Corpus. O-COCOSDA 2019: 1-6 - 2018
- [c15]Jichen Yang, Rohan Kumar Das, Haizhou Li:
Extended Constant-Q Cepstral Coefficients for Detection of Spoofing Attacks. APSIPA 2018: 1024-1029 - [c14]Rohan Kumar Das, Haizhou Li:
Instantaneous Phase and Excitation Source Features for Detection of Replay Attacks. APSIPA 2018: 1030-1037 - [c13]Rohan Kumar Das, S. R. Mahadeva Prasanna:
Investigating Text-independent Speaker Verification from Practically Realizable System Perspective. APSIPA 2018: 1483-1487 - [c12]Rohan Kumar Das, Maulik C. Madhavi, Haizhou Li:
Compensating Utterance Information in Fixed Phrase Speaker Verification. APSIPA 2018: 1708-1712 - [c11]Srinivas Kantheti, Rohan Kumar Das, Hemant A. Patil:
Combining Phase-based Features for Replay Spoof Detection System. ISCSLP 2018: 151-155 - [c10]Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative X-Vectors for Text-Independent Speaker Verification. SLT 2018: 1014-1020 - [c9]Biswajit Dev Sarma, Rohan Kumar Das, Abhishek Dey, Risto Haukioja:
Analysis of Speech Emotions in Realistic Environments. SMM 2018 - 2017
- [c8]Sarfaraz Jelil, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features. INTERSPEECH 2017: 22-26 - [c7]Nagendra Kumar, Rohan Kumar Das, Sarfaraz Jelil, Dhanush B. K, H. Kashyap, K. Sri Rama Murty, Sriram Ganapathy, Rohit Sinha, S. R. Mahadeva Prasanna:
IITG-Indigo System for NIST 2016 SRE Challenge. INTERSPEECH 2017: 2859-2863 - [c6]Sarfaraz Jelil, Rohan Kumar Das, S. R. Mahadeva Prasanna, Rohit Sinha:
Role of voice activity detection methods for the speakers in the wild challenge. NCC 2017: 1-6 - 2016
- [c5]Rohan Kumar Das, Sarfaraz Jelil, S. R. Mahadeva Prasanna:
Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances. INTERSPEECH 2016: 445-449 - 2015
- [c4]Sarfaraz Jelil, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances. INTERSPEECH 2015: 1042-1046 - [c3]Rohan Kumar Das, Debadatta Pati, S. R. Mahadeva Prasanna:
Different aspects of source information for limited data speaker verification. NCC 2015: 1-6 - 2014
- [c2]Rohan Kumar Das, S. Abhiram, S. R. M. Prasanna, A. G. Ramakrishnan:
Combining source and system information for limited data speaker verification. INTERSPEECH 2014: 1836-1840 - [c1]Subhadeep Dey, Sujit Barman, Ramesh Kumar Bhukya, Rohan Kumar Das, Haris B. C., S. R. M. Prasanna, Rohit Sinha:
Speech biometric based attendance system. NCC 2014: 1-6
Editorship
- 2020
- [e1]Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents]
Informal and Other Publications
- 2024
- [i27]Yang Xiao, Rohan Kumar Das:
Dual Knowledge Distillation for Efficient Sound Event Detection. CoRR abs/2402.02781 (2024) - [i26]Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf:
Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan. CoRR abs/2404.09342 (2024) - [i25]Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. CoRR abs/2404.17280 (2024) - [i24]Tianchi Liu, Lin Zhang, Rohan Kumar Das, Yi Ma, Ruijie Tao, Haizhou Li:
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio? CoRR abs/2406.02483 (2024) - [i23]Yang Xiao, Han Yin, Jisheng Bai, Rohan Kumar Das:
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels. CoRR abs/2407.00291 (2024) - [i22]Yang Xiao, Rohan Kumar Das:
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System. CoRR abs/2407.03656 (2024) - [i21]Yang Xiao, Rohan Kumar Das:
UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection. CoRR abs/2407.03657 (2024) - [i20]Yang Xiao, Rohan Kumar Das:
Configurable DOA Estimation using Incremental Learning. CoRR abs/2407.03661 (2024) - [i19]Yang Xiao, Rohan Kumar Das:
TF-Mamba: A Time-Frequency Network for Sound Source Localization. CoRR abs/2409.05034 (2024) - [i18]Han Yin, Jisheng Bai, Yang Xiao, Hui Wang, Siqi Zheng, Yafeng Chen, Rohan Kumar Das, Chong Deng, Jianfeng Chen:
Exploring Text-Queried Sound Event Detection with Audio Source Separation. CoRR abs/2409.13292 (2024) - 2022
- [i17]Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li:
MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances. CoRR abs/2202.01624 (2022) - [i16]Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li:
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs. CoRR abs/2210.15385 (2022) - [i15]Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md. Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera:
I4U System Description for NIST SRE'20 CTS Challenge. CoRR abs/2211.01091 (2022) - 2021
- [i14]Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li:
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. CoRR abs/2107.06592 (2021) - [i13]Longting Xu, Daiyu Huang, Syed Faham Ali Zaidi, Abdul Rauf, Rohan Kumar Das:
Graph Fourier Transform based Audio Zero-watermarking. CoRR abs/2109.08007 (2021) - [i12]Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna:
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition. CoRR abs/2110.00797 (2021) - 2020
- [i11]Xiaoyi Qin, Ming Li, Hui Bu, Rohan Kumar Das, Wei Rao, Shrikanth Narayanan, Haizhou Li:
The FFSVC 2020 Evaluation Plan. CoRR abs/2002.00387 (2020) - [i10]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. CoRR abs/2004.08849 (2020) - [i9]Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li:
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2005.08046 (2020) - [i8]Tianchi Liu, Rohan Kumar Das, Maulik C. Madhavi, Shengmei Shen, Haizhou Li:
Speaker-Utterance Dual Attention for Speaker and Utterance Verification. CoRR abs/2008.08901 (2020) - [i7]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion. CoRR abs/2008.12527 (2020) - [i6]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. CoRR abs/2009.03554 (2020) - [i5]Rohan Kumar Das, Ruijie Tao, Jichen Yang, Wei Rao, Cheng Yu, Haizhou Li:
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation. CoRR abs/2010.03905 (2020) - [i4]Rohan Kumar Das, Haizhou Li:
Classification of Speech with and without Face Mask using Acoustic Features. CoRR abs/2010.03907 (2020) - [i3]Biswajit Dev Sarma, Rohan Kumar Das:
Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech. CoRR abs/2010.03909 (2020) - 2019
- [i2]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - 2018
- [i1]Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li:
Generative x-vectors for text-independent speaker verification. CoRR abs/1809.06798 (2018)
Coauthor Index
aka: S. R. M. Prasanna
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-06 21:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint