


default search action
Sri Harish Reddy Mallidi
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c34]Rupak Vignesh Swaminathan, Grant P. Strimel, Ariya Rastrow, Sri Harish Mallidi, Kai Zhen, Hieu Duy Nguyen, Nathan Susanj, Athanasios Mouchtaris:
Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy. ICASSP 2024: 12226-12230 - 2022
- [c33]Gokce Keskin, Minhua Wu, Brian John King, Sri Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas:
Do You Listen with one or two Microphones? A Unified ASR Model for Single and Multi-Channel Audio. IWAENC 2022: 1-5 - 2021
- [c32]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
wav2vec-C: A Self-Supervised Model for Speech Representation Learning. Interspeech 2021: 711-715 - [c31]Xiaosu Tong, Che-Wei Huang, Sri Harish Mallidi, Shaun Joseph, Sonal Pareek, Chander Chandak, Ariya Rastrow, Roland Maas:
Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection. SLT 2021: 659-664 - [i11]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
Wav2vec-C: A Self-supervised Model for Speech Representation Learning. CoRR abs/2103.08393 (2021) - [i10]Bhargav Pulugundla, Yang Gao, Brian John King, Gokce Keskin, Sri Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas:
Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition. CoRR abs/2105.05920 (2021) - [i9]Gokce Keskin, Minhua Wu, Brian John King, Sri Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas:
Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio. CoRR abs/2106.02750 (2021) - 2020
- [j4]Ruizhi Li
, Xiaofei Wang
, Sri Harish Mallidi, Shinji Watanabe
, Takaaki Hori
, Hynek Hermansky
:
Multi-Stream End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 646-655 (2020) - [i8]Maarten Van Segbroeck, Sri Harish Mallidi, Brian John King, I-Fan Chen, Gurpreet Chadha, Roland Maas:
Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition. CoRR abs/2007.00131 (2020) - [i7]Xiaosu Tong, Che-Wei Huang, Sri Harish Mallidi, Shaun Joseph, Sonal Pareek, Chander Chandak, Ariya Rastrow, Roland Maas:
Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection. CoRR abs/2007.09245 (2020)
2010 – 2019
- 2019
- [c30]Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe
, Hynek Hermansky
:
Stream Attention-based Multi-array End-to-end Speech Recognition. ICASSP 2019: 7105-7109 - [c29]Prakhar Swarup, Roland Maas, Sri Garimella, Sri Harish Mallidi, Björn Hoffmeister:
Improving ASR Confidence Scores for Alexa Using Acoustic and Hypothesis Embeddings. INTERSPEECH 2019: 2175-2179 - [c28]Che-Wei Huang, Roland Maas, Sri Harish Mallidi, Björn Hoffmeister:
A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction. INTERSPEECH 2019: 3342-3346 - [i6]Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky:
Multi-Stream End-to-End Speech Recognition. CoRR abs/1906.08041 (2019) - 2018
- [c27]Sri Harish Reddy Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister:
Device-directed Utterance Detection. INTERSPEECH 2018: 1225-1228 - [c26]Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Mallidi, Nelson Yalta
, Martin Karafiát
, Shinji Watanabe
, Takaaki Hori:
Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. SLT 2018: 521-527 - [i5]Sri Harish Reddy Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister:
Device-directed Utterance Detection. CoRR abs/1808.02504 (2018) - [i4]Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Reddy Mallidi, Nelson Yalta, Martin Karafiát, Shinji Watanabe, Takaaki Hori:
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling. CoRR abs/1810.03459 (2018) - [i3]Ruizhi Li, Xiaofei Wang, Sri Harish Reddy Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Multi-encoder multi-resolution framework for end-to-end speech recognition. CoRR abs/1811.04897 (2018) - [i2]Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Stream attention-based multi-array end-to-end speech recognition. CoRR abs/1811.04903 (2018) - 2017
- [j3]Angel Mario Castro Martinez, Sri Harish Reddy Mallidi, Bernd T. Meyer:
On the relevance of auditory-based Gabor features for deep learning in robust speech recognition. Comput. Speech Lang. 45: 21-38 (2017) - [c25]Bernd T. Meyer, Sri Harish Reddy Mallidi, Hendrik Kayser
, Hynek Hermansky
:
Predicting error rates for unknown data in automatic speech recognition. ICASSP 2017: 5330-5334 - [c24]Pedro A. Torres-Carrasquillo, Fred Richardson, Shahan C. Nercessian, Douglas E. Sturim, William M. Campbell, Youngjune Gwon, Swaroop Vattam, Najim Dehak
, Sri Harish Reddy Mallidi, Phani Sankar Nidadavolu, Ruizhi Li, Réda Dehak:
The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System. INTERSPEECH 2017: 1333-1337 - [i1]Angel Mario Castro Martinez, Sri Harish Reddy Mallidi, Bernd T. Meyer:
On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition. CoRR abs/1702.04333 (2017) - 2016
- [c23]Sri Harish Reddy Mallidi, Hynek Hermansky
:
Novel neural network based fusion for multistream ASR. ICASSP 2016: 5680-5684 - [c22]Tetsuji Ogawa
, Sri Harish Reddy Mallidi, Emmanuel Dupoux, Jordan Cohen, Naomi H. Feldman
, Hynek Hermansky
:
A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation. ICPR 2016: 2222-2227 - [c21]Ruizhi Li, Sri Harish Reddy Mallidi, Lukás Burget
, Oldrich Plchot, Najim Dehak
:
Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. INTERSPEECH 2016: 3265-3269 - [c20]Sri Harish Reddy Mallidi, Hynek Hermansky
:
A Framework for Practical Multistream ASR. INTERSPEECH 2016: 3474-3478 - [c19]Bernd T. Meyer, Sri Harish Reddy Mallidi, Angel Mario Castro Martinez, Guillermo Payá-Vayá, Hendrik Kayser
, Hynek Hermansky
:
Performance monitoring for automatic speech recognition in noisy multi-channel environments. SLT 2016: 50-56 - 2015
- [c18]Sri Harish Reddy Mallidi, Tetsuji Ogawa
, Hynek Hermansky
:
Uncertainty estimation of DNN classifiers. ASRU 2015: 283-288 - [c17]Roger Hsiao, Jeff Z. Ma, William Hartmann, Martin Karafiát
, Frantisek Grézl, Lukás Burget
, Igor Szöke
, Jan Cernocký
, Shinji Watanabe
, Zhuo Chen, Sri Harish Reddy Mallidi, Hynek Hermansky
, Stavros Tsakalidis, Richard M. Schwartz:
Robust speech recognition in unknown reverberant and noisy conditions. ASRU 2015: 533-538 - [c16]Hynek Hermansky, Lukás Burget
, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman
, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013 - [c15]Sri Harish Reddy Mallidi, Tetsuji Ogawa, Karel Veselý, Phani S. Nidadavolu, Hynek Hermansky:
Autoencoder based multi-stream combination for noise robust speech recognition. INTERSPEECH 2015: 3551-3555 - 2014
- [j2]Sriram Ganapathy, Sri Harish Reddy Mallidi, Hynek Hermansky
:
Robust Feature Extraction Using Modulation Filtering of Autoregressive Models. IEEE ACM Trans. Audio Speech Lang. Process. 22(8): 1285-1295 (2014) - [c14]Tim Ng, Roger Hsiao, Le Zhang, Damianos G. Karakos, Sri Harish Reddy Mallidi, Martin Karafiát, Karel Veselý, Igor Szöke, Bing Zhang, Long Nguyen, Richard M. Schwartz:
Progress in the BBN keyword search system for the DARPA RATS program. INTERSPEECH 2014: 959-963 - [c13]Pavel Matejka, Le Zhang, Tim Ng, Ondrej Glembek, Jeff Z. Ma, Bing Zhang, Sri Harish Mallidi:
Neural Network Bottleneck Features for Language Identification. Odyssey 2014: 299-304 - 2013
- [c12]Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak
, Jeff Z. Ma, Sandro Cumani, Ondrej Glembek, Hynek Hermansky
, Sri Harish Reddy Mallidi, Nima Mesgarani, Richard M. Schwartz, Mehdi Soufifar, Zheng-Hua Tan
, Samuel Thomas, Bing Zhang, Xinhui Zhou:
Developing a speaker identification system for the DARPA RATS project. ICASSP 2013: 6768-6772 - [c11]Pascal Clark, Sri Harish Reddy Mallidi, Aren Jansen, Hynek Hermansky
:
Frequency offset correction in speech without detecting pitch. ICASSP 2013: 7020-7024 - [c10]Jeff Z. Ma, Bing Zhang, Spyros Matsoukas, Sri Harish Reddy Mallidi, Feipeng Li, Hynek Hermansky:
Improvements in language identification on the RATS noisy speech corpus. INTERSPEECH 2013: 69-73 - [c9]Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Robust speaker recognition using spectro-temporal autoregressive models. INTERSPEECH 2013: 3689-3693 - 2012
- [j1]Sri Garimella, Sri Harish Reddy Mallidi, Hynek Hermansky
:
Regularized Auto-Associative Neural Networks for Speaker Verification. IEEE Signal Process. Lett. 19(12): 841-844 (2012) - [c8]Daniel Garcia-Romero, Xinhui Zhou, Dmitry N. Zotkin, Balaji Vasan Srinivasan, Yuancheng Luo, Sriram Ganapathy, Samuel Thomas, Sridhar Krishna Nemala, Garimella S. V. S. Sivaram, Majid Mirbagheri, Sri Harish Reddy Mallidi, Thomas Janu, Padmanabhan Rajan, Nima Mesgarani, Mounya Elhilali
, Hynek Hermansky
, Shihab A. Shamma, Ramani Duraiswami
:
The UMD-JHU 2011 speaker recognition system. ICASSP 2012: 4229-4232 - [c7]Feipeng Li, Sri Harish Reddy Mallidi, Hynek Hermansky:
Phone recognition in critical bands using sub-band temporal modulations. INTERSPEECH 2012: 1816-1819 - [c6]Samuel Thomas, Sri Harish Reddy Mallidi, Thomas Janu, Hynek Hermansky, Nima Mesgarani, Xinhui Zhou, Shihab A. Shamma, Tim Ng, Bing Zhang, Long Nguyen, Spyros Matsoukas:
Acoustic and Data-driven Features for Robust Speech Activity Detection. INTERSPEECH 2012: 1985-1988 - [c5]Samuel Thomas, Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Adaptation transforms of auto-associative neural networks as features for speaker verification. Odyssey 2012: 98-104 - 2011
- [c4]Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Modulation Spectrum Analysis for Recognition of Reverberant Speech. INTERSPEECH 2011: 189-192 - 2010
- [c3]Sri Harish Reddy Mallidi, Kishore Prahallad, Suryakanth V. Gangashetty, B. Yegnanarayana:
Significance of pitch synchronous analysis for speaker recognition using AANN models. INTERSPEECH 2010: 669-672 - [c2]Anand Joseph Xavier Medabalimi, Sri Harish Reddy Mallidi, B. Yegnanarayana:
Speaker-dependent mapping of source and system features for enhancement of throat microphone speech. INTERSPEECH 2010: 985-988
2000 – 2009
- 2009
- [c1]K. Sudheer Kumar, Sri Harish Reddy Mallidi, K. Sri Rama Murty, B. Yegnanarayana:
Analysis of laugh signals for detecting in continuous speech. INTERSPEECH 2009: 1591-1594
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-22 03:38 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint