default search action

combined dblp search
author search
venue search
publication search

ask others

Roland Maas

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Books and Theses

see FAQ

What is the meaning of the colors in the publication lists?

2016
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/dnb/Maas16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/dnb/Maas16
Roland Maas:
Uncertainty Decoding for Reverberation-Robust Automatic Speech Recognition. University of Erlangen-Nuremberg, Germany, 2016, ISBN 978-3-944057-61-3, pp. 1-191

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2018
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuemmerHMK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuemmerHMK18
Christian Huemmer, Christian Hofmann, Roland Maas, Walter Kellermann:
Estimating Parameters of Nonlinear Systems Using the Elitist Particle Filter Based on Evolutionary Strategies. IEEE ACM Trans. Audio Speech Lang. Process. 26(3): 595-608 (2018)
2016
[j7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/KinoshitaDGHHKL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/KinoshitaDGHHKL16
Keisuke Kinoshita, Marc Delcroix, Sharon Gannot, Emanuël A. P. Habets, Reinhold Haeb-Umbach, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP J. Adv. Signal Process. 2016: 7 (2016)
2015
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/HuemmerMHK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/HuemmerMHK15
Christian Huemmer, Roland Maas, Christian Hofmann, Walter Kellermann:
A Bayesian network approach to linear and nonlinear acoustic echo cancellation. EURASIP J. Adv. Signal Process. 2015: 98 (2015)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/MaasHSK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/MaasHSK15
Roland Maas, Christian Huemmer, Armin Sehr, Walter Kellermann:
A Bayesian view on acoustic model-based techniques for robust speech recognition. EURASIP J. Adv. Signal Process. 2015: 103 (2015)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/HuemmerMK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/HuemmerMK15
Christian Huemmer, Roland Maas, Walter Kellermann:
The NLMS Algorithm with Time-Variant Optimum Stepsize Derived from a Bayesian Network Perspective. IEEE Signal Process. Lett. 22(11): 1874-1878 (2015)
2013
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/ReindlZSMMSK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/ReindlZSMMSK13
Klaus Reindl, Yuanhang Zheng, Andreas Schwarz, Stefan Meier, Roland Maas, Armin Sehr, Walter Kellermann:
A stereophonic acoustic signal extraction scheme for noisy and reverberant environments. Comput. Speech Lang. 27(3): 726-745 (2013)
2012
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/YoshiokaSDKMNK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/YoshiokaSDKMNK12
Takuya Yoshioka, Armin Sehr, Marc Delcroix, Keisuke Kinoshita, Roland Maas, Tomohiro Nakatani, Walter Kellermann:
Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition. IEEE Signal Process. Mag. 29(6): 114-126 (2012)
2010
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SehrMK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SehrMK10
Armin Sehr, Roland Maas, Walter Kellermann:
Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition. IEEE Trans. Speech Audio Process. 18(7): 1676-1691 (2010)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RajuKHSCATZVRMR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RajuKHSCATZVRMR23
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow:
Two-Pass Endpoint Detection for Speech Recognition. ASRU 2023: 1-8
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TankasalaCSRDCKMR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TankasalaCSRDCKMR23
Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran:
Cross-Utterance ASR Rescoring with Graph-Based Label Propagation. ICASSP 2023: 1-5
2022
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangTGHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangTGHM22
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas:
VADOI: Voice-Activity-Detection Overlapping Inference for End-To-End Long-Form Speech Recognition. ICASSP 2022: 6977-6981
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrinhGKDSM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrinhGKDSM22
Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. INTERSPEECH 2022: 1298-1302
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NidadavoluXJGDS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NidadavoluXJGDS22
Phani Sankar Nidadavolu, Na Xu, Nick Jutila, Ravi Teja Gadde, Aswarth Abhilash Dara, Joseph Savold, Sapan Patel, Aaron Hoff, Veerdhawal Pande, Kevin Crews, Ankur Gandhe, Ariya Rastrow, Roland Maas:
RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation. INTERSPEECH 2022: 4347-4351
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/iwaenc/KeskinWKMGDRM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwaenc/KeskinWKMGDRM22
Gokce Keskin, Minhua Wu, Brian John King, Sri Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas:
Do You Listen with one or two Microphones? A Unified ASR Model for Single and Multi-Channel Audio. IWAENC 2022: 1-5
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KhareWBDM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KhareWBDM22
Aparna Khare, Minhua Wu, Saurabhchand Bhati, Jasha Droppo, Roland Maas:
Guided Contrastive Self-Supervised Pre-Training for Automatic Speech Recognition. SLT 2022: 174-181
2021
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuYRGKARSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuYRGKARSM21
Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-To-End ASR by Domain Adversarial Training with Relabeling. ICASSP 2021: 6408-6412
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PunjabiARCBBMMR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PunjabiARCBBMMR21
Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching. ICASSP 2021: 7218-7222
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadhuHHMWRSDM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadhuHHMWRSDM21
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
wav2vec-C: A Self-Supervised Model for Speech Representation Learning. Interspeech 2021: 711-715
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FazelYLBMMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FazelYLBMMD21
Amin Fazel, Wei Yang, Yulan Liu, Roberto Barra-Chicote, Yixiong Meng, Roland Maas, Jasha Droppo:
SynthASR: Unlocking Synthetic Data for Speech Recognition. Interspeech 2021: 896-900
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TongHMJPCRM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TongHMJPCRM21
Xiaosu Tong, Che-Wei Huang, Sri Harish Mallidi, Shaun Joseph, Sonal Pareek, Chander Chandak, Ariya Rastrow, Roland Maas:
Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection. SLT 2021: 659-664
2020
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SegbroeckZKHNLH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SegbroeckZKHNLH20
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas:
DiPCo - Dinner Party Corpus. INTERSPEECH 2020: 434-436
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoTDSHSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoTDSHSM20
Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient Minimum Word Error Rate Training of RNN-Transducer for End-to-End Speech Recognition. INTERSPEECH 2020: 2807-2811
2019
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MosnerWRPKSMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MosnerWRPKSMH19
Ladislav Mosner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Ken'ichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister:
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning. ICASSP 2019: 6475-6479
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SwarupMGMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SwarupMGMH19
Prakhar Swarup, Roland Maas, Sri Garimella, Sri Harish Mallidi, Björn Hoffmeister:
Improving ASR Confidence Scores for Alexa Using Acoustic and Hypothesis Embeddings. INTERSPEECH 2019: 2175-2179
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangMMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangMMH19
Che-Wei Huang, Roland Maas, Sri Harish Mallidi, Björn Hoffmeister:
A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction. INTERSPEECH 2019: 3342-3346
2018
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaasRMLGTJH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaasRMLGTJH18
Roland Maas, Ariya Rastrow, Chengyuan Ma, Guitang Lan, Kyle Goehner, Gautam Tiwari, Shaun Joseph, Björn Hoffmeister:
Combining Acoustic Embeddings and Decoding Features for End-of-Utterance Detection in Real-Time Far-Field Speech Recognition Systems. ICASSP 2018: 5544-5548
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MallidiMGRMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MallidiMGRMH18
Sri Harish Reddy Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister:
Device-directed Utterance Detection. INTERSPEECH 2018: 1225-1228
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/RaeesyGMDGMRH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/RaeesyGMDGMRH18
Zeynab Raeesy, Kellen Gillespie, Chengyuan Ma, Thomas Drugman, Jiacheng Gu, Roland Maas, Ariya Rastrow, Björn Hoffmeister:
LSTM-Based Whisper Detection. SLT 2018: 139-144
2017
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaasRGTJH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaasRGTJH17
Roland Maas, Ariya Rastrow, Kyle Goehner, Gautam Tiwari, Shaun Joseph, Björn Hoffmeister:
Domain-Specific Utterance End-Point Detection for Speech Recognition. INTERSPEECH 2017: 1943-1947
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KingCVLMPH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KingCVLMPH17
Brian John King, I-Fan Chen, Yonatan Vaizman, Yuzong Liu, Roland Maas, Sree Hari Krishnan Parthasarathi, Björn Hoffmeister:
Robust Speech Recognition via Anchor Word Representations. INTERSPEECH 2017: 2471-2475
2016
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuemmerSMBAK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuemmerSMBAK16
Christian Huemmer, Andreas Schwarz, Roland Maas, Hendrik Barfuss, Ramón Fernandez Astudillo, Walter Kellermann:
A new uncertainty decoding scheme for DNN-HMM hybrid systems with multichannel speech enhancement. ICASSP 2016: 5760-5764
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaasPKHH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaasPKHH16
Roland Maas, Sree Hari Krishnan Parthasarathi, Brian John King, Ruitong Huang, Björn Hoffmeister:
Anchored Speech Detection. INTERSPEECH 2016: 2963-2967
2015
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SchwarzHMK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SchwarzHMK15
Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann:
Spatial diffuseness features for DNN-based speech recognition in noisy and reverberant environments. ICASSP 2015: 4380-4384
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuemmerMSAK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuemmerMSAK15
Christian Huemmer, Roland Maas, Andreas Schwarz, Ramón Fernandez Astudillo, Walter Kellermann:
Uncertainty decoding for DNN-HMM hybrid systems based on numerical sampling. INTERSPEECH 2015: 3556-3560
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/BuergerMLK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/BuergerMLK15
Michael Buerger, Roland Maas, Heinrich W. Löllmann, Walter Kellermann:
Multizone sound field synthesis based on the joint optimization of the sound pressure and particle velocity vector on closed contours. WASPAA 2015: 1-5
2014
[c17]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/ITGspeech/MaasHHK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ITGspeech/MaasHHK14
Roland Maas, Christian Huemmer, Christian Hofmann, Walter Kellermann:
On Bayesian Networks in Speech Signal Processing. ITG Symposium on Speech Communication 2014: 1-4
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/MaasHSHK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/MaasHSHK14
Roland Maas, Christian Huemmer, Andreas Schwarz, Christian Hofmann, Walter Kellermann:
A Bayesian network viewon linear and nonlinear acoustic echo cancellation. ChinaSIP 2014: 495-499
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/HuemmerHMK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/HuemmerHMK14
Christian Huemmer, Christian Hofmann, Roland Maas, Walter Kellermann:
The significance-aware EPFES to estimate a memoryless preprocessor for nonlinear acoustic echo cancellation. GlobalSIP 2014: 557-561
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/hscma/SehrBHMK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hscma/SehrBHMK14
Armin Sehr, Hendrik Barfuss, Christian Hofmann, Roland Maas, Walter Kellermann:
Efficient training of acoustic models for reverberation-robust medium-vocabulary automatic speech recognition. HSCMA 2014: 177-181
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuemmerHMSK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuemmerHMSK14
Christian Huemmer, Christian Hofmann, Roland Maas, Andreas Schwarz, Walter Kellermann:
The elitist particle filter based on evolutionary strategies as novel approach for nonlinear acoustic echo cancellation. ICASSP 2014: 1315-1319
2013
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaasTSK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaasTSK13
Roland Maas, Akshaya Thippur, Armin Sehr, Walter Kellermann:
An uncertainty decoding approach to noise- and reverberation-robust speech recognition. ICASSP 2013: 7388-7392
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icdsp/MaasKSYDKN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdsp/MaasKSYDKN13
Roland Maas, Walter Kellermann, Armin Sehr, Takuya Yoshioka, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani:
Formulation of the REMOS concept from an uncertainty decoding perspective. DSP 2013: 1-6
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SehrYDKNMK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SehrYDKNMK13
Armin Sehr, Takuya Yoshioka, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Roland Maas, Walter Kellermann:
Conditional emission densities for combining speech enhancement and recognition systems. INTERSPEECH 2013: 3502-3506
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KinoshitaDYNSKM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KinoshitaDYNSKM13
Keisuke Kinoshita, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, Armin Sehr, Walter Kellermann, Roland Maas:
The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech. WASPAA 2013: 1-4
2012
[c8]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/YoshiokaSDKMNK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YoshiokaSDKMNK12
Takuya Yoshioka, Armin Sehr, Marc Delcroix, Keisuke Kinoshita, Roland Maas, Tomohiro Nakatani, Walter Kellermann:
Survey on approaches to speech recognition in reverberant environments. APSIPA 2012: 1-4
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/cogip/MaasKSK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cogip/MaasKSK12
Roland Maas, Sujan R. Kotha, Armin Sehr, Walter Kellermann:
Combined-order hidden Markov models for reverberation-robust speech recognition. CIP 2012: 1-5
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaasHSK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaasHSK12
Roland Maas, Emanuël A. P. Habets, Armin Sehr, Walter Kellermann:
On the application of reverberation suppression to robust speech recognition. ICASSP 2012: 297-300
2011
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SehrMK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SehrMK11
Armin Sehr, Roland Maas, Walter Kellermann:
Frame-wise HMM adaptation using state-dependent reverberation estimates. ICASSP 2011: 5484-5487
2010
[c4]
- view
  - electronic edition @ vde-verlag.de
  - details & citations
- export record
  dblp key:
  - conf/ITGspeech/MaasSK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ITGspeech/MaasSK10
Roland Maas, Armin Sehr, Walter Kellermann:
Multi-Style Reverberation Models and Efficient Model Adaptation for Robust Distant-Talking Speech Recognition with REMOS. Sprachkommunikation 2010: 1-4
[c3]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/MaasSGK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MaasSGK10
Roland Maas, Armin Sehr, Martin Gugat, Walter Kellermann:
A highly efficient optimization scheme for REMOS-based distant-talking speech recognition. EUSIPCO 2010: 1983-1987
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SehrMK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SehrMK10
Armin Sehr, Roland Maas, Walter Kellermann:
Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition. ICASSP 2010: 4298-4301
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SehrHMK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SehrHMK10
Armin Sehr, Christian Hofmann, Roland Maas, Walter Kellermann:
A novel approach for matched reverberant training of HMMs using data pairs. INTERSPEECH 2010: 566-569

Parts in Books or Collections

see FAQ

What is the meaning of the colors in the publication lists?

2017
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/KinoshitaDGHHKLMNRSY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/KinoshitaDGHHKLMNRSY17
Keisuke Kinoshita, Marc Delcroix, Sharon Gannot, Emanuël A. P. Habets, Reinhold Haeb-Umbach, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 345-354

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08916
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow:
Two-pass Endpoint Detection for Speech Recognition. CoRR abs/2401.08916 (2024)
2023
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-15132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-15132
Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran:
Cross-utterance ASR Rescoring with Graph-based Label Propagation. CoRR abs/2303.15132 (2023)
2022
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10593
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas:
VADOI: Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition. CoRR abs/2202.10593 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07850
Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. CoRR abs/2207.07850 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12335
Aparna Khare, Minhua Wu, Saurabhchand Bhati, Jasha Droppo, Roland Maas:
Guided contrastive self-supervised pre-training for automatic speech recognition. CoRR abs/2210.12335 (2022)
2021
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08393
Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
Wav2vec-C: A Self-supervised Model for Speech Representation Learning. CoRR abs/2103.08393 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-05920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-05920
Bhargav Pulugundla, Yang Gao, Brian John King, Gokce Keskin, Sri Harish Mallidi, Minhua Wu, Jasha Droppo, Roland Maas:
Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition. CoRR abs/2105.05920 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02750
Gokce Keskin, Minhua Wu, Brian John King, Sri Harish Mallidi, Yang Gao, Jasha Droppo, Ariya Rastrow, Roland Maas:
Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio. CoRR abs/2106.02750 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07803
Amin Fazel, Wei Yang, Yulan Liu, Roberto Barra-Chicote, Yixiong Meng, Roland Maas, Jasha Droppo:
SynthASR: Unlocking Synthetic Data for Speech Recognition. CoRR abs/2106.07803 (2021)
2020
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-00703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-00703
Chander Chandak, Zeynab Raeesy, Ariya Rastrow, Yuzong Liu, Xiangyang Huang, Siyu Wang, Dong Kwon Joo, Roland Maas:
Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses. CoRR abs/2006.00703 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00131
Maarten Van Segbroeck, Sri Harish Mallidi, Brian John King, I-Fan Chen, Gurpreet Chadha, Roland Maas:
Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition. CoRR abs/2007.00131 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03900
Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Streaming End-to-End Bilingual ASR Systems with Joint Language Identification. CoRR abs/2007.03900 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-09245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-09245
Xiaosu Tong, Che-Wei Huang, Sri Harish Mallidi, Shaun Joseph, Sonal Pareek, Chander Chandak, Ariya Rastrow, Roland Maas:
Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection. CoRR abs/2007.09245 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13802
Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition. CoRR abs/2007.13802 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07353
Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gökçe Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling. CoRR abs/2012.07353 (2020)
2019
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-02348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-02348
Ladislav Mosner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Ken'ichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister:
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning. CoRR abs/1901.02348 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-13447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-13447
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas:
DiPCo - Dinner Party Corpus. CoRR abs/1909.13447 (2019)
2018
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02504
Sri Harish Reddy Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister:
Device-directed Utterance Detection. CoRR abs/1808.02504 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-07832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-07832
Zeynab Raeesy, Kellen Gillespie, Chengyuan Ma, Thomas Drugman, Jiacheng Gu, Roland Maas, Ariya Rastrow, Björn Hoffmeister:
LSTM-based Whisper Detection. CoRR abs/1809.07832 (2018)
2014
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SchwarzHMK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SchwarzHMK14
Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann:
Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments. CoRR abs/1410.2479 (2014)
2013
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/MaasHSK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MaasHSK13
Roland Maas, Christian Huemmer, Armin Sehr, Walter Kellermann:
A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition. CoRR abs/1310.3099 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.