


default search action
Odyssey 2012: Singapore
- Haizhou Li, Bin Ma, Kong-Aik Lee:

Odyssey 2012: The Speaker and Language Recognition Workshop, Singapore, June 25-28, 2012. ISCA 2012
Plenary Session
- Niko Brümmer:

The role of proper scoring rules in training and evaluating probabilistic speaker and language recognizers. - Li Deng:

Being deep and being dynamic - new-generation models and methodology for advancing speech technology. - Alvin F. Martin:

The NIST speaker recognition evaluations.
Speaker Recognition - Compact Representation
- Patrick Kenny:

A small footprint i-vector extractor. 1-6 - Sandro Cumani, Pietro Laface, Vasileios Vasilakakis:

Memory and computation effective approaches for i - vector extraction. 7-13 - Srikanth R. Madikeri:

A hybrid factor analysis and probabilistic PCA-based system for dictionary learning and encoding for robust speaker recognition. 14-20 - Haris B. C., Rohit Sinha:

On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems. 21-27
Speaker Recognition - Generative Modeling
- Ahilan Kanagasundaram, Robbie Vogt, David Dean, Sridha Sridharan:

PLDA based speaker recognition on short utterances. 28-33 - Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Robbie Vogt:

PLDA based speaker verification with weighted LDA techniques. 34-38 - Carlos Vaquero:

Dataset shift in PLDA based speaker verification. 39-46 - Jesús Antonio Villalba López, Eduardo Lleida:

Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data. 47-54 - Mitchell McLaren, Miranti Indar Mandasari, David A. van Leeuwen:

Source normalization for language-independent speaker recognition using i-vectors. 55-61
Forensic Speaker Recognition
- Geoffrey Stewart Morrison, Felipe Ochoa, Tharmarajah Thiruvaran:

Database selection for forensic voice comparison. 62-77 - Ewald Enzinger, Cuiling Zhang, Geoffrey Stewart Morrison:

Voice source features for forensic voice comparison - an evaluation of the GLOTTEX software package. 78-85 - Yosef A. Solewicz, Timo Becker, Gaëlle Jardine, Stefan G. Gfrörer:

Comparison of speaker recognition systems on a real forensic benchmark. 86-91
Neural Network for Speaker Recognition
- Sri Garimella, Hynek Hermansky:

Factor analysis of mixture of auto-associative neural networks for speaker verification. 92-97 - Samuel Thomas, Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:

Adaptation transforms of auto-associative neural networks as features for speaker verification. 98-104 - Sibel Yaman, Jason W. Pelecanos, Ruhi Sarikaya:

Bottleneck features for speaker recognition. 105-108 - Themos Stafylakis, Patrick Kenny, Mohammed Senoussaoui, Pierre Dumouchel:

Preliminary investigation of Boltzmann machine classifiers for speaker recognition. 109-116 - Mohammed Senoussaoui, Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel:

First attempt of boltzmann machines for speaker verification. 117-121
Speaker Diarization
- Hagai Aronowitz, Yosef A. Solewicz, Orith Toledo-Ronen:

Online two speaker diarization. 122-129 - Jordi Luque, Javier Hernando:

On the use of agglomerative and spectral clustering in speaker diarization of meetings. 130-137 - Itshak Lapidot, Jean-François Bonastre:

Generalized Viterbi-based models for time-series segmentation applied to speaker diarization. 138-145 - Mickael Rouvier, Sylvain Meignier:

A global optimization framework for speaker diarization. 146-150 - Sashin Kajarekar, Aparna Khare, Matthias Paulik, Neha Agrawal, Panchi Panchapagesan, Ananth Sankar, Satish Gannu:

Cisco's speaker segmentation and recognition system. 151-156
Speaker Recognition - Channel Robustness
- Pierre-Michel Bousquet, Anthony Larcher, Driss Matrouf, Jean-François Bonastre, Oldrich Plchot:

Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis. 157-164 - Wei Rao, Man-Wai Mak:

Utterance partitioning with acoustic vector resampling for i-vector based speaker verification. 165-171 - Sheng Chen, Mingxing Xu, Emlyn Pratt:

Study on the effects of intrinsic variation using i-vectors in text-independent speaker verification. 172-179 - William M. Campbell, Douglas E. Sturim, Bengt J. Borgström, Robert B. Dunn, Alan McCree, Thomas F. Quatieri, Douglas A. Reynolds:

Exploring the impact of advanced front-end processing on NIST speaker recognition microphone tasks. 180-186 - Bengt J. Borgström, Alan McCree:

Linear prediction modulation filtering for speaker recognition of reverberant speech. 187-193
Language Recognition Evaluation
- Luis Javier Rodríguez-Fuentes, Amparo Varona, Mireia Díez, Mikel Peñagarikano, Germán Bordel:

Evaluation of spoken language recognition technology using broadcast speech: performance and challenges. 194-201 - Stephanie M. Strassel, Kevin Walker, Karen Jones, David Graff, Christopher Cieri:

New resources for recognition of confusable linguistic varieties: the LRE11 corpus. 202-208 - Elliot Singer, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Alan McCree, Fred Richardson, Najim Dehak, Douglas E. Sturim:

The MITLL NIST LRE 2011 language recognition system. 209-215 - Niko Brümmer, Sandro Cumani, Ondrej Glembek, Martin Karafiát, Pavel Matejka, Jan Pesán, Oldrich Plchot, Mehdi Soufifar, Edward de Villiers, Jan Cernocký:

Description and analysis of the Brno276 system for LRE2011. 216-223 - Gang Liu, Chi Zhang, John H. L. Hansen:

A linguistic data acquisition front-end for language recognition evaluation. 224-228
Features for Speaker Recognition
- Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:

Feature extraction using 2-d autoregressive models for speaker recognition. 229-235 - Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:

Regularization of all-pole models for speaker verification under additive noise. 236-242 - Taufiq Hasan, John H. L. Hansen:

Factor analysis of acoustic features using a mixture of probabilistic principal component analyzers for robust speaker verification. 243-247 - Rahim Saeidi, Antti Hurmalainen, Tuomas Virtanen, David A. van Leeuwen:

Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification. 248-255 - Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:

On the use of asymmetric-shaped tapers for speaker verification using i-vectors. 256-262
Speaker Recognition Evaluation
- George R. Doddington:

The effect of target/non-target age difference on speaker recognition performance. 263-267 - Ville Hautamäki, Kong-Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li:

Variational Bayes logistic regression as regularized fusion for NIST SRE 2010. 268-274 - Craig S. Greenberg, Alvin F. Martin, Mark A. Przybocki:

The 2011 BEST speaker recognition interim assessment. 275-282 - Juliette Kahn, Olivier Galibert, Matthieu Carré, Aude Giraudel, Philippe Joly, Ludovic Quintard:

The REPERE challenge: finding people in a multimodal context. 283-290 - Kevin Walker, Stephanie M. Strassel:

The RATS radio traffic collection system. 291-297
Speaker Recognition - Application
- Andreas Stolcke, Martin Graciarena, Luciana Ferrer:

Effects of audio and ASR quality on cepstral and high-level speaker verification systems. 298-303 - Tomi Kinnunen, Rahim Saeidi, Jussi Leppänen, Jukka Saarinen:

Audio context recognition in variable mobile environments from short segments using speaker and language recognizers. 304-311 - Hagai Aronowitz:

Text dependent speaker verification using a small development set. 312-316 - Luciana Ferrer, Lukás Burget, Oldrich Plchot, Nicolas Scheffer:

A unified approach for audio characterization and its application to speaker recognition. 317-323 - Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:

Mean shift algorithm for exponential families with applications to speaker clustering. 324-329
Language Recognition - Feature, Classifier and Fusion
- Oldrich Plchot, Martin Karafiát, Niko Brümmer, Ondrej Glembek, Pavel Matejka, Edward de Villiers, Jan Cernocký:

Speaker vectors from subspace Gaussian mixture model as complementary features for language identification. 330-333 - Zhiyi Li, Wei-Qiang Zhang, Liang He, Jia Liu:

Complementary combination in i-vector level for language recognition. 334-337 - Chang Huai You, Haizhou Li, Eliathamby Ambikairajah, Kong-Aik Lee, Bin Ma:

Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition. 338-345 - Mohamed Faouzi BenZeghiba, Jean-Luc Gauvain, Lori Lamel:

Fusing language information from diverse data sources for phonotactic language recognition. 346-352

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














