 | 2012 |
| 46 |  | Zhen-Hua Ling,
Li-Rong Dai:
Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis.
IEEE Transactions on Audio, Speech & Language Processing 20(5): 1492-1502 (2012) |
| 2011 |
| 45 |  | Yanhua Long,
Zhi-Jie Yan,
Frank K. Soong,
Li-Rong Dai,
Wu Guo:
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model.
ICASSP 2011: 4520-4523 |
| 44 |  | Ming Lei,
Zhen-Hua Ling,
Li-Rong Dai:
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis.
ICASSP 2011: 4712-4715 |
| 43 |  | Eryu Wang,
Kong-Aik Lee,
Bin Ma,
Haizhou Li,
Wu Guo,
Li-Rong Dai:
Factored covariance modeling for text-independent speaker verification.
ICASSP 2011: 4856-4859 |
| 42 |  | Ling-Hui Chen,
Zhen-Hua Ling,
Li-Rong Dai:
Non-parallel training for voice conversion based on FT-GMM.
ICASSP 2011: 5116-5119 |
| 41 |  | Heng Lu,
Zhen-Hua Ling,
Li-Rong Dai,
Ren-Hua Wang:
Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score.
ICASSP 2011: 5352-5355 |
| 40 |  | Ling-Hui Chen,
Yoshihiko Nankaku,
Heiga Zen,
Keiichi Tokuda,
Zhen-Hua Ling,
Li-Rong Dai:
Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.
INTERSPEECH 2011: 1801-1804 |
| 39 |  | Ming Lei,
Junichi Yamagishi,
Korin Richmond,
Zhen-Hua Ling,
Simon King,
Li-Rong Dai:
Formant-Controlled HMM-Based Speech Synthesis.
INTERSPEECH 2011: 2777-2780 |
| 38 |  | Yanhua Long,
Zhi-Jie Yan,
Frank K. Soong,
Li-Rong Dai,
Wu Guo:
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model.
INTERSPEECH 2011: 373-376 |
| 2010 |
| 37 |  | Ming Lei,
Zhen-Hua Ling,
Li-Rong Dai:
Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis.
ICASSP 2010: 4230-4233 |
| 36 |  | Wu Guo,
Zhao Zhang,
Yanhua Long,
Li-Rong Dai:
N-gram nearest neighbor algorithm for voice password system.
ICASSP 2010: 4438-4441 |
| 35 |  | Jun Du,
Yu Hu,
Li-Rong Dai,
Ren-Hua Wang:
HMM-based pseudo-clean speech synthesis for splice algorithm.
ICASSP 2010: 4570-4573 |
| 34 |  | Cong Liu,
Yu Hu,
Hui Jiang,
Li-Rong Dai:
A bounded trust region optimization for discriminative training of HMMS in speech recognition.
ICASSP 2010: 4914-4917 |
| 33 |  | Yan Song,
Qi Tian,
Mengyue Wang,
Heng Liu,
Li-Rong Dai:
Multiple instance learning using visual phrases for object classification.
ICME 2010: 649-654 |
| 32 |  | Eryu Wang,
Kong-Aik Lee,
Bin Ma,
Haizhou Li,
Wu Guo,
Li-Rong Dai:
The estimation and kernel metric of spectral correlation for text-independent speaker verification.
INTERSPEECH 2010: 1065-1068 |
| 31 |  | Heng Lu,
Zhen-Hua Ling,
Si Wei,
Li-Rong Dai,
Ren-Hua Wang:
Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier.
INTERSPEECH 2010: 162-165 |
| 30 |  | Yanhua Long,
Li-Rong Dai,
Bin Ma,
Wu Guo:
Effects of the phonological relevance in speaker verification.
INTERSPEECH 2010: 2130-2133 |
| 29 |  | Ming Lei,
Yi-Jian Wu,
Frank K. Soong,
Zhen-Hua Ling,
Li-Rong Dai:
A hierarchical F0 modeling method for HMM-based speech synthesis.
INTERSPEECH 2010: 2170-2173 |
| 28 |  | Zhiwei Shuang,
Shiyin Kang,
Yong Qin,
Li-Rong Dai,
Lianhong Cai:
HMM based TTS for mixed language text.
INTERSPEECH 2010: 618-621 |
| 27 |  | Zhen-Hua Ling,
Yu Hu,
Li-Rong Dai:
Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis.
INTERSPEECH 2010: 825-828 |
| 2009 |
| 26 |  | Heng Lu,
Yi-Jian Wu,
Keiichi Tokuda,
Li-Rong Dai,
Ren-Hua Wang:
Full covariance state duration modeling for HMM-based speech synthesis.
ICASSP 2009: 4033-4036 |
| 25 |  | Haizhou Li,
Bin Ma,
Kong-Aik Lee,
Hanwu Sun,
Donglai Zhu,
Khe Chai Sim,
Changhuai You,
Rong Tong,
Ismo Kärkkäinen,
Chien-Lin Huang,
Vladimir Pervouchine,
Wu Guo,
Yijie Li,
Li-Rong Dai,
Mohaddeseh Nosratighods,
Tharmarajah Thiruvaran,
Julien Epps,
Eliathamby Ambikairajah,
Chng Eng Siong,
Tanja Schultz,
Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation.
ICASSP 2009: 4201-4204 |
| 24 |  | Wu Guo,
Yanhua Long,
Yijie Li,
Lei Pan,
Eryu Wang,
Li-Rong Dai:
iFLY system for the NIST 2008 speaker recognition evaluation.
ICASSP 2009: 4209-4212 |
| 23 |  | Yanhua Long,
Bin Ma,
Haizhou Li,
Wu Guo,
Chng Eng Siong,
Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition.
ICASSP 2009: 4225-4228 |
| 22 |  | Yan Song,
Li-Rong Dai,
Ren-Hua Wang:
An automatic language identification method based on subspace analysis.
ICME 2009: 598-601 |
| 21 |  | Cheng-Cheng Wang,
Zhen-Hua Ling,
Li-Rong Dai:
Asynchronous F0 and spectrum modeling for HMM-based speech synthesis.
INTERSPEECH 2009: 404-407 |
| 20 |  | Meng Wang,
Xian-Sheng Hua,
Tao Mei,
Richang Hong,
Guo-Jun Qi,
Yan Song,
Li-Rong Dai:
Semi-supervised kernel density estimation for video annotation.
Computer Vision and Image Understanding 113(3): 384-396 (2009) |
| 2008 |
| 19 |  | Long Qin,
Yi-Jian Wu,
Zhen-Hua Ling,
Ren-Hua Wang,
Li-Rong Dai:
Minumum generation error linear regression based model adaptation for HMM-based speech synthesis.
ICASSP 2008: 3953-3956 |
| 18 |  | Long Qin,
Yi-Jian Wu,
Zhen-Hua Ling,
Ren-Hua Wang,
Li-Rong Dai:
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis.
ICASSP 2008: 4621-4624 |
| 2007 |
| 17 |  | Meng Wang,
Tao Mei,
Xun Yuan,
Yan Song,
Li-Rong Dai:
Video annotation by graph-based learning with neighborhood similarity.
ACM Multimedia 2007: 325-328 |
| 16 |  | Meng Wang,
Xian-Sheng Hua,
Xun Yuan,
Yan Song,
Li-Rong Dai:
Optimizing multi-graph learning: towards a unified video annotation scheme.
ACM Multimedia 2007: 862-871 |
| 15 |  | Wu Guo,
Lei Pan,
Ren-Hua Wang,
Li-Rong Dai:
Angle of Models Distance as Test Algorithm in Speaker Verification.
FSKD (4) 2007: 231-234 |
| 14 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Li-Rong Dai,
Ren-Hua Wang:
An Interactive Video Annotation Frameowrk with Multiple Modalities.
ICASSP (1) 2007: 957-960 |
| 13 |  | Meng Wang,
Xian-Sheng Hua,
Xun Yuan,
Yan Song,
Li-Rong Dai:
Multi-Graph Semi-Supervised Learning for Video Semantic Feature Extraction.
ICME 2007: 1978-1981 |
| 12 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Richang Hong,
Li-Rong Dai:
Lazy Learning Based Efficient Video Annotation.
ICME 2007: 607-610 |
| 11 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Jinhui Tang,
Li-Rong Dai:
RMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation.
ICSC 2007: 321-328 |
| 10 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Wei Lai,
Li-Rong Dai,
Ren-Hua Wang:
An Efficient Automatic Video Shot Size Annotation Scheme.
MMM (1) 2007: 649-658 |
| 9 |  | Meng Wang,
Xian-Sheng Hua,
Tao Mei,
Jinhui Tang,
Guo-Jun Qi,
Yan Song,
Li-Rong Dai:
Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning.
Int. J. Semantic Computing 1(4): 459-477 (2007) |
| 2006 |
| 8 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Li-Rong Dai,
HongJiang Zhang:
Semi-Supervised Kernel Regression.
ICDM 2006: 1130-1135 |
| 7 |  | Meng Wang,
Xian-Sheng Hua,
Li-Rong Dai,
Yan Song:
Enhanced Semi-Supervised Learning for Automatic Video Annotation.
ICME 2006: 1485-1488 |
| 6 |  | Yan Song,
Guo-Jun Qi,
Xian-Sheng Hua,
Li-Rong Dai,
Ren-Hua Wang:
Video Annotation by Active Learning and Semi-Supervised Ensembling.
ICME 2006: 933-936 |
| 5 |  | Meng Wang,
Xian-Sheng Hua,
Yan Song,
Li-Rong Dai,
Shipeng Li:
Automatic video annotation based on co-adaptation and label correction.
ISCAS 2006 |
| 4 |  | Yan Song,
Xian-Sheng Hua,
Guo-Jun Qi,
Li-Rong Dai,
Meng Wang,
HongJiang Zhang:
Efficient semantic annotation method for indexing large personal video database.
Multimedia Information Retrieval 2006: 289-296 |
| 2005 |
| 3 |  | Yan Song,
Xian-Sheng Hua,
Li-Rong Dai,
Meng Wang:
Semi-automatic video annotation based on active learning with multiple complementary predictors.
Multimedia Information Retrieval 2005: 97-104 |
| 2004 |
| 2 |  | Wei Lai,
Xiaodong Gu,
Ren-Hua Wang,
Li-Rong Dai,
HongJiang Zhang:
A region based multiple frame-rate tradeoff of video streaming.
ICIP 2004: 2067-2070 |
| 1 |  | Wei Lai,
Xiaodong Gu,
Ren-Hua Wang,
Li-Rong Dai,
HongJiang Zhang:
Perceptual Video Streaming by Adaptive Spatial-temporal Scalability.
PCM (2) 2004: 431-438 |