


default search action
APSIPA 2017: Kuala Lumpur, Malaysia
- 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017, Kuala Lumpur, Malaysia, December 12-15, 2017. IEEE 2017, ISBN 978-1-5386-1542-3
- Chin-Hui Lee:
Keynote speech 1: An integrated deep learning approach to acoustic signal pre-processing and acoustic modeling with applications to robust automatic speech recognition. v-viii - Yan Chen, Chih-Yu Wang:
Tutorial 1: Sequential decision making: Theories and applications. ix-xii - Binh Trans:
Online learning in the Asia Pacific region. - Jie Yan, Lei Xie, Guangsen Wang, Zhong-Hua Fu:
A segmental DNN/i-vector approach for digit-prompted speaker verification. 1-5 - Szu-Wei Fu, Yu Tsao
, Xugang Lu, Hisashi Kawai:
Raw waveform-based speech enhancement by fully convolutional networks. 6-12 - Jian-Jiun Ding, Shiang-Chih Hua, Ronald Y. Chang
, Yih-Cherng Lee:
Generalized atom and dictionary design and compressive sensing for vocal signal expansion. 13-18 - Chien-Yao Wang, Andri Santoso
, Jia-Ching Wang:
Acoustic scene classification using self-determination convolutional neural network. 19-22 - I-Hsiang Wang, Jian-Jiun Ding, Hung-Wei Hsu:
Prediction techniques for wavelet based 1-D signal compression. 23-26 - Xiaoming Zhang, Hidetaka Aoki, Akiko Sato, Mohd Amin Abd Majid:
An empirical study on performance optimization at district cooling plant of Universiti Teknologi PETRONAS. 27-32 - Tomoya Sakai, Shun Ogawa, Hiroki Kuhara:
Sequential decomposition of 2D apparent motion fields based on low-rank and sparse approximation. 33-38 - Ettikan Kandasamy Karuppiah:
Internet of Things: Trend, technologies, and evolution. 37-38 - Lounell B. Gueta, Akiko Sato:
Classifying road surface conditions using vibration signals. 39-43 - Ryosuke Kawami, Hidetomo Kataoka, Daichi Kitahara, Akira Hirabayashi, Takashi Ijiri, Shigeharu Shimamura
, Hiroshi Kikuchi, Tomoo Ushio
:
Fast high-quality three-dimensional reconstruction from compressive observation of phased array weather radar. 44-49 - Akie Sakiyama, Yuichi Tanaka
:
Graph reduction method using localization operator and its application to pyramid transform. 50-55 - Vui Ann Shim, Miaolong Yuan, Boon Hwa Tan:
Automatic object searching by a mobile robot with single RGB-D camera. 56-62 - Yan Wu
, Ruohan Wang, Yong Ling Tay, Clarice Jiaying Wong:
Investigation on the roles of human and robot in collaborative storytelling. 63-68 - Gayane Shalunts, Gerhard Backfried, Helmy Syakh Alam:
Sentiment analysis in Indonesian and French by SentiSAIL. 69-75 - Luis Fernando D'Haro
, Andreea I. Niculescu, Caixia Cai, Suraj Nair, Rafael E. Banchs, Alois C. Knoll, Haizhou Li
:
An integrated framework for multimodal human-robot interaction. 76-82 - Andreea I. Niculescu, Luis Fernando D'Haro
, Rafael E. Banchs:
When industrial robots become social: On the design and evaluation of a multimodal interface for welding robots. 83-89 - Xiao-Zhi Zhang, Ya Li, Bingo Wing-Kuen Ling, Chao Song, Kok Lay Teo:
Spread spectrum compressed sensing magnetic resonance imaging via fractional Fourier transform. 90-93 - Yi-Ping Bao, Yan-Na Zhang
, Yu-E. Song, Bing-Zhao Li, Pei Dang:
Nonuniform sampling theorems for random signals in the offset linear canonical transform domain. 94-99 - Yi-Qian Wang
, Bing-Zhao Li, Qi-Yuan Cheng:
The fractional Fourier transform on graphs. 105-110 - Aykut Koç
, Haldun M. Özaktas, Burak Bartan, Erhan Gundogdu, Tolga Çukur
:
Digital computation of fractional Fourier and linear canonical transforms and sparse image representation. 111-117 - Iman Tabatabaei Ardekani, Xiao Zhang, Hamid R. Sharifzadeh, Jari P. Kaipio:
Maximum a posteriori adjustment of adaptive transversal filters in active noise control. 118-123 - Masato Nakayama, Takanobu Nishiura:
Synchronized amplitude-and-frequency modulation for a parametric loudspeaker. 130-135 - Tomoki Murata, Yoshinobu Kajikawa, Seiji Miyoshi:
Statistical-mechanical analysis of the FXLMS algorithm for multiple-channel active noise control. 136-139 - Michael Anthony, Cheng-Yuan Chang, Sen M. Kuo:
Active noise control for muffler. 140-144 - Nan Chen, Changchun Bao, Xianyun Wang:
Speech enhancement based on binaural cues. 145-148 - Yan Yang, Changchun Bao, Xianyun Wang:
Codebook-driven speech enhancement using DNN and harmonic emphasis. 149-154 - Xin Wang
, Jun Du, Yannan Wang:
A maximum likelihood approach to deep neural network based speech dereverberation. 155-158 - Tohari Ahmad
, Burhanudin Rasyid:
SCFT: Sector-based cancelable fingeprint template. 156-160 - Xiao-Lei Zhang:
Speech separation by cost-sensitive deep learning. 159-162 - Shasha Xia, Hao Li, Xueliang Zhang:
Using optimal ratio mask as training target for supervised speech separation. 163-166 - Minghui Dong, Zhengchen Zhang, Huaiping Ming:
Representing raw linguistic information in chinese text-to-speech system. 167-170 - Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng
:
An end-to-end neural network approach to story segmentation. 171-176 - Dong Wang, Lantian Li
, Zhiyuan Tang, Thomas Fang Zheng:
Deep speaker verification: Do we need end to end? 177-181 - Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Hiroyasu Ando, Kaoru Hiramatsu, Kunio Kashino:
Non-native speech conversion with consistency-aware recursive network and generative adversarial network. 182-188 - Sivanagaraja Tatinati, Mun Kit Ho, Andy W. H. Khong, Yubo Wang:
End-to-end speech emotion recognition using multi-scale convolution networks. 189-192 - Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki
:
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification. 193-202 - Anu Aryal, Shoko Imaizumi
, Takahiko Horiuchi, Hitoshi Kiya:
Integrated algorithm for block-permutation-based encryption with reversible data hiding. 203-208 - Simying Ong
, KokSheik Wong
, Kiyoshi Tanaka:
Redesigning data hiding: Interpolation-based scrambling-embedding method. 209-213 - KuanYew Tan, KokSheik Wong
, Simying Ong
, Kiyoshi Tanaka:
Rewritable data insertion in encrypted JPEG using coefficient prediction method. 214-219 - Koichi Ito, Takehisa Okano, Takafumi Aoki:
Recent advances in biometrie security: A case study of liveness detection in face recognition. 220-227 - Meng Yang, Nanning Zheng, Fei Wang
, Ce Zhu:
A new bilateral filter for post-removing the noise of synthesis view in 3D video. 228-231 - Hongsheng Liu, Baozhu Guo, Zhizhong Fu, Xiaofeng Li:
A new active contour model based on complexity of textures for segmentation of natural image. 232-236 - Yifan Zhang, Ting Wang, Renjie He, Mingyi He
:
Subpixel mapping of hyperspectral images with hybrid endmember library and optimized abundances. 237-241 - Yifan Zhang, Tuo Zhao, Mingyi He
:
Hyperspectral and multispectral image fusion using local spatial-spectral dictionary pair. 242-246 - Cho-Ying Wu, Jian-Jiun Ding:
A fast non-convex regularizer for low rank matrix completion. 247-250 - Chia-Wei Wang, Tzu-Chieh Yang, Sheng-Ho Chiang, Tsaipei Wang:
Identifying and filling occlusion holes on planar surfaces for 3-D scene editing. 251-254 - Wisarut Chantara, Yo-Sung Ho:
Initial depth estimation using EPIs and structure tensor. 255-258 - Guiqing He, Siyuan Xing, Dandan Dong, Ximei Zhao:
Panchromatic and multi-spectral image fusion method based on two-step sparse representation and wavelet transform. 259-262 - Yuma Kinoshita, Taichi Yoshida, Sayaka Shiota, Hitoshi Kiya:
Pseudo multi-exposure fusion using a single image. 263-269 - Wen-Nung Lie, Chih-Hao Hu, Yi-Kai Chen, Jui-Chiu Chiang:
Multi-layer background sprite model for 2D-to-3D video conversion. 270-274 - Chao Zhang, Ce Zhu, Yipeng Liu, Hongdiao Wen, Zhengtao Wang
:
Image ordinal estimation: Classification and regression benefit each other. 275-278 - Yusaku Akiyoshi, Taichi Sumi, Yoshimitsu Kuroki
:
Dictionary design and disparity interpolation on distributed compressed sensing for light field image. 279-282 - Kwan-Jung Oh, Minsik Park, Jinwoong Kim:
Digital hologram data representation method. 283-286 - Yufei Zhao, Zhizhong Fu, Jin Xu, Linghua Mao:
Image fusion algorithm based on gradient similarity filter. 287-291 - Manoj Ramanathan, Wei-Yun Yau, Eam Khwang Teoh, Nadia Magnenat-Thalmann
:
Pose-invariant kinematic features for action recognition. 292-299 - Tingtian Li
, Daniel Pak-Kong Lun:
Salient object detection using array images. 300-303 - Jia Du, Wei Xiong, Wenyu Chen, Jierong Cheng, Ying Gu:
Accurate subset selection for pose estimation from uncertain points and lines. 304-308 - Xin Rong Soh, Vishnu Monn Baskaran, Adamu Muhammad Buhari
, Raphael C.-W. Phan:
A real time micro-expression detection system with LBP-TOP on a many-core processor. 309-315 - Ryo Miyagi, Masaki Aono:
Sliced voxel representations with LSTM and CNN for 3D shape recognition. 320-323 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan
:
Localization of harmonic source using a single moving sensor of known trajectory. 324-328 - Yi Yang Ang, Nam Nguyen, Joni Polili Lie, Woon-Seng Gan
:
Grid-free compressive beamforming using a single moving sensor of known trajectory. 329-332 - Suraj Kumar Nayak, Karan Pande, Pratyush Kumar Patnaik, Shikshya Nayak, Shankar J. Patel, Arfat Anis
, Anilesh Dey
, Kunal Pal
:
Understanding the effect of cannabis abuse on the ANS and cardiac physiology of the Indian women paddy-field workers using RR interval and ECG signal analyses. 333-341 - Phuttapong Sertsi, Surasak Boonkla, Vataya Chunwijitra, Nattapong Kurpukdee, Chai Wutiwiwatchai:
Robust voice activity detection based on LSTM recurrent neural networks and modulation spectrum. 342-346 - Yu-Siang Huang, Szu-Yu Chou, Yi-Hsuan Yang:
Music thumbnailing via neural attention modeling of music emotion. 347-350 - Shohei Mori
, Hideo Saito:
Augmented visualization: Observing as desired. 351-356 - Kazuhisa Yamagishi:
QoE-estimation models for video streaming services. 357-363 - Kazuo Sugimoto, Robert A. Cohen, Dong Tian
, Anthony Vetro:
Trends in efficient representation of 3D point clouds. 364-369 - Yohei Kawaguchi
, Ryoichi Takashima, Takashi Endo, Masahito Togami:
Time-domain subsampling and reconstruction for microphone array. 370-374 - Yiqi Tew
, Tiong Yew Tang
, Yoon-Ket Lee:
A study on enhanced educational platform with adaptive sensing devices using IoT features. 375-379 - Yoon-Ket Lee, Jay Ming Lim, Kok Seng Eu, Yeh Huann Goh, Yiqi Tew
:
Real time image processing based obstacle avoidance and navigation system for autonomous wheelchair application. 380-385 - Jian Han Lim
, Eng Yeow Teh, Ming Han Geh, Chern Hong Lim:
Automated classroom monitoring with connected visioning system. 386-393 - Xin Li, Xueting Wei, Wei Zhou, Zhemin Duan:
Techniques for overheating detection and sensor allocation in a real dual-core processor. 394-400 - Jun Li
, Keng Peng Tee, Lawrence Chen, Kong-Wah Wan
, Wei-Yun Yau:
A perception system for robot arms to convey objects to in-car passengers. 401-408 - Yi Feng, Zhifeng Huang, Yun Zhang:
Motion planning of a 6-Dofs robot arm for bandaging nursing task. 409-413 - Jiadong Wang, Wenjuan Ouyang, Wenchao Gao, Qinyuan Ren
:
Locomotion control of a serpentine crawling robot inspired by central pattern generators. 414-419 - Nicola Catenacci Volpi
, Yan Wu
, Dimitri Ognibene
:
Towards event-based MCTS for autonomous cars. 420-427 - Yuya Chiba, Takashi Nose, Akinori Ito
:
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog. 428-431 - Xia Bai, Jiatong Han, Juan Zhao:
Sparse-based disturbance cancellation approach for passive radar. 432-436 - Juan Zhao, Xia Bai:
An improved orthogonal matching pursuit based on randomly enhanced adaptive subspace pursuit. 437-441 - Shiori Mikami, Arata Kawamura, Youji Iiguni:
Residual drum sound estimation for RPCA singing voice extraction. 442-446 - Hyeonggwon Kim, Yoonsik Choe:
Background subtraction via truncated nuclear norm minimization. 447-451 - Yohei Kawaguchi
, Sandra Ramaswami, Ryoichi Takashima, Takashi Endo, Rintaro Ikeshita:
Sub-Nyquist non-uniform sampling for low-cost sound monitoring. 452-456 - Valiantsin Belyi
, Woon-Seng Gan
:
Psychoacoustic subband active noise control algorithm. 457-463 - Shun Hirose, Yoshinobu Kajikawa:
Effectiveness of headrest ANC system with virtual sensing technique for factory noise. 464-468 - Dong-Yuan Shi, Chuang Shi
, Woon-Seng Gan
:
Effect of the audio amplifier's distortion on feedforward active noise control. 469-473 - Caixia Lu, Feiran Yang, Jun Yang:
A frequency-domain adaptive feedback cancellation algorithm based on convex combination. 474-477 - Kouei Yamaoka, Nobutaka Ono
, Shoji Makino, Takeshi Yamada:
Abnormal sound detection by two microphones using virtual microphone technique. 478-482 - Feng Bao, Waleed H. Abdulla:
Signal power estimation based on convex optimization for speech enhancement. 483-487 - Yanhui Tu, Jun Du, Lei Sun, Chin-Hui Lee:
LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement. 488-491 - Zexin Liu, Heather T. Ma, Fei Chen
:
A new data-driven band-weighting function for predicting the intelligibility of noise-suppressed speech. 492-496 - Miao Zhang, Yixiang Chen, Lantian Li
, Dong Wang:
Speaker recognition with cough, laugh and "Wei". 497-501 - Hosana Kamiyama, Atsushi Ando, Satoshi Kobashikawa, Yushi Aono:
Robust children and adults speech identification and confidence measure based on DNN posteriorgram. 502-505 - Feng Li, Huihui Bai, Yao Zhao:
Visual attention guided eye movements for 360 degree images. 506-511 - Cairong Xing, Anhong Wang, Suyue Li, Peihao Li, Jing Zhang:
Random aliasing modulation with decision-directed demodulation. 512-515 - Chang Duan, Yuhuan Shen, Yingying Zhang, Shuai Wang, Ce Zhu, Meng Yang:
Enhancing wedgelet-based depth modeling in 3D-HEVC. 516-519 - Xiaoqiang Cao, Ce Zhu, Minjie Yang, Yongbing Lin, Jianhua Zheng:
A new intra prediction method based on consistent luminance changes. 520-523 - Szu-Wei Fu, Jian-Jiun Ding, Ying-Wun Huang, Ching-Wen Hsiao, Hsin-Hui Chen:
Collagen image compression using the JPEG-based predictive lossless coding scheme. 524-533 - Sze-Teng Liong
, KokSheik Wong
:
Micro-expression recognition using apex frame with phase information. 534-537 - Jierong Cheng, Wei Xiong, Jia Du, Wenyu Chen, Ying Gu:
Detection of meaningful line segment configurations. 538-541 - Jinyoung Jang, Dong-Won Shin, Yo-Sung Ho:
Disparity map refinement method using coarse-to-fine image segmentation. 542-545 - Dong-Won Shin, Yo-Sung Ho:
Local patch descriptor using deep convolutional generative adversarial network for loop closure detection in SLAM. 546-549 - Chen Chen, Shangwen Li, Xiang Fu, Yuzhuo Ren, Yueru Chen, C.-C. Jay Kuo
:
Exploring confusing scene classes for the places dataset: Insights and solutions. 550-558 - Nirmesh J. Shah, Hemant A. Patil:
On the convergence of INCA algorithm. 559-562 - Maulik C. Madhavi
, Hemant A. Patil:
Combining evidences from detection sources for query-by-example spoken term detection. 563-568 - Yuanjun Zhao, Roberto Togneri, Victor Sreeram:
Compressed high dimensional features for speaker spoofing detection. 569-572 - Vishnu Vidyadhara Raju Vegesna, Hari Krishna Vydana, Suryakanth V. Gangashetty
, Anil Kumar Vuppala:
Importance of non-uniform prosody modification for speech recognition in emotion conditions. 573-576 - Chitralekha Gupta
, Haizhou Li
, Ye Wang
:
Perceptual evaluation of singing quality. 577-586 - Kishin Migimatsu, Takuya Wakazono, Isao T. Tokuda:
Experimental study on source-filter interaction using physical model of the vocal folds. 587-590 - Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao
, Hsin-Min Wang
:
Fast locally linear embedding algorithm for exemplar-based voice conversion. 591-595 - Shengke Lin, Takashi Tsunakawa, Masafumi Nishida, Masafumi Nishimura:
DNN-based feature transformation for speech recognition using throat microphone. 596-599 - Hitoshi Yamamoto, Koji Okabe, Takafumi Koshinaka:
Robust i-vector extraction tightly coupled with voice activity detection using deep neural networks. 600-604 - Chen-Yen Lai, Yu-Wen Lo, Yih-Liang Shen, Tai-Shih Chi:
Plastic multi-resolution auditory model based neural network for speech enhancement. 605-609 - Kazuho Morikawa, Tomoki Toda
:
Electrolaryngeal speech modification towards singing aid system for laryngectomees. 610-613 - Peixin Chen, Wu Guo, Qingnan Wang, Yan Song:
Topic classification based on distributed document representation and latent topic information. 614-617 - Michael Hentschel, Atsunori Ogawa, Marc Delcroix
, Tomohiro Nakatani, Yuji Matsumoto:
Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs. 618-621 - Junfeng Hou, Shiliang Zhang, Li-Rong Dai, Hui Jiang:
Feedforward sequential memory networks based encoder-decoder model for machine translation. 622-625 - Yu Chen, Yanting Chen, Hua Lin, Jie Hou, Yutong Xing, Jianwu Dang:
A study of high level tone in standard chinese produced by prelingually deaf adults. 626-629 - Hao Zhang
, Nan Yan, Lan Wang, Manwa L. Ng:
Energy distribution analysis and nonlinear dynamical analysis of phonation in patients with Parkinson's disease. 630-635