


default search action
IEEE Transactions on Multimedia, Volume 22
Volume 22, Number 1, January 2020
- Wenwu Zhu:
Message From the Outgoing Editor-in-Chief. 1 - Jiebo Luo
:
Editorial. 2 - S. Chandrakala
, S. L. Jayalakshmi
:
Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition. 3-14 - Kuo-Wei Chen, Ying-Sheng Luo
, Yu-Chi Lai
, Yan-Lin Chen, Chih-Yuan Yao
, Hung-Kuo Chu, Tong-Yee Lee
:
Image Vectorization With Real-Time Thin-Plate Spline. 15-29 - Joongchol Shin
, Minseo Kim, Joonki Paik
, Sangkeun Lee
:
Radiance-Reflectance Combined Optimization and Structure-Guided ℓ0-Norm for Single Image Dehazing. 30-44 - Linwei Zhu
, Sam Kwong
, Yun Zhang
, Shiqi Wang
, Xu Wang
:
Generative Adversarial Network-Based Intra Prediction for Video Coding. 45-58 - Wei Xiao
, Xiaolin Huang
, Fan He
, Jorge Silva
, Saba Emrani, Arin Chaudhuri:
Online Robust Principal Component Analysis With Change Point Detection. 59-68 - Javier Cubelos
, Pablo Carballeira
, Jesús Gutiérrez
, Narciso García
:
QoE Analysis of Dense Multiview Video With Head-Mounted Devices. 69-81 - Lixiang Li
, Guoqian Wen
, Zeming Wang, Yixian Yang:
Efficient and Secure Image Communication System Based on Compressed Sensing for IoT Monitoring Applications. 82-95 - Yufei Zha
, Tao Ku
, Yunqiang Li, Peng Zhang
:
Deep Position-Sensitive Tracking. 96-107 - Sarala Ghimire
, Jae Young Choi
, Bumshik Lee
:
Using Blockchain for Improved Video Integrity Verification. 108-121 - Sijie Mai
, Songlong Xing
, Haifeng Hu
:
Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing. 122-137 - Laura Cabrera Quiros
, David M. J. Tax, Hayley Hung:
Gestures In-The-Wild: Detecting Conversational Hand Gestures in Crowded Scenes Using a Multimodal Fusion of Bags of Video Trajectories and Body Worn Acceleration. 138-147 - Guoyun Tu
, Yanwei Fu
, Boyang Li
, Jiarui Gao
, Yu-Gang Jiang
, Xiangyang Xue
:
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization. 148-159 - Zhengzheng Tu, Tian Xia, Chenglong Li
, Xiaoxiao Wang, Yan Ma, Jin Tang
:
RGB-T Image Saliency Detection via Collaborative Graph Learning. 160-173 - Jian Zhang
, Yuxin Peng
:
Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval. 174-187 - Yanbin Hao
, Chong-Wah Ngo, Benoit Huet
:
Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking. 188-200 - Xin-Lin Huang
, Xiao-Wei Tang
, Fei Hu
:
Dynamic Spectrum Access for Multimedia Transmission Over Multi-User, Multi-Channel Cognitive Radio Networks. 201-214 - Jiale Bai
, Zefan Li, Bingbing Ni
, Minsi Wang
, Xiaokang Yang, Chuanping Hu, Wen Gao:
Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval. 215-228 - Chenggang Yan, Yunbin Tu
, Xingzheng Wang, Yongbing Zhang, Xinhong Hao
, Yongdong Zhang, Qionghai Dai:
STAT: Spatial-Temporal Attention Mechanism for Video Captioning. 229-241 - Shafin Rahman
, Salman H. Khan
, Nick Barnes
:
Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging. 242-255 - Songtao Wu
, Sheng-hua Zhong
, Yan Liu:
A Novel Convolutional Neural Network for Image Steganalysis With Shared Normalization. 256-270 - Silvia Cascianelli
, Gabriele Costante
, Alessandro Devo
, Thomas A. Ciarfuglia
, Paolo Valigi
, Mario Luca Fravolini
:
The Role of the Input in Natural Language Video Description. 271-283
Volume 22, Number 2, February 2020
- Bin Xiao
, Ge Ou, Han Tang, Xiuli Bi
, Weisheng Li
:
Multi-Focus Image Fusion by Hessian Matrix Based Decomposition. 285-297 - Bo-Kyeong Kim
, Geon-min Kim, Soo-Young Lee
:
Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation. 298-310 - Ke Gu
, Zhifang Xia
, Junfei Qiao
, Weisi Lin
:
Deep Dual-Channel Neural Network for Image-Based Smoke Detection. 311-323 - Guangxiao Ma
, Chenglizhao Chen
, Shuai Li
, Chong Peng
, Aimin Hao, Hong Qin:
Salient Object Detection via Multiple Instance Joint Re-Learning. 324-336 - Haijun Liu
, Shiguang Wang
, Wen Wang
, Jian Cheng
:
Multi-Scale Based Context-Aware Net for Action Detection. 337-348 - Congxuan Zhang
, Liyue Ge, Zhen Chen
, Ming Li, Wen Liu
, Hao Chen:
Refined TV-L1 Optical Flow Estimation Using Joint Filtering. 349-364 - Youtian Du
, Xue Wang
, Yunbo Cui, Hang Wang
, Chang Su:
Kernel-Based Mixture Mapping for Image and Text Association. 365-379 - Shifeng Zhang
, Yiliang Xie, Jun Wan
, Hansheng Xia
, Stan Z. Li, Guodong Guo
:
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild. 380-393 - Ke Xu
, Tanfeng Sun
, Xinghao Jiang
:
Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network. 394-406 - Ming Cheung
, James She
:
Detecting Social Signals in User-Shared Images for Connection Discovery Using Deep Learning. 407-420 - Yeqiang Qian
, Ming Yang
, Xu Zhao
, Chunxiang Wang, Bing Wang:
Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera. 421-431 - Lixing Chen
, Linqi Song
, Jacob Chakareski
, Jie Xu
:
Collaborative Content Placement Among Wireless Edge Caching Stations With Time-to-Live Cache. 432-444 - Zeyu Xu
, Yang Cao
, Wei Wang
, Tao Jiang
, Qian Zhang
:
Incentive Mechanism for Cooperative Scalable Video Coding (SVC) Multicast Based on Contract Theory. 445-458 - Hao Chen
, Xu Zhang
, Yiling Xu
, Zhan Ma
, Wenjun Zhang
:
Efficient Mobile Video Streaming via Context-Aware RaptorQ-Based Unequal Error Protection. 459-473 - Kefan Xiao, Shiwen Mao
, Jitendra K. Tugnait
:
Robust QoE-Driven DASH Over OFDMA Networks. 474-486 - Cheng Shi, Chi-Man Pun
:
Multiscale Superpixel-Based Hyperspectral Image Classification Using Recurrent Neural Networks With Stacked Autoencoders. 487-501 - Pau Rodríguez
, Diego Velazquez Dorta
, Guillem Cucurull, Josep M. Gonfaus, F. Xavier Roca
, Jordi Gonzàlez
:
Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition. 502-514 - Wei Zhang
, Xuanyu He
, Weizhi Lu:
Exploring Discriminative Representations for Image Emotion Recognition With CNNs. 515-523 - Lihua Lu
, Yao Lu, Ruizhe Yu, Huijun Di, Lin Zhang
, Shunzhou Wang:
GAIM: Graph Attention Interaction Model for Collective Activity Recognition. 524-539 - Zheng Zhang, Qin Zou
, Yuewei Lin
, Long Chen
, Song Wang
:
Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval. 540-553 - Junnan Li
, Yongkang Wong
, Qi Zhao, Mohan S. Kankanhalli
:
Video Storytelling: Textual Summaries for Events. 554-565
Volume 22, Number 3, March 2020
- Xianjun Xia
, Roberto Togneri
, Ferdous Sohel
, Yuanjun Zhao
, David Huang
:
Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information. 569-578 - Ruben Verhack
, Thomas Sikora, Glenn Van Wallendael
, Peter Lambert
:
Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding. 579-593 - Qianru Jiang
, Sheng Li
, Zhihui Zhu
, Huang Bai
, Xiongxiong He
, Rodrigo C. de Lamare
:
Design of Compressed Sensing System With Probability-Based Prior Information. 594-609 - Pan Gao
, Manoranjan Paul
:
Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming. 610-625 - Zhaoqiang Xia
, Xiaopeng Hong
, Xingyu Gao
, Xiaoyi Feng
, Guoying Zhao
:
Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions. 626-640 - Fan Tang
, Weiming Dong
, Yiping Meng, Chongyang Ma, Fuzhang Wu, Xinrui Li
, Tong-Yee Lee
:
Image Retargetability. 641-654 - Xiaoting Fan
, Jianjun Lei
, Yuming Fang
, Qingming Huang
, Nam Ling, Chunping Hou:
Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending. 655-665 - Qiao Liu
, Zhenyu He
, Xin Li, Yuan Zheng:
PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark. 666-675 - Bo Yan, Xuejing Niu
, Bahetiyaer Bare
, Weimin Tan:
Semantic Segmentation Guided Pixel Fusion for Image Retargeting. 676-687 - Konstantina Fotiadou
, Grigorios Tsagkatakis
, Panagiotis Tsakalides
:
Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning. 688-703 - Chongyi Li
, Chunle Guo
, Jichang Guo, Ping Han, Huazhu Fu
, Runmin Cong
:
PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement. 704-716 - Wooyoung Jang
:
MLC STT-MRAM-Aware Memory Subsystem for Smart Image Applications. 717-729 - Jianwen Lou
, Yiming Wang
, Charles Nduka
, Mahyar Hamedi, Ifigeneia Mavridou
, Fei-Yue Wang
, Hui Yu
:
Realistic Facial Expression Reconstruction for VR HMD Users. 730-743 - Ching-Ling Fan
, Shou-Cheng Yen, Chun-Ying Huang, Cheng-Hsin Hsu
:
Optimizing Fixation Prediction Using Recurrent Neural Networks for 360$^{\circ }$ Video Streaming in Head-Mounted Virtual Reality. 744-759 - Chao Ma
, Chen Gong
, Xiang Li, Xiaolin Huang
, Wei Liu
, Jie Yang
:
Toward Making Unsupervised Graph Hashing Discriminative. 760-774 - Lingling Zhang
, Minnan Luo
, Jun Liu, Xiaojun Chang
, Yi Yang, Alexander G. Hauptmann:
Deep Top-$k$ Ranking for Image-Sentence Matching. 775-785 - Liping Zhao
, Tao Lin
, Dongyu Zhang, Kailun Zhou
, Shuhui Wang:
An Ultra-Low Complexity and High Efficiency Approach for Lossless Alpha Channel Coding. 786-794 - Cheng Zhan
, Han Hu
, Zhi Wang
, Rongfei Fan
, Dusit Niyato
:
Unmanned Aircraft System Aided Adaptive Video Streaming: A Joint Optimization Approach. 795-807 - Lingxiang Wu
, Min Xu
, Jinqiao Wang
, Stuart W. Perry
:
Recall What You See Continually Using GridLSTM in Image Captioning. 808-818 - Sebastian Agethen
, Winston H. Hsu
:
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos. 819-829 - Chenggang Yan, Yunbin Tu
, Xingzheng Wang, Yongbing Zhang, Xinhong Hao
, Yongdong Zhang, Qionghai Dai:
Corrections to "STAT: Spatial-Temporal Attention Mechanism for Video Captioning". 830
Volume 22, Number 4, April 2020
- Dayong Wang
, Yu Sun, Ce Zhu
, Weisheng Li
, Frédéric Dufaux
:
Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding. 833-845 - Deyang Liu
, Ping An
, Ran Ma, Wenfa Zhan, Xinpeng Huang
, Ali Abdullah Yahya:
Content-Based Light Field Image Compression Method With Gaussian Process Regression. 846-859 - Zhengxue Cheng
, Heming Sun
, Masaru Takeuchi
, Jiro Katto
:
Energy Compaction-Based Image Compression Using Convolutional AutoEncoder. 860-873 - Zhaoxia Yin
, Youzhi Xiang
, Xinpeng Zhang
:
Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding. 874-884 - Cheng Deng
, Xu Yang
, Feiping Nie
, Dapeng Tao
:
Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking. 885-896 - Chandramani Chaudhary, Poonam Goyal, Dhanashree Nellayi Prasad, Yi-Ping Phoebe Chen
:
Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base. 897-911 - Badri Narayan Subudhi
, Thangaraj Veerakumar
, Esakkirajan Sankaralingam, Ashish Ghosh
:
Kernelized Fuzzy Modal Variation for Local Change Detection From Video Scenes. 912-920 - Xun Liu
, Mischa Dohler
, Yansha Deng
:
Vibrotactile Quality Assessment: Hybrid Metric Design Based on SNR and SSIM. 921-933 - Yang Liu
, Volkan Kiliç
, Jian Guan
, Wenwu Wang
:
Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. 934-948 - Xiwen Liu
, Xiaoming Tao
, Mai Xu
, Yafeng Zhan
, Jianhua Lu:
An EEG-Based Study on Perception of Video Distortion Under Various Content Motion Conditions. 949-960 - Lukas Krasula
, Yoann Baveye, Patrick Le Callet:
Training Objective Image and Video Quality Estimators Using Multiple Databases. 961-969 - Muwei Jian
, Junyu Dong
, Maoguo Gong
, Hui Yu
, Liqiang Nie, Yilong Yin
, Kin-Man Lam:
Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment. 970-979 - Hyunmin Jung
, Hyuk-Jae Lee
, Chae-Eun Rhee
:
Flexibly Connectable Light Field System For Free View Exploration. 980-991 - Thanh-Toan Do
, Tuan Hoang
, Dang-Khoa Le Tan, Anh-Dzung Doan
, Ngai-Man Cheung
:
Compact Hash Code Learning With Binary Deep Neural Network. 992-1004 - Riza Arda Kirmizioglu
, A. Murat Tekalp
:
Multi-Party WebRTC Services Using Delay and Bandwidth Aware SDN-Assisted IP Multicasting of Scalable Video Over 5G Networks. 1005-1015 - Chung-Chi Tsai
, Kuang-Jui Hsu
, Yen-Yu Lin
, Xiaoning Qian
, Yung-Yu Chuang
:
Deep Co-Saliency Detection via Stacked Autoencoder-Enabled Fusion and Self-Trained CNNs. 1016-1031 - Wenqiao Zhang, Siliang Tang
, Yanpeng Cao
, Shiliang Pu, Fei Wu, Yueting Zhuang:
Frame Augmented Alternating Attention Network for Video Question Answering. 1032-1041 - Zewei He
, Yanpeng Cao, Lei Du, Baobei Xu, Jiangxin Yang, Yanlong Cao, Siliang Tang
, Yueting Zhuang:
MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution. 1042-1054 - Zhi Jin
, Muhammad Zafar Iqbal
, Dmytro Bobkov
, Wenbin Zou
, Xia Li, Eckehard G. Steinbach
:
A Flexible Deep CNN Framework for Image Restoration. 1055-1068 - Zhe Zhang
, Chung-Horng Lung
, Marc St-Hilaire
, Ioannis Lambadaris
:
An SDN-Based Caching Decision Policy for Video Caching in Information-Centric Networking. 1069-1083 - Shangfei Wang
, Longfei Hao
, Qiang Ji
:
Knowledge-Augmented Multimodal Deep Regression Bayesian Networks for Emotion Video Tagging. 1084-1097 - Tianliang Liu
, Junwei Wan
, Xiubin Dai
, Feng Liu
, Quanzeng You, Jiebo Luo
:
Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion. 1098-1110 - Zhaoqiang Xia
, Xiaopeng Hong
, Xingyu Gao
, Xiaoyi Feng
, Guoying Zhao
:
Corrections to "Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions". 1111
Volume 22, Number 5, May 2020
- Minxiang Ye
, Cheng Yang, Vladimir Stankovic
, Lina Stankovic
, Samuel Cheng
:
Distinct Feature Extraction for Video-Based Gait Phase Classification. 1113-1125 - Tilo Strutz
, Phillip Möller
:
Screen Content Compression Based on Enhanced Soft Context Formation. 1126-1138 - Chieh-Chi Kao
, Yu-Xiang Wang, Jonathan Waltman, Pradeep Sen
:
Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images. 1139-1152 - Yunxiao Li, Shuai Li
, Chenglizhao Chen
, Aimin Hao, Hong Qin:
Accurate and Robust Video Saliency Detection via Self-Paced Diffusion. 1153-1167 - Yongqing Liang
, Xin Li
:
Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver. 1168-1181 - Lin Xie
, Feifei Lee, Li Liu
, Zhong Yin, Qiu Chen
:
Hierarchical Coding of Convolutional Features for Scene Recognition. 1182-1192 - Ying Wang, Yifan Dong, Songtao Guo
, Yuanyuan Yang
, Xiaofeng Liao
:
Latency-Aware Adaptive Video Summarization for Mobile Edge Clouds. 1193-1207 - Xiongli Chai
, Feng Shao
, Qiuping Jiang, Yo-Sung Ho
:
MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution. 1208-1219 - Wenfeng Song
, Shuai Li
, Ji Liu, Aimin Hao, Qinping Zhao, Hong Qin
:
Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image. 1220-1233 - Weipeng Hu
, Haifeng Hu
:
Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition. 1234-1248 - Alexandra Covaci
, Estêvão Bissoli Saleme
, Gebremariam Mesfin, Nadia Hussain, Elahe Kani-Zabihi
, Gheorghita Ghinea
:
How Do We Experience Crossmodal Correspondent Mulsemedia Content? 1249-1258 - Tao Xiang
, Ying Yang, Shangwei Guo
:
Blind Night-Time Image Quality Assessment: Subjective and Objective Approaches. 1259-1272 - Luming Zhang
, Jianwei Yin, Ping Li
, Yongheng Shang, Roger Zimmermann, Ling Shao
:
Flickr Image Community Analytics by Deep Noise-Refined Matrix Factorization. 1273-1284 - Yehao Li
, Ting Yao
, Yingwei Pan
, Hongyang Chao
, Tao Mei
:
Deep Metric Learning With Density Adaptivity. 1285-1297 - Yujuan Ding
, Wai Keung Wong
, Zhihui Lai
, Yudong Chen
:
Study on 2D Feature-Based Hash Learning. 1298-1309 - Yiling Wu, Shuhui Wang
, Qingming Huang
:
Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval. 1310-1322 - Zhiyang Xia
, Ping Yi
, Yunyu Liu
, Bo Jiang, Wei Wang
, Ting Zhu:
GENPass: A Multi-Source Deep Learning Model for Password Guessing. 1323-1332 - Shuang Qiu
, Yao Zhao
, Jianbo Jiao
, Yunchao Wei
, Shikui Wei
:
Referring Image Segmentation by Generative Adversarial Learning. 1333-1344 - Yabin Zhang
, Kui Jia
, Zhixin Wang
:
Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network. 1345-1357 - Dongyu She
, Jufeng Yang
, Ming-Ming Cheng
, Yu-Kun Lai
, Paul L. Rosin
, Liang Wang:
WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection. 1358-1371 - Ning Xu
, Hanwang Zhang
, An-An Liu
, Weizhi Nie
, Yuting Su
, Jie Nie
, Yongdong Zhang
:
Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning. 1372-1383
Volume 22, Number 6, June 2020
- Yanxiong Li
, Mingle Liu, Wucheng Wang, Yuhan Zhang, Qianhua He:
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration. 1385-1394 - Miaohui Wang
, Jian Xiong
, Long Xu
, Wuyuan Xie
, King Ngi Ngan, Jing Qin
:
Rate Constrained Multiple-QP Optimization for HEVC. 1395-1406 - Yunfeng Zhang
, Ping Wang
, Fangxun Bao
, Xunxiang Yao, Caiming Zhang
, Hongwei Lin
:
A Single-Image Super-Resolution Method Based on Progressive-Iterative Approximation. 1407-1422 - Bingjie Xu
, Junnan Li
, Yongkang Wong
, Qi Zhao, Mohan S. Kankanhalli
:
Interact as You Intend: Intention-Driven Human-Object Interaction Detection. 1423-1432 - Federico Angelini
, Zeyu Fu
, Yang Long, Ling Shao
, Syed Mohsen Naqvi
:
2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling. 1433-1446