default search action
IEEE Transactions on Multimedia, Volume 23
Volume 23, 2021
- Fei Tao, Carlos Busso:
End-to-End Audiovisual Speech Recognition System With Multitask Learning. 1-11 - Hadi Hadizadeh, Ivan V. Bajic:
Soft Video Multicasting Using Adaptive Compressed Sensing. 12-25 - Angeliki V. Katsenou, Goce Dimitrov, Di Ma, David R. Bull:
BVI-SynTex: A Synthetic Video Texture Dataset for Video Compression and Quality Assessment. 26-38 - Chuanmin Jia, Falei Luo, Xinfeng Zhang, Shiqi Wang, Shanshe Wang, Siwei Ma:
Fast Non-Local Adaptive In-Loop Filter Optimization on GPU. 39-51 - Wenguang He, Zhanchuan Cai, Yaomin Wang:
High-Fidelity Reversible Image Watermarking Based on Effective Prediction Error-Pairs Modification. 52-63 - Kai Liu, Lei Gao, Naimul Mefraz Khan, Lin Qi, Ling Guan:
A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition. 64-76 - André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira:
Constant Size Point Cloud Clustering: A Compact, Non-Overlapping Solution. 77-91 - Ji Zhang, Kuizhi Mei, Yu Zheng, Jianping Fan:
Integrating Part of Speech Guidance for Image Captioning. 92-104 - Meihui Li, Lingbing Peng, Tianfu Wu, Zhenming Peng:
A Bottom-Up and Top-Down Integration Framework for Online Object Tracking. 105-119 - Shengjing Tian, Xiuping Liu, Meng Liu, Shuhua Li, Baocai Yin:
Siamese Tracking Network With Informative Enhanced Loss. 120-132 - Huijing Zhan, Chenyu Yi, Boxin Shi, Jie Lin, Ling-Yu Duan, Alex C. Kot:
Pose-Normalized and Appearance-Preserved Street-to-Shop Clothing Image Generation and Feature Learning. 133-144 - Weipeng Hu, Haifeng Hu:
Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition. 145-160 - Qian Bao, Wu Liu, Yuhao Cheng, Boyan Zhou, Tao Mei:
Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking. 161-175 - Yifei Huang, Sheng Qiu, Changbo Wang, Chenhui Li:
Learning Representations for High-Dynamic-Range Image Color Transfer in a Self-Supervised Way. 176-188 - Qing Zhang, Yongwei Nie, Lei Zhu, Chunxia Xiao, Wei-Shi Zheng:
Enhancing Underexposed Photos Using Perceptually Bidirectional Similarity. 189-202 - Nanjun Li, Faliang Chang, Chunsheng Liu:
Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes. 203-215 - Boyue Wang, Yongli Hu, Junbin Gao, Yanfeng Sun, Fujiao Ju, Baocai Yin:
Learning Adaptive Neighborhood Graph on Grassmann Manifolds for Video/Image-Set Subspace Clustering. 216-227 - Rui Wang, Xiao-Jun Wu, Josef Kittler:
Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features. 228-242 - Luca Rossetto, Ralph Gasser, Jakub Lokoc, Werner Bailer, Klaus Schoeffmann, Bernd Münzer, Tomás Soucek, Phuong Anh Nguyen, Paolo Bolettieri, Andreas Leibetseder, Stefanos Vrochidis:
Interactive Video Retrieval in the Age of Deep Learning - Detailed Evaluation of VBS 2019. 243-256 - Ke Li, Yuxia Wu, Yao Xue, Xueming Qian:
Viewpoint Recommendation Based on Object-Oriented 3D Scene Reconstruction. 257-267 - Haoran An, Hai-Miao Hu, Yuanfang Guo, Qianli Zhou, Bo Li:
Hierarchical Reasoning Network for Pedestrian Attribute Recognition. 268-280 - Shizhou Zhang, Qi Zhang, Yifei Yang, Xing Wei, Peng Wang, Bingliang Jiao, Yanning Zhang:
Person Re-Identification in Aerial Imagery. 281-291 - Li Liu, Gang Feng, Denis Beautemps, Xiao-Ping Zhang:
Re-Synchronization Using the Hand Preceding Model for Multi-Modal Fusion in Automatic Continuous Cued Speech Recognition. 292-305 - Zhuo Li, Hai-Miao Hu, Wei Zhang, Shiliang Pu, Bo Li:
Spectrum Characteristics Preserved Visible and Near-Infrared Image Fusion Algorithm. 306-319 - Leida Li, Yu Zhou, Jinjian Wu, Fu Li, Guangming Shi:
Quality Index for View Synthesis by Measuring Instance Degradation and Global Appearance. 320-332 - Conor Keighrey, Ronan Flynn, Siobhan Murray, Niall Murray:
A Physiology-Based QoE Comparison of Interactive Augmented Reality, Virtual Reality and Tablet-Based Applications. 333-341 - Yi Xu, Xianglong Liu, Binshuai Wang, Renshuai Tao, Ke Xia, Xianbin Cao:
Fast Nearest Subspace Search via Random Angular Hashing. 342-352 - Yan Wu, Xianglong Liu, Haotong Qin, Ke Xia, Sheng Hu, Yuqing Ma, Meng Wang:
Boosting Temporal Binary Coding for Large-Scale Video Search. 353-364 - Shintami Chusnul Hidayati, Ting Wei Goh, Ji-Sheng Gary Chan, Cheng-Chun Hsu, John See, Lai-Kuan Wong, Kai-Lung Hua, Yu Tsao, Wen-Huang Cheng:
Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes. 365-377 - Xueming Qian, Yuxia Wu, Mingdi Li, Yayun Ren, Shuhui Jiang, Zhetao Li:
LAST: Location-Appearance-Semantic-Temporal Clustering Based POI Summarization. 378-390 - Hajar Emami, Majid Moradi Aliabadi, Ming Dong, Ratna Babu Chinnam:
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation. 391-401 - Diego Valsesia, Giulia Fracastoro, Enrico Magli:
Learning Localized Representations of Point Clouds With Graph-Convolutional Generative Adversarial Networks. 402-414 - Inwoong Lee, Doyoung Kim, Sanghoon Lee:
3-D Human Behavior Understanding Using Generalized TS-LSTM Networks. 415-428 - Qiang Wang, Huijie Fan, Gan Sun, Weihong Ren, Yandong Tang:
Recurrent Generative Adversarial Network for Face Completion. 429-442 - Xiaoheng Jiang, Li Zhang, Tianzhu Zhang, Pei Lv, Bing Zhou, Yanwei Pang, Mingliang Xu, Changsheng Xu:
Density-Aware Multi-Task Learning for Crowd Counting. 443-453 - Sheng Zhang, Yuliang Liu, Lianwen Jin, Zhongrong Wei, Chunhua Shen:
OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection. 454-467 - Mengyan Li, Zhaoyu Zhang, Jun Yu, Chang Wen Chen:
Learning Face Image Super-Resolution Through Facial Semantic Attribute Transformation and Self-Attentive Structure Enhancement. 468-483 - Xusong Chen, Dong Liu, Zhiwei Xiong, Zheng-Jun Zha:
Learning and Fusing Multiple User Interest Representations for Micro-Video and Movie Recommendations. 484-496 - Guofei Sun, Yongkang Wong, Zhiyong Cheng, Mohan S. Kankanhalli, Weidong Geng, Xiangdong Li:
DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning. 497-509 - Yuqi Gao, Jitao Sang, Chengpeng Fu, Zhengjia Wang, Tongwei Ren, Changsheng Xu:
Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event Search. 510-523 - Jingcai Guo, Song Guo:
A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion. 524-537 - Guangyu Li, Lina Qiu, Chenguang Yu, Houwei Cao, Yong Liu, Can Yang:
IPTV Channel Zapping Recommendation With Attention Mechanism. 538-549 - Qiubin Lin, Wenming Cao, Zhiquan He, Zhihai He:
Mask Cross-Modal Hashing Networks. 550-558 - Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang:
Augmented Adversarial Training for Cross-Modal Retrieval. 559-571 - Hao Yang, Li Liu, Weidong Min, Xiaosong Yang, Xin Xiong:
Driver Yawning Detection Based on Subtle Facial Action Recognition. 572-583 - Hao Chen, Ming Lu, Zhan Ma, Xu Zhang, Yiling Xu, Qiu Shen, Wenjun Zhang:
Learned Resolution Scaling Powered Gaming-as-a-Service at Scale. 584-596 - Qiaokang Xie, Wengang Zhou, Guo-Jun Qi, Qi Tian, Houqiang Li:
Progressive Unsupervised Person Re-Identification by Tracklet Association With Spatio-Temporal Regularization. 597-610 - Xiaodan Zhang, Xinbo Gao, Wen Lu, Lihuo He, Jie Li:
Beyond Vision: A Multimodal Recurrent Attention Convolutional Neural Network for Unified Image Aesthetic Prediction Tasks. 611-623 - Chang Tang, Xinwang Liu, Shan An, Pichao Wang:
BR$^2$Net: Defocus Blur Detection Via a Bidirectional Channel Attention Residual Refining Network. 624-635 - Pauline Puteaux, William Puech:
A Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload. 636-650 - Laizhong Cui, Dongyuan Su, Shu Yang, Zhi Wang, Zhong Ming:
TCLiVi: Transmission Control in Live Video Streaming Based on Deep Reinforcement Learning. 651-663 - Xiao Lin, Lizhuang Ma, Bin Sheng, Zhi-Jie Wang, Wansheng Chen:
Utilizing Two-Phase Processing With FBLS for Single Image Deraining. 664-676 - Bogdan Ionescu, Maia Rohm, Bogdan Boteanu, Alexandru-Lucian Gînsca, Mihai Lupu, Henning Müller:
Benchmarking Image Retrieval Diversification Techniques for Social Media. 677-691 - Xuejin Wang, Qiuping Jiang, Feng Shao, Ke Gu, Guangtao Zhai, Xiaokang Yang:
Exploiting Local Degradation Characteristics and Global Statistical Properties for Blind Quality Assessment of Tone-Mapped HDR Images. 692-705 - Ohini Kafui Toffa, Max Mignotte:
A Hierarchical Visual Feature-Based Approach For Image Sonification. 706-715 - Xueshi Hou, Sujit Dey, Jianzhong Zhang, Madhukar Budagavi:
Predictive Adaptive Streaming to Enable Mobile 360-Degree and VR Experiences. 716-731 - Shaohui Mei, Mingyang Ma, Shuai Wan, Junhui Hou, Zhiyong Wang, David Dagan Feng:
Patch Based Video Summarization With Block Sparse Representation. 732-747 - Minglang Qiao, Mai Xu, Zulin Wang, Ali Borji:
Viewport-Dependent Saliency Prediction in 360° Video. 748-760 - Yijun Cao, Chuan Lin, Yong-Jie Li:
Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss. 761-771 - Yifan Zuo, Yuming Fang, Ping An, Xiwu Shang, Junnan Yang:
Frequency-Dependent Depth Map Enhancement via Iterative Depth-Guided Affine Transformation and Intensity-Guided Refinement. 772-783 - Yuan Gao, Maoguo Gong, Yu Xie, Alex Kai Qin:
An Attention-Based Unsupervised Adversarial Model for Movie Review Spam Detection. 784-796 - Jiachen Yang, Tianlin Liu, Bin Jiang, Wen Lu, Qinggang Meng:
Panoramic Video Quality Assessment Based on Non-Local Spherical CNN. 797-809 - Yiming Li, Changhong Fu, Ziyuan Huang, Yinqiang Zhang, Jia Pan:
Intermittent Contextual Learning for Keyfilter-Aware UAV Object Tracking Using Deep Convolutional Feature. 810-822 - Longyu Yang, Hanli Wang, Pengjie Tang, Qinyu Li:
CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions. 835-845 - Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei:
Single Shot Video Object Detector. 846-858 - Shiquan Zhang, Xu Zhao, Liangji Fang:
CAT: Corner Aided Tracking With Deep Regression Network. 859-870 - Zhengguang Zhou, Wengang Zhou, Xutao Lv, Xuan Huang, Xiaoyu Wang, Houqiang Li:
Progressive Learning of Low-Precision Networks for Image Classification. 871-882 - Jianyu Yang, Wu Liu, Junsong Yuan, Tao Mei:
Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition. 883-898 - Shaobo Min, Xuejin Chen, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition. 899-910 - Philipp Schulz, Henrik Klessig, Meryem Simsek, Gerhard P. Fettweis:
Modeling QoE for Buffered Video Streaming in Interference-Limited Cellular Networks. 911-925 - Pengcheng Gao, Ke Lu, Jian Xue, Ling Shao, Jiayi Lyu:
A Coarse-to-Fine Facial Landmark Detection Method Based on Self-attention Mechanism. 926-938 - Lingchen Gu, Ju Liu, Xiaoxi Liu, Jiande Sun:
Deep Loss Driven Multi-Scale Hashing Based on Pyramid Connected Network. 939-954 - Yuming Fang, Jiebin Yan, Rengang Du, Yifan Zuo, Wenying Wen, Yan Zeng, Leida Li:
Blind Quality Assessment for Tone-Mapped Images by Analysis of Gradient and Chromatic Statistics. 955-966 - Di Liu, Kao Zhang, Zhenzhong Chen:
Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection. 967-981 - Hong Zhong, Fei Wu, Yan Xu, Jie Cui:
QoS-Aware Multicast for Scalable Video Streaming in Software-Defined Networks. 982-994 - Hengcan Shi, Hongliang Li, Qingbo Wu, King Ngi Ngan:
Query Reconstruction Network for Referring Expression Image Segmentation. 995-1007 - Weiling Chen, Ke Gu, Tiesong Zhao, Gangyi Jiang, Patrick Le Callet:
Semi-Reference Sonar Image Quality Assessment Based on Task and Visual Perception. 1008-1020 - Weizhi Nie, Wen-Wu Jia, Wenhui Li, An-An Liu, Sicheng Zhao:
3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval. 1021-1034 - Lei Zhou, Chen Gong, Zhi Liu, Keren Fu:
SAL: Selection and Attention Losses for Weakly Supervised Semantic Segmentation. 1035-1048 - Jia-Li Yin, Bo-Hao Chen, Yan-Tsung Peng, Chung-Chi Tsai:
Deep Battery Saver: End-to-End Learning for Power Constrained Contrast Enhancement. 1049-1059 - Lei Liu, Jie Jiang, Wenjing Jia, Saeed Amirgholipour, Yi Wang, Michelle Zeibots, Xiangjian He:
DENet: A Universal Network for Counting Crowd With Varying Densities and Scales. 1060-1068 - Yumo Zhang, Zhanchuan Cai, Gangqiang Xiong:
A New Image Compression Algorithm Based on Non-Uniform Partition and U-System. 1069-1082 - Erik Quintanilla, Yogesh S. Rawat, Andrey Sakryukin, Mubarak Shah, Mohan S. Kankanhalli:
Adversarial Learning for Personalized Tag Recommendation. 1083-1094 - Gebremariam Mesfin, Estêvão Bissoli Saleme, Oluwakemi Adewunmi Ademoye, Elahe Kani-Zabihi, Celso A. S. Santos, Gheorghita Ghinea:
Less is (Just as Good as) More - an Investigation of Odor Intensity and Hedonic Valence in Mulsemedia QoE using Heart Rate and Eye Tracking. 1095-1105 - Mingliang Zhou, Xuekai Wei, Sam Kwong, Weijia Jia, Bin Fang:
Rate Control Method Based on Deep Reinforcement Learning for Dynamic Video Sequences in HEVC. 1106-1121 - Huiyu Mo, Leibo Liu, Wenping Zhu, Qiang Li, Shouyi Yin, Shaojun Wei:
A 460 GOPS/W Improved Mnemonic Descent Method-Based Hardwired Accelerator for Face Alignment. 1122-1135 - Reza Ghazalian, Ali Aghagolzadeh, Seyed Mehdi Hosseini Andargoli:
Energy Optimization and QoE Satisfaction for Wireless Visual Sensor Networks in Multi Target Tracking Scenario. 823-834 - Ya Lu, Thomai Stathopoulou, Maria F. Vasiloglou, Stergios Christodoulidis, Zeno Stanga, Stavroula G. Mougiakakou:
An Artificial Intelligence-Based System to Assess Nutrient Intake for Hospitalised Patients. 1136-1147 - Jinjian Wu, Chuanwei Ma, Leida Li, Weisheng Dong, Guangming Shi:
Probabilistic Undirected Graph Based Denoising Method for Dynamic Vision Sensor. 1148-1159 - Xiaoguang Tu, Jian Zhao, Mei Xie, Zihang Jiang, Akshaya Balamurugan, Yao Luo, Yang Zhao, Lingxiao He, Zheng Ma, Jiashi Feng:
3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild. 1160-1172 - Xuejin Wang, Feng Shao, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho:
Measuring Coarse-to-Fine Texture and Geometric Distortions for Quality Assessment of DIBR-Synthesized Images. 1173-1186 - Xiangtao Zheng, Lei Qi, Yutao Ren, Xiaoqiang Lu:
Fine-Grained Visual Categorization by Localizing Object Parts With Single Image. 1187-1199 - Yaohui Zhu, Weiqing Min, Shuqiang Jiang:
Attribute-Guided Feature Learning for Few-Shot Image Recognition. 1200-1209 - Ruotao Xu, Yong Xu, Yuhui Quan:
Factorized Tensor Dictionary Learning for Visual Tensor Data Completion. 1225-1238 - Min Cao, Chen Chen, Hao Dou, Xiyuan Hu, Silong Peng, Arjan Kuijper:
Progressive Bilateral-Context Driven Model for Post-Processing Person Re-Identification. 1239-1251 - Xin Fan, Shichao Cheng, Kang Huyan, Minjun Hou, Risheng Liu, Zhongxuan Luo:
Dual Neural Networks Coupling Data Regression With Explicit Priors for Monocular 3D Face Reconstruction. 1252-1263 - Huasong Zhong, Jingyuan Chen, Chen Shen, Hanwang Zhang, Jianqiang Huang, Xian-Sheng Hua:
Self-Adaptive Neural Module Transformer for Visual Question Answering. 1264-1273 - Zijian Wang, Zheng Zhang, Yadan Luo, Zi Huang, Heng Tao Shen:
Deep Collaborative Discrete Hashing With Semantic-Invariant Structure Construction. 1274-1286 - Le Wang, Xin Lv, Qilin Zhang, Zhenxing Niu, Nanning Zheng, Gang Hua:
Object Cosegmentation in Noisy Videos With Multilevel Hypergraph. 1287-1300 - Ting Lan, Zhanchuan Cai:
A Novel Image Representation Method Under a Non-Standard Positional Numeral System. 1301-1315 - Yuxin Wang, Hongtao Xie, Zhengjun Zha, Youliang Tian, Zilong Fu, Yongdong Zhang:
R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection. 1316-1329 - Aouaidjia Kamel, Bin Sheng, Ping Li, Jinman Kim, David Dagan Feng:
Hybrid Refinement-Correction Heatmaps for Human Pose Estimation. 1330-1342 - Bo Jiang, Zitai Zhou, Xiao Wang, Jin Tang, Bin Luo:
cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks. 1343-1353 - Yang Li, Zhiqun Zhao, Hao Sun, Yigang Cen, Zhihai He:
Snowball: Iterative Model Evolution and Confident Sample Discovery for Semi-Supervised Learning on Very Small Labeled Datasets. 1354-1366 - Thanh Tuan Nguyen, Thanh Phuong Nguyen, Frédéric Bouchara:
Prominent Local Representation for Dynamic Textures Based on High-Order Gaussian-Gradients. 1367-1382 - Jing Li, Hongtao Huo, Chang Li, Renhua Wang, Qi Feng:
AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks. 1383-1396 - Junxia Li, Zefeng Pan, Qingshan Liu, Ziyang Wang:
Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection. 1397-1409 - Fangbing Zhang, Tao Yang, Linfeng Liu, Bang Liang, Yi Bai, Jing Li:
Image-Only Real-Time Incremental UAV Image Mosaic for Multi-Strip Flight. 1410-1425 - Xunxiang Yao, Qiang Wu, Peng Zhang, Fangxun Bao:
Weighted Adaptive Image Super-Resolution Scheme Based on Local Fractal Feature and Image Roughness. 1426-1441 - Qinghua Ren, Shijian Lu, Jinxia Zhang, Renjie Hu:
Salient Object Detection by Fusing Local and Global Contexts. 1442-1453