![](https://dblp.uni-trier.de/img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
default search action
IEEE Transactions on Multimedia, Volume 25
Volume 25, 2023
- Zan-Xia Jin
, Heran Wu, Chun Yang, Fang Zhou
, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
:
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering. 1-12 - Yu Wang
, Shiwei Chen:
Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion. 13-23 - Jiayi Xie, Yaochen Zhu
, Zhenzhong Chen
:
Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck. 24-37 - Zhicheng Guo
, Jiaxuan Zhao
, Licheng Jiao
, Xu Liu
, Fang Liu
:
A Universal Quaternion Hypergraph Network for Multimodal Video Question Answering. 38-49 - Xiao Lin
, Shuzhou Sun, Wei Huang, Bin Sheng
, Ping Li
, David Dagan Feng
:
EAPT: Efficient Attention Pyramid Transformer for Image Processing. 50-61 - Zhi Li
, Haoliang Li
, Xin Luo, Yongjian Hu
, Kwok-Yan Lam
, Alex C. Kot
:
Asymmetric Modality Translation for Face Presentation Attack Detection. 62-76 - Wei Lu
, Desheng Li, Liqiang Nie
, Peiguang Jing
, Yuting Su
:
Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification. 77-89 - Yun Wang
, Tong Zhang
, Chuanwei Zhou
, Zhen Cui
, Jian Yang
:
Instance-Aware Deep Graph Learning for Multi-Label Classification. 90-99 - Jae Young Choi
, Bumshik Lee
:
Combining Deep Convolutional Neural Networks With Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild. 100-111 - Zerui Shao
, Yifei Pu, Jiliu Zhou, Bihan Wen
, Yi Zhang
:
Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling on-the-Fly for Moving Object Detection. 112-125 - Yajing Liu, Zhiwei Xiong
, Ya Li, Xinmei Tian
, Zheng-Jun Zha
:
Domain Generalization Via Encoding and Resampling in a Unified Latent Space. 126-139 - Hangwei Chen
, Xiongli Chai
, Feng Shao
, Xuejin Wang, Qiuping Jiang
, Xiangchao Meng
, Yo-Sung Ho
:
Perceptual Quality Assessment of Cartoon Images. 140-153 - Yang Li
, Shengbin Meng, Xinfeng Zhang
, Meng Wang
, Shiqi Wang
, Yue Wang, Siwei Ma
:
User-Generated Video Quality Assessment: A Subjective and Objective Study. 154-166 - Yan Yang
, Jun Yu
, Jian Zhang
, Weidong Han
, Hanliang Jiang, Qingming Huang
:
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation. 167-178 - Hancheng Zhu
, Yong Zhou
, Leida Li
, Yaqian Li
, Yandong Guo:
Learning Personalized Image Aesthetics From Subjective and Objective Attributes. 179-190 - Jun Cheng
, Fusheng Hao
, Fengxiang He
, Liu Liu
, Qieshi Zhang
:
Mixer-Based Semantic Spread for Few-Shot Learning. 191-202 - Haojie Yuan
, Qi Chu
, Feng Zhu
, Rui Zhao, Bin Liu
, Nenghai Yu
:
AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks. 203-213 - Zefan Li
, Bingbing Ni
, Xiaokang Yang
, Wenjun Zhang
, Wen Gao:
Residual Quantization for Low Bit-Width Neural Networks. 214-227 - Zhaoliang Chen
, Jie Yao, Guobao Xiao
, Shiping Wang
:
Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation. 228-242 - Tong Xue
, Abdallah El Ali
, Tianyi Zhang
, Gangyi Ding, Pablo César
:
CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360$^\circ$ VR Videos. 243-255 - Gaosheng Liu
, Huanjing Yue
, Jiamin Wu
, Jing-Yu Yang
:
Intra-Inter View Interaction Network for Light Field Image Super-Resolution. 256-266 - Zhihao Wu
, Jie Wen
, Yong Xu
, Jian Yang
, David Zhang
:
Multiple Instance Detection Networks With Adaptive Instance Refinement. 267-279 - Yanhua Yang, Xiaozhe Zhang, Muli Yang
, Cheng Deng
:
Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning. 280-290 - Tung-I Chen, Yueh-Cheng Liu
, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh
, Wen-Chin Chen, Winston H. Hsu
:
Dual-Awareness Attention for Few-Shot Object Detection. 291-301 - Laizhong Cui
, Erchao Ni, Yipeng Zhou
, Zhi Wang
, Lei Zhang
, Jiangchuan Liu
, Yuedong Xu
:
Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution. 302-314 - Sutong Wang
, Jiacheng Zhu, Yunqiang Yin, Dujuan Wang
, T. C. Edwin Cheng
, Yanzhang Wang:
Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal. 315-328 - Zhihao Zhang
, Xianqiang Yang
, Chao Xu
:
Natural Image Stitching With Layered Warping Constraint. 329-338 - Hao Tang
, Guoshuai Zhao
, Yuxia Wu
, Xueming Qian
:
Multisample-Based Contrastive Loss for Top-K Recommendation. 339-351 - Ke Zhang
, Chun Yuan
, Yiming Zhu, Yong Jiang
, Lishu Luo:
Weakly Supervised Instance Segmentation by Exploring Entire Object Regions. 352-363 - Astha Verma
, A. Venkata Subramanyam
, Zheng Wang
, Shin'ichi Satoh
, Rajiv Ratn Shah:
Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation. 364-377 - Carlos M. Lentisco
, Luis Bellido
, Andrés Cárdenas
, Ricardo Flores Moyano
, David Fernández
:
Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery. 378-388 - Huicong Wu
, Liang Xiao, Le Sun
, Byeungwoo Jeon
:
A Novel Video Stabilization Model With Motion Morphological Component Priors. 389-404 - Xuehao Gao
, Yang Yang
, Yimeng Zhang
, Maosen Li
, Jin-Gang Yu
, Shaoyi Du
:
Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition. 405-417 - Cheng Xue, Xionghu Zhong
, Minjie Cai
, Hao Chen
, Wenwu Wang
:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. 418-429 - Guang Han
, Jinpeng Su, Yaoming Liu, Yuqiu Zhao, Sam Kwong
:
Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network. 430-442 - Lei Yu
, Bishan Wang
, Jingwei He, Gui-Song Xia
, Wen Yang
:
Single Image Deraining With Continuous Rain Density Estimation. 443-456 - Jianjun Xiang
, Gangyi Jiang
, Mei Yu
, Zhidi Jiang
, Yo-Sung Ho
:
No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform. 457-472 - Mehdi Rahmati
, Zhuoran Qi
, Dario Pompili:
Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems. 473-485 - Nan Jiang
, Kuiran Wang, Xiaoke Peng
, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li
, Guodong Guo, Qixiang Ye
, Jianbin Jiao
, Jian Zhao
, Zhenjun Han
:
Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. 486-500 - Yujie Huang
, Ming-e Jing, Jinjia Zhou
, Yuhao Liu
, Yibo Fan
:
LCCStyle: Arbitrary Style Transfer With Low Computational Complexity. 501-514 - Jing Yi, Yaochen Zhu
, Jiayi Xie, Zhenzhong Chen
:
Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation. 515-528 - Luntian Mou
, Chao Zhou, Pengtao Xie, Pengfei Zhao
, Ramesh C. Jain
, Wen Gao, Baocai Yin
:
Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion. 529-542 - Wenhui Li
, Yan Wang, Yuting Su
, Xuanya Li
, An-An Liu
, Yongdong Zhang
:
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching. 543-556 - Yongqiang Kong
, Yunhong Wang
, Annan Li
, Qiuyu Huang:
Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection. 557-571 - Qinchuan Zhang
, Yi Jiang, Qin Zhou, Yiru Zhao, Yao Liu, Hongtao Lu
, Xian-Sheng Hua
:
Single Person Dense Pose Estimation via Geometric Equivariance Consistency. 572-583 - Kailun Zhou
, Liping Zhao
, Zigao Ye
, Huihui Wang, Tao Lin
, Sheng Feng
, Yufen Yang
:
Equal Value String and Copy Above String Based String Prediction for SCC in AVS3. 584-592 - Maja Krivokuca
, Ehsan Miandji
, Christine Guillemot
, Philip A. Chou
:
Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms. 593-607 - Xiaoqing Luo
, Yuanhao Gao, Anqi Wang
, Zhancheng Zhang
, Xiaojun Wu
:
IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning. 608-623 - Shihao Xu
, Haocong Rao
, Xiping Hu
, Jun Cheng
, Bin Hu
:
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition. 624-634 - Huabing Zhou
, Wei Wu
, Yanduo Zhang
, Jiayi Ma
, Haibin Ling
:
Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network. 635-648 - Ming Li, Bin Fu
, Zhengfu Zhang, Yu Qiao
:
Character-Aware Sampling and Rectification for Scene Text Recognition. 649-661 - Mingyue Su
, Guanghua Gu
, Xianlong Ren, Hao Fu, Yao Zhao
:
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing. 662-675 - Lei Zhu
, Xiaoqiang Wang
, Ping Li
, Xin Yang, Qing Zhang, Weiming Wang
, Carola-Bibiane Schönlieb
, C. L. Philip Chen
:
S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. 676-689 - Xinjue Hu
, Yuxuan Pan
, Yumei Wang
, Lin Zhang, Shervin Shirmohammadi
:
Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression. 690-705 - Le Wang
, Qing Li, Sanping Zhou, Nanning Zheng
:
Multi-Panda Tracking. 706-720 - Changsheng Gao, Dong Liu
, Li Li
, Feng Wu:
Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics. 721-735 - Pei Lv
, Jianqi Fan
, Xixi Nie
, Weiming Dong
, Xiaoheng Jiang
, Bing Zhou
, Mingliang Xu
, Changsheng Xu
:
User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning. 736-749 - Xiao Tan
, Huaian Chen
, Kai Xu
, Yi Jin
, Changan Zhu
:
Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes. 750-763 - Zhen Bai
, Zhi Liu
, Gongyang Li
, Yang Wang
:
Adaptive Group-Wise Consistency Network for Co-Saliency Detection. 764-776 - Chenghu Du
, Feng Yu
, Minghua Jiang
, Ailing Hua, Xiong Wei, Tao Peng
, Xinrong Hu:
VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment. 777-791 - Shiji Zhou
, Zhi Wang
, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang
, Chuan Wu
, Wenwu Zhu
:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. 792-804 - Shuyi Li
, Bob Zhang
, Lunke Fei
, Shuping Zhao
, Yicong Zhou
:
Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition. 805-815 - Wenxue Cui
, Shaohui Liu
, Feng Jiang
, Debin Zhao
:
Image Compressed Sensing Using Non-Local Neural Network. 816-830 - Nastaran Nourbakhsh Kaashki
, Pengpeng Hu
, Adrian Munteanu
:
Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction. 831-844 - Xiaoyan Cai
, Sen Liu, Junwei Han
, Libin Yang
, Zhenguo Liu, Tianming Liu
:
ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization. 845-855 - Xuemeng Song
, Shi-Ting Fang
, Xiaolin Chen
, Yinwei Wei
, Zhongzhou Zhao, Liqiang Nie
:
Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling. 856-867 - Jie Nie
, Zian Zhao
, Lei Huang
, Weizhi Nie
, Zhiqiang Wei:
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion. 868-880 - Haimin Zhang
, Min Xu
:
Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning. 881-891 - Fei Peng
, Bo Long, Min Long
:
A Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy. 892-906 - Karam Park
, Jae Woong Soh
, Nam Ik Cho
:
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution. 907-918 - Ming Li
, Jun Liu
, Ce Zheng
, Xinming Huang
, Ziming Zhang:
Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. 919-929 - Liyuan Ma
, Kejie Huang
, Dongxu Wei
, Zhaoyan Ming
, Haibin Shen
:
FDA-GAN: Flow-Based Dual Attention GAN for Human Pose Transfer. 930-941 - Chongyang Bai
, Haipeng Chen
, Srijan Kumar, Jure Leskovec
, V. S. Subrahmanian
:
M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion. 942-952 - Prasen Kumar Sharma
, Arun Abraham
, Vikram Nelvoy Rajendiran
:
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics. 953-965 - Fan Zhao
, Wenda Zhao
, Huimin Lu
, Yong Liu
, Libo Yao, Yu Liu
:
Depth-Distilled Multi-Focus Image Fusion. 966-978 - Xuanhan Wang
, Yuyu Guo
, Jingkuan Song
, Lianli Gao
, Heng Tao Shen
:
AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences. 979-992 - Tiejian Zhang
, Xinwang Liu
, Lei Gong
, Siwei Wang
, Xin Niu
, Li Shen:
Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization. 993-1007 - Yiming Wang
, Dongxia Chang
, Zhiqiang Fu, Yao Zhao
:
Consistent Multiple Graph Embedding for Multi-View Clustering. 1008-1018 - Jingjing Xiong
, Lai-Man Po
, Wing Yin Yu
, Yuzhi Zhao
, Kwok-Wai Cheung:
Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation. 1019-1032 - Wei Qin
, Hanwang Zhang
, Richang Hong
, Ee-Peng Lim
, Qianru Sun
:
Causal Interventional Training for Image Recognition. 1033-1044 - Shikun Li
, Tongliang Liu
, Jiyong Tan
, Dan Zeng
, Shiming Ge
:
Trustable Co-Label Learning From Multiple Noisy Annotators. 1045-1057 - Jiebo Luo
:
Editorial. 1058-1059 - Yonggang Wen
:
Editorial. 1060 - Wenqian Wang
, Faliang Chang
, Chunsheng Liu
, Guangxin Li
, Bin Wang:
GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition. 1061-1073 - Qifan Wang
, Yinwei Wei
, Jianhua Yin
, Jianlong Wu
, Xuemeng Song
, Liqiang Nie
:
DualGNN: Dual Graph Neural Network for Multimedia Recommendation. 1074-1084 - Xiaoping Liang
, Zhenjun Tang
, Jingli Wu, Zhixin Li
, Xinpeng Zhang
:
Robust Image Hashing With Isomap and Saliency Map for Copy Detection. 1085-1097 - Shuping Zhao
, Lunke Fei
, Jie Wen
, Jigang Wu
, Bob Zhang
:
Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering. 1098-1110 - Shixiang Wu, Chao Dong
, Yu Qiao
:
Blind Image Restoration Based on Cycle-Consistent Network. 1111-1124 - Jose Jaena Mari Ople, Tai-Ming Huang
, Ming-Chih Chiu
, Yi-Ling Chen
, Kai-Lung Hua
:
Adjustable Model Compression Using Multiple Genetic Algorithm. 1125-1132 - Le Wang
, Mo Zhou
, Zhenxing Niu, Qilin Zhang
, Nanning Zheng
:
Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding. 1133-1147 - Weide Liu
, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin
:
Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation. 1148-1160 - Ziqiang Wang
, Zhi Liu
, Gongyang Li
, Yang Wang
, Tianhong Zhang, Lihua Xu, Jijun Wang:
Spatio-Temporal Self-Attention Network for Video Saliency Prediction. 1161-1174 - Rui Wang
, Jun Liu
, Qiuhong Ke
, Duo Peng
, Yinjie Lei
:
Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition. 1175-1189 - Cheng Wang
, Bingpeng Ma
, Hong Chang
, Shiguang Shan
, Xilin Chen
:
Person Search by a Bi-Directional Task-Consistent Learning Model. 1190-1203 - Jipeng Wu
, Rongrong Ji
, Qiang Wang, Shengchuan Zhang
, Xiaoshuai Sun
, Yan Wang
, Mingliang Xu
, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. 1204-1216 - Di Wang
, Caiping Zhang, Quan Wang
, Yumin Tian, Lihuo He
, Lin Zhao
:
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval. 1217-1229 - Min Cao
, Cong Ding
, Chen Chen
, Hao Dou, Xiyuan Hu
, Junchi Yan
:
Progressive Context-Aware Graph Feature Learning for Target Re-Identification. 1230-1242 - Yuting Su
, Wei Zhao, Peiguang Jing
, Liqiang Nie
:
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions. 1243-1255 - Gaoang Wang
, Yizhou Wang
, Renshu Gu
, Weijie Hu
, Jenq-Neng Hwang
:
Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking. 1256-1268 - Qiao Liu
, Di Yuan
, Nana Fan, Peng Gao
, Xin Li
, Zhenyu He
:
Learning Dual-Level Deep Representation for Thermal Infrared Tracking. 1269-1281 - Wenhao Li
, Hong Liu
, Runwei Ding
, Mengyuan Liu
, Pichao Wang
, Wenming Yang
:
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation. 1282-1293 - Mengxi Jia
, Xinhua Cheng, Shijian Lu
, Jian Zhang
:
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification. 1294-1305 - Zhe Tang
, Yi Yang
, Wen Li
, Defu Lian
, Lixin Duan:
Deep Cross-Attention Network for Crowdfunding Success Prediction. 1306-1319 - Kun Zhang
, Zhendong Mao
, An-An Liu
, Yongdong Zhang
:
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching. 1320-1332 - Dongnan Liu
, Chaoyi Zhang
, Yang Song
, Heng Huang
, Chenyu Wang
, Michael Barnett
, Tom Weidong Cai
:
Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement. 1333-1344 - Bin Chen
, Kunhong Liu
, Yong Xu, Qingqiang Wu, Junfeng Yao
:
Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition. 1345-1358 - Yingjian Li
, Zheng Zhang
, Bingzhi Chen
, Guangming Lu
, David Zhang
:
Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition. 1359-1373 - Jianjun Sun
, Yan Zhao
, Shigang Wang
, Jian Wei:
3D Holoscopic Image Compression Based on Gaussian Mixture Model. 1374-1389 - Huan Liu
, Wentao Liu, Zhixiang Chi
, Yang Wang
, Yuanhao Yu
, Jun Chen
, Jin Tang:
Fast Human Pose Estimation in Compressed Videos. 1390-1400