default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 20
Volume 20, Number 1, January 2024
- Zhenbo Xu, Hai-Miao Hu, Liu Liu, Dongping Zhang, Shifeng Zhang, Wenming Tan:
Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh Recognition. 1:1-1:23 - Xiaoping Liang, Zhenjun Tang, Zhixin Li, Mengzhu Yu, Hanyun Zhang, Xianquan Zhang:
Robust Hashing via Global and Local Invariant Features for Image Copy Detection. 2:1-2:22 - Sandipan Sarma, Arijit Sur:
DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning. 3:1-3:23 - Chengyu Zheng, Ning Song, Ruoyu Zhang, Lei Huang, Zhiqiang Wei, Jie Nie:
Scale-Semantic Joint Decoupling Network for Image-Text Retrieval in Remote Sensing. 4:1-4:20 - Jiankai Li, Yunhong Wang, Weixin Li:
Zero-shot Scene Graph Generation via Triplet Calibration and Reduction. 5:1-5:21 - Abid Yaqoob, Gabriel-Miro Muntean:
Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR Streaming. 6:1-6:28 - Jia Wang, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng:
Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding. 7:1-7:23 - Haoran Wang, Yajie Wang, Baosheng Yu, Yibing Zhan, Chunfeng Yuan, Wankou Yang:
Attentional Composition Networks for Long-Tailed Human Action Recognition. 8:1-8:18 - Zi-Chao Zhang, Zhen-Duo Chen, Zhen-Yu Xie, Xin Luo, Xin-Shun Xu:
S3Mix: Same Category Same Semantics Mixing for Augmenting Fine-grained Images. 9:1-9:16 - Mingkui Tan, Zhiquan Wen, Leyuan Fang, Qi Wu:
Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. 10:1-10:23 - Yiming Yang, Weipeng Hu, Haifeng Hu:
Syncretic Space Learning Network for NIR-VIS Face Recognition. 11:1-11:25 - Chenghua Li, Zongze Li, Jing Sun, Yun Zhang, Xiaoping Jiang, Fan Zhang:
Dynamic Weighted Gradient Reversal Network for Visible-infrared Person Re-identification. 12:1-12:23 - Jiajun Song, Zhuo Li, Weiqing Min, Shuqiang Jiang:
Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design. 13:1-13:19 - Yiting Jin, Jie Wu, Wanliang Wang, Yidong Yan, Jiawei Jiang, Jianwei Zheng:
Cascading Blend Network for Image Inpainting. 14:1-14:21 - Kehua Guo, Liang Chen, Xiangyuan Zhu, Xiaoyan Kui, Jian Zhang, Heyuan Shi:
Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution. 15:1-15:23 - Jing Zhao, Bin Li, Jiahao Li, Ruiqin Xiong, Yan Lu:
A Universal Optimization Framework for Learning-based Image Codec. 16:1-16:19 - Liping Zhang, Shukai Chen, Fei Lin, Wei Ren, Kim-Kwang Raymond Choo, Geyong Min:
1DIEN: Cross-session Electrocardiogram Authentication Using 1D Integrated EfficientNet. 17:1-17:17 - Baian Chen, Zhilei Chen, Xiaowei Hu, Jun Xu, Haoran Xie, Jing Qin, Mingqiang Wei:
Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection. 18:1-18:21 - Xiang Gao, Wei Hu, Guo-Jun Qi:
Self-supervised Multi-view Learning via Auto-encoding 3D Transformations. 19:1-19:23 - Dewang Wang, Gaobo Yang, Zhiqing Guo, Jiyou Chen:
Enhancing Adversarial Embedding based Image Steganography via Clustering Modification Directions. 20:1-20:20 - Xiaojia Zhao, Tingting Xu, Qiangqiang Shen, Youfa Liu, Yongyong Chen, Jingyong Su:
Double High-Order Correlation Preserved Robust Multi-View Ensemble Clustering. 21:1-21:21 - Shuji Tasaka:
Usefulness of QoS in Multidimensional QoE Prediction for Haptic-Audiovisual Communications. 22:1-22:24 - Ching-Nung Yang, Xiaotian Wu, Min-Jung Chung:
Enhancement of Information Carrying and Decoding for Visual Cryptography with Error Correction. 23:1-23:24 - Yuqing Zhang, Yong Zhang, Shaofan Wang, Yun Liang, Baocai Yin:
Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network. 24:1-24:23 - Wenying Wen, Minghui Huang, Yushu Zhang, Yuming Fang, Yifan Zuo:
Visual Security Index Combining CNN and Filter for Perceptually Encrypted Light Field Images. 25:1-25:15 - Linlin Liu, Haijun Zhang, Qun Li, Jianghong Ma, Zhao Zhang:
Collocated Clothing Synthesis with GANs Aided by Textual Information: A Multi-Modal Framework. 26:1-26:25 - Xulei Lou, Tinghui Wu, Haifeng Hu, Dihu Chen:
Self-Supervised Consistency Based on Joint Learning for Unsupervised Person Re-identification. 27:1-27:20 - Yichi Zhang, Gongchun Ding, Dandan Ding, Zhan Ma, Zhu Li:
On Content-Aware Post-Processing: Adapting Statistically Learned Models to Dynamic Content. 28:1-28:23 - Jing Xu, Bing Liu, Yong Zhou, Mingming Liu, Rui Yao, Zhiwen Shao:
Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning. 29:1-29:16 - Cong Zou, Rui Wang, Cheng Jin, Sanyi Zhang, Xin Wang:
S2CL-Leaf Net: Recognizing Leaf Images Like Human Botanists. 30:1-30:20
Volume 20, Number 2, February 2024
- Suyel Namasudra, Pascal Lorenz, Seifedine Kadry, Syed Ahmad Chan Bukhari:
Introduction to the Special Issue on DNA-centric Modeling and Practice for Next-generation Computing and Communication Systems. 31:1-31:2
- Shaohua Wan, Yi Jin, Guangdong Xu, Michele Nappi:
Editorial to Special Issue on Multimedia Cognitive Computing for Intelligent Transportation System. 32:1-32:2 - Ruonan Zhao, Laurence T. Yang, Debin Liu, Wanli Lu, Chenlu Zhu, Yiheng Ruan:
Tensor-Empowered LSTM for Communication-Efficient and Privacy-Enhanced Cognitive Federated Learning in Intelligent Transportation Systems. 33:1-33:21 - Hongjian Shi, Hao Wang, Ruhui Ma, Yang Hua, Tao Song, Honghao Gao, Haibing Guan:
Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation System. 34:1-34:23 - Zejia Weng, Zuxuan Wu, Hengduo Li, Jingjing Chen, Yu-Gang Jiang:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. 35:1-35:18 - Shixiong Zhang, Wenmin Wang, Honglei Li, Shenyong Zhang:
E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System. 36:1-36:20 - Ram Prasad Padhy, Pankaj Kumar Sa, Fabio Narducci, Carmen Bisogni, Sambit Bakshi:
Monocular Vision-aided Depth Measurement from RGB Images for Autonomous UAV Navigation. 37:1-37:22
- Zhihan Lv, Fabio Poiesi, Qi Dong, Jaime Lloret, Houbing Song:
Special Issue on Deep Learning for Intelligent Human Computer Interaction. 38:1-38:5 - Wenjuan Gong, Yue Zhang, Wei Wang, Peng Cheng, Jordi Gonzàlez:
Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition. 39:1-39:20 - Youcef Djenouri, Asma Belhadi, Gautam Srivastava, Jerry Chun-Wei Lin:
An Efficient and Accurate GPU-based Deep Learning Model for Multimedia Recommendation. 40:1-40:18 - Loveleen Gaur, Mohan Bhandari, Bhadwal Singh Shikhar, NZ Jhanjhi, Mohammad Shorfuzzaman, Mehedi Masud:
Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer's Disease. 41:1-41:16 - Mi Li, Wei Zhang, Bin Hu, Jiaming Kang, Yuqi Wang, Shengfu Lu:
Automatic Assessment of Depression and Anxiety through Encoding Pupil-wave from HCI in VR Scenes. 42:1-42:22 - Abdul Qayyum, Imran Razzak, Muhammad Tanveer, Moona Mazher:
Spontaneous Facial Behavior Analysis Using Deep Transformer-based Framework for Child-computer Interaction. 43:1-43:17 - Xiaowei Chen, Xiao Jiang, Lishuang Zhan, Shihui Guo, Qunsheng Ruan, Guoliang Luo, Minghong Liao, Yipeng Qin:
Full-body Human Motion Reconstruction with Sparse Joint Tracking Using Flexible Sensors. 44:1-44:19 - Shanbao Qiao, Neal N. Xiong, Yongbin Gao, Zhijun Fang, Wenjun Yu, Juan Zhang, Xiaoyan Jiang:
Self-Supervised Learning of Depth and Ego-Motion for 3D Perception in Human Computer Interaction. 45:1-45:21 - Yan Kang, Bin Pu, Yongqi Kou, Yun Yang, Jianguo Chen, Khan Muhammad, Po Yang, Lida Xu, Mohammad Hijji:
A Deep Graph Network with Multiple Similarity for User Clustering in Human-Computer Interaction. 46:1-46:20 - Bahar Uddin Mahmud, Guan Y. Hong, Bernard Fong:
A Study of Human-AI Symbiosis for Creative Work: Recent Developments and Future Directions in Deep Learning. 47:1-47:21 - Xiaoling Gu, Jie Huang, Yongkang Wong, Jun Yu, Jianping Fan, Pai Peng, Mohan S. Kankanhalli:
PAINT: Photo-realistic Fashion Design Synthesis. 48:1-48:23 - Qingfeng Dai, Yongkang Wong, Guofei Sun, Yanwei Wang, Zhou Zhou, Mohan S. Kankanhalli, Xiangdong Li, Weidong Geng:
Unsupervised Domain Adaptation by Causal Learning for Biometric Signal-based HCI. 49:1-49:18 - Yi Xiao, Tong Liu, Yu Han, Yue Liu, Yongtian Wang:
Realtime Recognition of Dynamic Hand Gestures in Practical Applications. 50:1-50:17 - Jianping Gou, Liyuan Sun, Baosheng Yu, Shaohua Wan, Dacheng Tao:
Hierarchical Multi-Attention Transfer for Knowledge Distillation. 51:1-51:20
- Subhrajyoti Deb, Abhilash Kumar Das, Nirmalya Kar:
An Applied Image Cryptosystem on Moore's Automaton Operating on δ (qk)/𝔽2. 52:1-52:20 - Sisi You, Yukun Zuo, Hantao Yao, Changsheng Xu:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene. 53:1-53:19 - Shiqi Sun, Danlan Huang, Xiaoming Tao, Chengkang Pan, Guangyi Liu, Changwen Chen:
Boosting Scene Graph Generation with Contextual Information. 54:1-54:24 - Jianwei Zheng, Yu Liu, Yuchao Feng, Honghui Xu, Meiyu Zhang:
Contrastive Attention-guided Multi-level Feature Registration for Reference-based Super-resolution. 55:1-55:21 - Shangxi Wu, Jitao Sang, Kaiyan Xu, Guanhua Zheng, Changsheng Xu:
Adaptive Adversarial Logits Pairing. 56:1-56:16 - Ying Chen, Rui Yao, Yong Zhou, Jiaqi Zhao, Bing Liu, Abdulmotaleb El-Saddik:
Black-box Attack against Self-supervised Video Object Segmentation Models with Contrastive Loss. 57:1-57:21 - Shuang Liang, Wentao Ma, Chi Xie:
Relation with Free Objects for Action Recognition. 58:1-58:19 - Qiaolin He, Zhijie Zheng, Haifeng Hu:
A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identification. 59:1-59:20 - Wuliang Huang, Yiqiang Chen, Xinlong Jiang, Teng Zhang, Qian Chen:
GJFusion: A Channel-Level Correlation Construction Method for Multimodal Physiological Signal Fusion. 60:1-60:23
Volume 20, Number 3, March 2024
- Chengji Shen, Zhenjiang Liu, Xin Gao, Zunlei Feng, Mingli Song:
Self-Adaptive Clothing Mapping Based Virtual Try-on. 61:1-61:26 - Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo:
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features. 62:1-62:24 - Yan Wang, Peize Li, Qingyi Si, Hanwen Zhang, Wenyu Zang, Zheng Lin, Peng Fu:
Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering. 63:1-63:22 - Qiang Guo, Zhi Zhang, Mingliang Zhou, Hong Yue, Huayan Pu, Jun Luo:
Image Defogging Based on Regional Gradient Constrained Prior. 64:1-64:17 - Jintao Guo, Lei Qi, Yinghuan Shi, Yang Gao:
PLACE Dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization. 65:1-65:23 - Yuan Xiong, Jingru Wang, Zhong Zhou:
VirtualLoc: Large-scale Visual Localization Using Virtual Images. 66:1-66:19 - Yiheng Zhang, Ting Yao, Zhaofan Qiu, Tao Mei:
Explaining Cross-domain Recognition with Interpretable Deep Classifier. 67:1-67:21 - Ruimin Wang, Fasheng Wang, Yiming Su, Jing Sun, Fuming Sun, Haojie Li:
Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection. 68:1-68:22 - Jemily Rime, Alan Archer-Boyd, Tom Collins:
How Will You Pod? Implications of Creators' Perspectives for Designing Innovative Podcasting Tools. 69:1-69:25 - Ming Cheung:
Learning from the Past: Fast NAS for Tasks and Datasets. 70:1-70:18 - Xinyue Li, Haiyong Xu, Gangyi Jiang, Mei Yu, Ting Luo, Xuebo Zhang, Hongwei Ying:
Underwater Image Quality Assessment from Synthetic to Real-world: Dataset and Objective Method. 71:1-71:23 - Sujuan Hou, Jiacheng Li, Weiqing Min, Qiang Hou, Yanna Zhao, Yuanjie Zheng, Shuqiang Jiang:
Deep Learning for Logo Detection: A Survey. 72:1-72:23 - Yunjie Peng, Jinlin Wu, Boqiang Xu, Chunshui Cao, Xu Liu, Zhenan Sun, Zhiqiang He:
Deep Learning Based Occluded Person Re-Identification: A Survey. 73:1-73:27 - Muhammad Arslan Manzoor, Sarah Albarri, Ziting Xian, Zaiqiao Meng, Preslav Nakov, Shangsong Liang:
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications. 74:1-74:34 - Yanyan Shi, Shaowu Yang, Wenjing Yang, Dianxi Shi, Xuehui Li:
Boosting Few-shot Object Detection with Discriminative Representation and Class Margin. 75:1-75:19 - Harry Cheng, Yangyang Guo, Tianyi Wang, Qi Li, Xiaojun Chang, Liqiang Nie:
Voice-Face Homogeneity Tells Deepfake. 76:1-76:22 - Jin Ye, Meng Dan, Wenchao Jiang:
A Visual Sensitivity Aware ABR Algorithm for DASH via Deep Reinforcement Learning. 77:1-77:22 - Jian Wang, Xiao Wang, Guosheng Zhao:
Task Recommendation via Heterogeneous Multi-modal Features and Decision Fusion in Mobile Crowdsensing. 78:1-78:20 - Si-chao Lei, Yue-Jiao Gong, Xiaolin Xiao, Yicong Zhou, Jun Zhang:
Boosting Diversity in Visual Search with Pareto Non-Dominated Re-Ranking. 79:1-79:23 - Huijie Zhang, Pu Li, Xiaobai Liu, Xianfeng Terry Yang, Li An:
An Iterative Semi-supervised Approach with Pixel-wise Contrastive Loss for Road Extraction in Aerial Images. 80:1-80:21 - Jing Fang, Yinbo Yu, Zhongyuan Wang, Xin Ding, Ruimin Hu:
An Image Arbitrary-Scale Super-Resolution Network Using Frequency-domain Information. 81:1-81:23 - Xiao Luo, Wei Ju, Yiyang Gu, Yifang Qin, Siyu Yi, Daqing Wu, Luchen Liu, Ming Zhang:
Toward Effective Semi-supervised Node Classification with Hybrid Curriculum Pseudo-labeling. 82:1-82:19 - Wen Guo, Wuzhou Quan, Junyu Gao, Tianzhu Zhang, Changsheng Xu:
Feature Disentanglement Network: Multi-Object Tracking Needs More Differentiated Features. 83:1-83:22 - Mohammed Khaleel, Azeez Idris, Wallapak Tavanapong, Jacob Pratt, Jung-Hwan Oh, Piet C. de Groen:
VisActive: Visual-concept-based Active Learning for Image Classification under Class Imbalance. 84:1-84:21 - Honghua Chen, Zhiqi Li, Mingqiang Wei, Jun Wang:
Geometric and Learning-Based Mesh Denoising: A Comprehensive Survey. 85:1-85:28 - Ning Han, Yawen Zeng, Chuhao Shi, Guangyi Xiao, Hao Chen, Jingjing Chen:
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval. 86:1-86:21 - Yuan Feng, Yaojun Hu, Pengfei Fang, Sheng Liu, Yanhong Yang, Shengyong Chen:
Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal. 87:1-87:23 - Yurui Xie, Ling Guan:
Sparsity-guided Discriminative Feature Encoding for Robust Keypoint Detection. 88:1-88:22 - Nicolas Beuve, Wassim Hamidouche, Olivier Déforges:
Hierarchical Learning and Dummy Triplet Loss for Efficient Deepfake Detection. 89:1-89:18 - Suncheng Xiang, Dahong Qian, Jingsheng Gao, Zirui Zhang, Ting Liu, Yuzhuo Fu:
Rethinking Person Re-Identification via Semantic-based Pretraining. 90:1-90:17
Volume 20, Number 4, April 2024
- Min Peng, Xiaohu Shao, Yu Shi, Xiangdong Zhou:
Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering. 91:1-91:22 - Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Philip Torr, Nicu Sebe:
Cloth Interactive Transformer for Virtual Try-On. 92:1-92:20 - Xiushan Nie, Yang Shi, Ziyu Meng, Jin Huang, Weili Guan, Yilong Yin:
Complex Scenario Image Retrieval via Deep Similarity-aware Hashing. 93:1-93:24 - Jiawei Tan, Hongxing Wang, Junsong Yuan:
Characters Link Shots: Character Attention Network for Movie Scene Segmentation. 94:1-94:23 - Mingliang Zhou, Xinwen Zhao, Futing Luo, Jun Luo, Huayan Pu, Tao Xiang:
Robust RGB-T Tracking via Adaptive Modality Weight Correlation Filters and Cross-modality Learning. 95:1-95:20 - Zicheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, Jing Liu, Xiongkuo Min, Guangtao Zhai:
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images. 96:1-96:22 - Shuvendu Roy, Ali Etemad:
Contrastive Learning of View-invariant Representations for Facial Expressions Recognition. 97:1-97:22 - Jun Liu, Jiantao Zhou, Haiwei Wu, Weiwei Sun, Jinyu Tian:
Generating Robust Adversarial Examples against Online Social Networks (OSNs). 98:1-98:26 - Tao Yao, Yiru Li, Ying Li, Yingying Zhu, Gang Wang, Jun Yue:
Cross-modal Semantically Augmented Network for Image-text Matching. 99:1-99:18 - Ahmed Telili, Sid Ahmed Fezza, Wassim Hamidouche, Hanene Brachemi Meftah:
2BiVQA: Double Bi-LSTM-based Video Quality Assessment of UGC Videos. 100:1-100:22 - Hongzhou Chen, Haihan Duan, Maha Abdallah, Yufeng Zhu, Yonggang Wen, Abdulmotaleb El-Saddik, Wei Cai:
Web3 Metaverse: State-of-the-Art and Vision. 101:1-101:42 - Lilong Wang, Yunhui Shi, Jin Wang, Shujun Chen, Baocai Yin, Nam Ling:
Graph Based Cross-Channel Transform for Color Image Compression. 102:1-102:25 - Kai Han, Yu Liu, Rukai Wei, Ke Zhou, Jinhui Xu, Kun Long:
Supervised Hierarchical Online Hashing for Cross-modal Retrieval. 103:1-103:23 - Fengyi Fu, Shancheng Fang, Weidong Chen, Zhendong Mao:
Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting. 104:1-104:24 - Yuxiang Peng, Chong Fu, Guixing Cao, Wei Song, Junxin Chen, Chiu-Wing Sham:
JPEG-compatible Joint Image Compression and Encryption Algorithm with File Size Preservation. 105:1-105:20 - Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Zichuan Xu, Haozhao Wang, Xing Di, Weining Lu, Yu Cheng:
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding. 106:1-106:19 - Yijie Hu, Bin Dong, Kaizhu Huang, Lei Ding, Wei Wang, Xiaowei Huang, Qiu-Feng Wang:
Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment. 107:1-107:20