


default search action
ICME 2023: Brisbane, Australia
- IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. IEEE 2023, ISBN 978-1-6654-6891-6

- Prashant Pandey, Mustafa Chasmai, Monish Natarajan, Brejesh Lall:

Weakly Supervised Few-Shot and Zero-Shot Semantic Segmentation with Mean Instance Aware Prompt Learning. 1-6 - Qianwen Cao, Heyan Huang, Minpeng Liao, Xianling Mao:

Ada-SwinBERT: Adaptive Token Selection for Efficient Video Captioning with Online Self-Distillation. 7-12 - Jiuxiang You, Zhenguo Yang, Qing Li, Wenyin Liu:

A Retriever-Reader Framework with Visual Entity Linking for Knowledge-Based Visual Question Answering. 13-18 - Pufen Zhang

, Peng Shi, Song Zhang:
2S-DFN: Dual-semantic Decoding Fusion Networks for Fine-grained Image Recognition. 19-24 - Yongzhu Miao

, Shasha Li, Jintao Tang, Ting Wang:
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models. 25-30 - Sai Shashank Kalakonda, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla

:
Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation. 31-36 - Tianhua Xu, Sheng-hua Zhong, Zhijiao Xiao:

Protecting Intellectual Property of EEG-based Model with Watermarking. 37-42 - Hanxiu Zhang, Guitao Cao, Xinyue Zhang, Jing Xiang, Chunwei Wu:

Making Adversarial Attack Imperceptible in Frequency Domain: A Watermark-based Framework. 43-48 - Jie Luo, Peisong He, Jiayong Liu, Hongxia Wang, Chunwang Wu, Yijing Chen, Wanjie Li

, Jiangchuan Li:
Content-adaptive Adversarial Embedding for Image Steganography Using Deep Reinforcement Learning. 49-54 - Youqiang Sun, Jianyi Liu, Ru Zhang:

A Robust Generative Image Steganography Method based on Guidance Features in Image Synthesis. 55-60 - Shiqiang Wu

, Jie Liu, Ying Huang, Hu Guan, Shuwu Zhang:
Adversarial Audio Watermarking: Embedding Watermark into Deep Feature. 61-66 - Tengjun Liu

, Ying Chen, Wanxuan Gu:
Deniable Diffusion Generative Steganography. 67-71 - Songbin Li, Xiangzhi Yang, Jingang Wang

:
Sea Surface Object Detection Based on Background Dynamic Perception and Cross-Layer Semantic Interaction. 72-77 - Guikun Chen, Lin Li, Yawei Luo, Jun Xiao:

Addressing Predicate Overlap in Scene Graph Generation with Semantic Granularity Controller. 78-83 - Shiqi Ren, Chao Zhu, Mengyin Liu

, Xu-Cheng Yin:
Towards Discriminative Semantic Relationship for Fine-grained Crowd Counting. 84-89 - Jun Xie, Yixuan Zhou, Xing Xu, Guoqing Wang, Fumin Shen, Yang Yang:

Region-Aware Semantic Consistency for Unsupervised Domain-Adaptive Semantic Segmentation. 90-95 - Chuang Zhao, Hefei Ling, Yuxuan Shi, Chengxin Zhao, Jiazhong Chen, Qiang Cao:

Deep Unsupervised Hashing with Selective Semantic Mining. 96-101 - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:

Boosting Interactive Image Segmentation by Exploiting Semantic Clues. 102-107 - Dafeng Li, Yingying Zhu:

Visual-Linguistic Alignment and Composition for Image Retrieval with Text Feedback. 108-113 - Xinyu Zhou, Anna Zhu, Huen Chen, Wei Pan

:
Scene Text Involved "Text"-to-Image Retrieval through Logically Hierarchical Matching. 114-119 - Yi Li, Meihua Yu, Xin Xie, Haiyan Fu

, Hao He, Yanqing Guo:
Federating Hashing Networks Adaptively for Privacy-Preserving Retrieval. 120-125 - Kangkang Lu, Yanhua Yu, Meiyu Liang, Min Zhang, Xiaowen Cao, Zehua Zhao, Mengran Yin, Zhe Xue:

Deep Unsupervised Momentum Contrastive Hashing for Cross-modal Retrieval. 126-131 - Yiyang Cai, Jiaming Lu, Jiewen Wang, Shuang Liang:

Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval. 132-137 - Guoliang Wang, Yanlei Shang, Yong Chen, Chaoqi Zhen, Dequan Cheng:

Scene Graph based Fusion Network for Image-Text Retrieval. 138-143 - Yuchao Feng, Honghui Xu, Jiawei Jiang, Jianwei Zheng:

Compact Intertemporal Coupling Network for Remote Sensing Change Detection. 144-149 - Jueyu Chen, Guanyu Xing, Jingwei Liao

, Housheng Wei, Yanli Liu:
Boundary-aware Shadow Detection via Mask Decoupling and Feature Correction. 150-155 - Yuzhong Zhao, Yuanqiang Cai, Weijia Wu, Weiqiang Wang:

Explore Faster Localization Learning For Scene Text Detection. 156-161 - Xiaofeng Ji, Jin Chen, Xinxiao Wu:

Counterfactual Inference for Visual Relationship Detection in Videos. 162-167 - Huayi Zhou

, Fei Jiang, Hongtao Lu:
Body-Part Joint Detection and Association via Extended Object Representation. 168-173 - Jian Cui

, Lin Li, Xiaohui Tao:
Be-or-Not Prompt Enhanced Hard Negatives Generating For Memes Category Detection. 174-179 - Yanni Wang, Gang Yang, Dayong Ding, Jianchun Zhao:

Automatic Retinal Nerve Fiber Trajectory Simulation and Quasi-polar Transformation for Detecting Retinal Nerve Fiber Layer Defect in Fundus Images. 180-185 - Jiawei Jiang, Jiacheng Chen, Honghui Xu, Yuchao Feng, Jianwei Zheng:

GA-HQS: MRI reconstruction via a generically accelerated unfolding approach. 186-191 - Yi Li, Baoyao Yang, Dan Pan, An Zeng, Long Wu, Yang Yang:

Early Diagnosis of Alzheimer's Disease Based on Multimodal Hypergraph Attention Network. 192-197 - Shanshan Huang, Qingsong Li, Lei Wang, Yuanhao Wang, Li Liu:

Score-based causal feature selection for cancer risk prediction. 198-203 - Wentian Cai

, Yulin Cheng, Ying Gao
, Weixiao Liu, Xinyan Xie, Xiongwen Luo, Weixian Yang, Zaiyi Liu, Changhong Liang:
A Dual-Path Supplemental Information Learning Architecture for Breast Cancer Ki-67 Status Prediction in T2w MRI. 210-215 - Hui Zhang, Shiqi Shen, Jinhua Xu:

Expression-Guided Attention GAN for Fine-Grained Facial Expression Editing. 216-221 - Yini Fang, Didan Deng, Liang Wu

, Frederic Jumelle
, Bertram E. Shi
:
RMES: Real-Time Micro-Expression Spotting Using Phase From Riesz Pyramid. 222-227 - Shukang Yin, Shiwei Wu, Tong Xu, Shifeng Liu, Sirui Zhao, Enhong Chen:

AU-aware graph convolutional network for Macroand Micro-expression spotting. 228-233 - Hao Sun, Chenchen Pi, Wei Xie:

Semi-Supervised Facial Expression Recognition by Exploring False Pseudo-Labels. 234-239 - Jingning Xu, Benlai Tang, Mingjie Wang, Minghao Li, Meirong Ma:

CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation. 240-245 - David Anghelone, Sarah Lannes, Antitza Dantcheva:

ANYRES: Generating High-Resolution visible-face images from Low-Resolution thermal-face images. 246-251 - Yutong Li, Zhenyu Liu, Gang Li, Qiongqiong Chen, Zhijie Ding, Xiping Hu

, Bin Hu:
A Visually Interpretable Convolutional-Transformer Model for Assessing Depression from Facial Images. 252-257 - Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxing Gao, Yuting Yang, Ming Tang, Jinqiao Wang:

FreConv: Frequency Branch-and-Integration Convolutional Networks. 258-263 - Ruofan Wang, Jiayu Guo, Rui-Wei Zhao, Ling Su, Yingzi Ye, Xiaobo Zhang, Yuejie Zhang, Rui Feng:

Class-aware Variational Auto-encoder for Open Set Recognition. 264-269 - Mingyang Zhang, Xinyi Yu

, Jingtao Rong, Linlin Ou:
Repnas: Searching for Efficient Re-Parameterizing Blocks. 270-275 - Bowen Zhao, Weidong Chen, Bo Hu

, Hongtao Xie, Zhendong Mao:
Difference-Aware Iterative Reasoning Network for Key Relation Detection. 276-281 - Luying Li, Lizhuang Ma:

Injecting-Diffusion: Inject Domain-Independent Contents into Diffusion Models for Unpaired Image-to-Image Translation. 282-287 - Lei Xu, Rong Wang, Feiping Nie, Jun Wu, Xuelong Li:

Semi-Supervised Top-k Feature Selection with a General Optimization Framework. 288-293 - Yukun Zhang, Shengming Yuan

, Jingkuan Song, Yixuan Zhou, Lin Zhang, Yulan He:
Towards Boosting Black-Box Attack Via Sharpness-Aware. 294-299 - Xiaolin Zhai, Zhengxi Hu, Dingye Yang, Shichao Wu

, Jingtai Liu:
Learning Group Residual Representation for Group Activity Prediction*. 300-305 - Xuesong Guo, Shuo Wang, Jiahao Chang, Zehui Chen, Feng Zhao:

SAFE: Simultaneous Alignment of Features and Predictions for Dense Object Detectors. 306-311 - Xiaohong Xiang, Fuyuan Zhang, Xin Deng, Ke Hu:

MSG-CAM:Multi-scale inputs make a better visual interpretation of CNN networks. 312-317 - Peng Yan

, Guodong Long:
Personalization Disentanglement for Federated Learning. 318-323 - Yuxin Shi, Zelei Liu, Zhuan Shi, Han Yu:

Fairness-Aware Client Selection for Federated Learning. 324-329 - Xiaoli Tang, Han Yu:

Utility-Maximizing Bidding Strategy for Data Consumers in Auction-Based Federated Learning. 330-335 - Zhiwei Xiong, Han Yu, Zhiqi Shen:

Federated Learning for Personalized Image Aesthetics Assessment. 336-341 - Yue Huang, Lanju Kong, Qingzhong Li, Baochen Zhang:

Decentralized Federated Learning Via Mutual Knowledge Distillation. 342-347 - Zekai Chen, Fuyi Wang, Zhiwei Zheng, Ximeng Liu, Yujie Lin:

Fedward: Flexible Federated Backdoor Defense Framework with Non-IID Data. 348-353 - Jialing He, Zhen Qin, Hangcheng Liu, Shangwei Guo, Biwen Chen, Ning Wang, Tao Xiang:

Contrastive Fusion Representation: Mitigating Adversarial Attacks on VQA Models. 354-359 - Zhengyu Wang, Yujie Zhang, Qi Yang, Yiling Xu, Yifei Zhou, Jun Sun, Shan Liu:

Improving Point Cloud Quality Metrics with Noticeable Possibility Maps. 360-365 - Haoning Wu, Liang Liao, Jingwen Hou, Chaofeng Chen, Erli Zhang

, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin:
Exploring Opinion-Unaware Video Quality Assessment with Semantic Affinity Criterion. 366-371 - Lirong Huang, Rong Zhang, Miaohui Wang:

Just Noticeable Difference Estimation for Screen Content Images: A Content Uncertainty-guided Approach. 372-377 - Hui Wang, Xiguang Zheng, Yong Qin:

Intermediate-Task Learning with Pretrained Model for Synthesized Speech MOS Prediction. 378-383 - Zenan Xu, Wanjun Zhong, Qinliang Su, Fuwei Zhang:

Cross-Modal-Aware Representation Learning with Syntactic Hypergraph Convolutional Network for VideoQA. 384-389 - Hui Su, Yue Ye, Wei Hua, Lechao Cheng

, Mingli Song:
SASFormer: Transformers for Sparsely Annotated Semantic Segmentation. 390-395 - Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen:

Holistic Weighted Distillation for Semantic Segmentation. 396-401 - Feng Jiang, Heng Gao, Shoumeng Qiu, Haiqiang Zhang, Ru Wan, Jian Pu:

Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation. 402-407 - Huazheng Hao, Hui Xiao, Li Dong, Diqun Yan, Dongtai Liang, Jiayan Zhuang, Chengbin Peng:

A Pseudo-Dual Self-Rectification Framework for Semantic Segmentation. 408-413 - Feifei Ding, Jianjun Li, Wanyong Tian:

Dual-level Consistency Learning for Unsupervised Domain Adaptive Night-time Semantic Segmentation. 420-425 - Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Hengyu Man, Xiaopeng Fan:

Modality-Fusion Spiking Transformer Network for Audio-Visual Zero-Shot Learning. 426-431 - Rui Gao

, Fan Wan, Daniel Organisciak, Jiyao Pu, Haoran Duan, Peng Zhang, Xingsong Hou, Yang Long:
Privacy-Enhanced Zero-Shot Learning via Data-Free Knowledge Transfer. 432-437 - Ting Guo, Jiye Liang

, Guo-Sen Xie:
Swap-Reconstruction Autoencoder for Compositional Zero-Shot Learning. 438-443 - Xinmiao Dai, Chong Wang, Haohe Li, Sunqi Lin, Li Dong, Jiafei Wu, Jun Wang:

Synthetic Feature Assessment for Zero-Shot Object Detection. 444-449 - Yapeng Li

, Yong Luo, Bo Du:
Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck. 450-455 - Han Jiang, Xiaoshan Yang, Chaofan Chen, Changsheng Xu:

Fine-grained Primitive Representation Learning for Compositional Zero-shot Classification. 456-461 - Jingwei Wang, Peng Zhou

, Xianjun Han
, Yanming Chen:
Medical Image Super-Resolution via Diagnosis-Guided Attention. 462-467 - Hong Zhang, Shenglun Chen, Zhihui Wang, Haojie Li, Wanli Ouyang

:
Denser is Better:cost distribution super-resolution network for more accurate sub-pixel disparity. 468-473 - Lin Sun, Chao Yang, Bin Jiang:

DSP-Net: Diverse Structure Prior Network for Image Inpainting. 474-479 - Zekun Ai, Xiaotong Luo, Yanyun Qu:

Joint Feature Aggregation for Stereo Image Super-resolution. 480-485 - Zijian Yuan, Kan Chang

, Zhiquan Liu, Xinjie Wei, Boning Chen:
Joint Super-Resolution and Classification Based on Bidirectional Mapping and Multiple Constraints. 486-491 - Qichen Wei, Zijie Zuo, Jie Nie, Jiahao Du, Yaning Diao, Min Ye, Xinyue Liang:

Inpainting of Remote Sensing Sea Surface Temperature image with Multi-scale Physical Constraints. 492-497 - Lei Chen, Huhe Dai, Yuan Zheng:

ICANet: A Lightweight Increasing Context Aided Network for Real-Time Image Semantic Segmentation. 492-497 - Zhijie Huang, Tianyi Sun, Xiaopeng Guo, Yanze Wang, Jun Sun:

Generalized Compressed Video Restoration by Multi-Scale Temporal Fusion and Hierarchical Quality Score Estimation. 498-503 - Yuan Zou, Yinyao Ma:

Edgeformer: Edge-Enhanced Transformer for High-Quality Image Deblurring. 504-509 - Yubo Huang, Jia Wang

, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He:
Generative Iris Prior Embedded Transformer for Iris Restoration. 510-515 - Zhongbao Yang, Jinshan Pan:

MBDFNet: Multi-scale Bidirectional Dynamic Feature Fusion Network for Efficient Image Deblurring. 522-527 - Minhua Liu, Yuanman Li, Rongqin Liang, Jiaxiang You, Xia Li:

Multiple degraded image restoration via degradation history estimation. 528-533 - Jintao Zhang, Guangyi Xiao:

Gradual Migration and Style Consistency for Unsupervised Domain Adaptation. 534-539 - Han Xie, Zhifeng Shen

, Shicai Yang, Weijie Chen, Luojun Lin
:
Adapt then Generalize: A Simple Two-Stage Framework for Semi-Supervised Domain Generalization. 540-545 - Hongjian Song, Jie Tang, Hongzhao Xiao, Juncheng Hu:

Rethinking Overfitting of Multiple Instance Learning for Whole Slide Image Classification. 546-551 - Qiang Chen, Dong Zhang, Shoushan Li, Guodong Zhou:

A Unified MRC Framework with Multi-Query for Multi-modal Relation Triplets Extraction. 552-557 - Jiaxin Yang, Xiaofei Li, Jun Zhang, Shuohao Li:

Feature Bias Correction: A Feature Augmentation Method for Long-tailed Recognition. 558-563 - Yuling Jiang, Yingyuan Zhao, Bing-Kun Bao:

Recombination Samples Training for Robust Natural Language Visual Reasoning. 564-569 - Yansong Qu

, Yuze Wang
, Yue Qi:
SG-NeRF: Semantic-guided Point-based Neural Radiance Fields. 570-575 - Hai Zhou, Zhe Xue, Ying Liu, Boang Li, Junping Du, Meiyu Liang:

RTMC: A Rubost Trusted Multi-View Classification Framework. 576-581 - Xinjiao Zhou, Bin Jiang, Chao Yang, Haotian Hu, Xiaofei Huo:

DF-CLIP: Towards Disentangled and Fine-grained Image Editing from Text. 582-587 - Changshuo Wang, Lei Wu, Xu Chen, Xiang Li, Lei Meng, Xiangxu Meng:

Letter Embedding Guidance Diffusion Model for Scene Text Editing. 588-593 - Rongyu Zhang, Yun Chen, Chenrui Wu, Fangxin Wang:

Cluster-driven GNN-based Federated Recommendation with Biased Message Dropout. 594-599 - Tianyu Huai

, Shuwen Yang, Junhang Zhang, Guoan Wang, Xinru Yu, Tianlong Ma, Liang He:
SQT: Debiased Visual Question Answering via Shuffling Question Types. 600-605 - Shizhuo Deng, Chuangui Yang, Zhubao Guo, Boqian Lin, Dongyue Chen, Tong Jia, Botao Wang:

Fast Personalized Human Activity Recognition on Heuristic Parameter Estimation. 606-611 - Yaolong Ju, Chunyang Xu

, Yichen Guo, Jinhu Li, Simon Lui:
Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice Separation. 612-617 - Han Guo, Yuanlong Yu, Yujie Wang, Xuelin Chen, Yixin Zhuang:

Learning High Frequency Surface Functions In Shells. 618-623 - Eli Lei, Jia Shao, Youfa Liu, Bo Du:

Multi-template Tracker Driven by Cache Manager Algorithm, Towards Multi-distractor Scenarios. 624-629 - Aoran Liu, Kun Hu, Wenxi Yue, Qiuxia Wu, Zhiyong Wang:

Material-Aware Self-Supervised Network for Dynamic 3D Garment Simulation. 630-635 - Yulin Wu

, Ruimin Hu, Xiaochen Wang:
Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network. 636-641 - Jinxin Wang, Zhongwen Guo, Chao Yang

, Xiaomei Li, Ziyuan Cui:
Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition. 642-647 - Tianhan Liu, Zhuang Qi, Zitan Chen, Xiangxu Meng, Lei Meng:

Cross-Training with Prototypical Distillation for improving the generalization of Federated Learning. 648-653 - Mehdi Setayesh, Vincent W. S. Wong:

A Content-based Viewport Prediction Framework for 360° Video Using Personalized Federated Learning and Fusion Techniques. 654-659 - Chenrui Wu, Zexi Li, Fangxin Wang, Chao Wu:

Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients. 660-665 - Yulan Gao, Yansong Zhao, Han Yu:

Multi-Tier Client Selection for Mobile Federated Learning Networks. 666-671 - Chengyi Yang, Zhaoxiang Hou, Sheng Guo, Hui Chen, Zengxiang Li:

SWATM: Contribution-Aware Adaptive Federated Learning Framework Based on Augmented Shapley Values. 672-677 - Yiqiang Chen

, Xiaodong Yang, Yuting He, Chunyan Miao, Piu Chan:
FedDBM: Federated Digital Biomarker for Detecting Parkinson's Disease Progress. 678-683 - Haihang Ruan, Feng Wang, Tongda Xu, Zhiyong Tan, Yan Wang:

MIXLIC: Mixing Global and Local Context Model for learned Image Compression. 684-689 - Ruoke Yan, Qian Yin

, Xinfeng Zhang, Siwei Ma:
Model-Driven Compression for Digital Human Using Multi-Granularity Representations. 690-695 - Hengyu Man, Xingtao Wang, Riyu Lu, Xiaopeng Fan:

Meta-ILF: In-Loop Filter with Customized Weights For VVC Intra Coding. 696-701 - Yunhui Shi, Pengquan Wang, Jin Wang, Baocai Yin, Nam Ling:

Variable-Rate Neural Image Compression with Joint Content-Channel Features and Accurate R-λ Model. 702-707 - Wenyi Wang, Yingzhan Xu, Kai Zhang, Li Zhang:

Peer Upsampled Transform Domain Prediction for G-PCC. 708-713 - Qiuyue Fang, Tao Xu, Lai Jiang, Shengxi Li, Mai Xu, Yunjin Chen, Leonid Sigal:

Optimizing DNN based quality assessment metric for image compression: A novel rate control method. 714-719 - Junhang Zhang, Zisong Zhuang, Luwei Xiao

, Xingjiao Wu
, Tianlong Ma, Liang He:
Dual-Expert Distillation Network for Few-Shot Segmentation. 720-725 - Linglan Zhao, Jing Lu, Zhanzhan Cheng, Duo Liu, Xiangzhong Fang:

Rethinking Self-Supervision for Few-Shot Class-Incremental Learning. 726-731 - Yongliang Su, Xu Chen, Lei Wu, Xiangxu Meng:

Learning Component-Level and Inter-Class Glyph Representation for few-shot Font Generation. 738-743 - Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu

, Jian Zhang
:
Masked Cross-image Encoding for Few-shot Segmentation. 744-749 - Xueyang Zhang, Shuxian Wang, Jun Du, Genwei Yan, Jigang Tang, Tian Gao, Xin Fang, Jia Pan, Jianqing Gao:

Frame-Level Embedding Learning for Few-shot Bioacoustic Event Detection. 750-755 - Xiaojia Chen, Xuanhan Wang, Beitao Chen, Lianli Gao:

End-To-End Part-Level Action Parsing With Transformer. 756-761 - Kaixiang Yang, Junyu Gao, Yangbo Feng, Changsheng Xu:

Leveraging Attribute Knowledge for Open-set Action Recognition. 762-767 - Hailun Zhang

, Ziyun Zeng, Qijun Zhao, Zhen Zhai:
ConCAP: Contrastive Context-Aware Prompt for Resource-hungry Action Recognition. 768-773 - Wentian Xin, Hongkai Lin, Ruyi Liu, Yi Liu, Qiguang Miao:

Is Really Correlation Information Represented Well in Self-Attention for Skeleton-based Action Recognition? 780-785 - Chang Li

, Qian Huang, Yingchi Mao:
DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition. 786-791 - Shilian Wu, Yongrui Li, Zengfu Wang:

Improving CTC-based Handwritten Chinese Text Recognition with Cross-Modality Knowledge Distillation and Feature Aggregation. 792-797 - Gao-Dong Liu, Wan-Lei Zhao, Jie Zhao:

Decoupled Mutual Distillation for Incremental Object Detection. 798-803 - Wujie Sun, Defang Chen, Can Wang, Deshi Ye, Yan Feng, Chun Chen:

Accelerating Diffusion Sampling with Classifier-based Feature Distillation. 810-815 - Dongqin Liu, Wentao Li, Wei Zhou, Zhaoxing Li, Jiao Dai, Jizhong Han

, Ruixuan Li, Songlin Hu:
Semantic Stage-Wise Learning for Knowledge Distillation. 816-821 - Hao Zhang, Yanxu Hu, Jiawen Peng, Andy J. Ma:

Discriminative Gradient Adjustment with Coupled Knowledge Distillation for Class Incremental Learning. 822-827 - Xiaowen Ma, Rui Che, Tingfeng Hong

, Mengting Ma, Ziyan Zhao, Tian Feng, Wei Zhang:
SACANet: scene-aware class attention network for semantic segmentation of remote sensing images. 828-833 - Hongyu Gu, Yunzhi Zhuge

, Lu Zhang, Jinqing Qi, Huchuan Lu:
Few-shot Semantic Segmentation by Exploiting Dynamic and Regional Contexts. 834-839 - Guoxing Yang, Feifei Fu, Nanyi Fei, Haoran Wu, Ruitao Ma, Zhiwu Lu:

DiST-GAN: Distillation-based Semantic Transfer for Text-Guided Face Generation. 840-845 - Guoying Sun, Meng Yang

:
Self-Attention Prediction Correction with Channel Suppression for Weakly-Supervised Semantic Segmentation. 846-851 - Rui Chen, Tao Chen, Qiong Wang, Yazhou Yao:

Semi-Supervised Semantic Segmentation With Region Relevance. 852-857 - Jiahao Guo

, Chao Liang, Zhongyuan Wang:
Who, What and Where: Composite-semantic Instance Search for Story Videos. 858-863 - Yan Wang, Yu-Ting Su, Wenhui Li, Chenggang Yan, Bolun Zheng, Xuanya Li, An-An Liu:

Semantic Embedding Uncertainty Learning for Image and Text Matching. 864-869 - Yifan Shang, Xiucai Ye, Tetsuya Sakurai:

Multi-view Network Embedding with Structure and Semantic Contrastive Learning. 870-875 - Guoqing Yang, Chuang Zhu, Yu Zhang:

A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation. 876-881 - Yuxin Jin, Ming Qian

, Jincheng Xiong, Nan Xue, Gui-Song Xia
:
Depth and DOF Cues Make A Better Defocus Blur Detector. 882-887 - Jialong Zhang, Lijun Zhao, Jinjing Zhang, Ke Wang, Anhong Wang:

Explainable Unfolding Network For Joint Edge-Preserving Depth Map Super-Resolution. 888-893 - Xianhe Jiao, Junli Zhao, Chenlei Lv, Fuqing Duan, Zhenkuan Pan, Xin Li:

Robust 3D Craniofacial Landmarks Localization by An End-to-End Regression Network. 900-905 - Xueyang Li, Minyang Xu, Xiangdong Zhou:

Twins-Mix: Self Mixing in Latent Space for Reasonable Data Augmentation of 3D Computer-Aided Design Generative Modeling. 906-911 - Shaoxu Li, Ye Pan:

Rendering and Reconstruction Based 3D Portrait Stylization. 912-917 - Zhenjiang Du, Yi Lu, Guan Wang, Ning Xie, Yang Yang:

GT-Net: Variational Autoencoder Networks based on Graph Transformer for 3D Shape Learning. 918-923 - Jing Hu, Xincheng Wang, Ziheng Liao, Tingsong Xiao

:
M-GCN: Multi-scale Graph Convolutional Network for 3D Point Cloud Classification. 924-929 - Boyang Zhang, Suping Wu, Leyang Yang, Bin Wang, Wenlong Lu:

A Lightweight Grouped Low-rank Tensor Approximation Network for 3D Mesh Reconstruction From Videos. 930-935 - Xin Zou

, Chang Tang, Wei Zhang, Kun Sun, Liangxiao Jiang
:
Hierarchical Attention Learning for Multimodal Classification. 936-941 - Zeman Shao, Gautham Vinod, Jiangpeng He, Fengqing Zhu:

An End-to-End Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image. 942-947 - Yuxiang An, Dongnan Liu, Weidong Cai

:
Unsupervised Domain Adaptation for Neuron Membrane Segmentation based on Structural Features. 948-953 - Nan Wang, Chengwei Chen, Lizhuang Ma, Shaohui Lin:

Latent Feature Regularization based Adversarial Network for Brain Tumor Anomaly Detection. 954-959 - Zhenda Xu, Jiahao Hu, Qiang Gao, Donghua Hang, Qihua Zhou, Song Guo, Aiqian Gan:

Development of Deep Learning Algorithms for Automated Scoliosis and Abnormal Posture Screening Using 2D Back Image. 960-965 - Yu Tang, Gang Yang, Jianchun Zhao, Dayong Ding, Jun Wu:

LACL: Lesion-Aware Contrastive Learning Framework for Medical Image Classification. 966-971 - Yinan Mao, Bowei He

, Shiji Zhou, Chen Ma
, Zhi Wang:
Collaborative Edge Caching: a Meta Reinforcement Learning Approach with Edge Sampling. 972-977 - Feng Peng, Bingcong Lu

, Li Song, Rong Xie, Yanmei Liu, Ying Chen:
PACC: Perception Aware Congestion Control for Real-time Communication. 978-983 - Xueting Jiang, Xin Liu, Yiu-Ming Cheung, Xing Xu, Shu-Kai Zheng, Taihao Li:

Label-Semantic-Enhanced Online Hashing for Efficient Cross-modal Retrieval. 984-989 - Cheng Zhan, Huan Yan, Han Hu, Liyue Zhu, Shubin Xu:

QoE Maximization for Aerial Video Streaming with Multiple Cellular Connected UAVs. 990-995 - Dieli Hu, Wen Ji, Zhi Wang:

Multi-stream Adaptive Offloading of Joint Compressed Video Streams, Feature Streams, and Semantic Streams in Edge Computing Systems. 996-1001 - Jangwoo Son, Yago Sanchez, Christian Hampe, Dominik Schnieders, Thomas Schierl, Cornelius Hellge:

L4S Congestion Control Algorithm for Interactive Low Latency Applications over 5G. 1002-1007 - Hao Ren, Wu Ran, Xingson Liu, Haoran Ren, Hong Lu, Rui Zhang, Cheng Jin:

Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network. 1008-1013 - Dazhao Du, Bing Su, Yu Li, Zhongang Qi, Lingyu Si, Ying Shan:

Do We Really Need Temporal Convolutions in Action Segmentation? 1014-1019 - Guo Chen, Yin-Dong Zheng, Zhe Chen, Jiahao Wang, Tong Lu:

ELAN: Enhancing Temporal Action Detection with Location Awareness. 1020-1025 - Yin-Dong Zheng, Guo Chen, Minglei Yuan, Tong Lu:

MRSN: Multi-Relation Support Network for Video Action Detection. 1026-1031 - Qinying Liu, Zilei Wang, Ruoxi Chen, Zhilin Li:

Unleashing the Potential of Adjacent Snippets for Weakly-supervised Temporal Action Localization. 1032-1037 - Zikun Zhuang

, Ruihao Qian, Chi Xie
, Shuang Liang:
Compositional Learning in Transformer-Based Human-Object Interaction Detection. 1038-1043 - Junkai Yan, Lingxiao Yang, Yipeng Gao, Wei-Shi Zheng:

Self-supervised Cross-stage Regional Contrastive Learning for Object Detection. 1044-1049 - Bingchao Wu, Yangyuxuan Kang, Daoguang Zan, Bei Guan, Yongji Wang:

Hierarchical and Contrastive Representation Learning for Knowledge-Aware Recommendation. 1050-1055 - Qingzhong Chen, Shilun Cai, Crystal Cai, Zefang Yu, Dahong Qian, Suncheng Xiang:

Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval. 1056-1061 - Wenye Lin, Yifeng Ding, Zhixiong Cao, Hai-Tao Zheng:

Establishing a Stronger Baseline for Lightweight Contrastive Models. 1062-1067 - Jinyong Wen, Yuhu Wang, Chunxia Zhang, Shiming Xiang, Chunhong Pan:

Graph Information Interaction on Feature and Structure via Cross-modal Contrastive Learning. 1068-1073 - Yidan Fan, Wenhuan Lu, Yahong Han:

Discriminative and Contrastive Consistency for Semi-supervised Domain Adaptive Image Classification. 1074-1079 - Feng Liu, Deyi Tuo, Yinan Xu, Xintong Han:

CoverHunter: Cover Song Identification with Refined Attention and Alignments. 1080-1085 - Iacopo Ghinassi, Matthew Purver

, Huy Phan, Chris Newell:
Exploring Pre-Trained Neural Audio Representations for Audio Topic Segmentation. 1086-1091 - Xun Zhou, Wujin Sun, Xiaodong Shi:

A High-Quality Melody-Aware Peking Opera Synthesizer Using Data Augmentation. 1092-1097 - Xinlu Liu, Jiale Qian, Qiqi He, Yi Yu, Wei Li:

LC-Beating: An Online System for Beat and Downbeat Tracking using Latency-Controlled Mechanism. 1098-1103 - Honglin Mu, Wentian Xia, Wanxiang Che:

Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer. 1104-1108 - Yulun Wu, Jiahao Zhao

, Yi Yu, Wei Li:
MFAE: Masked frame-level autoencoder with hybrid-supervision for low-resource music transcription. 1109-1114 - Hongji Yang, Jiao Liu

, Shao-Ping Lu, Bo Ren
:
Self-Supervised Implicit 3D Reconstruction via RGB-D Scans. 1115-1120 - Yang Wu, Lingyan Liang, Yaqian Zhao, Kaihua Zhang:

Object-Aware Calibrated Depth-Guided Transformer for RGB-D Co-Salient Object Detection. 1121-1126 - Yufan Deng, Xin Deng, Mai Xu:

A Two-stage hybrid CNN-Transformer Network for RGB Guided Indoor Depth Completion. 1127-1132 - Peiyuan Zhi, Kaiyue Zhou

, Yali Li, Shengjin Wang:
Feature Decoupling and Uncertainty Estimation for 3D Object Detection. 1133-1138 - Lianggangxu Chen, Jiale Lu, Changbo Wang, Gaoqi He:

Scene Graph Generation using Depth-based Multimodal Network. 1139-1144 - Linlong Fan, Yanqi Ge, Wen Li, Lixin Duan:

Multi-View Token Clustering and Fusion for 3D Object Recognition and Retrieval. 1145-1150 - Gang Wang, Yufei Chen

:
Local Consensus Transformer for Correspondence Learning. 1151-1156 - Bowen Zheng, Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan:

Preserving Locality in Vision Transformers for Class Incremental Learning. 1157-1162 - Ruichao Hou, Boyue Xu, Tongwei Ren, Gangshan Wu:

MTNet: Learning Modality-aware Representation with Transformer for RGBT Tracking. 1163-1168 - Zixuan Su, Jingjing Chen

, Lei Pang, Chong-Wah Ngo, Yu-Gang Jiang:
Adaptive Split-Fusion Transformer. 1169-1174 - Yijun Long, Zhaoyu Chen, Hong Lu, Wenqiang Zhang:

GSFormer: Geometric-Spatial Transformer on Point Cloud Completion. 1175-1180 - Chaohao Wen, Xun Gong:

SDGFormer: An Efficient Convolution Network Structurally Similar to Transformer. 1181-1186 - Huaming Wang, Jianwei Fei, Yunshu Dai, Lingyun Leng, Zhihua Xia:

General GAN-generated Image Detection by Data Augmentation in Fingerprint Domain. 1187-1192 - Qichao Ying, Hang Zhou, Xiaoxiao Hu, Zhenxing Qian

, Sheng Li, Xinpeng Zhang:
Image Protection for Robust Cropping Localization and Recovery. 1193-1198 - Pei-Kai Huang, Jun-Xiong Chong, Hui-Yu Ni, Tzu-Hsien Chen, Chiou-Ting Hsu

:
Towards Diverse Liveness Feature Representation and Domain Expansion for Cross-Domain Face Anti-Spoofing. 1199-1204 - Xin Dong, Tao Wang, Zhendong Li, Hao Liu:

Joint Statistical and Causal Feature Modulated Face Anti-Spoofing. 1205-1210 - Yuwei Zeng, Jingxuan Tan, Zhengxin You, Zhenxing Qian

, Xinpeng Zhang:
Watermarks for Generative Adversarial Network Based on Steganographic Invisible Backdoor. 1211-1216 - Yan Fang, Zhongyuan Wang, Jikang Cheng, Ruoxi Wang, Chao Liang:

Promoting adversarial transferability with enhanced loss flatness. 1217-1222 - Yuezun Li, Jiaran Zhou, Siwei Lyu:

Face Poison: Obstructing DeepFakes by Disrupting Face Detection. 1223-1228 - Wen Liu, Degang Sun, Yan Wang, Zhongyuan Chen, Xinbo Han

, Haitian Yang:
ABTD-Net: Autonomous Baggage Threat Detection Networks for X-ray Images. 1229-1234 - Zhi Zeng

, Mingmin Wu, Guodong Li, Xiang Li, Zhongqiang Huang, Ying Sha:
An Explainable Multi-view Semantic Fusion Model for Multimodal Fake News Detection. 1235-1240 - Hao Li, Xiangyang Luo, Yi Zhang:

Improving CoatNet for Spatial and JPEG Domain Steganalysis. 1241-1246 - Shuai Hao

, Jialin Yang, Xu Jia, You He, Huchuan Lu:
Image Super-Resolution with Implicit Texture Pattern Modulation. 1247-1252 - Feihong Qin, Liyan Zhang:

Towards Efficient Large Mask Inpainting via Knowledge Transfer. 1253-1258 - Shuyi Qu, Zhenxing Niu, Jianke Zhu, Bin Dong, Kaizhu Huang:

Structure First Detail Next: Image Inpainting with Pyramid Generator. 1265-1270 - Deyang Liu, Yifan Mao, Xiaofei Zhou, Ping An, Yuming Fang:

Learning a Multilevel Cooperative View Reconstruction Network for Light Field Angular Super-Resolution. 1271-1276 - Jiancong Feng, Yuan-Gen Wang, Fengchuang Xing:

NLCUnet: Single-Image Super-Resolution Network with Hairline Details. 1277-1282 - Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yiqing Rong, Shuai Cui:

An Order-Complexity Model for Aesthetic Quality Assessment of Symbolic Homophony Music Scores. 1289-1294 - Zehong Zhou, Fei Zhou, Guoping Qiu:

Collaborative Auto-encoding for Blind Image Quality Assessment. 1295-1300 - Jiaming Xie, Yu Luo, Jie Ling, Guanghui Yue:

No Reference Image Quality Assessment Via Quality Difference Learning. 1301-1306 - Yi Huang

, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng:
Low-Light Image Enhancement by Learning Contrastive Representations in Spatial and Frequency Domains. 1307-1312 - Lanxin Zhao, Dengshi Li, Jing Xiao, Chenyi Zhu:

Noise adaptive speech intelligibility enhancement based on improved StarGAN*. 1313-1318 - Bo Li, Lin Yuanbo Wu, Deyin Liu

, Hongyang Chen, Yuanxin Ye, Xianghua Xie:
Image Template Matching via Dense and Consistent Contrastive Learning. 1319-1324 - Andreas Sochopoulos, Ioannis Mademlis

, Evangelos Charalampakis, Sotirios Papadopoulos, Ioannis Pitas:
Deep Reinforcement Learning with semi-expert distillation for autonomous UAV cinematography. 1325-1330 - Xucheng Wang, Xiangyang Yang, Hengzhou Ye, Shuiwang Li:

Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking. 1331-1336 - Pan Mu, Jing Fang, Haotian Qian, Cong Bai:

Transmission and Color-guided Network for Underwater Image Enhancement. 1337-1342 - Dan Zeng, Mingliang Zou, Xucheng Wang, Shuiwang Li:

Towards Discriminative Representations with Contrastive Instances for Real-Time UAV Tracking. 1349-1354 - Rizwan Khan, Atif Mehmood, Saeed Akbar

, Zhonglong Zheng:
Underwater Image Enhancement with an Adaptive Self Supervised Network. 1355-1360 - Cong Liang, Shangfei Wang, Xiaoping Chen:

Privacy-Protected Facial Expression Recognition Augmented by High-Resolution Facial Images. 1361-1366 - Feipeng Ma, Yueyi Zhang, Xiaoyan Sun:

Multimodal Sentiment Analysis with Preferential Fusion and Distance-aware Contrastive Learning. 1367-1372 - Wenxiu Geng, Yulong Bian, Xiangxian Li

:
A Multi-View Co-Learning Method for Multimodal Sentiment Analysis. 1373-1378 - Zenan Xu, Qinliang Su, Junxi Xiao:

Multimodal Aspect-Based Sentiment Classification with Knowledge-Injected Transformer. 1379-1384 - Chuang Chen

, Xiao Sun:
STA-GCN:Spatial Temporal Adaptive Graph Convolutional Network for Gait Emotion Recognition. 1385-1390 - Yiming Zhang, Hao Wang, Yifan Xu, Xinglong Mao, Tong Xu, Sirui Zhao, Enhong Chen:

Adaptive Graph Attention Network with Temporal Fusion for Micro-Expressions Recognition. 1391-1396 - Haoyu Zhou, Wei Hu, Ying Li, Chu He, Xi Chen:

Deep Homography Estimation With Feature Correlation Transformer. 1397-1402 - Zepeng Huang, Qi Wan, Junliang Chen, Xiaodong Zhao, Kai Ye, Linlin Shen:

ADATS: Adaptive RoI-Align based Transformer for End-to-End Text Spotting. 1403-1408 - Zao Zhang, Dong Yuan, Yu Zhang

, Wei Bao:
Trajectory Alignment based Multi-Scaled Temporal Attention for Efficient Video Transformer. 1409-1414 - Qunchao Jin, Hongyu Hou, Guixu Zhang, Haoan Wang, Zhi Li:

Swin-ASNet: An Adaptive RGB-selection Network with Swin Transformer for Retinal Vessel Segmentation. 1415-1420 - Xin Yang, Hengliang Zhu, Guojun Mao, Shuli Xing

:
OAFormer: Occlusion Aware Transformer for Camouflaged Object Detection. 1421-1426 - Zhuojun Zou, Xuexin Liu, Yuanpei Zhang, Lin Shu, Jie Hao:

Know Who You Are: Learning Target-Aware Transformer for Object Tracking. 1427-1432 - Wei Lu, Yang Jiang, Peiguang Jing, Jinghui Chu, Fugui Fan:

A Novel Channel Pruning Approach based on Local Attention and Global Ranking for CNN Model Compression. 1433-1438 - Yiding Liu, Yinglei Teng, Tao Niu:

Splittable Pattern-Specific Weight Pruning for Deep Neural Networks. 1439-1444 - Minyu Sun, Bin Jiang, Chao Yang:

Dynamic Dense-Sparse Representations for Real-Time Question Answering. 1445-1446 - Da Shi, Jingsheng Gao

, Ting Liu, Yuzhuo Fu:
DynaSlim: Dynamic Slimming for Vision Transformers. 1451-1456 - Kai Feng, Zhuo Chen, Fei Gao

, Zhe Wang, Long Xu, Weisi Lin:
Post-Training Quantization for Vision Transformer in Transformed Domain. 1457-1462 - Chaoran Chen, Mai Xu, Shengxi Li, Tie Liu, Minglang Qiao, Zhuoyi Lv:

Residual based hierarchical feature compression for multi-task machine vision. 1463-1468 - Shangchao Su

, Bin Li, Chengzhi Zhang, Mingzhao Yang, Xiangyang Xue:
Cross-domain Federated Object Detection. 1469-1474 - Mei Ma, Ling Lin, Heng Wang, Zhendong Li, Hao Liu:

Cross-Modality Fourier Feature for Medical Image Synthesis. 1475-1480 - Ziwei Wang

, Reza Arablouei
, Jiajun Liu, Paulo Borges, Greg Bishop-Hurley
, Nicholas Heaney:
Point-Syn2Real: Semi-Supervised Synthetic-to-Real Cross-Domain Learning for Object Classification in 3D Point Clouds. 1481-1486 - Zezhong Lv

, Bing Su:
Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding. 1487-1492 - Sujuan Hou, Xingzhuo Li, Weiqing Min, Jiacheng Li, Jing Wang, Yuanjie Zheng, Shuqiang Jiang:

A Cross-direction Task Decoupling Network for Small Logo Detection. 1493-1498 - Wen Wang, Ling Zhong, Guang Gao, Minhong Wan, Jason Gu:

CHAN: Cross-Modal Hybrid Attention Network for Temporal Language Grounding in Videos. 1499-1504 - Zihan Fang

, Shide Du
, Yaqing Chen, Shiping Wang:
DMRL-Net: Differentiable Multi-view Representation Learning Network. 1505-1510 - Jueqi Wei, Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Shang Gao:

Conditional Video-Text Reconstruction Network with Cauchy Mask for Weakly Supervised Temporal Sentence Grounding. 1511-1516 - Yuzhong Zhao, Weijia Wu, Zhuang Li, Jiahong Li, Weiqiang Wang:

FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation. 1517-1522 - Hongzhou Wu, Yifan Lyu, Xingyu Shen, Xuechen Zhao, Mengzhu Wang, Xiang Zhang, Zhigang Luo:

Atomic-action-based Contrastive Network for Weakly Supervised Temporal Language Grounding. 1523-1528 - Xiaoqian Liu, Xiuyun Li, Yuan Cao, Fan Zhang, Xiongnan Jin

, Jinpeng Chen:
Mandari: Multi-Modal Temporal Knowledge Graph-aware Sub-graph Embedding for Next-POI Recommendation. 1529-1534 - Qin Chao, Eunsoo Kim

, Boyang Li:
Movie Box Office Prediction With Self-Supervised and Visually Grounded Pretraining. 1535-1540 - Wenbin Zou, Guoguang Hua, Guangxu Chen, Zaiyue He, Guangli Liu, Pengfei Chen, Yuyang Li, Huakun Li, Lei Zheng, Shishun Tian:

Need a dog for seeing eye? A Walk Viewpoint Dataset for Freespace Detection in Unstructured Environments. 1541-1546 - Linfan Zha, Yanming Chen

, Peng Zhou
, Yiwen Zhang:
Intensifying The Consistency of Pseudo Label Refinement for Unsupervised Domain Adaptation Person Re-Identification. 1547-1552 - Zihao Bu, Xiaoxiao Wang, Chengjian Qiu, Zhixuan Wang, Kai Han, Xiuhong Shan, Zhe Liu:

Noisy-to-Clean Label Learning for Medical Image Segmentation. 1553-1558 - Wenhao Hu, Yingying Liu, Jiazhen Xu, Xuanyu Chen, Gaoang Wang:

Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection. 1559-1564 - Bin Zheng, Miaohui Wang:

Rethinking Video Error Concealment: A Benchmark Dataset. 1565-1570 - Zemian Guo, Yingying Zhu:

Visual Place Recognition Datasets for Indoor Spaces. 1571-1576 - Dan You, Pengcheng Xia, Qiuzhu Chen, Minghui Wu, Suncheng Xiang, Jun Wang:

AutoKary2022: A Large-Scale Densely Annotated Dataset for Chromosome Instance Segmentation. 1577-1582 - Salman Siddique Khan, Vivek Boominathan, Ashok Veeraraghavan, Kaushik Mitra:

Designing Optics and Algorithm for Ultra-Thin, High-Speed Lensless Cameras. 1583-1588 - Yangke Ying, Jin Wang, Yunhui Shi, Baocai Yin:

Dual-Domain Feature Learning and Memory-Enhanced Unfolding Network for Spectral Compressive Imaging. 1589-1594 - Shumian Yang, Xinxin Xiang, Fenghua Tong

, Dawei Zhao, Xin Li:
Image Compressed Sensing Using Multi-Scale Characteristic Residual Learning. 1595-1600 - Pinjun Luo, Guoqiang Xiao, Xinbo Gao, Song Wu:

LKD-Net: Large Kernel Convolution Network for Single Image Dehazing. 1601-1606 - Haoran Huang, Yuhui Quan, Zhenghua Lei, Jinlong Hu

, Yan Huang:
Video Noise Removal Using Progressive Decomposition With Conditional Invertibility. 1607-1612 - Shaokai Liu, Hao Feng, Wengang Zhou, Houqiang Li, Cong Liu, Feng Wu:

DocMAE: Document Image Rectification via Self-supervised Representation Learning. 1613-1618 - He Zhu

, Yang Chen, Guyue Hu, Shan Yu:
Information-density Masking Strategy for Masked Image Modeling. 1619-1624 - Zheyuan Liu, Pan Mu, Hanning Xu, Cong Bai:

Histogram-guided Video Colorization Structure with Spatial-Temporal Connection. 1625-1630 - Xinye Yang, Dongbao Yang, Yu Zhou, Youhui Guo, Weiping Wang

:
Mask-Guided Stamp Erasure for Real Document Image. 1631-1636 - Yu Cao, Hao Tian

, P. Y. Mok:
Attention-Aware Anime Line Drawing Colorization. 1637-1642 - Xinghui Li, Yikang Ding, Jia Guo, Xiansong Lai, Shihao Ren, Wensen Feng, Long Zeng:

Edge-aware Neural Implicit Surface Reconstruction. 1643-1648 - Yinhe Lin, Fei Chen

, Hang Cheng, Meiqing Wang:
Handwriting Curve Interpolation Using Gradient Graph Laplacian Regularizer. 1649-1654 - Vibhoothi, François Pitié

, Angeliki Katsenou, Yeping Su, Balu Adsumilli, Anil C. Kokaram
:
Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1. 1655-1660 - Chunyi Li, May Lim, Abdelhak Bentaleb, Roger Zimmermann:

A Real-Time Blind Quality-of-Experience Assessment Metric for HTTP Adaptive Streaming. 1661-1666 - Andréas Pastor, Patrick Le Callet:

Towards Guidelines for Subjective Haptic Quality Assessment: A Case Study on Quality Assessment of Compressed Haptic Signals. 1667-1672 - Vignesh V. Menon, Jingwen Zhu, Prajit T. Rajendran, Hadi Amirpour, Patrick Le Callet, Christian Timmerer:

Just Noticeable Difference-Aware Per-Scene Bitrate-Laddering for Adaptive Video Streaming. 1673-1678 - Hadi Amirpour, Vignesh V. Menon, Samira Afzal, Radu Prodan, Christian Timmerer:

Optimizing Video Streaming for Sustainability and Quality: The Role of Preset Selection in Per-Title Encoding. 1679-1684 - Zicheng Zhang, Hao Chen

, Xun Cao, Zhan Ma:
Anableps: Adapting Bitrate for Real-Time Communication Using VBR-encoded Video. 1685-1690 - Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation-based Voice Conversion. 1691-1696 - Hegen Yan, Zhihua Lu

:
A Disentangled Recurrent Variational Autoencoder for Speech Enhancement. 1697-1702 - Sipan Li, Songxiang Liu, Luwen Zhang, Xiang Li, Yanyao Bian, Chao Weng, Zhiyong Wu, Helen Meng:

SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias. 1703-1708 - Zhibin Qiu, Yachao Guo, Mengfan Fu, Hao Huang, Ying Hu, Liang He

, Fuchun Sun:
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising. 1709-1714 - Ying Hu, Shijing Hou, Huamin Yang, Hao Huang, Liang He

:
A Joint Network Based on Interactive Attention for Speech Emotion Recognition. 1715-1720 - Fangjing Niu, Tengfei Cao, Ying Hu, Hao Huang, Liang He

:
Speech Topic Classification Based on Pre-trained and Graph Networks. 1721-1726 - Zhuoming Dong, Huajun Zhou

, Jianhuang Lai:
Unsupervised 3D Face Reconstruction with Reprogramming Skip Connections. 1727-1732 - Pengfei Hu, Yingfan Tao, Qiqi Bao, Guijin Wang, Wenming Yang:

EvenFace: Deep Face Recognition with Uniform Distribution of Identities. 1733-1738 - Xiaomeng Fu, Xi Wang, Jin Liu, Jiao Dai, Jizhong Han

:
Large Pose Friendly Face Reenactment using subtle motions. 1739-1744 - Wei Xu, Kangkang Wang, Ziliang Chen, Bin He, Bi Li, Haocheng Feng, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding:

MSAbox: A spatially stable face detector. 1745-1750 - Xianliang Huang, Yining Lang, Ying Guo, Yuan He, Hui Xue, Li Zhao, Shuigeng Zhou:

DR-Net: A Multi-View Face Synthesis Network Driven by Dual Representation. 1751-1756 - Weichen Zhang, Xiang Zhou, YuKang Cao, WenSen Feng, Chun Yuan:

MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images. 1757-1762 - Yaoru Luo, Ge Yang:

Enhancing Robustness of Deep Networks Against Noisy Labels Based on A Two-Phase Formulation of Their Learning Behavior. 1763-1768 - Yadang Chen, Dingwei Zhang

, Zhi-Xin Yang, Enhua Wu:
Robust and Efficient Memory Network for Video Object Segmentation. 1769-1774 - Hao Yang

, Min Wang, Zhengfei Yu, Yun Zhou:
Weight-based Regularization for Improving Robustness in Image Classification. 1775-1780 - Wenyi Feng

, Wei Guo, Ting Xiao, Zhe Wang:
Robust Structured Sparse Subspace Clustering with Neighborhood Preserving Projection. 1781-1786 - Jiawei Lin, Shuoyao Wang:

Improving robustness of learning-based adaptive video streaming in wildly fluctuating networks. 1787-1792 - Dong Xi, Wengang Zhou, Houqiang Li:

Robust Person Re-Identification with Wireless Signals. 1793-1798 - Tao Hong, Ya Wang, Xingwu Sun, Fengzong Lian, Zhanhui Kang, Jinwen Ma:

GradSalMix: Gradient Saliency-Based Mix for Image Data Augmentation. 1799-1804 - Hui Zhu, Yongchun Lü, Qin Ma, Xunyi Zhou, Fen Xia, Guoqing Zhao, Ning Jiang, Xiaofang Zhao:

Get a Head Start: Targeted Labeling at Source with Limited Annotation Overhead for Semi-Supervised Learning. 1805-1810 - Yan Hu, Xiaozhao Fang, Weijun Lv, Peipei Kang:

Partial multi-label learning: exploration of binary ground-truth labels. 1811-1816 - Shiya Luo, Defang Chen, Can Wang:

Customizing Synthetic Data for Data-Free Student Learning. 1817-1822 - Zhen Liang

, Changyuan Zhao, Wanwei Liu, Bai Xue, Wenjing Yang:
A Geometrical Characterization on Feature Density of Image Datasets. 1823-1828 - Gang Li, Qifei Zhang

, Peizheng Wang, Jie Zhang, Chao Wu:
Federated Domain Adaptation via Pseudo-label Refinement. 1829-1834 - Xinchen Gao, Yawei Li

, Wen Li, Lixin Duan, Luc Van Gool, Luca Benini, Michele Magno:
Learning continuous piecewise non-linear activation functions for deep neural networks. 1835-1840 - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:

Discriminative Spatiotemporal Alignment for Self-Supervised Video Correspondence Learning. 1841-1846 - Jia Chen, Haidongqing Yuan, Fei Fang, Tao Peng, Xinrong Hu:

Unsupervised Fashion Style Learning by Solving Fashion Jigsaw Puzzles. 1847-1852 - Selen Pehlivan, Jorma Laaksonen

:
Anchor-Free Action Proposal Network with Uncertainty Estimation. 1853-1858 - Shalayiding Sirejiding

, Yuxiang Lu
, Hongtao Lu, Yue Ding:
Scale-Aware Task Message Transferring for Multi-Task Learning. 1859-1864 - Yuhu Wang, Shiming Xiang, Chunhong Pan:

Improving the Homophily of Heterophilic Graphs for Semi-Supervised Node Classification. 1865-1870 - Kai Leng, Cong Yang, Wei Sui, Jie Liu, Zhijun Li:

Sitpose: A Siamese Convolutional Transformer for Relative Camera Pose Estimation. 1871-1876 - Xiaocong Wang, Chaoyue Wu, Haiyang Yu

, Bin Li, Xiangyang Xue:
TextFormer: Component-aware Text Segmentation with Transformer. 1877-1882 - Hui Lu, Ronald Poppe, Albert Ali Salah

:
SCFormer: Integrating hybrid Features in Vision Transformers. 1883-1888 - Tianyu Song

, Pengpeng Li, Guiyue Jin, Jiyu Jin, Shumin Fan, Xiang Chen
:
Image Deraining Transformer with Sparsity and Frequency Guidance. 1889-1894 - Beiying Yang, Guibo Zhu, Guojing Ge, Jinzhao Luo, Jinqiao Wang:

ShiftFormer: Spatial-Temporal Shift Operation in Video Transformer. 1895-1900 - Tianxiang Chen, Qi Chu, Zhentao Tan, Bin Liu, Nenghai Yu:

ABMNet: Coupling Transformer with CNN Based on Adams-Bashforth-Moulton Method for Infrared Small Target Detection. 1901-1906 - Yue He, Yufan Wang, Linlong He, Guangyao Pan, He Ma:

ART: An Efficient Transformer with Atrous Residual Learning for Medical Images. 1907-1912 - Shiao Xie, Huimin Huang, Ziwei Niu, Lanfen Lin, Yen-Wei Chen:

MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation. 1913-1918 - Jia Chen, Zhenpeng Fu, Fei Fang, Mingfu Xiong, Xinrong Hu, Tao Peng:

Cross-cycle Transformer-based Stitching Method for Low-resolution Borehole Images. 1919-1924 - Jiquan Peng, Chaozhuo Li, Yi Zhao, Yuting Lin, Xiaohan Fang, Jibing Gong:

Improving Vision Transformers with Nested Multi-head Attentions. 1925-1930 - Yuzhang Hu, Minghao Liu, Wenhan Yang, Jiaying Liu, Zongming Guo:

Collaborative Spatial-Temporal Distillation for Efficient Video Deraining. 1937-1942 - Hailin Zhang, Defang Chen, Can Wang:

Adaptive Multi-Teacher Knowledge Distillation with Meta-Learning. 1943-1948 - Defang Cai, Pan Mu, Sixian Chan, Zhanpeng Shao, Cong Bai:

Towards General and Fast Video Derain via Knowledge Distillation. 1949-1954 - Jian Zhu, Xiaohu Ruan, Yongli Cheng, Zhangmin Huang, Yu Cui, Lingfang Zeng

:
Deep Metric Multi-View Hashing for Multimedia Retrieval. 1955-1960 - Jinyu Li, Fuwei Zhang, Shujin Lin, Fan Zhou, Ruomei Wang:

MIM: Lightweight Multi-Modal Interaction Model for Joint Video Moment Retrieval and Highlight Detection. 1961-1966 - Xu Zhang, Xinzheng Niu, Philippe Fournier-Viger, Xudong Dai:

Image-text Retrieval via Preserving Main Semantics of Vision. 1967-1972 - Xun Jiang, Zhiguo Chen

, Xing Xu, Fumin Shen, Zuo Cao, Xunliang Cai:
Progressive Event Alignment Network for Partial Relevant Video Retrieval. 1973-1978 - Mengyu Yang, Di Wu, Zelong Wang, Miao Hu, Yipeng Zhou

:
Understanding and Improving Perceptual Quality of Volumetric Video Streaming. 1979-1984 - Lei Wei

, Shuai Wan, Xiaobin Ding, FuZheng Yang, Zhecheng Wang:
Adaptive Geometry Reconstruction for Geometry-based Point Cloud Compression. 1985-1990 - Chen Chen, Hui Yuan, Hao Liu, Junhui Hou

, Raouf Hamzaoui:
CAS-Net: Cascade Attention-Based Sampling Neural Network for Point Cloud Simplification. 1991-1996 - Lei Liu, Zhihao Hu, Jing Zhang:

PCHM-Net: A New Point Cloud Compression Framework for Both Human Vision and Machine Vision. 1997-2002 - Rui Song, Chunyang Fu

, Shan Liu, Ge Li:
Large-Scale Spatio-Temporal Attention Based Entropy Model for Point Cloud Compression. 2003-2008 - Haipeng Zhang, Jie Zhang

, Weimiao Feng, Kaigui Bian, Hu Tuo:
Edge-FVV: Free Viewpoint Video Streaming by Learning at the Edge. 2009-2014 - Weijia Wang, Xuequan Lu

, Di Shao
, Xiao Liu, Richard Dazeley, Antonio Robles-Kelly, Wei Pan:
Weighted Point Cloud Normal Estimation. 2015-2020 - Yiheng Li

, Canhui Tang
, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du:
HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching. 2021-2026 - Yakun Ju

, Cong Zhang, Songsong Huang, Yuan Rao, Kin-Man Lam:
Learning Deep Photometric Stereo Network with Reflectance Priors. 2027-2032 - Chenyangguang Zhang, Zhiqiang Lou, Yan Di, Federico Tombari, Xiangyang Ji:

SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance. 2033-2038 - Ke Liu, Ning Ma, Zhihua Wang, Jingjun Gu, Jiajun Bu, Haishuai Wang:

Implicit Neural Distance Optimization for Mesh Neural Subdivision. 2039-2044 - Ke Ren, Zhenjiang Du, Qifeng He, Ning Xie, Guan Wang:

MRRA-GAN: Multi-Resolution Relation-Aware GAN for Point Cloud Completion. 2045-2050 - Shanshan Zhong, Wushao Wen, Jinghui Qin, Qiangpu Chen, Zhongzhan Huang:

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem. 2051-2056 - Hui Lu, Ronald Poppe, Albert Ali Salah

:
LA-layer: General local attention layer for full attention networks. 2057-2062 - Qiangxi Zhu, Zhixin Li:

A Progressive Gated Attention Model for Fine-Grained Visual Classification. 2063-2068 - Yubo Wu, Yurui Ren, Yuanqi Chen, Ge Li:

Flow-Guided Attention Deformation for Person Image Generation. 2069-2074 - Jinyi Fang, Bingke Zhu, Yingying Chen, Jinqiao Wang, Ming Tang:

Explicit Attention Modeling for Pedestrian Attribute Recognition. 2075-2080 - Yaxi Chen, Ruimin Hu, Danni Xu, Zheng Wang, Linbo Luo, Dengshi Li:

Hidden Follower Detection via Refined Gaze and Walking State Estimation. 2081-2086 - Zhenbei Wu, Haoge Deng, Qiang Wang, Di Kong, Jie Yang, Yonggang Qi:

SketchScene: Scene Sketch To Image Generation With Diffusion Models. 2087-2092 - Yanjie Pan, Yaru Du, Shandong Wang, Yun Ye, Yong Jiang, Zhen Zhou, Li Xu, Ming Lu, Yunbiao Lin, Jiehui Lu:

DanceU: motion-and-music-based automatic effect generation for dance videos. 2093-2098 - Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han

:
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions. 2099-2104 - Wanqing Wu, Aihua Mao, Wenwei Yan, Qing Liu:

UFS-Net: Unsupervised Network For Fashion Style Editing And Generation. 2105-2110 - Yuxin Hou, Hongxun Yao, Haoran Li:

Graph Convolutional GRU for Music-Oriented Dance Choreography Generation. 2111-2116 - Zhongqi Wang

, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan:
CCLAP: Controllable Chinese Landscape Painting Generation Via Latent Diffusion Model. 2117-2122 - Zhongan Wang, Shuai Shi, Yingna Wu, Rui Yang:

Prototype calibration for long tailed recognition. 2123-2128 - Son Duy Dao, Dat Huynh, He Zhao, Dinh Phung, Jianfei Cai:

Open-Vocabulary Multi-label Image Classification with Pretrained Vision-Language Model. 2135-2140 - Ruotong Hu, Xianzhi Wang, Xiaojun Chang

, Yeqi Hu, Xiaowei Xin, Xiangqian Ding, Baoqi Guo:
RASNet: A Reinforcement Assistant Network for Frame Selection in Video-based Posture Recognition. 2141-2146 - Shengqin Wang, Yongji Zhang, Hong Qi, Minghao Zhao, Yu Jiang:

Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition. 2147-2152 - Zhengxuan Zhang

, Weixing Mai, Haoliang Xiong, Chuhan Wu
, Yun Xue:
A Token-wise Graph-based Framework for Multimodal Named Entity Recognition. 2153-2158 - Zhao Duan, Xiaoliu Luo, Taiping Zhang:

Multi-focus image fusion via gradient guidance progressive network. 2159-2164 - Chao-Liang Yu, I-Chen Lin

:
Efficient Video Matting on Human Video Clips for Real-Time Application. 2165-2170 - Shen Yan, Xiaoya Cheng, Yuxiang Liu, Juelin Zhu, Rouwan Wu

, Yu Liu, Maojun Zhang:
Render-and-Compare: Cross-view 6-DoF Localization from Noisy Prior. 2171-2176 - Zan Chen, Ran Li, Yongqiang Li, Yuanjing Feng:

Video Snapshot Compressive Imaging via Optical Flow. 2177-2182 - Wenpeng Xing, Jie Chen:

CasTensoRF: Cascaded Tensorial Radiance Fields for Novel View Synthesis. 2183-2188 - Lingzhi Li, Zhongshu Wang, Zhen Shen, Li Shen, Ping Tan:

Compact Real-Time Radiance Fields with Neural Codebook. 2189-2194 - Xiaowen Ma, Jiawei Yang, Tingfeng Hong

, Mengting Ma, Ziyan Zhao, Tian Feng, Wei Zhang:
STNet: Spatial and Temporal feature fusion network for change detection in remote sensing images. 2195-2200 - Boyu Qiao, Kun Li, Wei Zhou, Zhou Yan, Shilong Li, Songlin Hu:

Social Bot Detection Based on Window Strategy. 2201-2206 - Wei Ma, Shiyong Lan, Weikang Huang, Wenwu Wang, Hongyu Yang, Yitong Ma, Yongjie Ma:

A Semantics-Aware Normalizing Flow Model for Anomaly Detection. 2207-2212 - Haitao Leng, Xiaoming Shi, Wei Zhou, Kuncai Zhang, Qiankun Shi, Pengcheng Zhu:

Online Action Detection with Learning Future Representations by Contrastive Learning. 2213-2218 - Hantao Zhang, Shouhong Wan, Weidong Guo, Peiquan Jin, Mingguang Zheng:

HOD: Human-Object Decoupling Network for HOI Detection. 2219-2224 - Yuzhe Mao, Weike You, Linna Zhou, Zhigao Lu:

Fixing Domain Bias for Generalized Deepfake Detection. 2225-2230 - Jiangming Chen

, Wanxia Deng, Bo Peng, Tianpeng Liu, Yingmei Wei, Li Liu:
Variational Information Bottleneck for Cross Domain Object Detection. 2231-2236 - Peiwen Li, Lijun Zhang, Xiang-Dong Zhou, Yu Shi, Xiaohu Shao:

Attention Based Network with DA-Loss for X-ray Contraband Automatic Detection. 2237-2242 - Haiyan Zhang, Sumei Li

:
Cross-Level Attention Based Adaptive Feature Alignment Network for Arbitrary-Shaped Text Detection. 2243-2248 - Yang Wu, Zhibin Liu, Hefeng Wu, Liang Lin:

Multi-object Video Generation from Single Frame Layouts. 2249-2254 - Hongshuo Tian, Ning Xu, Yanhui Wang, Chenggang Yan, Bolun Zheng, Xuanya Li, An-An Liu:

Towards Confidence-Aware Commonsense Knowledge Integration for Scene Graph Generation. 2255-2260 - Tianlong Ma, Xingjiao Wu

, Xiangcheng Du
, Yanlong Wang, Cheng Jin:
Image Layer Modeling for Complex Document Layout Generation. 2261-2266 - Jieting Chen, Junkai Ding, Wenping Chen, Qin Jin:

Knowledge Enhanced Model for Live Video Comment Generation. 2267-2272 - Yun Guo, Wei Feng, Zheng Zhang, Xiancong Ren, Yaoyu Li, Jingjing Lv, Xin Zhu, Zhangang Lin, Jingping Shao:

Mutual Query Network for Multi-Modal Product Image Segmentation. 2273-2278 - Xiaogang Du, Yinghao Wu, Tao Lei, Dongxin Gu, Yinyin Nie, Asoke K. Nandi:

ATENet: Adaptive Tiny-Object Enhanced Network for Polyp Segmentation. 2279-2284 - Gang Xu, Shengxin Wang, Thomas Lukasiewicz, Zhenghua Xu:

Adaptive-Masking Policy with Deep Reinforcement Learning for Self-Supervised Medical Image Segmentation. 2285-2290 - Hao Zeng, Xinxin Shan, Yu Feng, Ying Wen:

MSAANet: Multi-scale Axial Attention Network for medical image segmentation. 2291-2296 - Hao Yang

, Min Wang, Zhengfei Yu, Yun Zhou:
A Simple Stochastic Neural Network for Improving Adversarial Robustness. 2297-2302 - Bo Zou, Chao Yang, Jiazhi Guan, Chengbin Quan, Youjian Zhao:

DFCP: Few-Shot DeepFake Detection via Contrastive Pretraining. 2303-2308 - Jiucui Lu, Yuezun Li, Jiaran Zhou, Bin Li, Siwei Lyu:

Forensics Forest: Multi-scale Hierarchical Cascade Forest for Detecting GAN-generated Faces. 2309-2314 - Bingyuan Huang, Sanshuai Cui, Xiangui Kang, Enping Li:

Transferable Waveform-level Adversarial Attack against Speech Anti-spoofing Models. 2315-2320 - Jian Zhang

, Jiangqun Ni:
Domain-Invariant Feature Learning for General Face Forgery Detection. 2321-2326 - Yingjie He, Yuanman Li, Changsheng Chen, Xia Li:

Image Copy-Move Forgery Detection via Deep Cross-Scale PatchMatch. 2327-2332 - Yuxuan Zhang, Wei Yang, Rong Hu:

BAProto: Boundary-Aware Prototype for High-quality Instance Segmentation. 2333-2338 - Weiwei Li

, Yuanyuan Ren, Junzhuo Liu
, Chenyang Wang, Yuchen Zheng
:
PMDA: Domain Alignment with Prototype Matching for Cross-Domain Adaptive Segmentation. 2339-2344 - Yongchao Wang, Bin Xiao, Xiuli Bi, Weisheng Li, Xinbo Gao:

Cross-slice Context Consistency for Semi-supervised 3D Left Atrium Segmentation. 2343-2350 - Chenbin Zhang, Qingyuan He, Kun Yan, Meng Ma, Defeng Liu, Ping Wang:

CTSSeg: Consistent Teacher-Student model for magnetic resonance image Segmentation. 2351-2356 - Xin Lv, Zhenming Su, Taiyi Zhang, Wenxiang Cheng, Xiaoqiong Qi

:
Adaptive Non-local Affinity Graph for Unsupervised Image Segmentation. 2357-2362 - Yongtuo Liu, Dan Xu, Sucheng Ren, Hanjie Wu, Hongmin Cai, Shengfeng He

:
Fine-grained Domain Adaptive Crowd Counting via Point-derived Segmentation. 2363-2368 - Zhengyi Liu, Xiaoshen Huang, Guanghui Zhang

, Xianyong Fang, Linbo Wang
, Bin Tang:
Scribble-Supervised RGB-T Salient Object Detection. 2369-2374 - Yibin Wang

, Yuchao Feng, Jie Wu, Honghui Xu, Jianwei Zheng:
CA-GAN: Object Placement via Coalescing Attention based Generative Adversarial Network. 2375-2380 - Peiwen Pan, Huan Wang, Chenyi Wang, Chang Nie:

ABC: Attention with Bilinear Correlation for Infrared Small Target Detection. 2381-2386 - Bo Yuan, Yao Jiang, Keren Fu, Qijun Zhao:

Guided Focal Stack Refinement Network for Light Field Salient Object Detection. 2387-2392 - Zhenshan Tan

, Cheng Chen, Xiaodong Gu:
Triplet Spatiotemporal Aggregation Network for Video Saliency Detection. 2393-2398 - Daosong Hu, Kai Huang:

GFNet: Gaze Focus Network using Attention for Gaze Estimation. 2399-2404 - Zepeng Wang, Ke Xu, Yuting Mou, Xinghao Jiang:

Feature Mixing and Disentangling for Occluded Person Re-Identification. 2405-2410 - Kaixiang Chen, Tiantian Gong

, Liyan Zhang:
Multi-Scale Query-Adaptive Convolution for Generalizable Person Re-Identification. 2411-2416 - Mengzan Qi, Sixian Chan, Chen Hang, Guixu Zhang, Zhi Li:

Fine-grained Learning for Visible-Infrared Person Re-identification. 2417-2422 - Yimin Liu

, Meibin Qi, Qiang Wu
, Yanfang Yang, Xiaohong Li, Jian Zhang
:
Camera Proxy based Contrastive Learning with Hard Sampling for Unsupervised Person Re-identification. 2423-2428 - Guoqing Zhang, Zhiyuan Luo, Weisi Lin, Xuan Jing:

Inter-Intra Camera Identity Learning for Person Re-Identification with Training in Single Camera. 2429-2434 - Tiantian Gong

, Kaixiang Chen, Junsheng Wang, Liyan Zhang:
Dynamically Adaptive Instance Normalization and Attention-Aware Incremental Meta-Learning for Generalizable Person Re-identification. 2435-2440 - Qing Zhang, Weiqi Yan:

CFANet: A Cross-layer Feature Aggregation Network for Camouflaged Object Detection. 2441-2446 - Jiaxiang Dong, Li Zhang:

Multibox Sample Selection for Active Object Detection. 2447-2452 - Luojun Lin

, Zhifeng Yang, Qipeng Liu, Yuanlong Yu, Qifeng Lin:
Run and Chase: Towards Accurate Source-Free Domain Adaptive Object Detection. 2453-2458 - Yuxuan Song, Xinyue Li, Lin Qi:

Camouflaged Object Detection with Feature Grafting and Distractor Aware. 2459-2464 - Dongyue Sun, Shiyao Jiang, Lin Qi:

Edge-Aware Mirror Network for Camouflaged Object Detection. 2465-2470 - Zhibin Zhang, Wanli Xue, Kaihua Zhang, Shengyong Chen:

'Skimming-Perusal' Detection: A Simple Object Detection Baseline in GigaPixel-level Images. 2471-2476 - Tong Zhu

, Leida Li
, Pengfei Chen, Jinjian Wu, Yuzhe Yang, Yaqian Li, Yandong Guo:
Attribute-assisted Multimodal Network for Image Aesthetics Assessment. 2477-2482 - Zicheng Zhang, Wei Sun, Yingjie Zhou

, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai:
EEP-3DQA: Efficient and Effective Projection-Based 3D Model Quality Assessment. 2483-2488 - Kaifa Yang, Qi Yang, Joel Jung, Yiling Xu, Xiaozhong Xu, Shan Liu:

Exploring the Influence of View and Camera Path Selection for Dynamic Mesh Quality Assessment. 2489-2494 - Shuaibing Wang, Shunli Wang, Dingkang Yang, Mingcheng Li, Ziyun Qian, Liuzhen Su, Lihua Zhang

:
HandGCAT: Occlusion-Robust 3D Hand Mesh Reconstruction from Monocular Images. 2495-2500 - Wei Lu, Wei Sun, Zicheng Zhang, Danyang Tu, Xiongkuo Min, Guangtao Zhai:

BH-VQA: Blind High Frame Rate Video Quality Assessment. 2501-2506 - Yuan Chen, Sumei Li

:
Multi-Level Feature-Guided Stereoscopic Video Quality Assessment Based on Transformer and Convolutional Neural Network. 2513-2518 - Zicheng Zhang, Yingjie Zhou

, Wei Sun, Wei Lu, Xiongkuo Min, Yu Wang, Guangtao Zhai:
DDH-QA: A Dynamic Digital Humans Quality Assessment Database. 2519-2524 - Litian Li, Zheng Yang, Yongqi Zhai, Jiayu Yang, Ronggang Wang:

Improving Multi-generation Robustness of Learned Image Compression. 2525-2530 - Yinqi Chen, Zhiyi Lu, Ya Lu, Yangting Zheng, Peiwen Li, Shuo Kang:

Code Verification Hashing for Image Retrieval. 2531-2536 - Xinjie Zhang, Jiawei Shao

, Jun Zhang:
Low-complexity Deep Video Compression with A Distributed Coding Architecture. 2537-2542 - Yulin Wu

, Ruimin Hu, Xiaochen Wang:
Perceptual Audio Object Coding Using Adaptive Subband Grouping with CNN and Residual Block. 2543-2548 - Kai Wang, Yuanchao Bai, Deming Zhai, Daxin Li, Junjun Jiang, Xianming Liu:

Learning Lossless Compression for High Bit-Depth Medical Imaging. 2549-2554 - Pengpeng Yu

, Dian Zuo, Yueer Huang, Ruishan Huang, Hanyun Wang
, Yulan Guo, Fan Liang:
Sparse Representation based Deep Residual Geometry Compression Network for Large-scale Point Clouds. 2555-2560 - Shaokang Wang, Xiaofeng Huang, Guoqing Xiang, Xizhong Zhu, Jiaojiao Yang, Peng Zhang, Huizhu Jia, Xiaodong Xie:

An Efficient Real-Time Hardware Architecture for Deblocking Filter in AVS3. 2561-2566 - Yuqing Yang, Xin Jin, Kedeng Tong, Chen Wang, Haitian Huang:

Microimage-based Two-step Search For Plenoptic 2.0 Video Coding. 2567-2572 - Xi Xie

, Kai Zhang, Li Zhang, Meng Wang
, Junru Li, Shiqi Wang
:
Low Complexity Transcoding from HEVC to VVC. 2573-2578 - Sixian Chan, Jiaao Cui, Yonggan Wu, Hongqiang Wang, Cong Bai:

Visible-Xray Cross-Modality Package Re-Identification. 2579-2584 - Huy Nguyen

, Kien Nguyen
, Sridha Sridharan
, Clinton Fookes:
Aerial-Ground Person Re-ID. 2585-2590 - Astha Verma, A. Venkata Subramanyam, Mohammad Ali Jauhar, Divij Gera, Rajiv Ratn Shah

:
Meta Perturbed Re-Id Defense. 2597-2602 - Guangyu Chen, Deyuan Zhang, Tao Liu, Xiaoyong Du:

EFT: Expert Fusion Transformer for Voice-Face Association Learning. 2603-2608 - Wenjun Peng, Weidong He, Derong Xu

, Tong Xu, Chen Zhu, Enhong Chen:
Social Context-aware GCN for Video Character Search via Scene-prior Enhancement. 2609-2614 - Wei Chen

, Jianwei Niu
, Xuefeng Liu:
MRCap: Multi-modal and Multi-level Relationship-based Dense Video Captioning. 2615-2620 - Yibo Cui

, Ruqiang Huang, Yakun Zhang, Yingjie Cen, Liang Xie, Ye Yan, Erwei Yin:
Auxiliary Fine-grained Alignment Constraints for Vision-and-Language Navigation. 2621-2626 - Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee:

Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. 2627-2632 - Wei Song, Bin Wu, Chunping Zheng, Huayang Zhang

:
Detection Of Public Speaking Anxiety: A New Dataset And Algorithm. 2633-2638 - Beitao Chen, Xuanhan Wang, Xiaojia Chen, Yulan He, Jingkuan Song:

EANet: Towards Lightweight Human Pose Estimation With Effective Aggregation Network. 2639-2644 - Zhihao Li

, Huaxiang Zhang, Lei Zhu, Jiande Sun, Li Liu:
Effective Occlusion Suppression Network via Grouped Pose Estimation for Occluded Person Re-Identification. 2645-2650 - Yaoxing Wang, Heng Zhou, Zhendong Li, Xian Mo, Hao Liu:

Structural Equivariance Self-Supervised Learning for Facial Pose Estimation. 2651-2656 - Hongwei Zheng, Han Li, Bowen Shi, Wenrui Dai, Botao Wang, Yu Sun, Min Guo, Hongkai Xiong:

ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting. 2657-2662 - Guanghua Zheng, Zhongqiu Zhao, Zhao Zhang, Yi Yang

:
Hierarchical Graph Neural Network for Human Pose Estimation. 2663-2668 - Chunyang Xie, Dongheng Zhang, Zhi Wu, Cong Yu, Yang Hu, Qibin Sun, Yan Chen:

RF-based Multi-view Pose Machine for Multi-Person 3D Pose Estimation. 2669-2674 - Jing Li

, Liu Yang, Qilong Wang
, Qinghua Hu:
Coarse Helps Fine: A Multi-Granularity Discriminative Adversarial Network for Fine-Grained Open-Set Domain Adaptation. 2675-2680 - Yao Xiao, Pengxu Wei, Cong Liu, Liang Lin:

Adversarially Robust Source-free Domain Adaptation with Relaxed Adversarial Training. 2681-2686 - Yi Li, Xin Xie, Haiyan Fu

, Xiangyang Luo, Yanqing Guo:
A Compact Transformer for Adaptive Style Transfer. 2687-2692 - Jianglin Wei, Guangyi Xiao, Shun Peng, Hao Chen, Jingzhi Guo, Zhiguo Gong:

Fine-Grained Alignment for Boundary Samples under Open Set Domain Adaptation. 2693-2698 - Kai Wang, Xing Xu, Jialin Tian, Zuo Cao, Gong Zhang:

Information Selection-based Domain Adaptation from Black-box Predictors. 2699-2704 - Meng Shen, Andy J. Ma, Pong C. Yuen:

E2: Entropy Discrimination and Energy Optimization for Source-free Universal Domain Adaptation. 2705-2710 - Shengyang Sun, Xiaojin Gong:

Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection. 2711-2716 - Xiangyu Huang, Caidan Zhao, Jinhui Yu, Chenxing Gao, Zhiqiang Wu:

Multi-Level Memory-Augmented Appearance-Motion Correspondence Framework for Video Anomaly Detection. 2717-2722 - Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang:

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance. 2723-2728 - Weilin Wan, Weizhong Zhang, Cheng Jin:

Pose-Motion Video Anomaly Detection via Memory-Augmented Reconstruction and Conditional Variational Prediction. 2729-2734 - Junyi Yan

, Enguang Zuo, Chen Chen, Cheng Chen, Jie Zhong, Tianle Li, Xiaoyi Lv:
Rethinking graph anomaly detection: A self-supervised Group Discrimination paradigm with Structure-Aware. 2735-2740 - Jie Zhong, Enguang Zuo, Chen Chen, Cheng Chen, Junyi Yan

, Tianle Li, Xiaoyi Lv:
A Masked Attention Network with Query Sparsity Measurement for Time Series Anomaly Detection. 2741-2746 - Qiong Wang, Kui Jiang, Jinyi Lai, Zheng Wang, Jianhui Zhang:

HPCNet: A Hybrid Progressive Coupled Network for Image Deraining. 2747-2752 - Fengchao Xiong, Jun Zhou, Zhuang Zhao, Yuntao Qian:

Iterative Refinement Network for Hyperspectral Image Denoising. 2753-2758 - Yuqi Jiang, Chune Zhang, Jiao Liu

:
CS-PCN: Context-Space Progressive Collaborative Network for Image Denoising. 2759-2764 - Kangliang Liu, Xiangcheng Du

, Sijie Liu, Yingbin Zheng, Xingjiao Wu
, Cheng Jin:
DDT: Dual-branch Deformable Transformer for Image Denoising. 2765-2770 - Fengyi Zhang

, Lin Zhang, Tianjun Zhang, Dongqing Wang:
Adaptively Hashing 3DLUTs for Lightweight Real-time Image Enhancement. 2771-2776 - Yingxue Pang, Shijie Zhao, Haiqiang Wang, Gen Zhan, Junlin Li, Li Zhang:

Frequency-Assisted Adaptive Sharpening Scheme Considering Bitrate and Quality Tradeoff. 2777-2782 - Zerun Liu, Fan Zhang, Jingxuan He

, Jin Wang, Zhangye Wang, Lechao Cheng
:
Text-Guided Mask-Free Local Image Retouching. 2783-2788 - Guoliang You, Xiaomeng Chu, Yifan Duan, Jie Peng, Jianmin Ji, Yu Zhang, Yanyong Zhang:

P3O: Transferring Visual Representations for Reinforcement Learning via Prompting. 2789-2794 - Yehuan Wang, Jian Hu, Lin Shang:

Accurate and Complete Captions for Question-controlled Text-aware Image Captioning. 2795-2800 - Yuhao Chen, Guoqing Zhang, Hongwei Zhang, Yuhui Zheng, Weisi Lin:

Multi-level Part-aware Feature Disentangling for Text-based Person Search. 2801-2806 - Yiren Zhang, Yuanwu Xu, Mohan Chen, Yuejie Zhang, Rui Feng, Shang Gao:

SPTNET: Span-based Prompt Tuning for Video Grounding. 2807-2812 - Xingyu Zhu, Feifei Dai, Xiaoyan Gu, Haihui Fan, Bo Li, Weiping Wang

:
ERPG: Enhancing Entity Representations with Prompt Guidance for Complex Named Entity Recognition. 2813-2818 - Peizhuo Lv, Hualong Ma, Jiachen Zhou, Ruigang Liang, Kai Chen, Shengzhi Zhang

, Yunfei Yang:
DBIA: Data-Free Backdoor Attack Against Transformer Networks. 2819-2824 - Yangming Zhou, Yuzhou Yang, Qichao Ying, Zhenxing Qian

, Xinpeng Zhang:
Multimodal Fake News Detection via CLIP-Guided Learning. 2825-2830 - Yiqiang Lv, Jingjing Chen

, Zhipeng Wei, Kai Chen, Zuxuan Wu, Yu-Gang Jiang:
Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models. 2831-2836 - Zhongqiang Huang, Yuxue Hu, Zhi Zeng

, Xiang Li, Ying Sha:
Multimodal Stacked Cross Attention Network for Fine-Grained Fake News Detection. 2837-2842 - Jinghong Xia

, Hongxia Wang, Sani M. Abdullahi
, Heng Wang, Fei Zhang, Bingling Luo:
Adaptive and Robust Fourier-Mellin-Based Image Watermarking for Social Networking Platforms. 2843-2848 - Pengcheng Su, Rongxin Tu, Hongmei Liu, Yue Qing, Xiangui Kang:

Adversarial Attacks on Generated Text Detectors. 2849-2854 - Qianjin Du, Wei Kun, Xiaohui Kuang, Xiang Li, Gang Zhao:

Automated Software Vulnerability Detection via Curriculum Learning. 2855-2860 - Zhi Zeng

, Mingmin Wu, Guodong Li, Xiang Li, Zhongqiang Huang, Ying Sha:
Correcting the Bias: Mitigating Multimodal Inconsistency Contrastive Learning for Multimodal Fake News Detection. 2861-2866 - Shiwei Jing, Jianjun Li, Wanyong Tian:

Meaningful ciphertext image encryption based on histogram shift and ND-ICM hyperchaos. 2867-2872 - Shansong Wang, Qingtian Zeng, Weijian Ni, Xue Zhang, Cheng Cheng:

Hierarchical Class Level Attribute Guided Generative Meta Learning for Pest Image Zero-shot Learning. 2873-2878 - Wenhao Qiu, Sichao Fu

, Jingyi Zhang, Chengxiang Lei, Qinmu Peng:
Semantic-visual Guided Transformer for Few-shot Class-incremental Learning. 2885-2890 - Jiaxin Chen, Yanxu Hu, Meng Shen, Andy J. Ma:

Dual Episodic Sampling and Momentum Consistency Regularization for Unsupervised Few-shot Learning. 2891-2896 - Yaqian Zhou

, Yu Liu, Dan Song, Jiayu Li, Xuanya Li, An-An Liu:
Cross-domain Prototype Contrastive loss for Few-shot 2D Image-Based 3D Model Retrieval. 2897-2902 - Dianlong You, Peng Wang, Yi Zhang, Ling Wang, Shunfu Jin:

Few-Shot Object Detection via Back Propagation and Dynamic Learning. 2903-2908 - Yunkai Dang, Meijun Sun, Min Zhang, Zhengyu Chen, Xinliang Zhang, Zheng Wang, Donglin Wang:

Multi-Level Correlation Network For Few-Shot Image Classification. 2909-2914 - Xixiang Lin, Zhenghao Li

, Liangchen Liu, Jun Wu, Lijun Zhang, Xiang-Dong Zhou:
Irecut+MM: Data Generalization and Metric Improvement for Few-shot Learning. 2915-2920 - Yiwen Zhang, Hailun Zhang

, Qijun Zhao:
Counting and Locating Anything: Class-agnostic Few-shot Object Counting and Localization. 2921-2926 - Yanhui Wang, Ning Xu, Hongshuo Tian, Bo Lv, Yulong Duan, Xuanya Li, An-An Liu:

Knowledge Prompt Makes Composed Pre-Trained Models Zero-Shot News Captioner. 28779-2884

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














