default search action
International Journal of Computer Vision, Volume 132
Volume 132, Number 1, January 2024
- Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Yongjian Wu, Yue Gao, Rongrong Ji:
Towards Language-Guided Visual Recognition via Dynamic Convolutions. 1-19 - Xin Luo, Wei Chen, Zhengfa Liang, Longqi Yang, Siwei Wang, Chen Li:
Crots: Cross-Domain Teacher-Student Learning for Source-Free Domain Adaptive Semantic Segmentation. 20-39 - Pia Bideau, Erik G. Learned-Miller, Cordelia Schmid, Karteek Alahari:
The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields. 40-55 - Junda Cheng, Gangwei Xu, Peng Guo, Xin Yang:
Coatrsnet: Fully Exploiting Convolution and Attention for Stereo Matching by Region Separation. 56-73 - Hyeongmin Lee, Taeoh Kim, Hanbin Son, Sangwook Baek, Minsu Cheon, Sangyoun Lee:
A Nonlinear, Regularized, and Data-independent Modulation for Continuously Interactive Image Processing Network. 74-94 - Gang Fu, Qing Zhang, Lei Zhu, Qifeng Lin, Yihao Wang, Siyuan Fan, Chunxia Xiao:
Towards High-Resolution Specular Highlight Detection. 95-117 - Ruize Han, Wei Feng, Feifan Wang, Zekun Qian, Haomin Yan, Song Wang:
Benchmarking the Complementary-View Multi-human Association and Tracking. 118-136 - Shijie Wang, Zhihui Wang, Haojie Li, Jianlong Chang, Wanli Ouyang, Qi Tian:
Accurate Fine-Grained Object Recognition with Structure-Driven Relation Graph Networks. 137-160 - Lingkun Luo, Shiqiang Hu, Liming Chen:
Discriminative Noise Robust Sparse Orthogonal Label Regression-Based Domain Adaptation. 161-184 - Jingjing Jiang, Ziyi Liu, Nanning Zheng:
Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering. 185-207 - Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang:
Context Autoencoder for Self-supervised Representation Learning. 208-223 - Yidong Wang, Zhuohao Yu, Jindong Wang, Qiang Heng, Hao Chen, Wei Ye, Rui Xie, Xing Xie, Shikun Zhang:
Exploring Vision-Language Models for Imbalanced Learning. 224-237 - Haocong Rao, Cyril Leung, Chunyan Miao:
Hierarchical Skeleton Meta-Prototype Contrastive Learning with Hard Skeleton Mining for Unsupervised Person Re-identification. 238-260 - Chunbo Lang, Gong Cheng, Binfei Tu, Junwei Han:
Few-Shot Segmentation via Divide-and-Conquer Proxies. 261-283 - Libo Zhang, Lutao Jiang, Ruyi Ji, Heng Fan:
Correction: PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection. 284 - Wenfeng Song, Xinyu Zhang, Yuting Guo, Shuai Li, Aimin Hao, Hong Qin:
Correction: Automatic Generation of 3D Scene Animation Based on Dynamic Knowledge Graphs and Contextual Encoding. 285
Volume 132, Number 2, February 2024
- Samu Koskinen, Erman Acar, Joni-Kristian Kämäräinen:
Single Pixel Spectral Color Constancy. 287-299 - Tianlun Zheng, Zhineng Chen, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang:
CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition. 300-318 - Zhong Zhuang, Taihui Li, Hengkang Wang, Ju Sun:
Blind Image Deblurring with Unknown Kernel Size and Substantial Noise. 319-348 - Da Chen, Jean-Marie Mirebeau, Huazhong Shu, Laurent D. Cohen:
A Region-Based Randers Geodesic Approach for Image Segmentation. 349-391 - Wenhao Wu, Zhun Sun, Yuxin Song, Jingdong Wang, Wanli Ouyang:
Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective. 392-409 - Chaoyu Zhao, Jianjun Qian, Shumin Zhu, Jin Xie, Jian Yang:
Learning Robust Facial Representation From the View of Diversity and Closeness. 410-427 - Liang Chen, Jiawei Zhang, Zhenhua Li, Yunxuan Wei, Faming Fang, Jimmy S. J. Ren, Jinshan Pan:
Deep Richardson-Lucy Deconvolution for Low-Light Image Deblurring. 428-445 - Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva:
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization. 446-465 - Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Dacheng Tao:
Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow. 466-489 - Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, Jian Yang, Joost van de Weijer:
MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains. 490-514 - Lujia Jin, Qing Guo, Shi Zhao, Lei Zhu, Qian Chen, Qiushi Ren, Yanye Lu:
One-Pot Multi-frame Denoising. 515-536 - Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi:
The Curious Layperson: Fine-Grained Image Recognition Without Expert Labels. 537-554 - Skylar Sutherland, Bernhard Egger, Joshua B. Tenenbaum:
Building 3D Generative Models from Minimal Data. 555-580 - Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. 581-595 - Yifei Ming, Yixuan Li:
How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models? 596-609
Volume 132, Number 3, March 2024
- Yushi Lan, Chen Change Loy, Bo Dai:
Correspondence Distillation from NeRF-Based GAN. 611-631 - Editor's Note: Special Issue on Physics-Based Vision Meets Deep Learning. 632
- Jun Tu, Gangshan Wu, Limin Wang:
Dual Graph Networks for Pose Estimation in Crowded Scenes. 633-653 - Song Tang, An Chang, Fabian Zhang, Xiatian Zhu, Mao Ye, Changshui Zhang:
Source-Free Domain Adaptation via Target Prediction Distribution Searching. 654-672 - Heyu Zhou, An-An Liu, Chenyu Zhang, Ping Zhu, Qianyi Zhang, Mohan S. Kankanhalli:
Multi-Modal Meta-Transfer Fusion Network for Few-Shot 3D Model Classification. 673-688 - Soumya Suvra Ghosal, Yixuan Li:
Are Vision Transformers Robust to Spurious Correlations? 689-709 - Yanan Sun, Chi-Keung Tang, Yu-Wing Tai:
Semantic Image Matting: General and Specific Semantics. 710-730 - Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou:
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels. 731-749 - Wei Zhai, Pingyu Wu, Kai Zhu, Yang Cao, Feng Wu, Zheng-Jun Zha:
Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation. 750-775 - Gani Rahmon, Kannappan Palaniappan, Imad Eddine Toubal, Filiz Bunyak, Raghuveer Rao, Guna Seetharaman:
DeepFTSG: Multi-stream Asymmetric USE-Net Trellis Encoders with Shared Decoder Feature Fusion Architecture for Video Motion Segmentation. 776-804 - Yunfei Guo, Wei Feng, Fei Yin, Cheng-Lin Liu:
SignParser: An End-to-End Framework for Traffic Sign Understanding. 805-821 - Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang:
MixStyle Neural Networks for Domain Generalization and Adaptation. 822-836 - Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee:
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization. 837-853 - Bolin Lai, Miao Liu, Fiona Ryan, James M. Rehg:
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation and Beyond. 854-871 - Shiyu Hu, Xin Zhao, Kaiqi Huang:
SOTVerse: A User-Defined Task Space of Single Object Tracking. 872-930 - Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari:
3D Adversarial Augmentations for Robust Out-of-Domain Predictions. 931-963 - Avishek Siris, Jianbo Jiao, Gary K. L. Tam, Xianghua Xie, Rynson W. H. Lau:
Inferring Attention Shifts for Salient Instance Ranking. 964-986 - Feng Xue, Yicong Chang, Tianxi Wang, Yu Zhou, Anlong Ming:
Indoor Obstacle Discovery on Reflective Ground via Monocular Camera. 987-1007
Volume 132, Number 4, April 2024
- Kaiyang Zhou, Ziwei Liu, Xiaohua Zhai, Chunyuan Li, Kate Saenko:
Guest Editorial: Special Issue on the Promises and Dangers of Large Vision Models. 1009-1011 - Mochu Xiang, Yuchao Dai, Feiyu Zhang, Jiawei Shi, Xinyu Tian, Zhensong Zhang:
Towards a Unified Network for Robust Monocular Depth Estimation: Network Architecture, Training Strategy and Dataset. 1012-1028 - Wu Wang, Liang-Jian Deng, Ran Ran, Gemine Vivone:
A General Paradigm with Detail-Preserving Conditional Invertible Network for Image Fusion. 1029-1054 - Zhiwei Lin, Tingting Liang, Taihong Xiao, Yongtao Wang, Ming-Hsuan Yang:
FlowNAS: Neural Architecture Search for Optical Flow Estimation. 1055-1074 - Shengyu Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang:
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes. 1075-1090 - Shuang Liu, Masanori Suganuma, Takayuki Okatani:
Symmetry-aware Neural Architecture for Embodied Visual Navigation. 1091-1107 - Adrian Bulat, Georgios Tzimiropoulos:
Language-Aware Soft Prompting: Text-to-Text Optimization for Few- and Zero-Shot Adaptation of V &L Models. 1108-1125 - Bowen Zhang, Liyang Liu, Minh Hieu Phan, Zhi Tian, Chunhua Shen, Yifan Liu:
SegViT v2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers. 1126-1147 - Pramod Rao, Mallikarjun B. R., Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Fangneng Zhan, Ayush Tewari, Christian Theobalt, Mohamed Elgharib:
A Deeper Analysis of Volumetric Relightable Faces. 1148-1166 - Soohyun Kim, Jongbeom Baek, Jihye Park, Eunjae Ha, Homin Jung, Taeyoung Lee, Seungryong Kim:
InstaFormer++: Multi-Domain Instance-Aware Image-to-Image Translation with Transformer. 1167-1186 - Libo Zhang, Xin Gu, Congcong Li, Tiejian Luo, Heng Fan:
Local Compressed Video Stream Learning for Generic Event Boundary Detection. 1187-1204 - Burak Tasdemir, Mustafa Goktan Gudukbay, Dogac Eldenk, Adil Meric, Aysegul Dundar:
Learning Portrait Drawing with Unsupervised Parts. 1205-1218 - Cong Yang, Bipin Indurkhya, John See, Bo Gao, Yan Ke, Zeyd Boukhers, Zhenyu Yang, Marcin Grzegorzek:
Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks. 1219-1241 - Yaokun Li, Guang Tan, Chao Gou:
Cascaded Iterative Transformer for Jointly Predicting Facial Landmark, Occlusion Probability and Head Pose. 1242-1257 - Feng Lin, Wenze Hu, Yaowei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang:
Universal Object Detection with Large Vision Model. 1258-1276 - Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin:
Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation. 1277-1291 - Yuecong Xu, Haozhi Cao, Jianxiong Yin, Zhenghua Chen, Xiaoli Li, Zhengguo Li, Qianwen Xu, Jianfei Yang:
Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study. 1292-1309 - Nishant Jain, Suryansh Kumar, Luc Van Gool:
Learning Robust Multi-scale Representation for Neural Radiance Fields from Unposed Images. 1310-1335 - Nan Yang, Xin Luan, Huidi Jia, Zhi Han, Xiaofeng Li, Yandong Tang:
CCR: Facial Image Editing with Continuity, Consistency and Reversibility. 1336-1349 - Daniel Wilson, Xiaohan Zhang, Waqas Sultani, Safwan Wshah:
Image and Object Geo-Localization. 1350-1392 - Tingyu Weng, Jun Xiao, Hao Pan, Haiyong Jiang:
PartCom: Part Composition Learning for 3D Open-Set Recognition. 1393-1416 - Mixue Xie, Shuang Li, Kaixiong Gong, Yulin Wang, Gao Huang:
Adapting Across Domains via Target-Oriented Transferable Semantic Augmentation Under Prototype Constraint. 1417-1441
Volume 132, Number 5, May 2024
- Weitao Feng, Lei Bai, Yongqiang Yao, Fengwei Yu, Wanli Ouyang:
Towards Frame Rate Agnostic Multi-object Tracking. 1443-1462 - Chongwei Liu, Haojie Li, Zhi-Hui Wang:
FastTrack: A Highly Efficient and Generic GPU-Based Multi-object Tracking Method with Parallel Kalman Filter. 1463-1483 - Rongcheng Wu, Mingzhe Wang, Zhidong Li, Jianlong Zhou, Fang Chen, Xuan Wang, Changming Sun:
Few-Shot Stereo Matching with High Domain Adaptability Based on Adaptive Recursive Network. 1484-1501 - Dong Zhang, Yi Lin, Jinhui Tang, Kwang-Ting Cheng:
CAE-GReaT: Convolutional-Auxiliary Efficient Graph Reasoning Transformer for Dense Image Predictions. 1502-1520 - Wei-Hong Li, Xialei Liu, Hakan Bilen:
Universal Representations: A Unified Look at Multiple Task and Domain Learning. 1521-1545 - Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. 1546-1556 - Yang Yang, Chaoyue Wang, Xiaojie Guo, Dacheng Tao:
Robust Unpaired Image Dehazing via Density and Depth Decomposition. 1557-1577 - Jie Ma, Jun Liu, Qi Chai, Pinghui Wang, Jing Tao:
Diagram Perception Networks for Textbook Question Answering via Joint Optimization. 1578-1591 - Yifan Zhang, Junhui Hou, Yixuan Yuan:
A Comprehensive Study of the Robustness for LiDAR-Based 3D Object Detectors Against Adversarial Attacks. 1592-1624 - Huafeng Li, Junyu Liu, Yafei Zhang, Yu Liu:
A Deep Learning Framework for Infrared and Visible Image Fusion Without Strict Registration. 1625-1644 - Shaochuan Zhao, Tianyang Xu, Xiaojun Wu, Josef Kittler:
A Spatio-Temporal Robust Tracker with Spatial-Channel Transformer and Jitter Suppression. 1645-1658 - Xin Zhao, Shiyu Hu, Yipei Wang, Jing Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, Jiadong Li:
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision. 1659-1684 - Xiaofeng Mao, Yufeng Chen, Xiaojun Jia, Rong Zhang, Hui Xue, Zhao Li:
Context-Aware Robust Fine-Tuning. 1685-1700 - Marcella Cornia, Lorenzo Baraldi, Giuseppe Fiameni, Rita Cucchiara:
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets. 1701-1720 - Youfa Liu, Bo Du, Yongyong Chen, Lefei Zhang, Mingming Gong, Dacheng Tao:
Convex-Concave Tensor Robust Principal Component Analysis. 1721-1747 - Jinyuan Liu, Runjia Lin, Guanyao Wu, Risheng Liu, Zhongxuan Luo, Xin Fan:
CoCoNet: Coupled Contrastive Learning Network with Multi-level Feature Ensemble for Multi-modality Image Fusion. 1748-1775 - Editor's Note: Special Issue on BMVC 2021. 1776
- Kongming Liang, Zijin Yin, Min Min, Yan Liu, Zhanyu Ma, Jun Guo:
Learning Dynamic Prototypes for Visual Pattern Debiasing. 1777-1799 - Yifan Wang, Lin Zhang, Ran Song, Hongliang Li, Paul L. Rosin, Wei Zhang:
Exploiting Inter-Sample Affinity for Knowability-Aware Universal Domain Adaptation. 1800-1816 - Wenqi Ren, Senyou Deng, Kaihao Zhang, Fenglong Song, Xiaochun Cao, Ming-Hsuan Yang:
Fast Ultra High-Definition Video Deblurring via Multi-scale Separable Network. 1817-1834 - Azin Jahedi, Maximilian Luz, Marc Rivinius, Lukas Mehl, Andrés Bruhn:
MS-RAFT+: High Resolution Multi-Scale RAFT. 1835-1856 - Jiqing Zhang, Bo Dong, Yingkai Fu, Yuanchen Wang, Xiaopeng Wei, Baocai Yin, Xin Yang:
A Universal Event-Based Plug-In Module for Visual Object Tracking in Degraded Conditions. 1857-1879 - Shiyu Hu, Xin Zhao, Kaiqi Huang:
Correction: SOTVerse: A User-Defined Task Space of Single Object Tracking. 1880
Volume 132, Number 6, June 2024
- Aishan Liu, Shiyu Tang, Xinyun Chen, Lei Huang, Haotong Qin, Xianglong Liu, Dacheng Tao:
Towards Defending Multiple ℓ p-Norm Bounded Adversarial Perturbations via Gated Batch Normalization. 1881-1898 - Xiang Wang, Shiwei Zhang, Jun Cen, Changxin Gao, Yingya Zhang, Deli Zhao, Nong Sang:
CLIP-guided Prototype Modulating for Few-shot Action Recognition. 1899-1912 - Chang Liu, Gaurav Mittal, Nikolaos Karianakis, Victor Fragoso, Ye Yu, Yun Fu, Mei Chen:
HyperSTAR: Task-Aware Hyperparameter Recommendation for Training and Compression. 1913-1927 - Xingxing Wei, Jie Yu, Yao Huang:
Infrared Adversarial Patches with Learnable Shapes and Locations in the Physical World. 1928-1944 - Hongchen Luo, Wei Zhai, Jing Zhang, Yang Cao, Dacheng Tao:
Grounded Affordance from Exocentric View. 1945-1969 - Yuki Fujimura, Masaaki Iiyama, Takuya Funatomi, Yasuhiro Mukaigawa:
Deep Depth from Focal Stack with Defocus Model for Camera-Setting Invariance. 1970-1985 - Moira Shooter, Charles Malleson, Adrian Hilton:
SyDog-Video: A Synthetic Dog Video Dataset for Temporal Pose Estimation. 1986-2002 - Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji:
Joint Learning of Audio-Visual Saliency Prediction and Sound Source Localization on Multi-face Videos. 2003-2025 - Sachit Menon, Ishaan Preetam Chandratreya, Carl Vondrick:
Task Bias in Contrastive Vision-Language Models. 2026-2040 - Namhyuk Ahn, Jaejun Yoo, Kyung-Ah Sohn:
Data Augmentation for Low-Level Vision: CutBlur and Mixture-of-Augmentation. 2041-2059 - Yang Guo, Wei Gao, Ge Li:
Interpretable Task-inspired Adaptive Filter Pruning for Neural Networks Under Multiple Constraints. 2060-2076 - Yafei Yang, Bo Yang:
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-World Single Images. 2077-2113 - Liang Zhao, Yao Teng, Limin Wang:
Logit Normalization for Long-Tail Object Detection. 2114-2134 - Agniva Sengupta, Adrien Bartoli:
ToTem NRSfM: Object-Wise Non-rigid Structure-from-Motion with a Topological Template. 2135-2176 - Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam:
S2P3: Self-Supervised Polarimetric Pose Prediction. 2177-2194 - Chafic Abou Akar, Rachelle Abdel Massih, Anthony Yaghi, Joe Khalil, Marc Kamradt, Abdallah Makhoul:
Generative Adversarial Network Applications in Industry 4.0: A Review. 2195-2254 - Xianqiang Lyu, Junhui Hou:
Probabilistic-Based Feature Embedding of 4-D Light Fields for Compressive Imaging and Denoising. 2255-2275 - Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai:
Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation. 2276-2289 - Kohei Uehara, Tatsuya Harada:
Learning by Asking Questions for Knowledge-Based Novel Object Recognition. 2290-2309