default search action
Xiatian Zhu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j40]Jie Liu, Jinzong Cui, Mao Ye, Xiatian Zhu, Song Tang:
Shooting condition insensitive unmanned aerial vehicle object detection. Expert Syst. Appl. 246: 123221 (2024) - [j39]Xingbing Fu, Chaofan Jiang, Chaorong Li, Jiangtao Li, Xiatian Zhu, Fagen Li:
A hybrid approach for Android malware detection using improved multi-scale convolutional neural networks and residual networks. Expert Syst. Appl. 249: 123675 (2024) - [j38]Song Tang, An Chang, Fabian Zhang, Xiatian Zhu, Mao Ye, Changshui Zhang:
Source-Free Domain Adaptation via Target Prediction Distribution Searching. Int. J. Comput. Vis. 132(3): 654-672 (2024) - [j37]Jiachen Lu, Junge Zhang, Xiatian Zhu, Jianfeng Feng, Tao Xiang, Li Zhang:
Softmax-Free Linear Transformers. Int. J. Comput. Vis. 132(8): 3355-3374 (2024) - [j36]Xi Chen, Haosen Yang, Huicong Zhang, Hongxun Yao, Xiatian Zhu:
Uncertainty-aware pseudo-label filtering for source-free unsupervised domain adaptation. Neurocomputing 575: 127190 (2024) - [j35]Yanbei Chen, Massimiliano Mancini, Xiatian Zhu, Zeynep Akata:
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1327-1347 (2024) - [j34]Hu Wang, Mao Ye, Xiatian Zhu, Shuai Li, Xue Li, Ce Zhu:
Compressed-SDR to HDR Video Reconstruction. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3679-3691 (2024) - [j33]Lihua Zhou, Nianxin Li, Mao Ye, Xiatian Zhu, Song Tang:
Source-free domain adaptation with Class Prototype Discovery. Pattern Recognit. 145: 109974 (2024) - [c90]Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu:
DiffSED: Sound Event Detection with Denoising Diffusion. AAAI 2024: 792-800 - [c89]Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery. CVPR Workshops 2024: 1211-1223 - [c88]Song Tang, Wenxin Su, Mao Ye, Xiatian Zhu:
Source-Free Domain Adaptation with Frozen Multimodal Foundation Model. CVPR 2024: 23711-23720 - [c87]Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
PartCraft: Crafting Creative Objects by Parts. ECCV (9) 2024: 420-437 - [c86]Zijie Pan, Jiachen Lu, Xiatian Zhu, Li Zhang:
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping. ICLR 2024 - [c85]Siying Xiao, Mao Ye, Qichen He, Shuaifeng Li, Song Tang, Xiatian Zhu:
Adversarial Experts Model for Black-box Domain Adaptation. ACM Multimedia 2024: 8982-8991 - [i116]Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang:
Fast Dynamic 3D Object Generation from a Single-view Video. CoRR abs/2401.08742 (2024) - [i115]Anindya Mondal, Sauradip Nag, Xiatian Zhu, Anjan Dutta:
OmniCount: Multi-label Object Counting with Semantic-Geometric Priors. CoRR abs/2403.05435 (2024) - [i114]Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu:
Unified Source-Free Domain Adaptation. CoRR abs/2403.07601 (2024) - [i113]Xi Chen, Haosen Yang, Huicong Zhang, Hongxun Yao, Xiatian Zhu:
Uncertainty-Aware Pseudo-Label Filtering for Source-Free Unsupervised Domain Adaptation. CoRR abs/2403.11256 (2024) - [i112]Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu:
Unsupervised Audio-Visual Segmentation with Modality Alignment. CoRR abs/2403.14203 (2024) - [i111]Chaitali Bhattacharyya, Hanxiao Wang, Feng Zhang, Sungho Kim, Xiatian Zhu:
Diffusion Deepfake. CoRR abs/2404.01579 (2024) - [i110]Li Zhang, Yuankun Yang, Ziyang Xie, Zhiyuan Yuan, Jianfeng Feng, Xiatian Zhu, Yu-Gang Jiang:
Automating the Diagnosis of Human Vision Disorders by Cross-modal 3D Generation. CoRR abs/2405.15239 (2024) - [i109]Chun Gu, Zeyu Yang, Zijie Pan, Xiatian Zhu, Li Zhang:
Tetrahedron Splatting for 3D Generation. CoRR abs/2406.01579 (2024) - [i108]Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu:
Proxy Denoising for Source-Free Domain Adaptation. CoRR abs/2406.01658 (2024) - [i107]Haosen Yang, Chenhao Zhang, Wenqing Wang, Marco Volino, Adrian Hilton, Li Zhang, Xiatian Zhu:
Gaussian Splatting with Localized Points Management. CoRR abs/2406.04251 (2024) - [i106]Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery. CoRR abs/2406.08457 (2024) - [i105]Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu:
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis. CoRR abs/2406.08920 (2024) - [i104]Song Tang, Shaxu Yan, Xiaozhi Qi, Jianxin Gao, Mao Ye, Jianwei Zhang, Xiatian Zhu:
Few-Shot Medical Image Segmentation with High-Fidelity Prototypes. CoRR abs/2406.18074 (2024) - [i103]Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
PartCraft: Crafting Creative Objects by Parts. CoRR abs/2407.04604 (2024) - [i102]Zhi Qin Tan, Olga Isupova, Gustavo Carneiro, Xiatian Zhu, Yunpeng Li:
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations. CoRR abs/2407.07958 (2024) - [i101]Zeyu Yang, Nan Song, Wei Li, Xiatian Zhu, Li Zhang, Philip H. S. Torr:
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving. CoRR abs/2408.05075 (2024) - [i100]Guoan Xu, Wenfeng Huang, Tao Wu, Ligeng Chen, Wenjing Jia, Guangwei Gao, Xiatian Zhu, Stuart W. Perry:
MacFormer: Semantic Segmentation with Fine Object Boundaries. CoRR abs/2408.05699 (2024) - [i99]Xi Chen, Haosen Yang, Sheng Jin, Xiatian Zhu, Hongxun Yao:
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation. CoRR abs/2409.03525 (2024) - [i98]Wenqing Wang, Haosen Yang, Josef Kittler, Xiatian Zhu:
Single Image, Any Face: Generalisable 3D Face Generation. CoRR abs/2409.16990 (2024) - 2023
- [j32]Peng Xu, Xiatian Zhu, David A. Clifton:
Multimodal Learning With Transformers: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12113-12132 (2023) - [j31]Wei Li, Shaogang Gong, Xiatian Zhu:
Neural operator search. Pattern Recognit. 136: 109215 (2023) - [j30]Lihua Zhou, Siying Xiao, Mao Ye, Xiatian Zhu, Shuaifeng Li:
Adaptive Mutual Learning for Unsupervised Domain Adaptation. IEEE Trans. Circuits Syst. Video Technol. 33(11): 6622-6634 (2023) - [c84]Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang:
PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer. AAAI 2023: 1042-1050 - [c83]Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang:
Unsupervised Hashing with Similarity Distribution Calibration. BMVC 2023: 53-69 - [c82]Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang:
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks. CVPR 2023: 2669-2680 - [c81]Jiaqi Chen, Jiachen Lu, Xiatian Zhu, Li Zhang:
Generative Semantic Segmentation. CVPR 2023: 7111-7120 - [c80]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Post-Processing Temporal Action Detection. CVPR 2023: 18837-18845 - [c79]Sauradip Nag, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang:
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion. ICCV 2023: 10328-10340 - [c78]Peng Xu, Xiatian Zhu:
DeepChange: A Long-Term Person Re-Identification Benchmark with Clothes Change. ICCV 2023: 11162-11171 - [c77]Lihua Zhou, Mao Ye, Xiatian Zhu, Siying Xiao, Xuqian Fan, Ferrante Neri:
Homeomorphism Alignment for Unsupervised Domain Adaptation. ICCV 2023: 18653-18664 - [c76]Xiao Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang:
Controllable Person Image Synthesis with Pose-Constrained Latent Diffusion. ICCV 2023: 22711-22720 - [c75]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. ICCV (Workshops) 2023: 272-283 - [c74]Anindya Mondal, Sauradip Nag, Joaquin M. Prada, Xiatian Zhu, Anjan Dutta:
Actor-agnostic Multi-label Action Recognition with Multi-modal Query. ICCV (Workshops) 2023: 784-794 - [c73]Kongming Liang, Xinran Wang, Rui Wang, Donghui Gao, Ling Jin, Weidong Liu, Xiatian Zhu, Zhanyu Ma, Jun Guo:
Vision-Language Assisted Attribute Learning. IC-NIDC 2023: 1-5 - [c72]Qichen He, Siying Xiao, Mao Ye, Xiatian Zhu, Ferrante Neri, Dongde Hou:
Independent Feature Decomposition and Instance Alignment for Unsupervised Domain Adaptation. IJCAI 2023: 819-827 - [c71]Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, Kwan-Yee K. Wong:
HeadSculpt: Crafting 3D Head Avatars with Text. NeurIPS 2023 - [i97]Li Zhang, Hengyuan Ma, Xiatian Zhu, Jianfeng Feng:
Preconditioned Score-based Generative Models. CoRR abs/2302.06504 (2023) - [i96]Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang:
Unsupervised Hashing via Similarity Distribution Calibration. CoRR abs/2302.07669 (2023) - [i95]Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang:
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks. CoRR abs/2303.02483 (2023) - [i94]Anran Qi, Sauradip Nag, Xiatian Zhu, Ariel Shamir:
PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds. CoRR abs/2303.09695 (2023) - [i93]Jiaqi Chen, Jiachen Lu, Xiatian Zhu, Li Zhang:
Generative Semantic Segmentation. CoRR abs/2303.11316 (2023) - [i92]Sauradip Nag, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang:
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion. CoRR abs/2303.14863 (2023) - [i91]Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, Kwan-Yee K. Wong:
HeadSculpt: Crafting 3D Head Avatars with Text. CoRR abs/2306.03038 (2023) - [i90]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. CoRR abs/2306.11732 (2023) - [i89]Anindya Mondal, Sauradip Nag, Joaquin M. Prada, Xiatian Zhu, Anjan Dutta:
MSQNet: Actor-agnostic Action Recognition with Multi-modal Query. CoRR abs/2307.10763 (2023) - [i88]Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu:
DiffSED: Sound Event Detection with Denoising Diffusion. CoRR abs/2308.07293 (2023) - [i87]Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Xiatian Zhu:
Leveraging Foundation models for Unsupervised Audio-Visual Segmentation. CoRR abs/2309.06728 (2023) - [i86]Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia:
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection. CoRR abs/2310.01430 (2023) - [i85]Zeyu Yang, Hongye Yang, Zijie Pan, Xiatian Zhu, Li Zhang:
Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting. CoRR abs/2310.10642 (2023) - [i84]Zijie Pan, Jiachen Lu, Xiatian Zhu, Li Zhang:
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping. CoRR abs/2310.12474 (2023) - [i83]Haosen Yang, Chuofan Ma, Bin Wen, Yi Jiang, Zehuan Yuan, Xiatian Zhu:
Recognize Any Regions. CoRR abs/2311.01373 (2023) - [i82]Jay Gala, Sauradip Nag, Huichou Huang, Ruirui Liu, Xiatian Zhu:
Adaptive-Labeling for Enhancing Remote Sensing Cloud Understanding. CoRR abs/2311.05198 (2023) - [i81]Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination. CoRR abs/2311.15477 (2023) - [i80]Huanxin Chen, Pengshuai Yin, Huichou Huang, Qingyao Wu, Ruirui Liu, Xiatian Zhu:
Typhoon Intensity Prediction with Vision Transformer. CoRR abs/2311.16450 (2023) - [i79]Song Tang, Wenxin Su, Mao Ye, Xiatian Zhu:
Source-Free Domain Adaptation with Frozen Multimodal Foundation Model. CoRR abs/2311.16510 (2023) - [i78]Yurui Chen, Chun Gu, Junzhe Jiang, Xiatian Zhu, Li Zhang:
Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering. CoRR abs/2311.18561 (2023) - [i77]Kongming Liang, Xinran Wang, Rui Wang, Donghui Gao, Ling Jin, Weidong Liu, Xiatian Zhu, Zhanyu Ma, Jun Guo:
Vision-language Assisted Attribute Learning. CoRR abs/2312.07009 (2023) - 2022
- [j29]Wei-Shi Zheng, Jincheng Hong, Jiening Jiao, Ancong Wu, Xiatian Zhu, Shaogang Gong, Jiayin Qin, Jianhuang Lai:
Joint Bilateral-Resolution Identity Modeling for Cross-Resolution Person Re-Identification. Int. J. Comput. Vis. 130(1): 136-156 (2022) - [j28]Feng Zhang, Xiatian Zhu, Chen Wang:
A Comprehensive Survey on Single-Person Pose Estimation in Social Robotics. Int. J. Soc. Robotics 14(9): 1995-2008 (2022) - [j27]Hui Tang, Xiatian Zhu, Ke Chen, Kui Jia, C. L. Philip Chen:
Towards Uncovering the Intrinsic Data Structures for Unsupervised Domain Adaptation Using Structurally Regularized Deep Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6517-6533 (2022) - [j26]Guile Wu, Xiatian Zhu, Shaogang Gong:
Learning hybrid ranking representation for person re-identification. Pattern Recognit. 121: 108239 (2022) - [j25]Xu Lan, Xiatian Zhu, Shaogang Gong:
Unsupervised cross-domain person re-identification by instance and distribution alignment. Pattern Recognit. 124: 108514 (2022) - [j24]Chen Wang, Feng Zhang, Xiatian Zhu, Shuzhi Sam Ge:
Low-resolution human pose estimation. Pattern Recognit. 126: 108579 (2022) - [j23]Mantun Chen, Yongjun Wang, Xiatian Zhu:
Few-shot Website Fingerprinting attack with Meta-Bias Learning. Pattern Recognit. 130: 108739 (2022) - [c70]Shuaifeng Li, Mao Ye, Xiatian Zhu, Lihua Zhou, Lin Xiong:
Source-Free Object Detection by Learning to Overlook Domain Style. CVPR 2022: 8004-8013 - [c69]Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng:
Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling. ECCV (23) 2022: 1-16 - [c68]Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang:
Learning Ego 3D Representation as Ray Tracing. ECCV (26) 2022: 129-144 - [c67]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers. ECCV (11) 2022: 294-311 - [c66]Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martínez:
SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition. ECCV (13) 2022: 604-620 - [c65]Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang:
FashionViL: Fashion-Focused Vision-and-Language Representation Learning. ECCV (35) 2022: 634-651 - [c64]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning. ECCV (3) 2022: 645-662 - [c63]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Semi-supervised Temporal Action Detection with Proposal-Free Masking. ECCV (3) 2022: 663-680 - [c62]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Zero-Shot Temporal Action Detection via Vision-Language Prompting. ECCV (3) 2022: 681-697 - [c61]Xian Shi, Xun Xu, Wanyue Zhang, Xiatian Zhu, Chuan Sheng Foo, Kui Jia:
Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding. ICPR 2022: 5045-5051 - [c60]Hu Wang, Mao Ye, Xiatian Zhu, Shuai Li, Ce Zhu, Xue Li:
KUNet: Imaging Knowledge-Inspired Single HDR Image Reconstruction. IJCAI 2022: 1408-1414 - [c59]Han Gao, Jinzhong Cui, Mao Ye, Shuai Li, Yu Zhao, Xiatian Zhu:
Structure-Preserving Motion Estimation for Learned Video Compression. ACM Multimedia 2022: 3055-3063 - [c58]Lihua Zhou, Mao Ye, Xiatian Zhu, Shuaifeng Li, Yiguang Liu:
Class Discriminative Adversarial Learning for Unsupervised Domain Adaptation. ACM Multimedia 2022: 4318-4326 - [c57]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning. NeurIPS 2022 - [c56]Zhenbin Wang, Mao Ye, Xiatian Zhu, Liuhan Peng, Liang Tian, Yingying Zhu:
MetaTeacher: Coordinating Multi-Model Domain Adaptation for Medical Image Classification. NeurIPS 2022 - [c55]Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang:
DeepInteraction: 3D Object Detection via Modality Interaction. NeurIPS 2022 - [i76]Mingkun Li, Peng Xu, Xiatian Zhu, Jun Guo:
Unsupervised Long-Term Person Re-Identification with Clothes Change. CoRR abs/2202.03087 (2022) - [i75]Mantun Chen, Yongxin Chen, Yongjun Wang, Peidai Xie, Shaojing Fu, Xiatian Zhu:
End-to-End Multi-Tab Website Fingerprinting Attack: A Detection Perspective. CoRR abs/2203.06376 (2022) - [i74]Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martínez:
SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition. CoRR abs/2204.04796 (2022) - [i73]Xian Shi, Xun Xu, Wanyue Zhang, Xiatian Zhu, Chuan Sheng Foo, Kui Jia:
Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding. CoRR abs/2205.01006 (2022) - [i72]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers. CoRR abs/2205.03436 (2022) - [i71]Jing Yang, Xiatian Zhu, Adrian Bulat, Brais Martínez, Georgios Tzimiropoulos:
Knowledge Distillation Meets Open-Set Semi-Supervised Learning. CoRR abs/2205.06701 (2022) - [i70]Hengyuan Ma, Li Zhang, Xiatian Zhu, Jingfeng Zhang, Jianfeng Feng:
Accelerating Score-based Generative Models for High-Resolution Image Synthesis. CoRR abs/2206.04029 (2022) - [i69]Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang:
Learning Ego 3D Representation as Ray Tracing. CoRR abs/2206.04042 (2022) - [i68]Peng Xu, Xiatian Zhu, David A. Clifton:
Multimodal Learning with Transformers: A Survey. CoRR abs/2206.06488 (2022) - [i67]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition. CoRR abs/2206.13559 (2022) - [i66]Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang:
PolarFormer: Multi-camera 3D Object Detection with Polar Transformers. CoRR abs/2206.15398 (2022) - [i65]Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng:
Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling. CoRR abs/2207.02196 (2022) - [i64]Jiachen Lu, Li Zhang, Junge Zhang, Xiatian Zhu, Hang Xu, Jianfeng Feng:
Softmax-free Linear Transformers. CoRR abs/2207.03341 (2022) - [i63]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Temporal Action Detection with Global Segmentation Mask Learning. CoRR abs/2207.06580 (2022) - [i62]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Semi-Supervised Temporal Action Detection with Proposal-Free Masking. CoRR abs/2207.07059 (2022) - [i61]Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang:
FashionViL: Fashion-Focused Vision-and-Language Representation Learning. CoRR abs/2207.08150 (2022) - [i60]Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang:
Zero-Shot Temporal Action Detection via Vision-Language Prompting. CoRR abs/2207.08184 (2022) - [i59]Li Zhang, Sixiao Zheng, Jiachen Lu, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng:
Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective. CoRR abs/2207.09339 (2022) - [i58]Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang:
DeepInteraction: 3D Object Detection via Modality Interaction. CoRR abs/2208.11112 (2022) - [i57]