default search action
Enze Xie
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j10]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Deeply Unsupervised Patch Re-Identification for Pre-Training Object Detectors. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1348-1361 (2024) - [j9]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao:
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2151-2170 (2024) - [j8]Yangguang Li, Bin Huang, Zeren Chen, Yufeng Cui, Feng Liang, Mingzhu Shen, Fenggang Liu, Enze Xie, Lu Sheng, Wanli Ouyang, Jing Shao:
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 8665-8679 (2024) - [j7]Zhenhua Xu, Yujia Zhang, Enze Xie, Zhen Zhao, Yong Guo, Kwan-Yee K. Wong, Zhenguo Li, Hengshuang Zhao:
DriveGPT4: Interpretable End-to-End Autonomous Driving Via Large Language Model. IEEE Robotics Autom. Lett. 9(10): 8186-8193 (2024) - [j6]Chuanyang Zheng, Haiming Wang, Enze Xie, Zhengying Liu, Jiankai Sun, Huajian Xin, Jianhao Shen, Zhenguo Li, Yu Li:
Lyra: Orchestrating Dual Correction in Automated Theorem Proving. Trans. Mach. Learn. Res. 2024 (2024) - [c46]Tianqi Wang, Sukmin Kim, Wenxuan Ji, Enze Xie, Chongjian Ge, Junsong Chen, Zhenguo Li, Ping Luo:
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving. AAAI 2024: 5599-5606 - [c45]Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze Xie, Zhenguo Li:
Accelerating Diffusion Sampling with Optimized Time Steps. CVPR 2024: 8292-8301 - [c44]Junsong Chen, Chongjian Ge, Enze Xie, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li:
PIXART-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation. ECCV (32) 2024: 74-91 - [c43]Shentong Mo, Enze Xie, Yue Wu, Junsong Chen, Matthias Nießner, Zhenguo Li:
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation. ECCV (84) 2024: 354-370 - [c42]Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo:
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts. ECCV (84) 2024: 407-423 - [c41]Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung, Qiang Xu:
MagicDrive: Street View Generation with Diverse 3D Geometry Control. ICLR 2024 - [c40]Kai Chen, Enze Xie, Zhe Chen, Yibo Wang, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung:
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation. ICLR 2024 - [c39]Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Zhongdao Wang, James T. Kwok, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis. ICLR 2024 - [c38]Yuanfeng Ji, Chongjian Ge, Weikai Kong, Enze Xie, Zhengying Liu, Zhenguo Li, Ping Luo:
Large Language Models as Automated Aligners for benchmarking Vision-Language Models. ICLR 2024 - [c37]Haiming Wang, Huajian Xin, Chuanyang Zheng, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. ICLR 2024 - [c36]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. ICLR 2024 - [c35]Renhao Wang, Zhiding Yu, Shiyi Lan, Enze Xie, Ke Chen, Anima Anandkumar, José M. Álvarez:
SF3D: SlowFast Temporal 3D Object Detection. IV 2024: 1280-1285 - [i79]Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li:
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models. CoRR abs/2401.05252 (2024) - [i78]Zhao Wang, Aoxue Li, Enze Xie, Lingting Zhu, Yong Guo, Qi Dou, Zhenguo Li:
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects. CoRR abs/2401.09962 (2024) - [i77]Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li:
Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation. CoRR abs/2401.15688 (2024) - [i76]Yihang Gao, Chuanyang Zheng, Enze Xie, Han Shi, Tianyang Hu, Yu Li, Michael K. Ng, Zhenguo Li, Zhaoqiang Liu:
On the Expressive Power of a Variant of the Looped Transformer. CoRR abs/2402.13572 (2024) - [i75]Shuchen Xue, Zhaoqiang Liu, Fei Chen, Shifeng Zhang, Tianyang Hu, Enze Xie, Zhenguo Li:
Accelerating Diffusion Sampling with Optimized Time Steps. CoRR abs/2402.17376 (2024) - [i74]Junsong Chen, Chongjian Ge, Enze Xie, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation. CoRR abs/2403.04692 (2024) - [i73]Jiahao Lyu, Jin Wei, Gangyan Zeng, Zeng Li, Enze Xie, Wei Wang, Yu Zhou:
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model. CoRR abs/2403.10047 (2024) - [i72]Tianwei Xiong, Yue Wu, Enze Xie, Yue Wu, Zhenguo Li, Xihui Liu:
Editing Massive Concepts in Text-to-Image Diffusion Models. CoRR abs/2403.13807 (2024) - [i71]Tianqi Wang, Enze Xie, Ruihang Chu, Zhenguo Li, Ping Luo:
DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving. CoRR abs/2403.16996 (2024) - [i70]Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo:
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts. CoRR abs/2407.11382 (2024) - [i69]Yecheng Wu, Zhuoyang Zhang, Junyu Chen, Haotian Tang, Dacheng Li, Yunhao Fang, Ligeng Zhu, Enze Xie, Hongxu Yin, Li Yi, Song Han, Yao Lu:
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation. CoRR abs/2409.04429 (2024) - [i68]Enze Xie, Junsong Chen, Junyu Chen, Han Cai, Haotian Tang, Yujun Lin, Zhekai Zhang, Muyang Li, Ligeng Zhu, Yao Lu, Song Han:
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers. CoRR abs/2410.10629 (2024) - [i67]Junyu Chen, Han Cai, Junsong Chen, Enze Xie, Shang Yang, Haotian Tang, Muyang Li, Yao Lu, Song Han:
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models. CoRR abs/2410.10733 (2024) - [i66]Haotian Tang, Yecheng Wu, Shang Yang, Enze Xie, Junsong Chen, Junyu Chen, Zhuoyang Zhang, Han Cai, Yao Lu, Song Han:
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer. CoRR abs/2410.10812 (2024) - 2023
- [j5]Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo:
CycleMLP: A MLP-Like Architecture for Dense Visual Predictions. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14284-14300 (2023) - [c34]Haiming Wang, Ye Yuan, Zhengying Liu, Jianhao Shen, Yichun Yin, Jing Xiong, Enze Xie, Han Shi, Yujun Li, Lin Li, Jian Yin, Zhenguo Li, Xiaodan Liang:
DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function. ACL (1) 2023: 12632-12646 - [c33]Yutao Hu, Qixiong Wang, Wenqi Shao, Enze Xie, Zhenguo Li, Jungong Han, Ping Luo:
Beyond One-to-One: Rethinking the Referring Image Segmentation. ICCV 2023: 4044-4054 - [c32]Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu, Jiawei Li, Zhenguo Li:
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning. ICCV 2023: 4207-4216 - [c31]Jiayu Yang, Enze Xie, Miaomiao Liu, José M. Álvarez:
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye View. ICCV 2023: 8449-8458 - [c30]Chongjian Ge, Junsong Chen, Enze Xie, Zhongdao Wang, Lanqing Hong, Huchuan Lu, Zhenguo Li, Ping Luo:
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation. ICCV 2023: 8687-8697 - [c29]Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo:
DDP: Diffusion Model for Dense Visual Prediction. ICCV 2023: 21684-21695 - [c28]Ruihang Chu, Enze Xie, Shentong Mo, Zhenguo Li, Matthias Nießner, Chi-Wing Fu, Jiaya Jia:
DiffComplete: Diffusion-based Generative 3D Shape Completion. NeurIPS 2023 - [c27]Kaiyi Huang, Kaiyue Sun, Enze Xie, Zhenguo Li, Xihui Liu:
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation. NeurIPS 2023 - [c26]Shentong Mo, Enze Xie, Ruihang Chu, Lanqing Hong, Matthias Nießner, Zhenguo Li:
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation. NeurIPS 2023 - [c25]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Ping Luo, Zaiqing Nie:
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection. NeurIPS 2023 - [c24]Xurui Sun, Jiahao Lyu, Yifei Zhang, Gangyan Zeng, Bo Fang, Yu Zhou, Enze Xie, Can Ma:
Feature Enhancement with Text-Specific Region Contrast for Scene Text Detection. PRCV (7) 2023: 3-14 - [i65]Bin Huang, Yangguang Li, Enze Xie, Feng Liang, Luya Wang, Mingzhu Shen, Fenggang Liu, Tianqi Wang, Ping Luo, Jing Shao:
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception. CoRR abs/2301.07870 (2023) - [i64]Yangguang Li, Bin Huang, Zeren Chen, Yufeng Cui, Feng Liang, Mingzhu Shen, Fenggang Liu, Enze Xie, Lu Sheng, Wanli Ouyang, Jing Shao:
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline. CoRR abs/2301.12511 (2023) - [i63]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Jirui Yuan, Ping Luo, Zaiqing Nie:
Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction. CoRR abs/2303.10552 (2023) - [i62]Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo:
DDP: Diffusion Model for Dense Visual Prediction. CoRR abs/2303.17559 (2023) - [i61]Tianqi Wang, Sukmin Kim, Wenxuan Ji, Enze Xie, Chongjian Ge, Junsong Chen, Zhenguo Li, Ping Luo:
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving. CoRR abs/2304.01168 (2023) - [i60]Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu, Jiawei Li, Zhenguo Li:
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning. CoRR abs/2304.06648 (2023) - [i59]Chuanyang Zheng, Zhengying Liu, Enze Xie, Zhenguo Li, Yu Li:
Progressive-Hint Prompting Improves Reasoning in Large Language Models. CoRR abs/2304.09797 (2023) - [i58]Chongjian Ge, Junsong Chen, Enze Xie, Zhongdao Wang, Lanqing Hong, Huchuan Lu, Zhenguo Li, Ping Luo:
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation. CoRR abs/2304.09801 (2023) - [i57]Yuyang Zhao, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee:
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts. CoRR abs/2305.08850 (2023) - [i56]Kai Chen, Enze Xie, Zhe Chen, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung:
Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt. CoRR abs/2306.04607 (2023) - [i55]Ruihang Chu, Enze Xie, Shentong Mo, Zhenguo Li, Matthias Nießner, Chi-Wing Fu, Jiaya Jia:
DiffComplete: Diffusion-based Generative 3D Shape Completion. CoRR abs/2306.16329 (2023) - [i54]Shentong Mo, Enze Xie, Ruihang Chu, Lewei Yao, Lanqing Hong, Matthias Nießner, Zhenguo Li:
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation. CoRR abs/2307.01831 (2023) - [i53]Jingwei Zhang, Han Shi, Jincheng Yu, Enze Xie, Zhenguo Li:
DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks. CoRR abs/2307.02159 (2023) - [i52]Jiayu Yang, Enze Xie, Miaomiao Liu, José M. Álvarez:
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye View. CoRR abs/2307.04106 (2023) - [i51]Kaiyi Huang, Kaiyue Sun, Enze Xie, Zhenguo Li, Xihui Liu:
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation. CoRR abs/2307.06350 (2023) - [i50]Yutao Hu, Qixiong Wang, Wenqi Shao, Enze Xie, Zhenguo Li, Jungong Han, Ping Luo:
Beyond One-to-One: Rethinking the Referring Image Segmentation. CoRR abs/2308.13853 (2023) - [i49]Chuanyang Zheng, Haiming Wang, Enze Xie, Zhengying Liu, Jiankai Sun, Huajian Xin, Jianhao Shen, Zhenguo Li, Yu Li:
Lyra: Orchestrating Dual Correction in Automated Theorem Proving. CoRR abs/2309.15806 (2023) - [i48]Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James T. Kwok, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis. CoRR abs/2310.00426 (2023) - [i47]Haiming Wang, Huajian Xin, Chuanyang Zheng, Lin Li, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Heng Liao, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. CoRR abs/2310.00656 (2023) - [i46]Zhenhua Xu, Yujia Zhang, Enze Xie, Zhen Zhao, Yong Guo, Kwan-Yee K. Wong, Zhenguo Li, Hengshuang Zhao:
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model. CoRR abs/2310.01412 (2023) - [i45]Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung, Qiang Xu:
MagicDrive: Street View Generation with Diverse 3D Geometry Control. CoRR abs/2310.02601 (2023) - [i44]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. CoRR abs/2310.02954 (2023) - [i43]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Ping Luo, Zaiqing Nie:
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection. CoRR abs/2311.01682 (2023) - [i42]Yuanfeng Ji, Chongjian Ge, Weikai Kong, Enze Xie, Zhengying Liu, Zhengguo Li, Ping Luo:
Large Language Models as Automated Aligners for benchmarking Vision-Language Models. CoRR abs/2311.14580 (2023) - [i41]Yuyang Zhao, Zhiwen Yan, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee:
Animate124: Animating One Image to 4D Dynamic Scene. CoRR abs/2311.14603 (2023) - [i40]Yao Teng, Enze Xie, Yue Wu, Haoyu Han, Zhenguo Li, Xihui Liu:
Drag-A-Video: Non-rigid Video Editing with Point-based Interaction. CoRR abs/2312.02936 (2023) - [i39]Shentong Mo, Enze Xie, Yue Wu, Junsong Chen, Matthias Nießner, Zhenguo Li:
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation. CoRR abs/2312.07231 (2023) - [i38]Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng-Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li:
A Survey of Reasoning with Foundation Models. CoRR abs/2312.11562 (2023) - [i37]Kaichen Zhou, Lanqing Hong, Enze Xie, Yongxin Yang, Zhenguo Li, Wei Zhang:
SERF: Fine-Grained Interactive 3D Segmentation and Editing with Radiance Fields. CoRR abs/2312.15856 (2023) - 2022
- [j4]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
PVT v2: Improved baselines with Pyramid Vision Transformer. Comput. Vis. Media 8(3): 415-424 (2022) - [j3]Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen:
PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5349-5367 (2022) - [j2]Enze Xie, Wenhai Wang, Mingyu Ding, Ruimao Zhang, Ping Luo:
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5385-5400 (2022) - [j1]Libo Sun, Wei Yin, Enze Xie, Zhengrong Li, Changming Sun, Chunhua Shen:
Improving Monocular Visual Odometry Using Learned Depth. IEEE Trans. Robotics 38(5): 3173-3186 (2022) - [c23]Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo:
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization. AAAI 2022: 393-400 - [c22]Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo, Tong Lu:
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers. CVPR 2022: 1270-1279 - [c21]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers. ECCV (9) 2022: 1-18 - [c20]Weijia Wu, Enze Xie, Ruimao Zhang, Wenhai Wang, Ping Luo, Hong Zhou:
Polygon-Free: Unconstrained Scene Text Detection with Box Annotations. ICIP 2022: 1226-1230 - [c19]Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo:
CycleMLP: A MLP-like Architecture for Dense Prediction. ICLR 2022 - [c18]Youhui Guo, Yu Zhou, Xugong Qin, Enze Xie, Weiping Wang:
UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection. ICME 2022: 1-6 - [c17]Daquan Zhou, Zhiding Yu, Enze Xie, Chaowei Xiao, Animashree Anandkumar, Jiashi Feng, José M. Álvarez:
Understanding The Robustness in Vision Transformers. ICML 2022: 27378-27394 - [i36]Chunmeng Liu, Enze Xie, Wenjia Wang, Wenhai Wang, Guangyao Li, Ping Luo:
WegFormer: Transformers for Weakly Supervised Semantic Segmentation. CoRR abs/2203.08421 (2022) - [i35]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Qiao Yu, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers. CoRR abs/2203.17270 (2022) - [i34]Libo Sun, Wei Yin, Enze Xie, Zhengrong Li, Changming Sun, Chunhua Shen:
Improving Monocular Visual Odometry Using Learned Depth. CoRR abs/2204.01268 (2022) - [i33]Enze Xie, Zhiding Yu, Daquan Zhou, Jonah Philion, Anima Anandkumar, Sanja Fidler, Ping Luo, José M. Álvarez:
M2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation. CoRR abs/2204.05088 (2022) - [i32]Daquan Zhou, Zhiding Yu, Enze Xie, Chaowei Xiao, Anima Anandkumar, Jiashi Feng, José M. Álvarez:
Understanding The Robustness in Vision Transformers. CoRR abs/2204.12451 (2022) - [i31]Youhui Guo, Yu Zhou, Xugong Qin, Enze Xie, Weiping Wang:
UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection. CoRR abs/2205.04683 (2022) - [i30]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo:
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe. CoRR abs/2209.05324 (2022) - 2021
- [c16]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. ICCV 2021: 548-558 - [c15]Shoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo:
Watch Only Once: An End-to-End Video Action Detection Framework. ICCV 2021: 8158-8167 - [c14]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Peize Sun, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. ICCV 2021: 8372-8381 - [c13]Peize Sun, Yi Jiang, Enze Xie, Wenqi Shao, Zehuan Yuan, Changhu Wang, Ping Luo:
What Makes for End-to-End Object Detection? ICML 2021: 9934-9944 - [c12]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Segmenting Transparent Objects in the Wild with Transformer. IJCAI 2021: 1194-1200 - [c11]Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo:
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. NeurIPS 2021: 12077-12090 - [i29]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Trans2Seg: Transparent Object Segmentation with Transformer. CoRR abs/2101.08461 (2021) - [i28]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. CoRR abs/2102.04803 (2021) - [i27]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. CoRR abs/2102.12122 (2021) - [i26]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Unsupervised Pretraining for Object Detection by Patch Reidentification. CoRR abs/2103.04814 (2021) - [i25]Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo:
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization. CoRR abs/2103.11784 (2021) - [i24]Yang Cao, Zhengqiang Zhang, Enze Xie, Qibin Hou, Kai Zhao, Xiangui Luo, Jian Tuo:
FakeMix Augmentation Improves Transparent Object Detection. CoRR abs/2103.13279 (2021) - [i23]Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen:
PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text. CoRR abs/2105.00405 (2021) - [i22]Enze Xie, Wenhai Wang, Mingyu Ding, Ruimao Zhang, Ping Luo:
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond. CoRR abs/2105.02184 (2021) - [i21]Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo:
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. CoRR abs/2105.15203 (2021) - [i20]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan,