default search action

combined dblp search
author search
venue search
publication search

ask others

Peng Gao 0007

> Home > Persons

Person information

affiliation: Shanghai Artificial Intelligence Laboratory, China
affiliation (PhD 2021): Chinese University of Hong Kong, Hong Kong

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/eswa/FuGLQGW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eswa/FuGLQGW24
Kexue Fu, Peng Gao, Shaolei Liu, Linhao Qu, Longxiang Gao, Manning Wang:
POS-BERT: Point cloud one-stage BERT pre-training. Expert Syst. Appl. 240: 122563 (2024)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/GaoGZMFZLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/GaoGZMFZLQ24
Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/GaoLZFLLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/GaoLZFLLQ24
Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/FangGZCLDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/FangGZCLDL24
Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6402-6415 (2024)
2023
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/bib/LuWLLTPLGXY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bib/LuWLLTPLGXY23
Ruiqiang Lu, Jun Wang, Pengyong Li, Yuquan Li, Shuoyan Tan, Yiting Pan, Huanxiang Liu, Peng Gao, Guotong Xie, Xiaojun Yao:
Improving drug-target affinity prediction via feature fusion and knowledge distillation. Briefings Bioinform. 24(3) (2023)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/LiWZGSLLQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiWZGSLLQ23
Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/SuWLGQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/SuWLGQ23
Weicong Su, Yali Wang, Kunchang Li, Peng Gao, Yu Qiao:
Hybrid token transformer for deep face recognition. Pattern Recognit. 139: 109443 (2023)
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/WangCCZZZDG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/WangCCZZZDG23
Guanqun Wang, He Chen, Liang Chen, Yin Zhuang, Shanghang Zhang, Tong Zhang, Hao Dong, Peng Gao:
P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification. Remote. Sens. 15(7): 1773 (2023)
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/staeors/ZhangZCCWGD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/staeors/ZhangZCCWGD23
Tong Zhang, Yin Zhuang, He Chen, Liang Chen, Guanqun Wang, Peng Gao, Hao Dong:
Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 16: 5013-5025 (2023)
2022
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/LiZDGDCCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/LiZDGDCCL22
Jianhao Li, Yin Zhuang, Shan Dong, Peng Gao, Hao Dong, He Chen, Liang Chen, Lianlin Li:
Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery. Remote. Sens. 14(7): 1767 (2022)
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/ZhangGDZWZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/ZhangGDZWZC22
Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang, He Chen:
Consecutive Pre-Training: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain. Remote. Sens. 14(22): 5675 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/bib/LiWQCYYGXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bib/LiWQCYYGXS21
Pengyong Li, Jun Wang, Yixuan Qiao, Hao Chen, Yihuan Yu, Xiaojun Yao, Peng Gao, Guotong Xie, Sen Song:
An effective self-supervised framework for learning expressive molecular global representations to drug discovery. Briefings Bioinform. 22(6) (2021)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/ZhangWHGX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/ZhangWHGX21
Cheng Zhang, Jun Wang, Jian He, Peng Gao, Guotong Xie:
Automated vertebral landmarks and spinal curvature estimation using non-directional part affinity fields. Neurocomputing 438: 280-289 (2021)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YanZGCZ0Q0H024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YanZGCZ0Q0H024
Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Hao Dong, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. AAAI 2024: 6449-6457
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/MengSLGZQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/MengSLGZQL24
Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. ACL (Findings) 2024: 7775-7803
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/CaiJQGZLMWWYPFD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/CaiJQGZLMWWYPFD24
Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Liang Pan, Xiangyu Fan, Han Du, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu:
Digital Life Project: Autonomous 3D Characters with Social Intelligence. CVPR 2024: 582-592
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhuZHGLXF0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhuZHGLXF0G24
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation. CVPR 2024: 3838-3847
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HanGZ0ZL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HanGZ0ZL00024
Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CVPR 2024: 26574-26585
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LinLZGQXQSCHHZHQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LinLZGQXQSCHHZHQL24
Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. ECCV (62) 2024: 36-55
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/XiaoZZGZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/XiaoZZGZL24
Han Xiao, Wenzhao Zheng, Sicheng Zuo, Peng Gao, Jie Zhou, Jiwen Lu:
SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding. ECCV (13) 2024: 37-54
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZhangJZLGQZLCQGL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZhangJZLGQZLCQGL24
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Yu Qiao, Peng Gao, Hongsheng Li:
MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? ECCV (8) 2024: 169-186
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/TangZLGZWGLWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/TangZLGZWGLWL24
Yiwen Tang, Ray Zhang, Jiaming Liu, Zoey Guo, Bin Zhao, Zhigang Wang, Peng Gao, Hongsheng Li, Dong Wang, Xuelong Li:
Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding. ECCV (36) 2024: 456-473
[c67]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/ZhaoZLWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhaoZLWZ024
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. EMNLP (Findings) 2024: 10152-10163
[c66]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ShaoC0XZLZ00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShaoC0XZLZ00024
Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. ICLR 2024
[c65]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuSCTZ0A0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuSCTZ0A0024
Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. ICLR 2024
[c64]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Zhang0GYP000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Zhang0GYP000024
Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. ICLR 2024
[c63]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangHLZL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangHLZL00024
Renrui Zhang, Jiaming Han, Chris Liu, Aojun Zhou, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention. ICLR 2024
[c62]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001ZCHLYHZ0GZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001ZCHLYHZ0GZ24
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024
[c61]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHW0C0YYLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHW0C0YYLGZ24
Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024
[c60]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuZQHLZGLJZSXH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuZQHLZGLJZSXH24
Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024
[c59]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LuZXZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LuZXZ0024
Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. ICML 2024
[c58]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YingMWLLYZZLLLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YingMWLLYZZLLLL24
Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. ICML 2024
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/CaiHCLGS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/CaiHCLGS024
Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong:
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill. ICRA 2024: 5228-5234
2023
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuLML000L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuLML000L23
Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lu:
Resilient Binary Neural Network. AAAI 2023: 10620-10628
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/XuLLGGL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/XuLLGGL023
Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CVPR 2023: 3842-3851
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangWWGLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangWWGLS23
Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CVPR 2023: 5344-5353
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangHLHDQGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangHLHDQGL23
Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangWQGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangWQGL23
Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders. CVPR 2023: 21769-21780
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/XueGLQSLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/XueGLQSLL23
Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CVPR 2023: 22732-22741
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWTGFX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWTGFX23
Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie:
Filter Pruning Via Filters Similarity in Consecutive Layers. ICASSP 2023: 1-5
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhuZHZWZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhuZHZWZ023
Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. ICCV 2023: 2605-2615
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhuZHGZQZG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhuZHGZQZG23
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao:
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning. ICCV 2023: 2639-2650
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/Zhang0WGCQLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/Zhang0WGCQLG23
Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhouLQLPZZGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhouLQLPZZGL23
Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li:
SparseMAE: Sparse Training Meets Masked Autoencoders. ICCV 2023: 16130-16140
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/CuiZDZGCC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/CuiZDZGCC23
Yongjing Cui, Yin Zhuang, Shan Dong, Xinyi Zhang, Peng Gao, He Chen, Liang Chen:
Hybrid Transformer Network for Change Detection Under Self-Supervised Pretraining. IGARSS 2023: 6652-6655
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangZSLLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangZSLLG23
Siyuan Huang, Bo Zhang, Botian Shi, Hongsheng Li, Yikang Li, Peng Gao:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. ACM Multimedia 2023: 8644-8652
2022
[c43]
- view
  - electronic edition @ mpg.de
  - details & citations
- export record
  dblp key:
  - conf/bmvc/CuiL0SG00H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/CuiL0SG00H22
Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure Correction. BMVC 2022: 238
[c42]
- view
  - electronic edition @ mpg.de
  - details & citations
- export record
  dblp key:
  - conf/bmvc/MaGWX00G022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/MaGWX00G022
Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangGZLM0QG022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangGZLM0QG022
Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/XuLWMZGQLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/XuLWMZGQLG22
Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. ECCV (24) 2022: 19-35
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/JiangLYGWTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/JiangLYGWTW22
Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang:
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation. ECCV (34) 2022: 36-54
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/XuLZMZCGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/XuLZMZCGL22
Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü:
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors. ECCV (11) 2022: 346-361
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LinGZGMWDQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LinGZGMWDQL22
Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZhangZFGLDQL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZhangZFGLDQL22
Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahGGCHMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahGGCHMRH22
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning. ICASSP 2022: 7732-7736
[c34]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Li00S00022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Li00S00022
Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/LiuZ0GWZCCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/LiuZ0GWZCCL22
Shanjunyu Liu, Yin Zhuang, Hao Dong, Peng Gao, Guanqun Wang, Tong Zhang, Liang Chen, He Chen, Lianlin Li:
Adaptive Local Context Embedding for Small Vehicle Detection from Aerial Optical Remote Sensing Images. IGARSS 2022: 1712-1715
[c32]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GaoMLLDQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GaoMLLDQ22
Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022
[c31]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiX000G22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiX000G22
Yanjing Li, Sheng Xu, Baochang Zhang, Xianbin Cao, Peng Gao, Guodong Guo:
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer. NeurIPS 2022
[c30]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangG0FZW0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangG0FZW0022
Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/semeval/HouWQJGXLWJWX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semeval/HouWQJGXLWJWX22
Changyu Hou, Jun Wang, Yixuan Qiao, Peng Jiang, Peng Gao, Guotong Xie, Qizhi Lin, Xiaopeng Wang, Xiandi Jiang, Benqi Wang, Qifeng Xiao:
SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models. SemEval@NAACL 2022: 1593-1596
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/semweb/WangLHTQFLGX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semweb/WangLHTQFLGX22
Jun Wang, Weixun Li, Changyu Hou, Xin Tang, Yixuan Qiao, Rui Fang, Pengyong Li, Peng Gao, Guotong Xie:
HCL: Improving Graph Representation with Hierarchical Contrastive Learning. ISWC 2022: 108-124
2021
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GengGCHRZLC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GengGCHRZLC21
Shijie Geng, Peng Gao, Moitreya Chatterjee, Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian:
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers. AAAI 2021: 1415-1423
[c26]
- view
  - electronic edition @ bmvc2021-virtualconference.com (open access)
  - details & citations
- export record
  dblp key:
  - conf/bmvc/Zheng0ZL0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/Zheng0ZL0021
Minghang Zheng, Peng Gao, Renrui Zhang, Kunchang Li, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. BMVC 2021: 226
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/0007Z0D021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/0007Z0D021
Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. ICCV 2021: 3601-3610
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LiWLQLMGSX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LiWLQLMGSX21
Pengyong Li, Jun Wang, Ziliang Li, Yixuan Qiao, Xianggen Liu, Fei Ma, Peng Gao, Sen Song, Guotong Xie:
Pairwise Half-graph Discrimination: A Simple Graph-level Self-supervised Strategy for Pre-training Graph Neural Networks. IJCAI 2021: 2694-2700
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiSGGFMCS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiSGGFMCS21
Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su:
Dense Contrastive Visual-Linguistic Pretraining. ACM Multimedia 2021: 5203-5212
[c22]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GaoLLMK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GaoLLMK21
Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Networks. NeurIPS 2021: 19160-19171
[c21]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MaoGZZMPDZH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MaoGZZMPDZH21
Mingyuan Mao, Peng Gao, Renrui Zhang, Honghui Zheng, Teli Ma, Yan Peng, Errui Ding, Baochang Zhang, Shumin Han:
Dual-stream Network for Visual Recognition. NeurIPS 2021: 25346-25358
[c20]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trec/QiaoC0GXLYX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trec/QiaoC0GXLYX21
Yixuan Qiao, Hao Chen, Tuozhen Liu, Xianbin Ye, Jun Wang, Peng Gao, Guotong Xie:
PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stageRankingtrack: DL. TREC 2021
2020
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiLZGX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiLZGX20
Ge Li, Changsheng Li, Chan Zeng, Peng Gao, Guotong Xie:
Region Focus Network for Joint Optic Disc and Cup Segmentation. AAAI 2020: 751-758
[c18]
- view
  - electronic edition @ bmvc2020-conference.com (open access)
  - details & citations
- export record
  dblp key:
  - conf/bmvc/WangWYCZGXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/WangWYCZGXL20
Jun Wang, Shaoguo Wen, Jianghua Yu, Kaixing Chen, Xin Zhou, Peng Gao, Guotong Xie, Changsheng Li:
Semi-supervised Active Learning for Instance Segmentation via Scoring Predictions. BMVC 2020
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/JiangLYLG0XP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/JiangLYLG0XP20
Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan:
Learning Where to Focus for Efficient Video Object Detection. ECCV (16) 2020: 18-34
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangSWYGX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangSWYGX20
Yijun Wang, Changzhi Sun, Yuanbin Wu, Junchi Yan, Peng Gao, Guotong Xie:
Pre-training Entity Relation Encoder with Intra-span and Inter-span Information. EMNLP (1) 2020: 1692-1705
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0002GSHLGS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0002GSHLGS20
Lei Shi, Shijie Geng, Kai Shuang, Chiori Hori, Songxiang Liu, Peng Gao, Sen Su:
Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering. ICASSP 2020: 4412-4416
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhangZYGX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhangZYGX20
Zhexi Zhang, Wei Zhu, Junchi Yan, Peng Gao, Guotong Xie:
Automatic Student Network Search for Knowledge Distillation. ICPR 2020: 2446-2453
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/isbi/YangZWXLGL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isbi/YangZWXLGL20
Suhui Yang, Xia Zhou, Jun Wang, Guotong Xie, Chuanfeng Lv, Peng Gao, Bin Lv:
Unsupervised Domain Adaptation for Cross-Device OCT Lesion Detection via Learning Adaptive Features. ISBI 2020: 1570-1573
[c12]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trec/CaoQCGNX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trec/CaoQCGNX20
Liyu Cao, Yixuan Qiao, Hao Chen, Peng Gao, Yuan Ni, Guo Tong Xie:
A Multiple Models Ensembling Method in TREC Deep Learning. TREC 2020
[c11]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/trec/QiaoCCCLWGNXCLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/trec/QiaoCCCLWGNXCLX20
Yixuan Qiao, Hao Chen, Liyu Cao, Liping Chen, Pengyong Li, Jun Wang, Peng Gao, Yuan Ni, Guotong Xie:
PASH at TREC 2020 Deep Learning Track: Dense Matching for Nested Ranking. TREC 2020
2019
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiangGGZXP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiangGGZXP19
Zhengkai Jiang, Peng Gao, Chaoxu Guo, Qian Zhang, Shiming Xiang, Chunhong Pan:
Video Object Detection with Locally-Weighted Deformable Neighbors. AAAI 2019: 8529-8536
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GaoJYLHWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GaoJYLHWL19
Peng Gao, Zhengkai Jiang, Haoxuan You, Pan Lu, Steven C. H. Hoi, Xiaogang Wang, Hongsheng Li:
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering. CVPR 2019: 6639-6648
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/GaoYZWL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/GaoYZWL19
Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-Modality Latent Interaction Network for Visual Question Answering. ICCV 2019: 5824-5834
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/miccai/GuoWYWGXLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/miccai/GuoWYWGXLL19
Yan Guo, Kang Wang, Suhui Yang, Yue Wang, Peng Gao, Guotong Xie, Chuanfeng Lv, Bin Lv:
Structure-Aware Noise Reduction Generative Adversarial Network for Optical Coherence Tomography Image. OMIA@MICCAI 2019: 9-17
2018
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/GaoLLLLHW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/GaoLLLLHW18
Peng Gao, Hongsheng Li, Shuang Li, Pan Lu, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. ECCV (1) 2018: 485-501
2017
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icws/HuZDG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icws/HuZDG17
Guoqiang Hu, Xin Zhang, Ning Duan, Peng Gao:
Towards Reliable Online Services Analyzing Mobile Sensor Big Data. ICWS 2017: 849-852
2016
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/soli/MaZGDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/soli/MaZGDL16
Chunyang Ma, Xin Zhang, Peng Gao, Weishan Dong, Changsheng Li:
Space-map-matching-based candidate selection for GPS map matching. SOLI 2016: 77-82
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/soli/0001ZDGHDWZJMH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/soli/0001ZDGHDWZJMH16
Wei Sun, Jun Zhu, Ning Duan, Peng Gao, Guo Qiang Hu, Weishan Dong, Zhi Hu Wang, Xin Zhang, Peng Ji, Chunyang Ma, Jing Chang Huang:
Moving object map analytics: A framework enabling contextual spatial-temporal analytics of Internet of Things applications. SOLI 2016: 101-106
2014
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/DongYMLSWWGY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/DongYMLSWWGY14
Weishan Dong, Renjie Yao, Chunyang Ma, Changsheng Li, Lei Shi, Lu Wang, Yu Wang, Peng Gao, Junchi Yan:
Maximizing Multi-scale Spatial Statistical Discrepancy. CIKM 2014: 471-480
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mobserv/0008HDGDZ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobserv/0008HDGDZ14
Xin Zhang, Guoqiang Hu, Ning Duan, Peng Gao, Weishan Dong, Jun Zhu:
Scalable Mobile Data Streaming with Trajectory Preserving Partitioning. IEEE MS 2014: 16-23

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02384
Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. CoRR abs/2401.02384 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03327
Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang:
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models. CoRR abs/2402.03327 (2024)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05935
Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16570
Peng Gao, Xiao Liu, Yu Wang, Ru-Yue Yuan:
Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking. CoRR abs/2402.16570 (2024)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16880
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16880
Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. CoRR abs/2402.16880 (2024)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11289
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong:
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models. CoRR abs/2403.11289 (2024)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14624
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li:
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? CoRR abs/2403.14624 (2024)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20271
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-04050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-04050
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao:
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation. CoRR abs/2404.04050 (2024)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16006
Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. CoRR abs/2404.16006 (2024)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-04883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-04883
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-05945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-05945
Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14854
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14854
Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li:
TerDiT: Ternary Diffusion Models with Transformers. CoRR abs/2405.14854 (2024)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16057
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16057
Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. CoRR abs/2405.16057 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18407
Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang:
Phased Consistency Model. CoRR abs/2405.18407 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07549
Siyuan Huang, Haonan Chang, Yuhan Liu, Yimeng Zhu, Hao Dong, Peng Gao, Abdeslam Boularias, Hongsheng Li:
A3VLM: Actionable Articulation-Aware Vision Language Model. CoRR abs/2406.07549 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18583
Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-07667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-07667
Jingwen He, Tianfan Xue, Dongyang Liu, Xinqi Lin, Peng Gao, Dahua Lin, Yu Qiao, Wanli Ouyang, Ziwei Liu:
VEnhancer: Generative Space-Time Enhancement for Video Generation. CoRR abs/2407.07667 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08739
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11062
Mengzhao Chen, Wenqi Shao, Peng Xu, Jiahao Wang, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models. CoRR abs/2407.11062 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-17490
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-17490
Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Dingyu Zhang, Peng Gao, Shuai Ren, Hongsheng Li:
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents. CoRR abs/2407.17490 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02657
Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. CoRR abs/2408.02657 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16768
Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Chengzhuo Tong, Peng Gao, Chunyuan Li, Pheng-Ann Heng:
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners. CoRR abs/2408.16768 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12959
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12959
Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanmin Wu, Jiayi Lei, Pengshuo Qiu, Pan Lu, Zehui Chen, Guanglu Song, Peng Gao, Yu Liu, Chunyuan Li, Hongsheng Li:
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines. CoRR abs/2409.12959 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15278
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15278
Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xi, Yu Qiao, Peng Gao, Hongsheng Li:
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions. CoRR abs/2409.15278 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18082
Xin Li, Siyuan Huang, Qiaojun Yu, Zhengkai Jiang, Ce Hao, Yimeng Zhu, Hongsheng Li, Peng Gao, Cewu Lu:
SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation. CoRR abs/2409.18082 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20551
Qiaojun Yu, Siyuan Huang, Xibin Yuan, Zhengkai Jiang, Ce Hao, Xin Li, Haonan Chang, Junbo Wang, Liu Liu, Hongsheng Li, Peng Gao, Cewu Lu:
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models. CoRR abs/2409.20551 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-00363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-00363
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. CoRR abs/2410.00363 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-07536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-07536
Ruoyi Du, Dongyang Liu, Le Zhuo, Qin Qi, Hongsheng Li, Zhanyu Ma, Peng Gao:
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow. CoRR abs/2410.07536 (2024)
2023
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00956
Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lv:
Resilient Binary Neural Network. CoRR abs/2302.00956 (2023)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01503
Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-02151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-02151
Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05475
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05475
Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08134
Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16199
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16199
Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00253
Sheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lu, Baochang Zhang:
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer. CoRR abs/2304.00253 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-01195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-01195
Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao:
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement. CoRR abs/2304.01195 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13397
Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie:
Filter Pruning via Filters Similarity in Consecutive Layers. CoRR abs/2304.13397 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-15010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-15010
Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-03048
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-03048
Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09160
Siyuan Huang, Bo Zhang, Botian Shi, Peng Gao, Yikang Li, Hongsheng Li:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. CoRR abs/2305.09160 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11176
Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16318
Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao:
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. CoRR abs/2305.16318 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09265
Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-03729
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-03729
Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-12961
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-12961
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao:
Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks. CoRR abs/2308.12961 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-13137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-13137
Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00615
Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng:
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following. CoRR abs/2309.00615 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03905
Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao:
ImageBind-LLM: Multi-modality Instruction Tuning. CoRR abs/2309.03905 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10309
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10309
Wenzhe Cai, Siyuan Huang, Guangran Cheng, Yuxing Long, Peng Gao, Changyin Sun, Hao Dong:
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill. CoRR abs/2309.10309 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06311
Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris N. Metaxas:
Improving Compositional Text-to-image Generation with Large Vision-Language Models. CoRR abs/2310.06311 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-07575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-07575
Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17963
Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo:
ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model. CoRR abs/2311.17963 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03700
Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CoRR abs/2312.03700 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04547
Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu:
Digital Life Project: Autonomous 3D Characters with Social Intelligence. CoRR abs/2312.04547 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09738
Dingning Liu, Xiaomeng Dong, Renrui Zhang, Xu Luo, Peng Gao, Xiaoshui Huang, Yongshun Gong, Zhihui Wang:
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V. CoRR abs/2312.09738 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12436
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14074
Senqiao Yang, Jiaming Liu, Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang:
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding. CoRR abs/2312.14074 (2023)
2022
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-02314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-02314
Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Peng Gao, Zenghui Zhang, Tatsuya Harada:
RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images. CoRR abs/2201.02314 (2022)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-04676
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-04676
Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning. CoRR abs/2201.04676 (2022)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08050
Sheng Xu, Yanjing Li, Teli Ma, Bohan Zeng, Baochang Zhang, Peng Gao, Jinhu Lv:
TerViT: An Efficient Ternary Vision Transformer. CoRR abs/2201.08050 (2022)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-09450
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-09450
Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-attention for Visual Recognition. CoRR abs/2201.09450 (2022)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04241
Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00836
Xianbin Ye, Ziliang Li, Fei Ma, Zongbi Yi, Pengyong Li, Jun Wang, Peng Gao, Yixuan Qiao, Guotong Xie:
CandidateDrug4Cancer: An Open Molecular Graph Learning Benchmark on Drug Discovery for Cancer. CoRR abs/2203.00836 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13310
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13310
Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00989
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00989
Kexue Fu, Peng Gao, Shaolei Liu, Renrui Zhang, Yu Qiao, Manning Wang:
POS-BERT: Point Cloud One-Stage BERT Pre-Training. CoRR abs/2204.00989 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-03892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-03892
Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11245
Yixuan Qiao, Hao Chen, Yongquan Lai, Jun Wang, Tuozhen Liu, Xianbin Ye, Rui Fang, Peng Gao, Wenfeng Xie, Guotong Xie:
PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking. CoRR abs/2205.11245 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14401
Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14660
Changyu Hou, Jun Wang, Yixuan Qiao, Peng Jiang, Peng Gao, Guotong Xie, Qizhi Lin, Xiaopeng Wang, Xiandi Jiang, Benqi Wang, Qifeng Xiao:
SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models. CoRR abs/2205.14660 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14871
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14871
Ziteng Cui, Kunchang Li, Lin Gu, Shenghan Su, Peng Gao, Zhengkai Jiang, Yu Qiao, Tatsuya Harada:
Illumination Adaptive Transformer. CoRR abs/2205.14871 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-03860
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-03860
Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang, He Chen:
Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain. CoRR abs/2207.03860 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06654
Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang:
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation. CoRR abs/2207.06654 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-09519
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-09519
Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-03550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-03550
Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-01542
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-01542
Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lv, Guodong Guo:
Recurrent Bilinear Optimization for Binary Neural Networks. CoRR abs/2209.01542 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-12255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-12255
Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03477
Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lv:
IDa-Det: An Information Discrepancy-aware Distillation for 1-bit Detectors. CoRR abs/2210.03477 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06707
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06707
Yanjing Li, Sheng Xu, Baochang Zhang, Xianbin Cao, Peng Gao, Guodong Guo:
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer. CoRR abs/2210.06707 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12020
Jun Wang, Weixun Li, Changyu Hou, Xin Tang, Yixuan Qiao, Rui Fang, Pengyong Li, Peng Gao, Guotong Xie:
HCL: Improving Graph Representation with Hierarchical Contrastive Learning. CoRR abs/2210.12020 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08887
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08887
Hongwei Xue, Peng Gao, Hongyang Li, Yu Qiao, Hao Sun, Houqiang Li, Jiebo Luo:
Stare at What You See: Masked Image Modeling without Reconstruction. CoRR abs/2211.08887 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11682
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyao Zeng, Shanghang Zhang, Peng Gao:
PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning. CoRR abs/2211.11682 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-06785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-06785
Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022)
2021
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-07448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-07448
Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2101.07448 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-09755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-09755
Shijie Geng, Peng Gao, Zuohui Fu, Yongfeng Zhang:
RomeBERT: Robust Training of Multi-Exit BERT. CoRR abs/2101.09755 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-14734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-14734
Mingyuan Mao, Renrui Zhang, Honghui Zheng, Peng Gao, Teli Ma, Yan Peng, Errui Ding, Shumin Han:
Dual-stream Network for Visual Recognition. CoRR abs/2105.14734 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-01401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-01401
Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Network. CoRR abs/2106.01401 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02242
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02242
Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03146
Teli Ma, Mingyuan Mao, Honghui Zheng, Peng Gao, Xiaodi Wang, Shumin Han, Errui Ding, Baochang Zhang, David S. Doermann:
Oriented Object Detection with Transformer. CoRR abs/2106.03146 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15332
Yixuan Qiao, Hao Chen, Jun Wang, Yihao Chen, Xianbin Ye, Ziliang Li, Xianbiao Qi, Peng Gao, Guotong Xie:
Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model. CoRR abs/2106.15332 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02404
Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2108.02404 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11778
Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su:
Dense Contrastive Visual-Linguistic Pretraining. CoRR abs/2109.11778 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04544
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04544
Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. CoRR abs/2110.04544 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06894
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning. CoRR abs/2110.06894 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13567
Pengyong Li, Jun Wang, Ziliang Li, Yixuan Qiao, Xianggen Liu, Fei Ma, Peng Gao, Seng Song, Guotong Xie:
Pairwise Half-graph Discrimination: A Simple Graph-level Self-supervised Strategy for Pre-training Graph Neural Networks. CoRR abs/2110.13567 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-03930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-03930
Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-14745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-14745
Teli Ma, Shijie Geng, Mengmeng Wang, Jing Shao, Jiasen Lu, Hongsheng Li, Peng Gao, Yu Qiao:
A Simple Long-Tailed Recognition Baseline via Vision-Language Model. CoRR abs/2111.14745 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02413
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02413
Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04744
Jun Wang, Zhoujing Li, Yixuan Qiao, Qiming Qin, Peng Gao, Guotong Xie:
Superpixel-Based Building Damage Detection from Post-earthquake Very High Resolution Imagery Using Deep Neural Networks. CoRR abs/2112.04744 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-12053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-12053
Liang Pan, Tong Wu, Zhongang Cai, Ziwei Liu, Xumin Yu, Yongming Rao, Jiwen Lu, Jie Zhou, Mingye Xu, Xiaoyuan Luo, Kexue Fu, Peng Gao, Manning Wang, Yali Wang, Yu Qiao, Junsheng Zhou, Xin Wen, Peng Xiang, Yu-Shen Liu, Zhizhong Han, Yuanjie Yan, Junyi An, Lifa Zhu, Changwei Lin, Dongrui Liu, Xin Li, Francisco Gómez Fernández, Qinlong Wang, Yang Yang:
Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results. CoRR abs/2112.12053 (2021)
2020
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-05840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-05840
Lei Shi, Shijie Geng, Kai Shuang, Chiori Hori, Songxiang Liu, Peng Gao, Sen Su:
Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering. CoRR abs/2001.05840 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08001
Keqi Wang, Peng Gao, Steven C. H. Hoi, Qian Guo, Yuhua Qian:
Extreme Low-Light Imaging with Multi-granulation Cooperative Networks. CoRR abs/2005.08001 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08646
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08646
Shijie Geng, Ji Zhang, Zuohui Fu, Peng Gao, Hang Zhang, Gerard de Melo:
Character Matters: Video Story Understanding with Character-Aware Relations. CoRR abs/2005.08646 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03848
Shijie Geng, Peng Gao, Chiori Hori, Jonathan Le Roux, Anoop Cherian:
Spatio-Temporal Scene Graphs for Video Dialog. CoRR abs/2007.03848 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-12942
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-12942
Peng Su, Shixiang Tang, Peng Gao, Di Qiu, Ni Zhao, Xiaogang Wang:
Gradient Regularized Contrastive Learning for Continual Domain Adaptation. CoRR abs/2007.12942 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13135
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13135
Lei Shi, Kai Shuang, Shijie Geng, Peng Su, Zhengkai Jiang, Peng Gao, Zuohui Fu, Gerard de Melo, Sen Su:
Contrastive Visual-Linguistic Pretraining. CoRR abs/2007.13135 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-11382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-11382
Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux:
Multi-Pass Transformer for Machine Translation. CoRR abs/2009.11382 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09315
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09315
Minghang Zheng, Peng Gao, Xiaogang Wang, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. CoRR abs/2011.09315 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-04829
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-04829
Jun Wang, Shaoguo Wen, Kaixing Chen, Jianghua Yu, Xin Zhou, Peng Gao, Changsheng Li, Guotong Xie:
Semi-supervised Active Learning for Instance Segmentation via Scoring Predictions. CoRR abs/2012.04829 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-11175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-11175
Pengyong Li, Jun Wang, Yixuan Qiao, Hao Chen, Yihuan Yu, Xiaojun Yao, Peng Gao, Guotong Xie, Sen Song:
Learn molecular representations from large-scale unlabeled molecules for drug discovery. CoRR abs/2012.11175 (2020)
2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-04289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-04289
Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-modality Latent Interaction Network for Visual Question Answering. CoRR abs/1908.04289 (2019)
2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02632
Peng Gao, Pan Lu, Hongsheng Li, Shuang Li, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. CoRR abs/1808.02632 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-05252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-05252
Peng Gao, Hongsheng Li, Haoxuan You, Zhengkai Jiang, Pan Lu, Steven C. H. Hoi, Xiaogang Wang:
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering. CoRR abs/1812.05252 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.