


Остановите войну!
for scientists:


default search action
Jingdong Wang 0001
Person information

- affiliation: Baidu, AI Group, Sunnyvale, CA, USA
- affiliation (former): Microsoft Research Asia, Beijing, China
- affiliation (PhD 2007): Hong Kong University of Science and Technology, Hong Kong
- affiliation (1997 - 2004): Tsinghua University, Department of Automation, Beijing, China
Other persons with the same name
- Jingdong Wang 0002
— Northeast Electric Power University, School of Computer Science, Jilin City, China
- Jingdong Wang 0003 — Beijing Institute of Technology, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j77]Yifan Liu
, Changyong Shu, Jingdong Wang
, Chunhua Shen
:
Structured Knowledge Distillation for Dense Prediction. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7035-7049 (2023) - [j76]Jiankai Sun
, Yan Xu
, Mingyu Ding
, Hongwei Yi, Chen Wang
, Jingdong Wang, Liangjun Zhang
, Mac Schwager
:
NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields. IEEE Robotics Autom. Lett. 8(8): 5244-5250 (2023) - [c168]Kaisiyuan Wang, Changcheng Liang, Hang Zhou, Jiaxiang Tang, Qianyi Wu, Dongliang He, Zhibin Hong, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
Robust Video Portrait Reenactment via Personalized Representation Quantization. AAAI 2023: 2564-2572 - [c167]Haixiao Yue, Keyao Wang, Guosheng Zhang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Cyclically Disentangled Feature Translation for Face Anti-spoofing. AAAI 2023: 3358-3366 - [c166]Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator. CVPR 2023: 1505-1515 - [c165]Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang:
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models. CVPR 2023: 6620-6630 - [c164]Sifan Long, Zhen Zhao, Jimin Pi, Shengsheng Wang, Jingdong Wang:
Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers. CVPR 2023: 10334-10343 - [c163]Wenhao Wu, Haipeng Luo, Bo Fang, Jingdong Wang, Wanli Ouyang:
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? CVPR 2023: 10704-10713 - [c162]Zhen Zhao, Lihe Yang, Sifan Long, Jimin Pi, Luping Zhou, Jingdong Wang:
Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation. CVPR 2023: 11350-11359 - [c161]Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang:
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection. CVPR 2023: 15579-15588 - [c160]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang:
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation with Progressive Video Transformers. CVPR 2023: 21254-21263 - [c159]Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai:
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection. CVPR 2023: 21570-21579 - [c158]Zhen Zhao, Sifan Long, Jimin Pi, Jingdong Wang, Luping Zhou:
Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation. CVPR 2023: 23705-23714 - [c157]Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers. CVPR 2023: 23809-23818 - [c156]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. ICDAR (2) 2023: 536-552 - [c155]Xiaohu Huang, Hao Zhou, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng:
Graph Contrastive Learning for Skeleton-based Action Recognition. ICLR 2023 - [c154]Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training. ICLR 2023 - [c153]Kaisiyuan Wang
, Hang Zhou
, Qianyi Wu
, Jiaxiang Tang
, Zhiliang Xu
, Borong Liang
, Tianshu Hu
, Errui Ding
, Jingtuo Liu
, Ziwei Liu
, Jingdong Wang
:
Efficient Video Portrait Reenactment via Grid-based Codebook. SIGGRAPH (Conference Paper Track) 2023: 66:1-66:9 - [i142]Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang:
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models. CoRR abs/2301.00182 (2023) - [i141]Wenhao Wu, Haipeng Luo, Bo Fang, Jingdong Wang, Wanli Ouyang:
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? CoRR abs/2301.00184 (2023) - [i140]Xiaohu Huang, Hao Zhou, Bin Feng, Xinggang Wang, Wenyu Liu, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
Graph Contrastive Learning for Skeleton-based Action Recognition. CoRR abs/2301.10900 (2023) - [i139]Jie Zhu, Jiyang Qi, Mingyu Ding, Xiaokang Chen, Ping Luo, Xinggang Wang, Wenyu Liu, Leye Wang, Jingdong Wang:
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning. CoRR abs/2301.11915 (2023) - [i138]Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Dongliang He, Jingtuo Liu, Errui Ding, Jingdong Wang, Shio Miyafuji, Ziwei Liu, Hideki Koike:
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation. CoRR abs/2302.06857 (2023) - [i137]Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training. CoRR abs/2303.00289 (2023) - [i136]Jiaxiang Tang, Hang Zhou, Xiaokang Chen, Tianshu Hu, Errui Ding, Jingdong Wang, Gang Zeng:
Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement. CoRR abs/2303.02091 (2023) - [i135]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang:
PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers. CoRR abs/2303.09187 (2023) - [i134]Kaixin Xiong, Shi Gong, Xiaoqing Ye, Xiao Tan, Ji Wan, Errui Ding, Jingdong Wang, Xiang Bai:
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection. CoRR abs/2303.10209 (2023) - [i133]Chang Liu, Weiming Zhang, Xiangru Lin, Wei Zhang, Xiao Tan, Junyu Han, Xiaomao Li, Errui Ding, Jingdong Wang:
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection. CoRR abs/2303.14960 (2023) - [i132]Yifu Zhang, Xinggang Wang, Xiaoqing Ye, Wei Zhang, Jincheng Lu, Xiao Tan, Errui Ding, Peize Sun, Jingdong Wang:
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box. CoRR abs/2303.15334 (2023) - [i131]Sifan Long, Zhen Zhao, Junkun Yuan, Zichang Tan, Jiangjiang Liu, Luping Zhou
, Shengsheng Wang, Jingdong Wang:
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models. CoRR abs/2303.17169 (2023) - [i130]Yanpeng Sun, Qiang Chen, Jian Wang, Jingdong Wang, Zechao Li:
Exploring Effective Factors for Improving Visual In-Context Learning. CoRR abs/2304.04748 (2023) - [i129]Jiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu, Jingdong Wang:
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator. CoRR abs/2305.05445 (2023) - [i128]Zhe Liu, Xiaoqing Ye, Zhikang Zou, Xinwei He, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai:
Multi-Modal 3D Object Detection by Box Matching. CoRR abs/2305.07713 (2023) - [i127]Jiang-Tian Zhai, Ze Feng, Jinhao Du, Yongqiang Mao, Jiang-Jiang Liu, Zichang Tan, Yifu Zhang, Xiaoqing Ye, Jingdong Wang:
Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes. CoRR abs/2305.10430 (2023) - [i126]Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai:
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images. CoRR abs/2306.03287 (2023) - [i125]Haiyang Xu, Zhichao Zhou, Dongliang He, Fu Li, Jingdong Wang:
Vision Transformer with Attention Map Hallucination and FFN Compaction. CoRR abs/2306.10875 (2023) - [i124]Zhongwei Qiu, Qiansheng Yang, Jian Wang, Xiyu Wang, Chang Xu, Dongmei Fu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation. CoRR abs/2306.17074 (2023) - [i123]Jiacheng Zhang, Xiangru Lin, Wei Zhang, Kuo Wang, Xiao Tan, Junyu Han, Errui Ding, Jingdong Wang, Guanbin Li:
Semi-DETR: Semi-Supervised Object Detection with Detection Transformers. CoRR abs/2307.08095 (2023) - [i122]Wenhao Wu, Yuxin Song, Zhun Sun, Jingdong Wang, Chang Xu, Wanli Ouyang:
What Can Simple Arithmetic Operations Do for Temporal Modeling? CoRR abs/2307.08908 (2023) - [i121]Lizhao Liu, Zhuangwei Zhuang, Shangxin Huang, Xunlong Xiao, Tianhang Xiang, Cen Chen, Jingdong Wang, Mingkui Tan:
CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation. CoRR abs/2307.10316 (2023) - [i120]Yiqun Chen, Qiang Chen, Peize Sun, Shoufa Chen, Jingdong Wang, Jian Cheng:
Enhancing Your Trained DETRs with Box Refinement. CoRR abs/2307.11828 (2023) - [i119]Jiazheng Xing, Mengmeng Wang, Xiaojun Hou, Guang Dai, Jingdong Wang, Yong Liu:
Multimodal Adaptation of CLIP for Few-Shot Action Recognition. CoRR abs/2308.01532 (2023) - [i118]Feng Chen, Jiajia Liu, Kaixiang Ji, Wang Ren, Jian Wang, Jingdong Wang:
Learning Implicit Entity-object Relations by Bidirectional Generative Alignment for Multimodal NER. CoRR abs/2308.02570 (2023) - [i117]Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, Jingdong Wang:
Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation. CoRR abs/2308.07313 (2023) - [i116]Jiazheng Xing, Mengmeng Wang, Yudi Ruan, Bofan Chen, Yaowei Guo, Boyu Mu, Guang Dai, Jingdong Wang, Yong Liu:
Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching. CoRR abs/2308.09346 (2023) - [i115]Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Mengmeng Wang, Jingdong Wang:
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation. CoRR abs/2308.10156 (2023) - [i114]Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang:
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation. CoRR abs/2309.00398 (2023) - [i113]Zhiyin Shao, Xinyu Zhang, Changxing Ding, Jian Wang, Jingdong Wang:
Unified Pre-training with Pseudo Texts for Text-To-Image Person Re-identification. CoRR abs/2309.01420 (2023) - [i112]Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang:
Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation. CoRR abs/2309.09667 (2023) - [i111]Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Jingdong Wang, Qinghua Zheng:
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement. CoRR abs/2309.11125 (2023) - 2022
- [j75]Jingdong Wang, Zhuowen Tu, Jianlong Fu
, Nicu Sebe, Serge J. Belongie
:
Guest Editorial: Introduction to the Special Section on Fine-Grained Visual Categorization. IEEE Trans. Pattern Anal. Mach. Intell. 44(2): 560-562 (2022) - [j74]Yan Huang
, Jingdong Wang
, Liang Wang:
Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory. IEEE Trans. Pattern Anal. Mach. Intell. 44(6): 2968-2983 (2022) - [j73]Jianming Ye
, Jingdong Wang
, Shiliang Zhang
:
Distillation-Guided Residual Learning for Binary Convolutional Neural Networks. IEEE Trans. Neural Networks Learn. Syst. 33(12): 7765-7777 (2022) - [c152]Borong Liang
, Yan Pan, Zhizhi Guo, Hang Zhou, Zhibin Hong, Xiaoguang Han, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Expressive Talking Head Generation with Granular Audio-Visual Control. CVPR 2022: 3377-3386 - [c151]Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval. CVPR 2022: 5174-5183 - [c150]Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang:
MixFormer: Mixing Features across Windows and Dimensions. CVPR 2022: 5239-5249 - [c149]Xinyu Zhang, Dongdong Li, Zhigang Wang, Jian Wang, Errui Ding, Javen Qinfeng Shi
, Zhaoxiang Zhang, Jingdong Wang:
Implicit Sample Extension for Unsupervised Person Re-Identification. CVPR 2022: 7359-7368 - [c148]Licheng Tang, Yiyang Cai, Jiaming Liu, Zhibin Hong, Mingming Gong, Minhu Fan, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Few-Shot Font Generation by Learning Fine-Grained Local Styles. CVPR 2022: 7885-7894 - [c147]Changyong Shu, Hemao Wu, Hang Zhou, Jiaming Liu, Zhibin Hong, Changxing Ding, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Few-Shot Head Swapping in the Wild. CVPR 2022: 10779-10788 - [c146]Desen Zhou, Zhichao Liu, Jian Wang, Leshan Wang, Tao Hu, Errui Ding, Jingdong Wang:
Human-Object Interaction Detection via Disentangled Transformer. CVPR 2022: 19546-19555 - [c145]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. ECCV (24) 2022: 74-92 - [c144]Shi Gong, Xiaoqing Ye, Xiao Tan, Jingdong Wang, Errui Ding, Yu Zhou, Xiang Bai:
GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation. ECCV (1) 2022: 396-411 - [c143]Yang Bai
, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang:
Action Quality Assessment with Temporal Parsing Transformer. ECCV (4) 2022: 422-438 - [c142]Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
UFO: Unified Feature Optimization. ECCV (26) 2022: 472-488 - [c141]Linfeng Li, Minyue Jiang, Yue Yu, Wei Zhang, Xiangru Lin, Yingying Li, Xiao Tan, Jingdong Wang, Errui Ding:
Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection. ECCV (30) 2022: 640-655 - [c140]Zhiliang Xu
, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
StyleSwap: Style-Based Generator Empowers Robust Face Swapping. ECCV (14) 2022: 661-677 - [c139]Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang:
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval. ECCV (36) 2022: 700-716 - [c138]Zhigang Wang, Jingdong Wang, Zhengqiu Huang, Lixin Ren:
Fatigue Life Evaluation of Rubber Tyred Gantry Crane based on Minner Criterion. ICCIR 2022: 675-679 - [c137]Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo:
Learning Versatile Neural Architectures by Propagating Network Codes. ICLR 2022 - [c136]Qi Han, Zejia Fan, Qi Dai, Lei Sun, Ming-Ming Cheng, Jiaying Liu, Jingdong Wang:
On the Connection between Local Attention and Dynamic Depth-wise Convolution. ICLR 2022 - [c135]Dongdong Li, Zhigang Wang, Jian Wang, Xinyu Zhang, Errui Ding, Jingdong Wang, Zhaoxiang Zhang:
Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification. IJCAI 2022: 1067-1073 - [c134]Bo Ju, Zhikang Zou, Xiaoqing Ye, Minyue Jiang, Xiao Tan, Errui Ding, Jingdong Wang:
Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network. ACM Multimedia 2022: 5639-5648 - [c133]Jian Wang, Chenhui Gou, Qiman Wu, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer. NeurIPS 2022 - [c132]Jiazhi Guan, Hang Zhou, Zhibin Hong, Errui Ding, Jingdong Wang, Chengbin Quan, Youjian Zhao:
Delving into Sequential Patches for Deepfake Detection. NeurIPS 2022 - [c131]Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong Wang:
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning. NeurIPS 2022 - [c130]Yasheng Sun
, Hang Zhou
, Kaisiyuan Wang
, Qianyi Wu
, Zhibin Hong
, Jingtuo Liu
, Errui Ding
, Jingdong Wang
, Ziwei Liu
, Hideki Koike
:
Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers. SIGGRAPH Asia 2022: 17:1-17:9 - [i110]Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang:
Context Autoencoder for Self-Supervised Representation Learning. CoRR abs/2202.03026 (2022) - [i109]Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang:
Efficient Video Segmentation Models with Per-frame Inference. CoRR abs/2202.12427 (2022) - [i108]Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval. CoRR abs/2203.16778 (2022) - [i107]Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang:
MixFormer: Mixing Features across Windows and Dimensions. CoRR abs/2204.02557 (2022) - [i106]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. CoRR abs/2204.03645 (2022) - [i105]Xinyu Zhang, Dongdong Li, Zhigang Wang, Jian Wang, Errui Ding, Javen Qinfeng Shi
, Zhaoxiang Zhang, Jingdong Wang:
Implicit Sample Extension for Unsupervised Person Re-Identification. CoRR abs/2204.06892 (2022) - [i104]Shi Gong, Xiaoqing Ye, Xiao Tan, Jingdong Wang, Errui Ding, Yu Zhou, Xiang Bai:
GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation. CoRR abs/2204.07733 (2022) - [i103]Desen Zhou, Zhichao Liu, Jian Wang, Leshan Wang, Tao Hu, Errui Ding, Jingdong Wang:
Human-Object Interaction Detection via Disentangled Transformer. CoRR abs/2204.09290 (2022) - [i102]Changyong Shu, Hemao Wu, Hang Zhou, Jiaming Liu, Zhibin Hong, Changxing Ding, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Few-Shot Head Swapping in the Wild. CoRR abs/2204.13100 (2022) - [i101]Harsha Vardhan Simhadri, George Williams, Martin Aumüller, Matthijs Douze, Artem Babenko, Dmitry Baranchuk, Qi Chen, Lucas Hosseini, Ravishankar Krishnaswamy, Gopal Srinivasa, Suhas Jayaram Subramanya, Jingdong Wang:
Results of the NeurIPS'21 Challenge on Billion-Scale Approximate Nearest Neighbor Search. CoRR abs/2205.03763 (2022) - [i100]Licheng Tang, Yiyang Cai, Jiaming Liu, Zhibin Hong, Mingming Gong, Minhu Fan, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
Few-Shot Font Generation by Learning Fine-Grained Local Styles. CoRR abs/2205.09965 (2022) - [i99]Pengyuan Lyu
, Chengquan Zhang, Shanshan Liu, Meina Qiao, Yangliu Xu, Liang Wu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining. CoRR abs/2206.00311 (2022) - [i98]Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong Wang:
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning. CoRR abs/2206.06122 (2022) - [i97]Jiazhi Guan, Hang Zhou, Zhibin Hong, Errui Ding, Jingdong Wang, Chengbin Quan, Youjian Zhao:
Delving into Sequential Patches for Deepfake Detection. CoRR abs/2207.02803 (2022) - [i96]Bo Ju, Zhikang Zou, Xiaoqing Ye, Minyue Jiang, Xiao Tan, Errui Ding, Jingdong Wang:
Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network. CoRR abs/2207.05497 (2022) - [i95]Yong Guo, Jingdong Wang, Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Jian Chen, Mingkui Tan:
Towards Lightweight Super-Resolution with Dual Regression Learning. CoRR abs/2207.07929 (2022) - [i94]Xiaokang Chen, Fangyun Wei, Gang Zeng, Jingdong Wang:
Conditional DETR V2: Efficient Detection Transformer with Box Queries. CoRR abs/2207.08914 (2022) - [i93]Yang Bai, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang:
Action Quality Assessment with Temporal Parsing Transformer. CoRR abs/2207.09270 (2022) - [i92]Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
UFO: Unified Feature Optimization. CoRR abs/2207.10341 (2022) - [i91]Jiazhi Guan, Hang Zhou, Mingming Gong, Youjian Zhao, Errui Ding, Jingdong Wang:
Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption. CoRR abs/2207.10402 (2022) - [i90]Qiang Chen, Xiaokang Chen, Gang Zeng, Jingdong Wang:
Group DETR: Fast Training Convergence with Decoupled One-to-Many Label Assignment. CoRR abs/2207.13085 (2022) - [i89]Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang:
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval. CoRR abs/2208.09843 (2022) - [i88]Zengyuan Guo, Yuechen Yu, Pengyuan Lv
, Chengquan Zhang, Haojie Li, Zhihui Wang, Kun Yao, Jingtuo Liu, Jingdong Wang:
TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers. CoRR abs/2208.14687 (2022) - [i87]Jiankai Sun, Yan Xu, Mingyu Ding, Hongwei Yi, Jingdong Wang, Liangjun Zhang, Mac Schwager:
NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields. CoRR abs/2209.12068 (2022) - [i86]Zhiliang Xu, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang:
StyleSwap: Style-Based Generator Empowers Robust Face Swapping. CoRR abs/2209.13514 (2022) - [i85]Yuxin Song, Min Yang, Wenhao Wu, Dongliang He, Fu Li, Jingdong Wang:
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training. CoRR abs/2210.05234 (2022) - [i84]Jian Wang, Chenhui Gou, Qiman Wu, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang:
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer. CoRR abs/2210.07124 (2022) - [i83]Qiang Chen, Jian Wang, Chuchu Han, Shan Zhang, Zexian Li, Xiaokang Chen, Jiahui Chen, Xiaodi Wang, Shuming Han, Gang Zhang, Haocheng Feng, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining. CoRR abs/2211.03594 (2022) - [i82]Xinyu Zhang, Jiahui Chen, Junkun Yuan, Qiang Chen, Jian Wang, Xiaodi Wang, Shumin Han, Xiaokang Chen, Jimin Pi, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang:
CAE v2: Context Autoencoder with CLIP Target. CoRR abs/2211.09799 (2022) - [i81]