default search action
Xintao Wang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j8]Yanli Hou, Xilin Gai, Xintao Wang, Yongqiang Zhang:
SiamMFF: UAV Object Tracking Algorithm Based on Multi-Scale Feature Fusion. IEEE Access 12: 24725-24734 (2024) - [j7]Yihao Liu, Hengyuan Zhao, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Yu Qiao, Chao Dong:
Temporally consistent video colorization with deep feature propagation and self-regularization learning. Comput. Vis. Media 10(2): 375-395 (2024) - [c61]Yue Ma, Yingqing He, Xiaodong Cun, Xintao Wang, Siran Chen, Xiu Li, Qifeng Chen:
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos. AAAI 2024: 4117-4125 - [c60]Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan:
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models. AAAI 2024: 4296-4304 - [c59]Tao Wu, Xuewei Li, Zhongang Qi, Di Hu, Xintao Wang, Ying Shan, Xi Li:
SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model. AAAI 2024: 6126-6134 - [c58]Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao:
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews. ACL (1) 2024: 1840-1873 - [c57]Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao:
Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models. ACL (Findings) 2024: 13379-13389 - [c56]Yazhou Xing, Yingqing He, Zeyue Tian, Xintao Wang, Qifeng Chen:
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners. CVPR 2024: 7151-7161 - [c55]Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shan:
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models. CVPR 2024: 7310-7320 - [c54]Yuchao Gu, Xintao Wang, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis. CVPR 2024: 7631-7640 - [c53]Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan:
SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models. CVPR 2024: 8362-8371 - [c52]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-Based Image Editing. CVPR 2024: 8488-8497 - [c51]Zhen Li, Mingdeng Cao, Xintao Wang, Zhongang Qi, Ming-Ming Cheng, Ying Shan:
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding. CVPR 2024: 8640-8650 - [c50]Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou:
X- Adapter: Universal Compatibility of Plugins for Upgraded Diffusion Model. CVPR 2024: 8775-8784 - [c49]Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond H. Chan, Ying Shan:
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models. CVPR 2024: 22139-22149 - [c48]Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong:
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. CVPR 2024: 25669-25680 - [c47]Xintao Wang, Rize Jin, Shibo Qi:
Reinforced Multi-teacher Knowledge Distillation for Unsupervised Sentence Representation. ICANN (7) 2024: 320-332 - [c46]Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan:
Making LLaMA SEE and Draw with SEED Tokenizer. ICLR 2024 - [c45]Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan:
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models. ICLR 2024 - [c44]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models. ICLR 2024 - [c43]Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu:
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling. ICLR 2024 - [c42]Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Unifying Image Processing as Visual Prompting Question Answering. ICML 2024 - [c41]Zhouxia Wang, Ziyang Yuan, Xintao Wang, Yaowei Li, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan:
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. SIGGRAPH (Conference Paper Track) 2024: 114 - [i100]Xintao Wang, Zhouhong Gu, Jiaqing Liang, Dakuan Lu, Yanghua Xiao, Wei Wang:
ConcEPT: Concept-Enhanced Pre-Training for Language Models. CoRR abs/2401.05669 (2024) - [i99]Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou:
Towards A Better Metric for Text-to-Video Generation. CoRR abs/2401.07781 (2024) - [i98]Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shan:
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models. CoRR abs/2401.09047 (2024) - [i97]Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong:
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. CoRR abs/2401.13627 (2024) - [i96]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing. CoRR abs/2402.02583 (2024) - [i95]Lanqing Guo, Yingqing He, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen:
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation. CoRR abs/2402.10491 (2024) - [i94]Yazhou Xing, Yingqing He, Zeyue Tian, Xintao Wang, Qifeng Chen:
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners. CoRR abs/2402.17723 (2024) - [i93]Xuan Ju, Xian Liu, Xintao Wang, Yuxuan Bian, Ying Shan, Qiang Xu:
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion. CoRR abs/2403.06976 (2024) - [i92]Tao Wu, Xuewei Li, Zhongang Qi, Di Hu, Xintao Wang, Ying Shan, Xi Li:
SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model. CoRR abs/2403.10044 (2024) - [i91]Xintao Wang, Jiangjie Chen, Nianqi Li, Lida Chen, Xinfeng Yuan, Wei Shi, Xuyang Ge, Rui Xu, Yanghua Xiao:
SurveyAgent: A Conversational System for Personalized and Efficient Research Survey. CoRR abs/2404.06364 (2024) - [i90]Jiale Xu, Weihao Cheng, Yiming Gao, Xintao Wang, Shenghua Gao, Ying Shan:
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models. CoRR abs/2404.07191 (2024) - [i89]Rui Xu, Xintao Wang, Jiangjie Chen, Siyu Yuan, Xinfeng Yuan, Jiaqing Liang, Zulong Chen, Xiaoqing Dong, Yanghua Xiao:
Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing? CoRR abs/2404.12138 (2024) - [i88]Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang:
Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works. CoRR abs/2404.12726 (2024) - [i87]Jiangjie Chen, Xintao Wang, Rui Xu, Siyu Yuan, Yikai Zhang, Wei Shi, Jian Xie, Shuang Li, Ruihan Yang, Tinghui Zhu, Aili Chen, Nianqi Li, Lida Chen, Caiyu Hu, Siye Wu, Scott Ren, Ziquan Fu, Yanghua Xiao:
From Persona to Personalization: A Survey on Role-Playing Language Agents. CoRR abs/2404.18231 (2024) - [i86]Chong Mou, Mingdeng Cao, Xintao Wang, Zhaoyang Zhang, Ying Shan, Jian Zhang:
ReVideo: Remake a Video with Motion and Content Control. CoRR abs/2405.13865 (2024) - [i85]Jinbo Xing, Hanyuan Liu, Menghan Xia, Yong Zhang, Xintao Wang, Ying Shan, Tien-Tsin Wong:
ToonCrafter: Generative Cartoon Interpolation. CoRR abs/2405.17933 (2024) - [i84]Muyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng:
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model. CoRR abs/2405.20222 (2024) - [i83]Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. CoRR abs/2406.04277 (2024) - [i82]Lida Chen, Zujie Liang, Xintao Wang, Jiaqing Liang, Yanghua Xiao, Feng Wei, Jinglei Chen, Zhenghong Hao, Bing Han, Wei Wang:
Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals. CoRR abs/2406.10881 (2024) - [i81]Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao:
Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models. CoRR abs/2406.10902 (2024) - [i80]Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Yuexian Zou, Ying Shan:
Image Conductor: Precision Control for Interactive Video Synthesis. CoRR abs/2406.15339 (2024) - [i79]Yiting Ran, Xintao Wang, Rui Xu, Xinfeng Yuan, Jiaqing Liang, Yanghua Xiao, Deqing Yang:
Capturing Minds, Not Just Words: Enhancing Role-Playing Language Models with Personality-Indicative Data. CoRR abs/2406.18921 (2024) - [i78]Yifei Zhang, Xintao Wang, Jiaqing Liang, Sirui Xia, Lida Chen, Yanghua Xiao:
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs. CoRR abs/2407.00653 (2024) - [i77]Sirui Xia, Xintao Wang, Jiaqing Liang, Yifei Zhang, Weikang Zhou, Jiaji Deng, Fei Yu, Yanghua Xiao:
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation. CoRR abs/2407.01796 (2024) - [i76]Rui Xu, Dakuan Lu, Xiaoyu Tan, Xintao Wang, Siyu Yuan, Jiangjie Chen, Wei Chu, Yinghui Xu:
MINDECHO: Role-Playing Language Agents for Key Opinion Leaders. CoRR abs/2407.05305 (2024) - [i75]Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan:
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions. CoRR abs/2407.06358 (2024) - [i74]Yuzhou Huang, Yiran Qin, Shunlin Lu, Xintao Wang, Rui Huang, Ying Shan, Ruimao Zhang:
Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models. CoRR abs/2408.11801 (2024) - [i73]Tao Wu, Yong Zhang, Xintao Wang, Xianpan Zhou, Guangcong Zheng, Zhongang Qi, Ying Shan, Xi Li:
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities. CoRR abs/2408.13239 (2024) - 2023
- [j6]Kelvin C. K. Chan, Xiangyu Xu, Xintao Wang, Jinwei Gu, Chen Change Loy:
GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3154-3168 (2023) - [j5]Yuming Jiang, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu:
Reference-Based Image and Video Super-Resolution via $C^{2}$-Matching. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8874-8887 (2023) - [j4]Xintao Wang, Jiaqing Liang, Yanghua Xiao, Wei Wang:
Prototypical Concept Representation. IEEE Trans. Knowl. Data Eng. 35(7): 7357-7370 (2023) - [c40]Lijian Lin, Xintao Wang, Zhongang Qi, Ying Shan:
Accelerating the Training of Video Super-resolution Models. AAAI 2023: 1595-1603 - [c39]Liangbin Xie, Xintao Wang, Shuwei Shi, Jinjin Gu, Chao Dong, Ying Shan:
Mitigating Artifacts in Real-World Video Super-resolution Models. AAAI 2023: 2956-2964 - [c38]Qianyu He, Xintao Wang, Jiaqing Liang, Yanghua Xiao:
MAPS-KB: A Million-Scale Probabilistic Simile Knowledge Base. AAAI 2023: 6398-6406 - [c37]Mingdeng Cao, Chong Mou, Fanghua Yu, Xintao Wang, Yinqiang Zheng, Jian Zhang, Chao Dong, Gen Li, Ying Shan, Radu Timofte, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Bin Chen, Haoyu Ma, Ming Cheng, Shijie Zhao, Wanwan Cui, Tianyu Xu, Chunyang Li, Long Bao, Heng Sun, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Renlong Wu, Yi Yang, Zhilu Zhang, Shuohao Zhang, Junyi Li, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qian Wang, Hao-Hsiang Yang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Zebin Zhang, Jiaqi Zhang, Yuhui Wang, Shuhao Cui, Junshi Huang, Li Zhu, Shuman Tian, Wei Yu, Bingchun Luo:
NTIRE 2023 Challenge on 360° Omnidirectional Image and Video Super-Resolution: Datasets, Methods and Results. CVPR Workshops 2023: 1731-1745 - [c36]Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong:
OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer. CVPR 2023: 13283-13292 - [c35]Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao:
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models. CVPR 2023: 20908-20918 - [c34]Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Activating More Pixels in Image Super-Resolution Transformer. CVPR 2023: 22367-22377 - [c33]Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Stan Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou:
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation. ICCV 2023: 7589-7599 - [c32]Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang, Ying Shan, Qifeng Chen:
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing. ICCV 2023: 15886-15896 - [c31]Mingdeng Cao, Xintao Wang, Zhongang Qi, Ying Shan, Xiaohu Qie, Yinqiang Zheng:
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing. ICCV 2023: 22503-22513 - [c30]Liangbin Xie, Xintao Wang, Xiangyu Chen, Gen Li, Ying Shan, Jiantao Zhou, Chao Dong:
DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models. ICML 2023: 38204-38226 - [c29]Yuchao Gu, Xintao Wang, Jay Zhangjie Wu, Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models. NeurIPS 2023 - [c28]Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng:
Inserting Anybody in Diffusion Models via Celeb Basis. NeurIPS 2023 - [c27]Yuan Gong, Youxin Pang, Xiaodong Cun, Menghan Xia, Yingqing He, Haoxin Chen, Longyue Wang, Yong Zhang, Xintao Wang, Ying Shan, Yujiu Yang:
Interactive Story Visualization with Multiple Characters. SIGGRAPH Asia 2023: 101:1-101:10 - [i72]Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong:
OSRT: Omnidirectional Image Super-Resolution with Distortion-aware Transformer. CoRR abs/2302.03453 (2023) - [i71]Chong Mou, Xintao Wang, Liangbin Xie, Jian Zhang, Zhongang Qi, Ying Shan, Xiaohu Qie:
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models. CoRR abs/2302.08453 (2023) - [i70]Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang, Ying Shan, Qifeng Chen:
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing. CoRR abs/2303.09535 (2023) - [i69]Yue Ma, Yingqing He, Xiaodong Cun, Xintao Wang, Ying Shan, Xiu Li, Qifeng Chen:
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos. CoRR abs/2304.01186 (2023) - [i68]Mingdeng Cao, Xintao Wang, Zhongang Qi, Ying Shan, Xiaohu Qie, Yinqiang Zheng:
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing. CoRR abs/2304.08465 (2023) - [i67]Yuan Gong, Youxin Pang, Xiaodong Cun, Menghan Xia, Yingqing He, Haoxin Chen, Longyue Wang, Yong Zhang, Xintao Wang, Ying Shan, Yujiu Yang:
TaleCrafter: Interactive Story Visualization with Multiple Characters. CoRR abs/2305.18247 (2023) - [i66]Yuchao Gu, Xintao Wang, Jay Zhangjie Wu, Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, Mike Zheng Shou:
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models. CoRR abs/2305.18292 (2023) - [i65]Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng:
Inserting Anybody in Diffusion Models via Celeb Basis. CoRR abs/2306.00926 (2023) - [i64]Jinbo Xing, Menghan Xia, Yuxin Liu, Yuechen Zhang, Yong Zhang, Yingqing He, Hanyuan Liu, Haoxin Chen, Xiaodong Cun, Xintao Wang, Ying Shan, Tien-Tsin Wong:
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance. CoRR abs/2306.00943 (2023) - [i63]Jiale Xu, Xintao Wang, Yan-Pei Cao, Weihao Cheng, Ying Shan, Shenghua Gao:
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions. CoRR abs/2306.07154 (2023) - [i62]Yunpeng Bai, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan:
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals. CoRR abs/2306.16934 (2023) - [i61]Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhang:
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models. CoRR abs/2307.02421 (2023) - [i60]Liangbin Xie, Xintao Wang, Xiangyu Chen, Gen Li, Ying Shan, Jiantao Zhou, Chao Dong:
DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models. CoRR abs/2307.02457 (2023) - [i59]Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen:
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation. CoRR abs/2307.06940 (2023) - [i58]Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan:
Planting a SEED of Vision in Large Language Model. CoRR abs/2307.08041 (2023) - [i57]Fanghua Yu, Xintao Wang, Zheyuan Li, Yan-Pei Cao, Ying Shan, Chao Dong:
GET3D-: Learning GET3D from Unconstrained Image Collections. CoRR abs/2307.14918 (2023) - [i56]Xintao Wang, Qianwen Yang, Yongting Qiu, Jiaqing Liang, Qianyu He, Zhouhong Gu, Yanghua Xiao, Wei Wang:
KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases. CoRR abs/2308.11761 (2023) - [i55]Zhouxia Wang, Xintao Wang, Liangbin Xie, Zhongang Qi, Ying Shan, Wenping Wang, Ping Luo:
StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation. CoRR abs/2309.01770 (2023) - [i54]Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, Chao Dong:
HAT: Hybrid Attention Transformer for Image Restoration. CoRR abs/2309.05239 (2023) - [i53]Qianyu He, Jie Zeng, Wenhao Huang, Lina Chen, Jin Xiao, Qianxi He, Xunzhe Zhou, Lida Chen, Xintao Wang, Yuncheng Huang, Haoning Ye, Zihan Li, Shisong Chen, Yikai Zhang, Zhouhong Gu, Jiaqing Liang, Yanghua Xiao:
Can Large Language Models Understand Real-World Complex Instructions? CoRR abs/2309.09150 (2023) - [i52]Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan:
Making LLaMA SEE and Draw with SEED Tokenizer. CoRR abs/2310.01218 (2023) - [i51]Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan:
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models. CoRR abs/2310.07702 (2023) - [i50]Yihao Liu, Xiangyu Chen, Xianzheng Ma, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong:
Unifying Image Processing as Visual Prompting Question Answering. CoRR abs/2310.10513 (2023) - [i49]Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond H. Chan, Ying Shan:
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models. CoRR abs/2310.11440 (2023) - [i48]Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Xintao Wang, Tien-Tsin Wong, Ying Shan:
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors. CoRR abs/2310.12190 (2023) - [i47]Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu:
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling. CoRR abs/2310.15169 (2023) - [i46]Xintao Wang, Quan Tu, Yaying Fei, Ziang Leng, Cheng Li:
Does Role-Playing Chatbots Capture the Character Personalities? Assessing Personality Traits for Role-Playing Chatbots. CoRR abs/2310.17976 (2023) - [i45]Qun Zhao, Xintao Wang, Menghui Yang:
New Boolean satisfiability problem heuristic strategy: Minimal Positive Negative Product Strategy. CoRR abs/2310.18370 (2023) - [i44]Haoxin Chen, Menghan Xia, Yingqing He, Yong Zhang, Xiaodong Cun, Shaoshu Yang, Jinbo Xing, Yaofang Liu, Qifeng Chen, Xintao Wang, Chao Weng, Ying Shan:
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation. CoRR abs/2310.19512 (2023) - [i43]Ziyang Yuan, Mingdeng Cao, Xintao Wang, Zhongang Qi, Chun Yuan, Ying Shan:
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models. CoRR abs/2310.19784 (2023) - [i42]Yipei Xu, Dakuan Lu, Jiaqing Liang, Xintao Wang, Yipeng Geng, Yingsi Xin, Hengkui Wu, Ken Chen, ruiji zhang, Yanghua Xiao:
Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources. CoRR abs/2311.09732 (2023) - [i41]Gongye Liu, Menghan Xia, Yong Zhang, Haoxin Chen, Jinbo Xing, Xintao Wang, Yujiu Yang, Ying Shan:
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter. CoRR abs/2312.00330 (2023) - [i40]Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou:
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model. CoRR abs/2312.02238 (2023) - [i39]Yue Ma, Xiaodong Cun, Yingqing He, Chenyang Qi, Xintao Wang, Ying Shan, Xiu Li, Qifeng Chen:
MagicStick: Controllable Video Editing via Control Handle Transformations. CoRR abs/2312.03047 (2023) - [i38]Zhouxia Wang, Ziyang Yuan, Xintao Wang, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan:
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. CoRR abs/2312.03641 (2023) - [i37]Jiwen Yu, Xiaodong Cun, Chenyang Qi, Yong Zhang, Xintao Wang, Ying Shan, Jian Zhang:
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators. CoRR abs/2312.03793 (2023) - [i36]Zhen Li, Mingdeng Cao, Xintao Wang, Zhongang Qi, Ming-Ming Cheng, Ying Shan:
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding. CoRR abs/2312.04461 (2023) - [i35]