


default search action
Ming-Hsuan Yang 0001
Person information
- affiliation: University of California, Merced, Electrical Engineering and Computer Science, Merced, CA, USA
- affiliation (former): Honda Fundamental Research Labs, Mountain View, CA, USA
- affiliation (former, PhD 2000): University of Illinois at Urbana-Champaign, IL, USA
Other persons with the same name
- Ming-Hsuan Yang
- Ming-Hsuan Yang 0002 — Arizona University, Tucson, AZ, USA
- Ming-Hsuan Yang 0003 — Google Research, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
[j191]Jiangning Zhang
, Xuhai Chen, Yabiao Wang
, Chengjie Wang
, Yong Liu, Xiangtai Li
, Ming-Hsuan Yang, Dacheng Tao
:
Exploring plain ViT features for multi-class unsupervised visual anomaly detection. Comput. Vis. Image Underst. 253: 104308 (2025)
[j190]Tiantian Wang
, Xinxin Zuo
, Fangzhou Mu
, Jian Wang
, Ming-Hsuan Yang
:
Towards 4D human video stylization. Comput. Vis. Image Underst. 262: 104532 (2025)
[j189]Zhun Zhong, Hong Liu, Yin Cui, Shin'ichi Satoh, Nicu Sebe, Ming-Hsuan Yang:
Guest Editorial: Special Issue on Open-World Visual Recognition. Int. J. Comput. Vis. 133(2): 985-988 (2025)
[j188]Shuai Jia
, Chao Ma, Yibing Song, Xiaokang Yang, Ming-Hsuan Yang
:
Robust Deep Object Tracking against Adversarial Attacks. Int. J. Comput. Vis. 133(3): 1238-1257 (2025)
[j187]Shengfeng He, Lin Gao, Hongbo Fu, Varun Jampani, Lu Jiang, Ming-Hsuan Yang:
Guest Editorial: Special Issue on Large-Scale Generative Models for Content Creation and Manipulation. Int. J. Comput. Vis. 133(7): 4962-4965 (2025)
[j186]Chengzhuan Yang, Qian Yu, Hui Wei, Fei Wu, Yunliang Jiang, Zhonglong Zheng, Ming-Hsuan Yang
:
A Fast and Lightweight 3D Keypoint Detector. Int. J. Comput. Vis. 133(8): 5216-5237 (2025)
[j185]Runsheng Xu, Chia-Ju Chen, Zhengzhong Tu
, Ming-Hsuan Yang
:
V2X-ViTv2: Improved Vision Transformers for Vehicle-to-Everything Cooperative Perception. IEEE Trans. Pattern Anal. Mach. Intell. 47(1): 650-662 (2025)
[j184]Muhammad Awais
, Muzammal Naseer
, Salman Khan
, Rao Muhammad Anwer
, Hisham Cholakkal
, Mubarak Shah
, Ming-Hsuan Yang
, Fahad Shahbaz Khan
:
Foundation Models Defining a New Era in Vision: A Survey and Outlook. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2245-2264 (2025)
[j183]Yong Du
, Jiahui Zhan
, Xinzhe Li
, Junyu Dong
, Sheng Chen
, Ming-Hsuan Yang
, Shengfeng He
:
One-for-All: Towards Universal Domain Translation With a Single StyleGAN. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2865-2881 (2025)
[j182]Hao Zhou
, Lu Qi
, Tiancheng Shen, Hai Huang
, Xu Yang
, Xiangtai Li
, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 47(8): 6780-6796 (2025)
[j181]Akshay Dudhane
, Syed Waqas Zamir
, Salman Khan
, Fahad Shahbaz Khan
, Ming-Hsuan Yang
:
Burst Image Restoration and Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 9454-9467 (2025)
[j180]Xin Lin, Yuyan Zhou
, Jingtong Yue, Chao Ren
, Kelvin C. K. Chan
, Lu Qi
, Ming-Hsuan Yang
:
Re-Boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 9827-9844 (2025)
[j179]Liang Li
, Gaoxiang Cong
, Yuankai Qi
, Zheng-Jun Zha
, Qi Wu, Quan Z. Sheng
, Qingming Huang
, Ming-Hsuan Yang
:
Dubbing Movies via Hierarchical Phoneme Modeling and Acoustic Diffusion Denoising. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 10361-10377 (2025)
[j178]Chang Wan
, Ming-Hsuan Yang
, Minglu Li
, Yunliang Jiang
, Zhonglong Zheng
:
Nested Annealed Training Scheme for Generative Adversarial Networks. IEEE Trans. Circuits Syst. Video Technol. 35(1): 670-683 (2025)
[j177]Huajie Jiang
, Zhengxian Li, Yongli Hu
, Baocai Yin
, Jian Yang
, Anton van den Hengel
, Ming-Hsuan Yang
, Yuankai Qi
:
Dual Prototype Contrastive Network for Generalized Zero-Shot Learning. IEEE Trans. Circuits Syst. Video Technol. 35(2): 1111-1122 (2025)
[j176]Xin Lin, Jingtong Yue, Sixian Ding, Chao Ren
, Lu Qi, Ming-Hsuan Yang
:
Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark. IEEE Trans. Circuits Syst. Video Technol. 35(3): 2461-2473 (2025)
[j175]Pin-Hung Kuo
, Jinshan Pan
, Shao-Yi Chien
, Ming-Hsuan Yang
:
Efficient Non-Blind Image Deblurring With Discriminative Shrinkage Deep Networks. IEEE Trans. Circuits Syst. Video Technol. 35(9): 8545-8558 (2025)
[j174]Liyuan Chen
, Ming-Hsuan Yang, Jian Pu
, Zhonglong Zheng
:
TripleNet: Exploiting Complementary Features and Pseudo-Labels for Semi-Supervised Salient Object Detection. IEEE Trans. Image Process. 34: 5628-5641 (2025)
[j173]Chen Zhang
, Guorong Li
, Yuankai Qi
, Hanhua Ye
, Laiyun Qing
, Ming-Hsuan Yang
, Qingming Huang
:
Dynamic Erasing Network With Adaptive Temporal Modeling for Weakly Supervised Video Anomaly Detection. IEEE Trans. Neural Networks Learn. Syst. 36(9): 16706-16720 (2025)
[c421]Botao Ye, Sifei Liu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang:
Synthesizing Consistent Novel Views Via 3D Epipolar Attention Without Re-Training. 3DV 2025: 337-346
[c420]Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. 3DV 2025: 1177-1186
[c419]Tianlong Zhang, Zhe Xue, Adnan Mahmood
, Junping Du, Yuchen Dong, Shilong Ou, Lang Feng, Ming-Hsuan Yang, Yuankai Qi
:
Generating Synthetic Data for Unsupervised Federated Learning of Cross-Modal Retrieval. AAAI 2025: 22569-22577
[c418]Yuqian Fu, Xingyu Qiu, Bin Ren, Yanwei Fu, Radu Timofte, Nicu Sebe, Ming-Hsuan Yang, Luc Van Gool, Kaijin Zhang, Qingpeng Nong, Xiugang Dong, Hong Gao, Xiangsheng Zhou, Jiancheng Pan, Yanxing Liu, Xiao He, Jiahao Li, Yuze Sun, Xiaomeng Huang, Zhenyu Zhang, Ran Ma, Yuhan Liu, Zijian Zhuang, Shuai Yi, Yixiong Zou, Lingyi Hong, Mingxi Chen, Runze Li, Xingdong Sheng, Wenqiang Zhang, Weisen Chen, Yongxin Yan, Xinguo Chen, Yuanjie Shao, Zhengrong Zuo, Nong Sang, Hao Wu, Haoran Sun, Shuming Hu, Yan Zhang, Zhiguang Shi, Yu Zhang, Chao Chen, Tao Wang, Da Feng, Linhai Zhuo, Ziming Lin, Yali Huang, Jie Me, Yiming Yang, Mi Guo, Mingyuan Jiu, Mingliang Xu, Maomao Xiong, Qunshu Zhang, Xinyu Cao, Yuqing Yang, Dianmo Sheng, Xuanpu Zhao, Zhiyu Li, Xuyang Ding, Wenqian Li:
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results. CVPR Workshops 2025: 1048-1069
[c417]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Yuwei Fang, Kwot Sin Lee, Ivan Skorokhodov, Kfir Aberman, Jun-Yan Zhu, Ming-Hsuan Yang, Sergey Tulyakov:
Multi-subject Open-set Personalization in Video Generation. CVPR 2025: 6099-6110
[c416]Jinxiu Liu, Shaoheng Lin, Yinxiao Li, Ming-Hsuan Yang:
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes. CVPR 2025: 6144-6153
[c415]Lingshun Kong, Jiangxin Dong, Jinhui Tang, Ming-Hsuan Yang, Jinshan Pan:
Efficient Visual State Space Model for Image Deblurring. CVPR 2025: 12710-12719
[c414]Chanyoung Kim, Dayun Ju, Woojung Han, Ming-Hsuan Yang, Seong Jae Hwang:
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation. CVPR 2025: 15033-15042
[c413]I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Yuan-Chun Chiang, Sy-Yen Kuo, Ming-Hsuan Yang:
UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior. CVPR 2025: 17969-17979
[c412]Kyungmin Lee, Xiahong Li, Qifei Wang, Junfeng He, Junjie Ke, Ming-Hsuan Yang, Irfan Essa, Jinwoo Shin, Feng Yang, Yinxiao Li:
Calibrated Multi-Preference Optimization for Aligning Diffusion Models. CVPR 2025: 18465-18475
[c411]Hsin-Ping Huang, Yang Zhou, Jui-Hsien Wang, Difan Liu, Feng Liu, Ming-Hsuan Yang, Zhan Xu:
Move-in-2D: 2D-Conditioned Human Motion Generation. CVPR 2025: 22766-22775
[c410]Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CVPR 2025: 28963-28973
[c409]Seung Hyun Lee, Jijun Jiang, Yiran Xu, Zhuofang Li, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang:
Cropper: Vision-Language Model for Image Cropping through In-Context Learning. CVPR 2025: 30010-30019
[c408]Junyi Zhang, Charles Herrmann, Junhwa Hur, Varun Jampani, Trevor Darrell, Forrester Cole, Deqing Sun, Ming-Hsuan Yang:
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion. ICLR 2025
[c407]Xin Li, Deshui Miao, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
Learning Spatial-Semantic Features for Robust Video Object Segmentation. ICLR 2025
[c406]Lichang Chen, Hexiang Hu, Mingda Zhang, Yiwen Chen, Zifeng Wang, Yandong Li, Pranav Shyam, Tianyi Zhou, Heng Huang, Ming-Hsuan Yang, Boqing Gong:
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities. ICLR 2025
[c405]Xirui Li, Charles Herrmann, Kelvin C. K. Chan, Yinxiao Li, Deqing Sun, Chao Ma, Ming-Hsuan Yang:
A Simple Approach to Unifying Diffusion-based Conditional Generation. ICLR 2025
[c404]Xin Lin, Shi Luo, Xiaojun Shan, Xiaoyu Zhou, Chao Ren, Lu Qi, Ming-Hsuan Yang, Nuno Vasconcelos:
HQGS: High-Quality Novel View Synthesis with Gaussian Splatting in Degraded Scenes. ICLR 2025
[c403]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything. ICLR 2025
[c402]Botao Ye, Sifei Liu, Haofei Xu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang, Songyou Peng:
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images. ICLR 2025
[c401]Wei-Hsiang Yu, Yen-Yu Lin, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Ranking-aware adapter for text-driven image ordering with CLIP. ICLR 2025
[c400]Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. ICLR 2025
[c399]Junwei Zhou, Xueting Li, Lu Qi, Ming-Hsuan Yang:
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint. ICLR 2025
[c398]Yoonseok Choi, Sunyoung Jung, Mohammed A. Al-masni, Ming-Hsuan Yang, Dong-Hyun Kim:
TESLA: Test-Time Reference-Free Through-Plane Super-Resolution for Multi-Contrast Brain MRI. MICCAI (13) 2025: 584-593
[c397]Tao Tu, Ming-Feng Li, Chieh Hubert Lin, Yen-Chi Cheng, Min Sun, Ming-Hsuan Yang:
DreaMo: Articulated 3D Reconstruction from a Single Casual Video. WACV 2025: 2269-2279
[c396]Hsin-Ping Huang, Yu-Chuan Su, Deqing Sun, Lu Jiang, Xuhui Jia, Yukun Zhu, Ming-Hsuan Yang:
Fine-grained Controllable Video Generation via Object Appearance and Context. WACV 2025: 3698-3708
[c395]Hsin-Ping Huang, Yu-Chuan Su, Ming-Hsuan Yang:
Generating Long-Take Videos via Effective Keyframes and Guidance. WACV 2025: 3709-3720
[c394]Siddharth Seth, Rishabh Dabral, Diogo C. Luvizon, Marc Habermann, Ming-Hsuan Yang, Christian Theobalt, Adam Kortylewski:
PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing. WACV 2025: 5197-5206
[c393]Abdelrahman M. Shaker, Syed Talal Wasim, Martin Danelljan, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Efficient Video Object Segmentation via Modulated Cross-Attention Memory. WACV 2025: 8681-8690
[i400]Zhaoliang Wan, Yonggen Ling, Senlin Yi, Lu Qi, Wangwei Lee, Minglei Lu, Sicheng Yang, Xiao Teng, Peng Lu, Xu Yang, Ming-Hsuan Yang, Hui Cheng:
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception. CoRR abs/2501.00510 (2025)
[i399]Haobo Yuan, Xiangtai Li, Tao Zhang, Zilong Huang, Shilin Xu, Shunping Ji, Yunhai Tong, Lu Qi, Jiashi Feng, Ming-Hsuan Yang:
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos. CoRR abs/2501.04001 (2025)
[i398]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Yuwei Fang, Kwot Sin Lee, Ivan Skorokhodov, Kfir Aberman, Jun-Yan Zhu, Ming-Hsuan Yang, Sergey Tulyakov:
Multi-subject Open-set Personalization in Video Generation. CoRR abs/2501.06187 (2025)
[i397]Chang Wan, Ming-Hsuan Yang, Minglu Li, Yunliang Jiang, Zhonglong Zheng:
Nested Annealed Training Scheme for Generative Adversarial Networks. CoRR abs/2501.11318 (2025)
[i396]Yu-Chu Yu, Chieh Hubert Lin, Hsin-Ying Lee, Chaoyang Wang, Yu-Chiang Frank Wang, Ming-Hsuan Yang:
Towards Affordance-Aware Articulation Synthesis for Rigged Objects. CoRR abs/2501.12393 (2025)
[i395]I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Yuan-Chun Chiang, Sy-Yen Kuo, Ming-Hsuan Yang:
UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior. CoRR abs/2501.13134 (2025)
[i394]Junhao Cheng, Wei-Ting Chen, Xi Lu, Ming-Hsuan Yang:
BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled Learning. CoRR abs/2502.01522 (2025)
[i393]Kyungmin Lee, Xiaohang Li, Qifei Wang, Junfeng He, Junjie Ke, Ming-Hsuan Yang, Irfan Essa, Jinwoo Shin, Feng Yang, Yinxiao Li:
Calibrated Multi-Preference Optimization for Aligning Diffusion Models. CoRR abs/2502.02588 (2025)
[i392]Samyak Rawlekar, Yujun Cai
, Yiwei Wang, Ming-Hsuan Yang, Narendra Ahuja:
Disentangling CLIP Features for Enhanced Localized Understanding. CoRR abs/2502.02977 (2025)
[i391]Yu Qiu, Xin Lin, Jingbo Wang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions. CoRR abs/2502.03035 (2025)
[i390]Wenhao You, Bryan Hooi, Yiwei Wang, Euijin Choo, Ming-Hsuan Yang, Junsong Yuan, Zi Huang, Yujun Cai
:
Lost in Edits? A λ-Compass for AIGC Provenance. CoRR abs/2502.04364 (2025)
[i389]Xiaoyu Zhou, Jingqi Wang, Yongtao Wang, Yufei Wei, Nan Dong, Ming-Hsuan Yang:
OccGS: Zero-shot 3D Occupancy Reconstruction with Semantic and Geometric-Aware Gaussian Splatting. CoRR abs/2502.04981 (2025)
[i388]Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. CoRR abs/2502.13071 (2025)
[i387]Shuai Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, Sibo Song, Kai Dang, Peng Wang, Shijie Wang
, Jun Tang, Humen Zhong, Yuanzhi Zhu, Ming-Hsuan Yang, Zhaohai Li, Jianqiang Wan, Pengfei Wang, Wei Ding, Zheren Fu, Yiheng Xu, Jiabo Ye, Xi Zhang, Tianbao Xie, Zesen Cheng, Hang Zhang, Zhibo Yang, Haiyang Xu, Junyang Lin:
Qwen2.5-VL Technical Report. CoRR abs/2502.13923 (2025)
[i386]Dengjie Li, Tiancheng Shen, Yao Zhou, Baisong Yang, Zhongying Liu, Masheng Yang, Bernard Ghanem, Yibo Yang, Yujie Zhong, Ming-Hsuan Yang:
Optimizing Singular Spectrum for Large Language Model Compression. CoRR abs/2502.15092 (2025)
[i385]Botao Ye, Sifei Liu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang:
Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training. CoRR abs/2502.18219 (2025)
[i384]Komal Kumar, Tajamul Ashraf, Omkar Thawakar, Rao Muhammad Anwer
, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H. S. Torr, Salman H. Khan, Fahad Shahbaz Khan:
LLM Post-Training: A Deep Dive into Reasoning Large Language Models. CoRR abs/2502.21321 (2025)
[i383]Yuheng Liu, Xinke Li
, Yuning Zhang, Lu Qi, Xin Li, Wenping Wang, Chongshou Li, Xueting Li, Ming-Hsuan Yang:
Controllable 3D Outdoor Scene Generation via Scene Graphs. CoRR abs/2503.07152 (2025)
[i382]Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CoRR abs/2503.09344 (2025)
[i381]Ming-Hsuan Yang, Zhi-An Huang, Zhihang Zheng, Yuqiao Liu, Shichen Zhang, Pengfei Zhang, Hui Xiong, Shaojun Tang:
HiCMamba: Enhancing Hi-C Resolution and Identifying 3D Genome Structures with State Space Modeling. CoRR abs/2503.10713 (2025)
[i380]Shuyang Hao, Yiwei Wang, Bryan Hooi, Ming-Hsuan Yang, Jun Liu, Chengcheng Tang, Zi Huang, Yujun Cai
:
Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense. CoRR abs/2503.11619 (2025)
[i379]Zhuo Tao, Liang Li, Qi Chen, Yunbin Tu, Zheng-Jun Zha, Ming-Hsuan Yang, Yuankai Qi, Qingming Huang:
Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization. CoRR abs/2503.17651 (2025)
[i378]Yawei Li
, Bin Ren, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Nicu Sebe, Ming-Hsuan Yang, Luca Benini:
Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration. CoRR abs/2503.17825 (2025)
[i377]Wenhao You, Bryan Hooi, Yiwei Wang, Youke Wang, Zong Ke, Ming-Hsuan Yang, Zi Huang, Yujun Cai
:
MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks. CoRR abs/2503.19134 (2025)
[i376]Hsin-Ying Lee, Kelvin C. K. Chan, Ming-Hsuan Yang:
Consistent Subject Generation via Contrastive Instantiated Concepts. CoRR abs/2503.24387 (2025)
[i375]Haobo Yuan, Tao Zhang, Xiangtai Li, Lu Qi, Zilong Huang, Shilin Xu, Jiashi Feng, Ming-Hsuan Yang:
4th PVUW MeViS 3rd Place Report: Sa2VA. CoRR abs/2504.00476 (2025)
[i374]Zhaochen Wang, Bryan Hooi, Yiwei Wang, Ming-Hsuan Yang, Zi Huang, Yujun Cai
:
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models. CoRR abs/2504.01589 (2025)
[i373]Yiran Xu, Siqi Xie, Zhuofang Li, Harris Shadmany, Yinxiao Li, Luciano Sbaiz, Miaosen Wang, Junjie Ke, José Lezama, Hang Qi, Han Zhang, Jesse Berent, Ming-Hsuan Yang, Irfan Essa, Jiabin Huang, Feng Yang:
HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations. CoRR abs/2504.03026 (2025)
[i372]Qi Mao, Lan Chen, Yuchao Gu, Mike Zheng Shou, Ming-Hsuan Yang:
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model. CoRR abs/2504.05594 (2025)
[i371]Yuqian Fu, Xingyu Qiu, Bin Ren, Yanwei Fu, Radu Timofte, Nicu Sebe, Ming-Hsuan Yang, Luc Van Gool, Kaijin Zhang, Qingpeng Nong, Xiugang Dong, Hong Gao, Xiangsheng Zhou, Jiancheng Pan, Yanxing Liu, Xiao He, Jiahao Li, Yuze Sun, Xiaomeng Huang, Zhenyu Zhang, Ran Ma, Yuhan Liu, Zijian Zhuang, Shuai Yi, Yixiong Zou, Lingyi Hong, Mingxi Chen, Runze Li, Xingdong Sheng, Wenqiang Zhang, Weisen Chen, Yongxin Yan, Xinguo Chen, Yuanjie Shao, Zhengrong Zuo, Nong Sang, Hao Wu, Haoran Sun, Shuming Hu, Yan Zhang, Zhiguang Shi, Yu Zhang, Chao Chen, Tao Wang, Da Feng, Linhai Zhuo, Ziming Lin, Yali Huang, Jie Me, Yiming Yang, Mi Guo, Mingyuan Jiu, Mingliang Xu, Maomao Xiong, Qunshu Zhang, Xinyu Cao, Yuqing Yang, Dianmo Sheng, Xuanpu Zhao, Zhiyu Li, Xuyang Ding, Wenqian Li:
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results. CoRR abs/2504.10685 (2025)
[i370]Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, Philip Torr, Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang, Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma, Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Chen, Wei Zhang, Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu, Haobo Yuan, Xiangtai Li, Tao Zhang, Lu Qi, Ming-Hsuan Yang:
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild. CoRR abs/2504.11326 (2025)
[i369]Mengshi Qi, Pengfei Zhu, Xiangtai Li, Xiaoyang Bi, Lu Qi, Huadong Ma, Ming-Hsuan Yang:
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency. CoRR abs/2504.12080 (2025)
[i368]Bin Ren, Eduard Zamfir, Zongwei Wu, Yawei Li, Yidi Li, Danda Pani Paudel, Radu Timofte, Ming-Hsuan Yang, Luc Van Gool, Nicu Sebe:
Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation. CoRR abs/2504.14249 (2025)
[i367]Yu-Ju Tsai, Brian L. Price, Qing Liu, Luis Figueroa, Daniil Pakhomov, Zhihong Ding, Scott Cohen, Ming-Hsuan Yang:
CompleteMe: Reference-based Human Image Completion. CoRR abs/2504.20042 (2025)
[i366]Fangling Jiang, Qi Li, Bing Liu, Weining Wang, Caifeng Shan, Zhenan Sun, Ming-Hsuan Yang:
Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection. CoRR abs/2505.03610 (2025)
[i365]Yuping Wang, Shuo Xing, Cui Can, Renjie Li, Hongyuan Hua, Kexin Tian, Zhaobin Mo, Xiangbo Gao, Keshu Wu, Sulong Zhou, Hengxu You, Juntong Peng, Junge Zhang, Zehao Wang, Rui Song, Mingxuan Yan, Walter Zimmer, Xingcheng Zhou, Peiran Li, Zhaohan Lu, Chia-Ju Chen, Yue Huang, Ryan A. Rossi, Lichao Sun, Hongkai Yu, Zhiwen Fan, Hao (Frank) Yang, Yuhao Kang, Ross Greer, Chenxi Liu, Eun Hak Lee, Xuan Di, Xinyue Ye, Liu Ren, Alois Knoll, Xiaopeng Li, Shuiwang Ji, Masayoshi Tomizuka, Marco Pavone, Laurence Tianruo Yang, Jing Du, Ming-Hsuan Yang, Hua Wei, Ziran Wang, Yang Zhou, Jiachen Li, Zhengzhong Tu:
Generative AI for Autonomous Driving: Frontiers and Opportunities. CoRR abs/2505.08854 (2025)
[i364]Yongliang Wu, Zonghui Li, Xinting Hu, Xinyu Ye, Xianfang Zeng, Gang Yu, Wenbo Zhu, Bernt Schiele, Ming-Hsuan Yang, Xu Yang:
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models. CoRR abs/2505.16707 (2025)
[i363]Bin Ren, Yawei Li, Xu Zheng, Yuqian Fu, Danda Pani Paudel, Ming-Hsuan Yang, Luc Van Gool, Nicu Sebe:
Manifold-aware Representation Learning for Degradation-agnostic Image Restoration. CoRR abs/2505.18679 (2025)
[i362]Guofeng Mei, Bin Ren, Juan Liu, Luigi Riz, Xiaoshui Huang, Xu Zheng, Yongshun Gong, Ming-Hsuan Yang, Nicu Sebe, Fabio Poiesi:
Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding. CoRR abs/2505.18819 (2025)
[i361]Zheng Chu, Huiming Fan, Jingchang Chen, Qianyu Wang, Ming-Hsuan Yang, Jiafeng Liang, Zhongjie Wang, Hao Li, Guo Tang, Ming Liu, Bing Qin:
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering. CoRR abs/2505.19112 (2025)
[i360]Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Ronald Clark, Ming-Hsuan Yang:
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation. CoRR abs/2506.03150 (2025)
[i359]Yuyang Wanyan, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Jiabo Ye, Yutong Kou, Ming-Hsuan Yang, Fei Huang, Xiaoshan Yang, Weiming Dong, Changsheng Xu:
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation. CoRR abs/2506.04614 (2025)
[i358]Hanoona Abdul Rasheed, Abdelrahman M. Shaker, Anqi Tang, Muhammad Maaz, Ming-Hsuan Yang, Salman H. Khan, Fahad Shahbaz Khan:
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos. CoRR abs/2506.05349 (2025)
[i357]Zhaoliang Wan, Zetong Bi, Zida Zhou, Hao Ren, Yiming Zeng, Yihan Li, Lu Qi, Xu Yang, Ming-Hsuan Yang, Hui Cheng:
RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy. CoRR abs/2506.07490 (2025)
[i356]Chieh Hubert Lin, Zhaoyang Lv, Songyin Wu, Zhen Xu, Thu Nguyen-Phuoc, Hung-Yu Tseng, Julian Straub, Numair Khan, Lei Xiao, Ming-Hsuan Yang, Yuheng Ren, Richard A. Newcombe, Zhao Dong, Zhengqin Li:
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos. CoRR abs/2506.09997 (2025)
[i355]Junqi You, Chieh Hubert Lin, Weijie Lyu, Zhengbo Zhang, Ming-Hsuan Yang:
InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model. CoRR abs/2506.10980 (2025)
[i354]Yibo Yang, Si Liu, Chuan Rao, Bang An, Tiancheng Shen, Philip H. S. Torr, Ming-Hsuan Yang, Bernard Ghanem:
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence. CoRR abs/2506.13187 (2025)
[i353]Xiaoyuan Wang, Yizhou Zhao, Botao Ye, Xiaojun Shan, Weijie Lyu, Lu Qi, Kelvin C. K. Chan, Yinxiao Li, Ming-Hsuan Yang:
HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis. CoRR abs/2506.19291 (2025)
[i352]Junwei Zhou, Xueting Li, Lu Qi, Ming-Hsuan Yang:
CoCo4D: Comprehensive and Complex 4D Scene Generation. CoRR abs/2506.19798 (2025)
[i351]Chang Liu, Hongkai Chen, Yujun Cai
, Hang Wu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang:
Structured Attention Matters to Multimodal LLMs in Document Understanding. CoRR abs/2506.21600 (2025)
[i350]Hang Wu, Hongkai Chen, Yujun Cai
, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang:
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning. CoRR abs/2507.00008 (2025)
[i349]Chengxu Liu, Lu Qi, Jinshan Pan, Xueming Qian, Ming-Hsuan Yang:
Frequency Domain-Based Diffusion Model for Unpaired Image Dehazing. CoRR abs/2507.01275 (2025)
[i348]Qiguang Chen, Ming-Hsuan Yang, Libo Qin, Jinhao Liu, Zheng Yan, Jiannan Guan, Dengyun Peng, Yiyan Ji, Hanjing Li, Mengkang Hu, Yimeng Zhang, Yihao Liang, Yu Zhou, Jiaqi Wang, Zhi Chen, Wanxiang Che:
AI4Research: A Survey of Artificial Intelligence for Scientific Research. CoRR abs/2507.01903 (2025)
[i347]Yushen Zuo, Qi Zheng, Mingyang Wu, Xinrui Jiang, Renjie Li, Jian Wang, Yide Zhang, Gengchen Mai, Lihong V. Wang, James Zou, Xiaoyu Wang, Ming-Hsuan Yang, Zhengzhong Tu:
4KAgent: Agentic Any Image to 4K Super-Resolution. CoRR abs/2507.07105 (2025)
[i346]Chengxu Liu, Lu Qi, Jinshan Pan, Xueming Qian, Ming-Hsuan Yang:
Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model. CoRR abs/2507.13599 (2025)
[i345]Jindong Li, Yali Fu, Jiahong Liu, Linxiao Cao, Wei Ji, Menglin Yang, Irwin King, Ming-Hsuan Yang:
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey. CoRR abs/2507.22920 (2025)
[i344]Yizhou Zhao, Haoyu Chen, Chunjiang Liu, Zhenyang Li, Charles Herrmann, Junhwa Hur, Yinxiao Li, Ming-Hsuan Yang, Bhiksha Raj, Min Xu:
MASIV: Toward Material-Agnostic System Identification from Videos. CoRR abs/2508.01112 (2025)
[i343]Shuangkang Fang, I-Chao Shen, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Shuchang Zhou, Wenrui Ding, Takeo Igarashi, Ming-Hsuan Yang:
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh. CoRR abs/2508.01242 (2025)
[i342]Jianbo Ma, Hui Luo, Qi Chen, Yuankai Qi, Yumei Sun, Amin Beheshti, Jianlin Zhang, Ming-Hsuan Yang:
Tracking the Unstable: Appearance-Guided Motion Modeling for Robust Multi-Object Tracking in UAV-Captured Videos. CoRR abs/2508.01730 (2025)
[i341]Yongliang Wu, Yizhou Zhou, Zhou Ziheng, Yingzhe Peng, Xinyu Ye, Xinting Hu, Wenbo Zhu, Lu Qi, Ming-Hsuan Yang, Xu Yang:
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification. CoRR abs/2508.05629 (2025)
[i340]Jixuan He, Chieh Hubert Lin, Lu Qi, Ming-Hsuan Yang:
Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video. CoRR abs/2508.06715 (2025)
[i339]Haonan Ge
, Yiwei Wang, Ming-Hsuan Yang, Yujun Cai
:
MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs. CoRR abs/2508.10264 (2025)
[i338]Mengyuan Liu, Xinshun Wang, Zhongbin Fang, Deheng Ye, Xia Li, Tao Tang, Songtao Wu, Xiangtai Li, Ming-Hsuan Yang:
Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning. CoRR abs/2508.10897 (2025)
[i337]Bowen Sun, Yujun Cai
, Ming-Hsuan Yang, Yiwei Wang:
Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding. CoRR abs/2508.19529 (2025)
[i336]Yajiao Xiong, Xiaoyu Zhou, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes. CoRR abs/2508.20965 (2025)
[i335]Xin Lin, Xian Ge, Dizhe Zhang, Zhaoliang Wan, Xianshun Wang, Xiangtai Li, Wenjie Jiang, Bo Du, Dacheng Tao, Ming-Hsuan Yang, Lu Qi:
One Flight Over the Gap: A Survey from Perspective to Panoramic Vision. CoRR abs/2509.04444 (2025)
[i334]Zhengxi Lu, Jiabo Ye, Fei Tang, Yongliang Shen, Haiyang Xu, Ziwei Zheng, Weiming Lu, Ming-Hsuan Yang, Fei Huang, Jun Xiao, Yueting Zhuang:
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning. CoRR abs/2509.11543 (2025)
[i333]Ling Lo, Kelvin C. K. Chan, Wen-Huang Cheng, Ming-Hsuan Yang:
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition. CoRR abs/2509.19690 (2025)
[i332]Hao Zhou, Yiyan Ma, Dan Fei, Weirong Liu, Zhengyu Zhang, Ming-Hsuan Yang, Guoyu Ma, Yunlong Lu, Ruisi He, Guoyu Wang, Cheng Li, Zhaohui Song, Bo Ai:
Delay-Doppler Domain Channel Measurements and Modeling in High-Speed Railways. CoRR abs/2509.25854 (2025)
[i331]Hang Wu, Yujun Cai, Haonan Ge, Hongkai Chen, Ming-Hsuan Yang, Yiwei Wang:
RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation. CoRR abs/2510.02423 (2025)
[i330]Qiguang Chen, Zheng Yan, Ming-Hsuan Yang, Libo Qin, Yixin Yuan, Hanjing Li, Jinhao Liu, Yiyan Ji, Dengyun Peng, Jiannan Guan, Mengkang Hu, Yantao Du, Wanxiang Che:
AutoPR: Let's Automate Your Academic Promotion! CoRR abs/2510.09558 (2025)
[i329]Shiqi Zhang, Xinbei Ma, Yunqing Xu, Zouying Cao, Pengrui Lu, Haobo Yuan, Tiancheng Shen, Zhuosheng Zhang, Hai Zhao, Ming-Hsuan Yang:
ParaCook: On Time-Efficient Planning for Multi-Agent Systems. CoRR abs/2510.11608 (2025)
[i328]Jungbin Cho, Minsu Kim, Jisoo Kim, Ce Zheng, László Attila Jeni, Ming-Hsuan Yang, Youngjae Yu, Seonjoo Kim:
SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion. CoRR abs/2510.13044 (2025)
[i327]Wenhao Wang, Longqi Cai, Taihong Xiao, Yuxiao Wang, Ming-Hsuan Yang:
Scaling Laws for Deepfake Detection. CoRR abs/2510.16320 (2025)
[i326]Jinbin Bai, Yu Lei, Hecong Wu, Yuchen Zhu, Shufan Li, Yi Xin, Xiangtai Li, Molei Tao, Aditya Grover, Ming-Hsuan Yang:
From Masks to Worlds: A Hitchhiker's Guide to World Models. CoRR abs/2510.20668 (2025)
[i325]Xiaoyu Zhou, Jingqi Wang, Yuang Jia, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
EA3D: Online Open-World 3D Object Extraction from Streaming Videos. CoRR abs/2510.25146 (2025)- 2024
[j172]Ling Yang
, Zhilong Zhang
, Yang Song
, Shenda Hong
, Runsheng Xu
, Yue Zhao
, Wentao Zhang
, Bin Cui
, Ming-Hsuan Yang
:
Diffusion Models: A Comprehensive Survey of Methods and Applications. ACM Comput. Surv. 56(4): 105:1-105:39 (2024)
[j171]Zhiwei Lin, Tingting Liang, Taihong Xiao
, Yongtao Wang, Ming-Hsuan Yang
:
FlowNAS: Neural Architecture Search for Optical Flow Estimation. Int. J. Comput. Vis. 132(4): 1055-1074 (2024)
[j170]Wenqi Ren, Senyou Deng, Kaihao Zhang, Fenglong Song, Xiaochun Cao, Ming-Hsuan Yang
:
Fast Ultra High-Definition Video Deblurring via Multi-scale Separable Network. Int. J. Comput. Vis. 132(5): 1817-1834 (2024)
[j169]Guorong Li
, Hanhua Ye
, Yuankai Qi
, Shuhui Wang
, Laiyun Qing
, Qingming Huang
, Ming-Hsuan Yang
:
Learning Hierarchical Modular Networks for Video Captioning. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 1049-1064 (2024)
[j168]Lu Zhang
, Lu Qi
, Xu Yang
, Hong Qiao
, Ming-Hsuan Yang
, Zhiyong Liu
:
Automatically Discovering Novel Visual Categories With Adaptive Prototype Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2533-2544 (2024)
[j167]Jun Luo
, Yunfeng Nie
, Wenqi Ren
, Xiaochun Cao
, Ming-Hsuan Yang
:
Correcting Optical Aberration via Depth-Aware Point Spread Functions. IEEE Trans. Pattern Anal. Mach. Intell. 46(8): 5541-5555 (2024)
[j166]Qi Li
, Weining Wang
, Chengzhong Xu
, Zhenan Sun
, Ming-Hsuan Yang
:
Learning Disentangled Representation for One-Shot Progressive Face Swapping. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 8348-8364 (2024)
[j165]Yue Han
, Jiangning Zhang
, Yabiao Wang
, Chengjie Wang
, Yong Liu
, Lu Qi
, Ming-Hsuan Yang
, Xiangtai Li
:
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9221-9238 (2024)
[j164]Lei Huang
, Yunhao Ni
, Xi Weng
, Rao Muhammad Anwer
, Salman Khan
, Ming-Hsuan Yang
, Fahad Khan
:
Understanding Whitening Loss in Self-Supervised Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9479-9492 (2024)
[j163]Xiangtai Li
, Shilin Xu
, Yibo Yang
, Haobo Yuan
, Guangliang Cheng
, Yunhai Tong
, Zhouchen Lin
, Ming-Hsuan Yang
, Dacheng Tao
:
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11087-11103 (2024)
[j162]Ziheng Yan
, Yuankai Qi
, Guorong Li
, Xinyan Liu
, Weigang Zhang
, Ming-Hsuan Yang
, Qingming Huang
:
Progressive Multi-Resolution Loss for Crowd Counting. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3232-3244 (2024)
[j161]Kaihao Zhang
, Tao Wang
, Wenhan Luo
, Wenqi Ren
, Björn Stenger, Wei Liu
, Hongdong Li
, Ming-Hsuan Yang
:
MC-Blur: A Comprehensive Benchmark for Image Deblurring. IEEE Trans. Circuits Syst. Video Technol. 34(5): 3755-3767 (2024)
[j160]Abdelrahman M. Shaker
, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan
, Ming-Hsuan Yang
, Fahad Shahbaz Khan
:
UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation. IEEE Trans. Medical Imaging 43(9): 3377-3390 (2024)
[j159]Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong:
VideoGLUE: Video General Understanding Evaluation of Foundation Models. Trans. Mach. Learn. Res. 2024 (2024)
[j158]Xin Li
, Wenjie Pei
, Yaowei Wang
, Zhenyu He
, Huchuan Lu
, Ming-Hsuan Yang
:
Self-Supervised Tracking via Target-Aware Data Synthesis. IEEE Trans. Neural Networks Learn. Syst. 35(7): 9186-9197 (2024)
[c392]Youming Deng, Xueting Li, Sifei Liu, Ming-Hsuan Yang:
Physics-based Indirect Illumination for Inverse Rendering. 3DV 2024: 1249-1258
[c391]Zhiwei Lin, Yongtao Wang, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios. AAAI 2024: 3531-3539
[c390]Hao Zhang, Fang Li, Lu Qi, Ming-Hsuan Yang, Narendra Ahuja:
CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen. AAAI 2024: 7078-7086
[c389]Gaoxiang Cong, Yuankai Qi
, Liang Li, Amin Beheshti
, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang:
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing. ACL (Findings) 2024: 6767-6779
[c388]Junyi Zhang, Charles Herrmann, Junhwa Hur, Eric Chen, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence. CVPR 2024: 3076-3085
[c387]Lu Qi, Lehan Yang, Weidong Guo, Yu Xu, Bo Du, Varun Jampani, Ming-Hsuan Yang:
UniGS: Unified Representation for Image Generation and Segmentation. CVPR 2024: 6305-6315
[c386]Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang:
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance. CVPR 2024: 6733-6742
[c385]Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Lu Jiang, Ming-Hsuan Yang:
Text-Driven Image Editing via Learnable Regions. CVPR 2024: 7059-7068
[c384]Xirui Li
, Chao Ma, Xiaokang Yang, Ming-Hsuan Yang:
VidToMe: Video Token Merging for Zero-Shot Video Editing. CVPR 2024: 7486-7495
[c383]Hsin-Ying Lee, Hung-Yu Tseng, Hsin-Ying Lee, Ming-Hsuan Yang:
Exploiting Diffusion Prior for Generalizable Dense Prediction. CVPR 2024: 7861-7871
[c382]Hanoona Abdul Rasheed, Muhammad Maaz, Sahal Shaji Mullappilly, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer
, Eric P. Xing, Ming-Hsuan Yang, Fahad Shahbaz Khan:
GLaMM: Pixel Grounding Large Multimodal Model. CVPR 2024: 13009-13018
[c381]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, Sergey Tulyakov:
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers. CVPR 2024: 13320-13331
[c380]Kuan-Chih Huang, Weijie Lyu, Ming-Hsuan Yang, Yi-Hsuan Tsai:
PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection. CVPR 2024: 14938-14947
[c379]Syed Talal Wasim, Muzammal Naseer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding. CVPR 2024: 18909-18918
[c378]Yuqing Huang
, Xin Li, Zikun Zhou
, Yaowei Wang, Zhenyu He, Ming-Hsuan Yang:
RTracker: Recoverable Tracking via PN Tree Structured Memory. CVPR 2024: 19038-19047
[c377]Xinyan Liu, Guorong Li, Yuankai Qi
, Ziheng Yan, Zhenjun Han, Anton van den Hengel, Ming-Hsuan Yang, Qingming Huang:
Weakly Supervised Video Individual Counting. CVPR 2024: 19228-19237
[c376]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes. CVPR 2024: 21634-21643
[c375]Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang:
Motion-Adaptive Separable Collaborative Filters for Blind Motion Deblurring. CVPR 2024: 25595-25605
[c374]Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang:
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation. CVPR 2024: 28056-28065
[c373]Yuheng Liu, Xinke Li
, Xueting Li, Lu Qi
, Chongshou Li
, Ming-Hsuan Yang
:
Pyramid Diffusion for Fine 3D Large Scene Generation. ECCV (69) 2024: 71-87
[c372]Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang:
Spatial-Temporal Multi-level Association for Video Object Segmentation. ECCV (67) 2024: 91-107
[c371]Chieh Hubert Lin, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng:
Taming Latent Diffusion Model for Neural Radiance Field Inpainting. ECCV (3) 2024: 149-165
[c370]Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Weakly Supervised 3D Object Detection via Multi-level Visual Guidance. ECCV (1) 2024: 175-191
[c369]Shuangkang Fang
, Yufeng Wang
, Yi-Hsuan Tsai
, Yi Yang, Wenrui Ding, Shuchang Zhou
, Ming-Hsuan Yang
:
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts. ECCV (42) 2024: 199-216
[c368]Matej Kristan, Jirí Matas, Pavel Tokmakov, Michael Felsberg, Luka Cehovin Zajc, Alan Lukezic, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Hyung Jin Chang, Gustavo Fernández, Minasadat Attari, Antoni B. Chan, Liang Chen, Xin Chen, Jaired Collins, Yutao Cui, Ganesh Sai Manas Devarapu, Yinglong Du, Heng Fan, Wan-Cyuan Fan, Zhenhua Feng, Mingqi Gao, Rama Krishna Gorthi, Raghav Goyal, Jungong Han, Bijaya Kumar Hatuwal
, Zhenyu He, Xiantao Hu, Xingsen Huang, Yuqing Huang, Dongmei Jiang, Ben Kang, Kannappan Palaniappan, Josef Kittler, Simiao Lai, Ning Li, Xiaohai Li, Xin Li, Cheng Liang, Liting Lin, Haibin Ling, Ting Liu, Ziquan Liu, Huchuan Lu, Yifei Luo, Deshui Miao, Juan Mogollon
, Ziqi Pang, Jaswanth Reddy Pochimireddy, Viktor Prutyanov
, Gani Rahmon, Aleksandr Romanov
, Liangtao Shi, Mennatullah Siam, Leonid Sigal, Arun Kumar Sivapuram, Roman A. Solovyev
, Elham Soltani Kazemi, Imad Eddine Toubal, Jia Wan, Limin Wang, Xinying Wang, Yaowei Wang, Yu-Xiong Wang, Zhiquan Wang, Gangshan Wu, Qiangqiang Wu, Xiaojun Wu, Zihao Xia, Jinxia Xie, Chenlong Xu, Tianyang Xu, Yong Xu, Chaocan Xue
, Chao Yang, Jinyu Yang, Ming-Hsuan Yang, Chenyang Yu, Ke Yu, Chunhui Zhang, Jiaming Zhang, Zhipeng Zhang, Feng Zheng, Yaozong Zheng, Bineng Zhong, Jinglin Zhou, Junbao Zhou, Yong Zhou, Zikun Zhou, Guibo Zhu, Jiawen Zhu, Xuefeng Zhu, Vladimir V. Zunin
:
The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results. ECCV Workshops (7) 2024: 357-383
[c367]Henghui Ding
, Chang Liu, Yunchao Wei
, Nikhila Ravi, Shuting He
, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng Zheng, Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu, Feiyu Pan, Hao Fang, Xiankai Lu:
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results. ECCV Workshops (10) 2024: 361-377
[c366]Zhongyu Xia, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
HENet: Hybrid Encoding for End-to-End Multi-task 3D Perception from Multi-view Cameras. ECCV (50) 2024: 376-392
[c365]Henghui Ding
, Lingyi Hong, Chang Liu, Ning Xu, Linjie Yang, Yuchen Fan, Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Jinming Chai, Qin Ma, Junpei Zhang, Licheng Jiao, Fang Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Xu Liu, Lingling Li, Hao Fang, Feiyu Pan, Xiankai Lu, Wei Zhang, Runmin Cong, Tuyen Tran, Bin Cao, Yisi Zhang, Hanyi Wang, Xingjian He, Jing Liu:
LSVOS Challenge Report: Large-Scale Complex and Long Video Object Segmentation. ECCV Workshops (10) 2024: 378-394
[c364]I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo:
Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance. ECCV (24) 2024: 428-444
[c363]Yu-Ju Tsai, Yu-Lun Liu, Lu Qi, Kelvin C. K. Chan, Ming-Hsuan Yang:
Dual Associated Encoder for Face Restoration. ICLR 2024
[c362]Yuanhao Xiong, Long Zhao, Boqing Gong, Ming-Hsuan Yang, Florian Schroff, Ting Liu, Cho-Jui Hsieh, Liangzhe Yuan:
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding. ICLR 2024
[c361]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is key to visual generation. ICLR 2024
[c360]Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong:
VideoPrism: A Foundational Visual Encoder for Video Understanding. ICML 2024
[c359]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Joshua V. Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. ICML 2024
[c358]Zhaoliang Wan, Yonggen Ling, Senlin Yi, Lu Qi, Wang Wei Lee, Minglei Lu, Sicheng Yang, Xiao Teng, Peng Lu, Xu Yang, Ming-Hsuan Yang, Hui Cheng:
VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception. ICML 2024
[c357]Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting. ICML 2024
[c356]Nitesh Bharadwaj Gundavarapu, Luke Friedman, Raghav Goyal, Chaitra Hegde, Eirikur Agustsson, Sagar Waghmare, Mikhail Sirotenko, Ming-Hsuan Yang, Tobias Weyand, Boqing Gong, Leonid Sigal:
Extending Video Masked Autoencoders to 128 frames. NeurIPS 2024
[c355]Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Ming-Hsuan Yang, Nicu Sebe:
Sharing Key Semantics in Transformer Makes Efficient Image Restoration. NeurIPS 2024
[c354]Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. NeurIPS 2024
[c353]Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova:
Beyond SOT: Tracking Multiple Generic Objects at Once. WACV 2024: 6812-6822
[i324]Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding. CoRR abs/2401.00901 (2024)
[i323]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024)
[i322]Tao Wang, Wanglong Lu
, Kaihao Zhang, Wenhan Luo
, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yang:
PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal. CoRR abs/2402.02374 (2024)
[i321]Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yang:
Generalizable Entity Grounding via Assistance of Large Language Model. CoRR abs/2402.02555 (2024)
[i320]Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong
, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting. CoRR abs/2402.07207 (2024)
[i319]Divin Yan, Lu Qi, Vincent Tao Hu, Ming-Hsuan Yang, Meng Tang:
Training Class-Imbalanced Diffusion Model Via Overlap Optimization. CoRR abs/2402.10821 (2024)
[i318]Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang:
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing. CoRR abs/2402.12636 (2024)
[i317]Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong:
VideoPrism: A Foundational Visual Encoder for Video Understanding. CoRR abs/2402.13217 (2024)
[i316]Zhengxue Wang, Zhiqiang Yan
, Ming-Hsuan Yang, Jinshan Pan, Jian Yang, Ying Tai, Guangwei Gao:
Scene Prior Filtering for Depth Map Super-Resolution. CoRR abs/2402.13876 (2024)
[i315]Hankyul Kang, Ming-Hsuan Yang, Jongbin Ryu:
Interactive Multi-Head Self-Attention with Linear Complexity. CoRR abs/2402.17507 (2024)
[i314]Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, Sergey Tulyakov:
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers. CoRR abs/2402.19479 (2024)
[i313]Abdelrahman M. Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Efficient Video Object Segmentation via Modulated Cross-Attention Memory. CoRR abs/2403.17937 (2024)
[i312]Yuqing Huang, Xin Li, Zikun Zhou, Yaowei Wang, Zhenyu He, Ming-Hsuan Yang:
RTracker: Recoverable Tracking via PN Tree Structured Memory. CoRR abs/2403.19242 (2024)
[i311]Akshay Dudhane, Omkar Thawakar, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration. CoRR abs/2404.02154 (2024)
[i310]Zhongyu Xia, Zhiwei Lin, Xinhao Wang
, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang:
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras. CoRR abs/2404.02517 (2024)
[i309]Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang:
Mansformer: Efficient Transformer of Mixed Attention for Image Deblurring and Beyond. CoRR abs/2404.06135 (2024)
[i308]Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang:
Spatial-Temporal Multi-level Association for Video Object Segmentation. CoRR abs/2404.06265 (2024)
[i307]Bohao Peng, Zhuotao Tian, Shu Liu, Ming-Hsuan Yang, Jiaya Jia:
Scalable Language Model with Generalized Continual Learning. CoRR abs/2404.07470 (2024)
[i306]Weijie Lyu, Xueting Li, Abhijit Kundu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Gaga: Group Any Gaussians via 3D-aware Memory Bank. CoRR abs/2404.07977 (2024)
[i305]Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang:
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation. CoRR abs/2404.09993 (2024)
[i304]Chieh Hubert Lin, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng:
Taming Latent Diffusion Model for Neural Radiance Field Inpainting. CoRR abs/2404.09995 (2024)
[i303]Hao-Wei Chen, Yu-Syuan Xu, Kelvin C. K. Chan, Hsien-Kai Kuo, Chun-Yi Lee, Ming-Hsuan Yang:
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters. CoRR abs/2404.11475 (2024)
[i302]Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang:
Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring. CoRR abs/2404.13153 (2024)
[i301]Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang:
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance. CoRR abs/2405.01356 (2024)
[i300]I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo:
Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance. CoRR abs/2405.10589 (2024)
[i299]Lingshun Kong, Jiangxin Dong, Ming-Hsuan Yang, Jinshan Pan:
Efficient Visual State Space Model for Image Deblurring. CoRR abs/2405.14343 (2024)
[i298]Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. CoRR abs/2405.17427 (2024)
[i297]Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Ming-Hsuan Yang, Nicu Sebe:
Sharing Key Semantics in Transformer Makes Efficient Image Restoration. CoRR abs/2405.20008 (2024)
[i296]Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. CoRR abs/2405.20282 (2024)
[i295]Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang:
1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation. CoRR abs/2406.04600 (2024)
[i294]Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng Zheng, Bin Cao, Yisi Zhang, Xuanxu Lin, Xingjian He, Bo Zhao, Jing Liu, Feiyu Pan, Hao Fang, Xiankai Lu:
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results. CoRR abs/2406.17005 (2024)
[i293]Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy:
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. CoRR abs/2406.19369 (2024)
[i292]Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang:
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts. CoRR abs/2407.06842 (2024)
[i291]Xin Li, Deshui Miao, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
Learning Spatial-Semantic Features for Robust Video Object Segmentation. CoRR abs/2407.07760 (2024)
[i290]Mingyang Zhao, Xiaohong Jia, Lei Ma, Yuke Shi, Jingen Jiang, Qizhai Li, Ming-Hsuan Yang, Tiejun Huang:
A Bayesian Approach Toward Robust Multidimensional Ellipsoid-Specific Fitting. CoRR abs/2407.19269 (2024)
[i289]Shilin Xu, Xiangtai Li, Haobo Yuan, Lu Qi, Yunhai Tong, Ming-Hsuan Yang:
LLAVADI: What Matters For Multimodal Large Language Models Distillation. CoRR abs/2407.19409 (2024)
[i288]Seung Hyun Lee, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang:
Cropper: Vision-Language Model for Image Cropping through In-Context Learning. CoRR abs/2408.07790 (2024)
[i287]Xin Lin, Yuyan Zhou, Jingtong Yue, Chao Ren, Kelvin C. K. Chan, Lu Qi, Ming-Hsuan Yang:
Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration. CoRR abs/2408.09241 (2024)
[i286]Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang:
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS. CoRR abs/2408.16431 (2024)
[i285]Henghui Ding, Lingyi Hong, Chang Liu, Ning Xu, Linjie Yang, Yuchen Fan, Deshui Miao, Yameng Gu, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Jinming Chai, Qin Ma, Junpei Zhang, Licheng Jiao, Fang Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Xu Liu, Lingling Li, Hao Fang, Feiyu Pan, Xiankai Lu, Wei Zhang, Runmin Cong, Tuyen Tran, Bin Cao, Yisi Zhang, Hanyi Wang, Xingjian He, Jing Liu:
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation. CoRR abs/2409.05847 (2024)
[i284]Junyi Zhang, Charles Herrmann, Junhwa Hur, Varun Jampani, Trevor Darrell, Forrester Cole, Deqing Sun, Ming-Hsuan Yang:
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion. CoRR abs/2410.03825 (2024)
[i283]Yujin Tang, Lu Qi, Fei Xie, Xiangtai Li, Chao Ma, Ming-Hsuan Yang:
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners. CoRR abs/2410.04733 (2024)
[i282]Jingzhi Bao, Xueting Li, Ming-Hsuan Yang:
Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models. CoRR abs/2410.10821 (2024)
[i281]Xirui Li, Charles Herrmann, Kelvin C. K. Chan, Yinxiao Li, Deqing Sun, Chao Ma, Ming-Hsuan Yang:
A Simple Approach to Unifying Diffusion-based Conditional Generation. CoRR abs/2410.11439 (2024)
[i280]Hsin-Ping Huang, Xinyi Wang, Yonatan Bitton, Hagai Taitelbaum, Gaurav Singh Tomar, Ming-Wei Chang, Xuhui Jia, Kelvin C. K. Chan, Hexiang Hu, Yu-Chuan Su, Ming-Hsuan Yang:
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities. CoRR abs/2410.11824 (2024)
[i279]Lichang Chen, Hexiang Hu, Mingda Zhang, Yiwen Chen, Zifeng Wang, Yandong Li, Pranav Shyam, Tianyi Zhou
, Heng Huang, Ming-Hsuan Yang, Boqing Gong:
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities. CoRR abs/2410.12219 (2024)
[i278]Junwei Zhou
, Xueting Li, Lu Qi, Ming-Hsuan Yang:
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint. CoRR abs/2410.15391 (2024)
[i277]Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong, Xiangtai Li, Ming-Hsuan Yang:
RelationBooth: Towards Relation-Aware Customized Object Generation. CoRR abs/2410.23280 (2024)
[i276]Botao Ye, Sifei Liu, Haofei Xu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang, Songyou Peng:
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images. CoRR abs/2410.24207 (2024)
[i275]Siddharth Seth, Rishabh Dabral, Diogo C. Luvizon, Marc Habermann, Ming-Hsuan Yang, Christian Theobalt
, Adam Kortylewski:
PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing. CoRR abs/2411.04249 (2024)
[i274]Nitesh Bharadwaj Gundavarapu, Luke Friedman, Raghav Goyal, Chaitra Hegde, Eirikur Agustsson, Sagar M. Waghmare, Mikhail Sirotenko, Ming-Hsuan Yang, Tobias Weyand, Boqing Gong, Leonid Sigal:
Extending Video Masked Autoencoders to 128 frames. CoRR abs/2411.13683 (2024)
[i273]Chanyoung Kim, Dayun Ju, Woojung Han, Ming-Hsuan Yang, Seong Jae Hwang:
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation. CoRR abs/2411.17150 (2024)
[i272]Zhongyu Xia, Jishuo Li, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Ming-Hsuan Yang:
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection. CoRR abs/2411.17761 (2024)
[i271]Yawei Li, Bin Ren, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Nicu Sebe, Ming-Hsuan Yang, Luca Benini:
Hierarchical Information Flow for Generalized Efficient Image Restoration. CoRR abs/2411.18588 (2024)
[i270]Li-Yuan Tsao, Hao-Wei Chen, Hao-Wei Chung, Deqing Sun, Chun-Yi Lee, Kelvin C. K. Chan, Ming-Hsuan Yang:
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior. CoRR abs/2411.18662 (2024)
[i269]Wei-Hsiang Yu, Yen-Yu Lin, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Ranking-aware adapter for text-driven image ordering with CLIP. CoRR abs/2412.06760 (2024)
[i268]Baocai Yin, Ji Zhao, Huajie Jiang, Ningning Hou, Yongli Hu, Amin Beheshti, Ming-Hsuan Yang, Yuankai Qi:
Adapter-Enhanced Semantic Prompting for Continual Learning. CoRR abs/2412.11074 (2024)
[i267]Jinxiu Liu, Shaoheng Lin, Yinxiao Li, Ming-Hsuan Yang:
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes. CoRR abs/2412.11100 (2024)
[i266]Hsin-Ping Huang, Yang Zhou, Jui-Hsien Wang, Difan Liu, Feng Liu, Ming-Hsuan Yang, Zhan Xu:
Move-in-2D: 2D-Conditioned Human Motion Generation. CoRR abs/2412.13185 (2024)
[i265]Weijie Lyu, Yi Zhou, Ming-Hsuan Yang, Zhixin Shu:
FaceLift: Single Image to 3D Head with View Generation and GS-LRM. CoRR abs/2412.17812 (2024)- 2023
[j157]Yan-Bo Lin, Hung-Yu Tseng, Hsin-Ying Lee, Yen-Yu Lin
, Ming-Hsuan Yang
:
Unsupervised sound localization via iterative contrastive learning. Comput. Vis. Image Underst. 227: 103602 (2023)
[j156]Syed Waqas Zamir
, Aditya Arora, Salman Khan
, Munawar Hayat, Fahad Shahbaz Khan
, Ming-Hsuan Yang
, Ling Shao
:
Learning Enriched Features for Fast Image Restoration and Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 1934-1948 (2023)
[j155]Weitao Wan, Cheng Yu, Jiansheng Chen
, Tong Wu, Yuanyi Zhong
, Ming-Hsuan Yang
:
Shaping Deep Feature Space Towards Gaussian Mixture for Visual Classification. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2430-2444 (2023)
[j154]Weihao Xia
, Yulun Zhang
, Yujiu Yang
, Jing-Hao Xue
, Bolei Zhou, Ming-Hsuan Yang
:
GAN Inversion: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3121-3138 (2023)
[j153]Shanghua Gao
, Zhong-Yu Li
, Ming-Hsuan Yang
, Ming-Ming Cheng
, Junwei Han
, Philip H. S. Torr:
Large-Scale Unsupervised Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7457-7476 (2023)
[j152]Jie Cao
, Mandi Luo
, Junchi Yu
, Ming-Hsuan Yang
, Ran He
:
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8920-8935 (2023)
[j151]Jinshan Pan
, Boming Xu, Haoran Bai
, Jinhui Tang
, Ming-Hsuan Yang
:
Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-Local Spatial-Temporal Similarity. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 9411-9425 (2023)
[j150]Salman H. Khan
, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming-Hsuan Yang, Mubarak Shah:
Guest Editorial Introduction to the Special Section on Transformer Models in Vision. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12721-12725 (2023)
[j149]Yunfan Liu
, Qi Li
, Qiyao Deng
, Zhenan Sun
, Ming-Hsuan Yang
:
GAN-Based Facial Attribute Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14590-14610 (2023)
[j148]Zheheng Jiang
, Zhihua Liu
, Long Chen
, Lei Tong
, Xiangrong Zhang
, Xiangyuan Lan
, Danny Crookes
, Ming-Hsuan Yang
, Huiyu Zhou
:
Detecting and Tracking of Multiple Mice Using Part Proposal Networks. IEEE Trans. Neural Networks Learn. Syst. 34(12): 9806-9820 (2023)
[c352]Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble. CVPR 2023: 4853-4862
[c351]Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. CVPR 2023: 5703-5712
[c350]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CVPR 2023: 10459-10469
[c349]Yunhao Ge, Jie Ren, Andrew Gallagher, Yuxiao Wang, Ming-Hsuan Yang, Hartwig Adam, Laurent Itti, Balaji Lakshminarayanan, Jiaping Zhao:
Improving Zero-shot Generalization and Robustness of Multi-Modal Models. CVPR 2023: 11093-11101
[c348]Hsin-Ping Huang, Charles Herrmann, Junhwa Hur, Erika Lu, Kyle Sargent, Austin Stone, Ming-Hsuan Yang, Deqing Sun:
Self-supervised AutoFlow. CVPR 2023: 11412-11421
[c347]Gaoxiang Cong, Liang Li, Yuankai Qi
, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang:
Learning to Dub Movies via Hierarchical Prosody Models. CVPR 2023: 14687-14697
[c346]Chen Zhang
, Guorong Li, Yuankai Qi
, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang:
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection. CVPR 2023: 16271-16280
[c345]Botao Ye, Sifei Liu, Xueting Li, Ming-Hsuan Yang:
Self-Supervised Super-Plane for Neural 3D Reconstruction. CVPR 2023: 21415-21424
[c344]Lu Qi, Jason Kuen, Tiancheng Shen, Jiuxiang Gu, Wenbo Li, Weidong Guo, Jiaya Jia
, Zhe Lin, Ming-Hsuan Yang:
High Quality Entity Segmentation. ICCV 2023: 4024-4033
[c343]Kuan-Chih Huang, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Delving into Motion-Aware Matching for Monocular 3D Object Tracking. ICCV 2023: 6886-6895
[c342]Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu:
Unified Visual Relationship Detection with Vision and Language Models. ICCV 2023: 6939-6950
[c341]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer
, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Generative Multiplane Neural Radiance for 3D-Aware Image Generation. ICCV 2023: 7354-7364
[c340]Xin Li, Yuqing Huang
, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
CiteTracker: Correlating Image and Text for Visual Tracking. ICCV 2023: 9940-9949
[c339]Joungbin An, Hyolim Kang, Su Ho Han, Ming-Hsuan Yang, Seon Joo Kim:
MiniROAD: Minimal RNN Framework for Online Action Detection. ICCV 2023: 10307-10316
[c338]Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Self-regulating Prompts: Foundational Model Adaptation without Forgetting. ICCV 2023: 15144-15154
[c337]Abdelrahman M. Shaker
, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications. ICCV 2023: 17379-17390
[c336]Yunhao Ge, Yuecheng Li, Shuo Ni, Jiaping Zhao, Ming-Hsuan Yang, Laurent Itti:
CLR: Channel-wise Lightweight Reprogramming for Continual Learning. ICCV 2023: 18752-18762
[c335]Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin, Ming-Hsuan Yang, Sergey Tulyakov:
InfiniCity: Infinite-Scale City Synthesis. ICCV 2023: 22751-22761
[c334]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image. ICCV 2023: 22773-22783
[c333]Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Sy-Yen Kuo, Ming-Hsuan Yang:
Counting Crowds in Bad Weather. ICCV 2023: 23251-23262
[c332]Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu
, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo:
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results. ICCV (Workshops) 2023: 1788-1810
[c331]Huiwen Chang, Han Zhang, Jarred Barber, Aaron Maschinot, José Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Patrick Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan:
Muse: Text-To-Image Generation via Masked Generative Transformers. ICML 2023: 4055-4075
[c330]Chieh Hubert Lin, Hung-Yu Tseng, Hsin-Ying Lee, Maneesh Kumar Singh, Ming-Hsuan Yang:
Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features. ICML 2023: 21204-21222
[c329]Chen Liang, Jiahui Yu, Ming-Hsuan Yang, Matthew Brown, Yin Cui, Tuo Zhao, Boqing Gong, Tianyi Zhou:
Module-wise Adaptive Distillation for Multimodality Foundation Models. NeurIPS 2023
[c328]Cheng-Ju Ho, Chen-Hsuan Tai, Yen-Yu Lin, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection. NeurIPS 2023
[c327]Meng Liu, Mingda Zhang, Jialu Liu, Hanjun Dai, Ming-Hsuan Yang, Shuiwang Ji, Zheyun Feng, Boqing Gong:
Video Timeline Modeling For News Story Understanding. NeurIPS 2023
[c326]Lu Qi, Jason Kuen, Weidong Guo, Jiuxiang Gu, Zhe Lin, Bo Du, Yu Xu, Ming-Hsuan Yang:
AIMS: All-Inclusive Multi-Level Segmentation for Anything. NeurIPS 2023
[c325]Chun-Han Yao, Amit Raj, Wei-Chih Hung, Michael Rubinstein, Yuanzhen Li, Ming-Hsuan Yang, Varun Jampani:
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections. NeurIPS 2023
[c324]Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin P. Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. NeurIPS 2023
[c323]Junyi Zhang, Charles Herrmann, Junhwa Hur, Luisa Polania Cabrera, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence. NeurIPS 2023
[i264]Huiwen Chang, Han Zhang, Jarred Barber, Aaron Maschinot, José Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan:
Muse: Text-To-Image Generation via Masked Generative Transformers. CoRR abs/2301.00704 (2023)
[i263]Chieh Hubert Lin, Hsin-Ying Lee, Willi Menapace, Menglei Chai, Aliaksandr Siarohin, Ming-Hsuan Yang, Sergey Tulyakov:
InfiniCity: Infinite-Scale City Synthesis. CoRR abs/2301.09637 (2023)
[i262]Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu:
Unified Visual Relationship Detection with Vision and Language Models. CoRR abs/2303.08998 (2023)
[i261]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications. CoRR abs/2303.15446 (2023)
[i260]Yuanhao Xiong, Long Zhao, Boqing Gong, Ming-Hsuan Yang, Florian Schroff, Ting Liu, Cho-Jui Hsieh, Liangzhe Yuan:
Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding. CoRR abs/2303.16341 (2023)
[i259]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer
, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Generative Multiplane Neural Radiance for 3D-Aware Image Generation. CoRR abs/2304.01172 (2023)
[i258]Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. CoRR abs/2304.01194 (2023)
[i257]Hsin-Ping Huang, Yu-Chuan Su, Ming-Hsuan Yang:
Video Generation Beyond a Single Clip. CoRR abs/2304.07483 (2023)
[i256]Tsai-Shien Chen, Chieh Hubert Lin, Hung-Yu Tseng, Tsung-Yi Lin, Ming-Hsuan Yang:
Motion-Conditioned Diffusion Model for Controllable Video Synthesis. CoRR abs/2304.14404 (2023)
[i255]Junyi Zhang
, Charles Herrmann, Junhwa Hur, Luisa Polania Cabrera, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence. CoRR abs/2305.15347 (2023)
[i254]Lu Qi, Jason Kuen, Weidong Guo, Jiuxiang Gu, Zhe Lin, Bo Du, Yu Xu, Ming-Hsuan Yang:
AIMS: All-Inclusive Multi-Level Segmentation. CoRR abs/2305.17768 (2023)
[i253]Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Sy-Yen Kuo, Ming-Hsuan Yang:
Counting Crowds in Bad Weather. CoRR abs/2306.01209 (2023)
[i252]Chun-Han Yao, Amit Raj, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections. CoRR abs/2306.04619 (2023)
[i251]Lijun Yu
, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. CoRR abs/2306.17842 (2023)
[i250]Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong:
VideoGLUE: Video General Understanding Evaluation of Foundation Models. CoRR abs/2307.03166 (2023)
[i249]Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Self-regulating Prompts: Foundational Model Adaptation without Forgetting. CoRR abs/2307.06948 (2023)
[i248]Yunhao Ge, Yuecheng Li, Shuo Ni, Jiaping Zhao, Ming-Hsuan Yang, Laurent Itti:
CLR: Channel-wise Lightweight Reprogramming for Continual Learning. CoRR abs/2307.11386 (2023)
[i247]Muhammad Awais, Muzammal Naseer, Salman H. Khan, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Foundational Models Defining a New Era in Vision: A Survey and Outlook. CoRR abs/2307.13721 (2023)
[i246]Yu-Ju Tsai, Yu-Lun Liu, Lu Qi, Kelvin C. K. Chan, Ming-Hsuan Yang:
Dual Associated Encoder for Face Restoration. CoRR abs/2308.07314 (2023)
[i245]Xin Li, Yuqing Huang, Zhenyu He, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
CiteTracker: Correlating Image and Text for Visual Tracking. CoRR abs/2308.11322 (2023)
[i244]Kuan-Chih Huang, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Delving into Motion-Aware Matching for Monocular 3D Object Tracking. CoRR abs/2308.11607 (2023)
[i243]Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding, Ming-Hsuan Yang, Shuchang Zhou:
Text-driven Editing of 3D Scenes without Retraining. CoRR abs/2309.04917 (2023)
[i242]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image. CoRR abs/2309.06323 (2023)
[i241]Meng Liu, Mingda Zhang, Jialu Liu, Hanjun Dai, Ming-Hsuan Yang, Shuiwang Ji, Zheyun Feng, Boqing Gong:
Video Timeline Modeling For News Story Understanding. CoRR abs/2309.13446 (2023)
[i240]Chen Liang, Jiahui Yu, Ming-Hsuan Yang, Matthew Brown, Yin Cui, Tuo Zhao, Boqing Gong, Tianyi Zhou
:
Module-wise Adaptive Distillation for Multimodality Foundation Models. CoRR abs/2310.04550 (2023)
[i239]Lijun Yu
, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is Key to Visual Generation. CoRR abs/2310.05737 (2023)
[i238]Yong Du, Jiahui Zhan, Shengfeng He, Xinzhe Li, Junyu Dong, Sheng Chen, Ming-Hsuan Yang:
One-for-All: Towards Universal Domain Translation with a Single StyleGAN. CoRR abs/2310.14222 (2023)
[i237]Hao Zhou, Tiancheng Shen, Xu Yang, Hai Huang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion. CoRR abs/2311.03352 (2023)
[i236]Hanoona Abdul Rasheed, Muhammad Maaz, Sahal Shaji Mullappilly, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer
, Eric P. Xing, Ming-Hsuan Yang, Fahad Shahbaz Khan:
GLaMM: Pixel Grounding Large Multimodal Model. CoRR abs/2311.03356 (2023)
[i235]Yuheng Liu, Xinke Li
, Xueting Li, Lu Qi, Chongshou Li, Ming-Hsuan Yang:
Pyramid Diffusion for Fine 3D Large Scene Generation. CoRR abs/2311.12085 (2023)
[i234]Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai, Lu Jiang, Ming-Hsuan Yang:
Text-Driven Image Editing via Learnable Regions. CoRR abs/2311.16432 (2023)
[i233]Junyi Zhang
, Charles Herrmann, Junhwa Hur, Eric Chen, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence. CoRR abs/2311.17034 (2023)
[i232]Hsin-Ying Lee, Hung-Yu Tseng, Hsin-Ying Lee, Ming-Hsuan Yang:
Exploiting Diffusion Prior for Generalizable Pixel-Level Semantic Prediction. CoRR abs/2311.18832 (2023)
[i231]Yunhao Liu
, Lu Qi, Yu-Ju Tsai, Xiangtai Li, Kelvin C. K. Chan, Ming-Hsuan Yang:
Effective Adapter for Face Recognition in the Wild. CoRR abs/2312.01734 (2023)
[i230]Chen Zhang, Guorong Li, Yuankai Qi, Hanhua Ye, Laiyun Qing, Ming-Hsuan Yang, Qingming Huang:
Dynamic Erasing Network Based on Multi-Scale Temporal Features for Weakly Supervised Video Anomaly Detection. CoRR abs/2312.01764 (2023)
[i229]Lu Qi, Lehan Yang, Weidong Guo, Yu Xu, Bo Du, Varun Jampani, Ming-Hsuan Yang:
UniGS: Unified Representation for Image Generation and Segmentation. CoRR abs/2312.01985 (2023)
[i228]Tao Tu, Ming-Feng Li, Chieh Hubert Lin, Yen-Chi Cheng, Min Sun, Ming-Hsuan Yang:
DreaMo: Articulated 3D Reconstruction From A Single Casual Video. CoRR abs/2312.02617 (2023)
[i227]Hsin-Ping Huang, Yu-Chuan Su, Deqing Sun, Lu Jiang, Xuhui Jia, Yukun Zhu, Ming-Hsuan Yang:
Fine-grained Controllable Video Generation via Object Appearance and Context. CoRR abs/2312.02919 (2023)
[i226]Cheng-Ju Ho, Chen-Hsuan Tai, Yen-Yu Lin, Ming-Hsuan Yang, Yi-Hsuan Tsai:
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection. CoRR abs/2312.02966 (2023)
[i225]Tiantian Wang, Xinxin Zuo, Fangzhou Mu, Jian Wang, Ming-Hsuan Yang:
Towards 4D Human Video Stylization. CoRR abs/2312.04143 (2023)
[i224]Hao Zhang, Fang Li, Lu Qi, Ming-Hsuan Yang, Narendra Ahuja:
CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen. CoRR abs/2312.05538 (2023)
[i223]Xinyan Liu, Guorong Li, Yuankai Qi, Ziheng Yan, Zhenjun Han, Anton van den Hengel, Ming-Hsuan Yang, Qingming Huang:
Weakly Supervised Video Individual CountingWeakly Supervised Video Individual Counting. CoRR abs/2312.05923 (2023)
[i222]Jiangning Zhang
, Xuhai Chen, Yabiao Wang
, Chengjie Wang
, Yong Liu, Xiangtai Li, Ming-Hsuan Yang, Dacheng Tao
:
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection. CoRR abs/2312.07495 (2023)
[i221]Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance. CoRR abs/2312.07530 (2023)
[i220]Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang:
DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes. CoRR abs/2312.07920 (2023)
[i219]Kuan-Chih Huang, Weijie Lyu, Ming-Hsuan Yang, Yi-Hsuan Tsai:
PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection. CoRR abs/2312.08371 (2023)
[i218]Xirui Li, Chao Ma, Xiaokang Yang, Ming-Hsuan Yang:
VidToMe: Video Token Merging for Zero-Shot Video Editing. CoRR abs/2312.10656 (2023)
[i217]Dan Kondratyuk, Lijun Yu
, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Joshua V. Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez
, David Minnen, David A. Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. CoRR abs/2312.14125 (2023)- 2022
[j147]Yu-Chuan Su
, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay
, Cho-Jui Hsieh, Lior Shapira, Radu Soricut
, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang
, Boqing Gong:
2.5D visual relationship detection. Comput. Vis. Image Underst. 224: 103557 (2022)
[j146]Qi Mao, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Siwei Ma, Ming-Hsuan Yang
:
Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors. Int. J. Comput. Vis. 130(2): 517-549 (2022)
[j145]Taihong Xiao, Sifei Liu, Shalini De Mello, Zhiding Yu, Jan Kautz, Ming-Hsuan Yang
:
Learning Contrastive Representation for Semantic Correspondence. Int. J. Comput. Vis. 130(5): 1293-1309 (2022)
[j144]Jinshan Pan, Deqing Sun, Jiawei Zhang, Jinhui Tang, Jian Yang, Yu-Wing Tai
, Ming-Hsuan Yang
:
Dual Convolutional Neural Networks for Low-Level Vision. Int. J. Comput. Vis. 130(6): 1440-1458 (2022)
[j143]Taihong Xiao, Sifei Liu, Shalini De Mello, Zhiding Yu, Jan Kautz, Ming-Hsuan Yang
:
Correction to: Learning Contrastive Representation for Semantic Correspondence. Int. J. Comput. Vis. 130(6): 1607 (2022)
[j142]Kaihao Zhang, Wenqi Ren, Wenhan Luo
, Wei-Sheng Lai, Björn Stenger, Ming-Hsuan Yang
, Hongdong Li:
Deep Image Deblurring: A Survey. Int. J. Comput. Vis. 130(9): 2103-2130 (2022)
[j141]Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang
:
Understanding Synonymous Referring Expressions via Contrastive Features. Int. J. Comput. Vis. 130(10): 2501-2516 (2022)
[j140]Weitao Wan, Jiansheng Chen, Ming-Hsuan Yang, Huimin Ma:
Co-attention dictionary network for weakly-supervised semantic segmentation. Neurocomputing 486: 272-285 (2022)
[j139]Junyi Feng
, Songyuan Li, Xi Li
, Fei Wu
, Qi Tian
, Ming-Hsuan Yang
, Haibin Ling:
TapLab: A Fast Framework for Semantic Video Segmentation Tapping Into Compressed-Domain Knowledge. IEEE Trans. Pattern Anal. Mach. Intell. 44(3): 1591-1603 (2022)
[j138]Xiangyu Xu
, Yongrui Ma, Wenxiu Sun, Ming-Hsuan Yang
:
Exploiting Raw Images for Real-Scene Super-Resolution. IEEE Trans. Pattern Anal. Mach. Intell. 44(4): 1905-1921 (2022)
[j137]Xiankai Lu
, Chao Ma
, Jianbing Shen
, Xiaokang Yang
, Ian Reid
, Ming-Hsuan Yang
:
Deep Object Tracking With Shrinkage Loss. IEEE Trans. Pattern Anal. Mach. Intell. 44(5): 2386-2401 (2022)
[j136]Wenqi Ren, Jiawei Zhang
, Jinshan Pan
, Sifei Liu
, Jimmy S. Ren
, Junping Du
, Xiaochun Cao
, Ming-Hsuan Yang
:
Deblurring Dynamic Scenes via Spatially Varying Recurrent Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(8): 3974-3987 (2022)
[j135]Dingwen Zhang, Junwei Han
, Gong Cheng
, Ming-Hsuan Yang
:
Weakly Supervised Object Localization and Detection: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5866-5885 (2022)
[j134]Jiangxin Dong
, Jinshan Pan
, Jimmy S. Ren, Liang Lin
, Jinhui Tang
, Ming-Hsuan Yang
:
Learning Spatially Variant Linear Representation Models for Joint Filtering. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 8355-8370 (2022)
[j133]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang
, Yung-Yu Chuang
, Jia-Bin Huang
:
Learning to See Through Obstructions With Layered Decomposition. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 8387-8402 (2022)
[j132]Sanghyun Son, Jaeha Kim, Wei-Sheng Lai, Ming-Hsuan Yang
, Kyoung Mu Lee
:
Toward Real-World Super-Resolution via Adaptive Downsampling Models. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 8657-8670 (2022)
[j131]Wei-Sheng Lai
, Yi-Chang Shih, Chia-Kai Liang
, Ming-Hsuan Yang
:
Correcting Face Distortion in Wide-Angle Videos. IEEE Trans. Image Process. 31: 366-378 (2022)
[c322]Cheng-Ju Ho, Chen-Hsuan Tai, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang:
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection. BMVC 2022: 185
[c321]Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge J. Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui:
On Temporal Granularity in Self-Supervised Video Representation Learning. BMVC 2022: 541
[c320]Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Restormer: Efficient Transformer for High-Resolution Image Restoration. CVPR 2022: 5718-5729
[c319]Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burst Image Restoration and Enhancement. CVPR 2022: 5749-5758
[c318]Yen-Chi Cheng
, Chieh Hubert Lin, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Ming-Hsuan Yang:
InOut: Diverse Image Outpainting via GAN Inversion. CVPR 2022: 11421-11430
[c317]Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu:
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision. CVPR 2022: 13957-13966
[c316]Zhihao Shi, Xiangyu Xu, Xiaohong Liu
, Jun Chen, Ming-Hsuan Yang:
Video Frame Interpolation Transformer. CVPR 2022: 17461-17470
[c315]Hanhua Ye, Guorong Li, Yuankai Qi
, Shuhui Wang, Qingming Huang, Ming-Hsuan Yang:
Hierarchical Modular Network for Video Captioning. CVPR 2022: 17918-17927
[c314]Hsin-Ping Huang, Deqing Sun, Yaojie Liu, Wen-Sheng Chu, Taihong Xiao, Jinwei Yuan, Hartwig Adam, Ming-Hsuan Yang:
Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing. ECCV (13) 2022: 37-54
[c313]Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, Ming-Hsuan Yang, Jiaya Jia
:
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation. ECCV (31) 2022: 59-77
[c312]An-Chieh Cheng, Xueting Li, Sifei Liu, Min Sun, Ming-Hsuan Yang:
Autoregressive 3D Shape Generation via Canonical Mapping. ECCV (3) 2022: 89-104
[c311]Runsheng Xu
, Hao Xiang
, Zhengzhong Tu
, Xin Xia
, Ming-Hsuan Yang
, Jiaqi Ma
:
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer. ECCV (39) 2022: 107-124
[c310]Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang:
Learning Discriminative Shrinkage Deep Networks for Image Deconvolution. ECCV (19) 2022: 217-234
[c309]Xueting Li, Xiaolong Wang, Ming-Hsuan Yang, Alexei A. Efros, Sifei Liu:
Scraping Textures from Natural Images for Synthesis and Editing. ECCV (15) 2022: 391-408
[c308]Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang:
Learning Visibility for Robust Dense Human Body Estimation. ECCV (1) 2022: 412-428
[c307]Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer
, Ming-Hsuan Yang:
Class-Agnostic Object Detection with Multi-modal Transformer. ECCV (10) 2022: 512-531
[c306]Rakesh Jasti, Varun Jampani, Deqing Sun, Ming-Hsuan Yang:
Multi-Frame Video Prediction with Learnable Motion Encodings. ICIP 2022: 4198-4202
[c305]Tsai-Shien Chen, Wei-Chih Hung, Hung-Yu Tseng, Shao-Yi Chien, Ming-Hsuan Yang:
Incremental False Negative Detection for Contrastive Learning. ICLR 2022
[c304]Xueting Li, Shalini De Mello, Xiaolong Wang, Ming-Hsuan Yang, Jan Kautz, Sifei Liu:
Learning Continuous Environment Fields via Implicit Functions. ICLR 2022
[c303]Chieh Hubert Lin, Hsin-Ying Lee, Yen-Chi Cheng, Sergey Tulyakov, Ming-Hsuan Yang:
InfinityGAN: Towards Infinite-Pixel Image Synthesis. ICLR 2022
[c302]Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang:
ViDT: An Efficient and Effective Fully Transformer-based Object Detector. ICLR 2022
[c301]Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
LASSIE: Learning Articulated Shapes from Sparse Image Ensemble via 3D Part Discovery. NeurIPS 2022
[c300]Yi-Wen Chen, Xiaojie Jin, Xiaohui Shen, Ming-Hsuan Yang:
Video Salient Object Detection via Contrastive Features and Attention Modules. WACV 2022: 536-545
[c299]Chun-Han Yao, Boqing Gong, Hang Qi, Yin Cui, Yukun Zhu, Ming-Hsuan Yang:
Federated Multi-Target Domain Adaptation. WACV 2022: 1081-1090
[c298]Yufeng Wang, Yi-Hsuan Tsai, Wei-Chih Hung, Wenrui Ding, Shuo Liu, Ming-Hsuan Yang:
Semi-supervised Multi-task Learning for Semantics and Depth. WACV 2022: 2663-2672
[i216]Kaihao Zhang, Wenqi Ren, Wenhan Luo, Wei-Sheng Lai, Björn Stenger, Ming-Hsuan Yang, Hongdong Li:
Deep Image Deblurring: A Survey. CoRR abs/2201.10700 (2022)
[i215]Runsheng Xu, Hao Xiang, Zhengzhong Tu, Xin Xia, Ming-Hsuan Yang, Jiaqi Ma:
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer. CoRR abs/2203.10638 (2022)
[i214]Hsin-Ping Huang, Deqing Sun, Yaojie Liu, Wen-Sheng Chu, Taihong Xiao, Jinwei Yuan, Hartwig Adam, Ming-Hsuan Yang:
Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing. CoRR abs/2203.12175 (2022)
[i213]Tiantian Wang, Nikolaos Sarafianos, Ming-Hsuan Yang, Tony Tung:
Animatable Neural Radiance Fields from Monocular RGB-D. CoRR abs/2204.01218 (2022)
[i212]An-Chieh Cheng, Xueting Li, Sifei Liu, Min Sun, Ming-Hsuan Yang:
Autoregressive 3D Shape Generation via Canonical Mapping. CoRR abs/2204.01955 (2022)
[i211]Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang:
An Extendable, Efficient and Effective Transformer-based Object Detector. CoRR abs/2204.07962 (2022)
[i210]Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Learning Enriched Features for Fast Image Restoration and Enhancement. CoRR abs/2205.01649 (2022)
[i209]Chieh Hubert Lin, Hsin-Ying Lee, Hung-Yu Tseng, Maneesh Kumar Singh, Ming-Hsuan Yang:
Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features. CoRR abs/2206.01202 (2022)
[i208]Zhiwei Lin, Tingting Liang, Taihong Xiao, Yongtao Wang, Zhi Tang, Ming-Hsuan Yang:
FlowNAS: Neural Architecture Search for Optical Flow Estimation. CoRR abs/2207.01271 (2022)
[i207]Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
LASSIE: Learning Articulated Shapes from Sparse Image Ensemble via 3D Part Discovery. CoRR abs/2207.03434 (2022)
[i206]Rui Qian, Yeqing Li, Zheng Xu, Ming-Hsuan Yang, Serge J. Belongie, Yin Cui:
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models. CoRR abs/2207.07646 (2022)
[i205]Lu Zhang, Lu Qi, Xu Yang, Hong Qiao, Ming-Hsuan Yang, Zhiyong Liu:
Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning. CoRR abs/2208.00979 (2022)
[i204]Jean Lahoud
, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Muhammad Anwer
, Salman Khan, Ming-Hsuan Yang:
3D Vision with Transformers: A Survey. CoRR abs/2208.04309 (2022)
[i203]Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang:
Learning Visibility for Robust Dense Human Body Estimation. CoRR abs/2208.10652 (2022)
[i202]Ling Yang, Zhilong Zhang, Yang Song, Shenda Hong, Runsheng Xu, Yue Zhao, Yingxia Shao, Wentao Zhang, Ming-Hsuan Yang, Bin Cui:
Diffusion Models: A Comprehensive Survey of Methods and Applications. CoRR abs/2209.00796 (2022)
[i201]Yunfan Liu, Qi Li, Qiyao Deng, Zhenan Sun, Ming-Hsuan Yang:
GAN-based Facial Attribute Manipulation. CoRR abs/2210.12683 (2022)
[i200]Jie Cao, Mandi Luo, Junchi Yu, Ming-Hsuan Yang, Ran He:
ScoreMix: A Scalable Augmentation Strategy for Training GANs with Limited Data. CoRR abs/2210.15137 (2022)
[i199]Lu Qi, Jason Kuen, Weidong Guo, Tiancheng Shen, Jiuxiang Gu, Wenbo Li, Jiaya Jia, Zhe Lin, Ming-Hsuan Yang:
Fine-Grained Entity Segmentation. CoRR abs/2211.05776 (2022)
[i198]Ling Yang, Zhilin Huang, Yang Song, Shenda Hong, Guohao Li, Wentao Zhang, Bin Cui, Bernard Ghanem, Ming-Hsuan Yang:
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training. CoRR abs/2211.11138 (2022)
[i197]Taihong Xiao, Zirui Wang, Liangliang Cao, Jiahui Yu, Shengyang Dai, Ming-Hsuan Yang:
Exploiting Category Names for Few-Shot Classification with Vision-Language Models. CoRR abs/2211.16594 (2022)
[i196]Yunhao Ge, Jie Ren, Yuxiao Wang, Andrew Gallagher, Ming-Hsuan Yang, Laurent Itti, Hartwig Adam, Balaji Lakshminarayanan, Jiaping Zhao:
Improving Zero-shot Generalization and Robustness of Multi-modal Models. CoRR abs/2212.01758 (2022)
[i195]Hsin-Ping Huang, Charles Herrmann, Junhwa Hur, Erika Lu, Kyle Sargent, Austin Stone, Ming-Hsuan Yang, Deqing Sun:
Self-supervised AutoFlow. CoRR abs/2212.01762 (2022)
[i194]Gaoxiang Cong, Liang Li, Yuankai Qi, Zhengjun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang:
Learning to Dub Movies via Hierarchical Prosody Models. CoRR abs/2212.04054 (2022)
[i193]Xinyan Liu, Guorong Li, Yuankai Qi, Zhenjun Han, Qingming Huang, Ming-Hsuan Yang, Nicu Sebe:
Consistency-Aware Anchor Pyramid Network for Crowd Localization. CoRR abs/2212.04067 (2022)
[i192]Chen Zhang, Guorong Li, Yuankai Qi, Shuhui Wang, Laiyun Qing, Qingming Huang, Ming-Hsuan Yang:
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection. CoRR abs/2212.04090 (2022)
[i191]Ziheng Yan, Yuankai Qi, Guorong Li, Xinyan Liu, Weigang Zhang, Qingming Huang, Ming-Hsuan Yang:
Progressive Multi-resolution Loss for Crowd Counting. CoRR abs/2212.04127 (2022)
[i190]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation. CoRR abs/2212.04497 (2022)
[i189]Youming Deng
, Xueting Li, Sifei Liu, Ming-Hsuan Yang:
DIP: Differentiable Interreflection-aware Physics-based Inverse Rendering. CoRR abs/2212.04705 (2022)
[i188]Lijun Yu
, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CoRR abs/2212.05199 (2022)
[i187]Cheng-Ju Ho, Chen-Hsuan Tai, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang:
Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection. CoRR abs/2212.09273 (2022)
[i186]Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani:
Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble. CoRR abs/2212.11042 (2022)
[i185]Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova:
Beyond SOT: It's Time to Track Multiple Generic Objects at Once. CoRR abs/2212.11920 (2022)- 2021
[j130]Jinshan Pan, Deqing Sun, Jian Yang, Wangmeng Zuo, Paolo Favaro, Yasuyuki Matsushita
, Ming-Hsuan Yang:
Editorial for CVIU_DL for image restoration. Comput. Vis. Image Underst. 208-209: 103222 (2021)
[j129]Yun-Chun Chen
, Marco Piccirilli, Robinson Piramuthu
, Ming-Hsuan Yang
:
Self-attentive 3D human pose and shape estimation from videos. Comput. Vis. Image Underst. 213: 103305 (2021)
[j128]Wenqing Chu, Wei-Chih Hung, Yi-Hsuan Tsai, Yu-Ting Chang, Yijun Li, Deng Cai, Ming-Hsuan Yang
:
Learning to Caricature via Semantic Shape Transform. Int. J. Comput. Vis. 129(9): 2663-2679 (2021)
[j127]Jongbin Ryu, Ming-Hsuan Yang, Jongwoo Lim:
Unsupervised feature learning for self-tuning neural networks. Neural Networks 133: 103-111 (2021)
[j126]Dongwei Ren
, Wangmeng Zuo
, David Zhang
, Lei Zhang
, Ming-Hsuan Yang
:
Simultaneous Fidelity and Regularization Learning for Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 43(1): 284-299 (2021)
[j125]Shanghua Gao
, Ming-Ming Cheng
, Kai Zhao
, Xin-Yu Zhang
, Ming-Hsuan Yang
, Philip H. S. Torr:
Res2Net: A New Multi-Scale Backbone Architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2): 652-662 (2021)
[j124]Wenbo Bao
, Wei-Sheng Lai
, Xiaoyun Zhang
, Zhiyong Gao, Ming-Hsuan Yang
:
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 43(3): 933-948 (2021)
[j123]Arda Senocak
, Tae-Hyun Oh
, Junsik Kim
, Ming-Hsuan Yang
, In So Kweon:
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications. IEEE Trans. Pattern Anal. Mach. Intell. 43(5): 1605-1619 (2021)
[j122]Jinshan Pan
, Jiangxin Dong
, Yang Liu, Jiawei Zhang, Jimmy S. J. Ren
, Jinhui Tang
, Yu-Wing Tai
, Ming-Hsuan Yang
:
Physics-Based Generative Adversarial Models for Image Restoration and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 43(7): 2449-2462 (2021)
[j121]Yun-Chun Chen
, Yen-Yu Lin
, Ming-Hsuan Yang
, Jia-Bin Huang
:
Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 43(10): 3632-3647 (2021)
[j120]Shi Pu
, Yibing Song
, Chao Ma
, Honggang Zhang
, Ming-Hsuan Yang
:
Learning Recurrent Memory Activation Networks for Visual Tracking. IEEE Trans. Image Process. 30: 725-738 (2021)
[j119]Peiyu Yang
, Guofeng Zhang, Lu Wang
, Lisheng Xu
, Qingxu Deng, Ming-Hsuan Yang
:
A Part-Aware Multi-Scale Fully Convolutional Network for Pedestrian Detection. IEEE Trans. Intell. Transp. Syst. 22(2): 1125-1137 (2021)
[c297]Taihong Xiao, Xin-Yu Zhang, Hao-Lin Jia, Ming-Ming Cheng, Ming-Hsuan Yang:
Semi-Supervised Learning with Meta-Gradient. AISTATS 2021: 73-81
[c296]Hwanjun Song, Eunyoung Kim, Varun Jampani, Deqing Sun, Jae-Gil Lee, Ming-Hsuan Yang:
Exploiting Scene Depth for Object Detection with Multimodal Transformers. BMVC 2021: 265
[c295]Jingkai Zhou, Varun Jampani, Zhixiong Pi, Qiong Liu, Ming-Hsuan Yang:
Decoupled Dynamic Filter Networks. CVPR 2021: 6647-6656
[c294]Rui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge J. Belongie
, Yin Cui:
Spatiotemporal Contrastive Video Representation Learning. CVPR 2021: 6964-6974
[c293]Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang:
Regularizing Generative Adversarial Networks Under Limited Data. CVPR 2021: 7921-7931
[c292]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat
, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Multi-Stage Progressive Image Restoration. CVPR 2021: 14821-14831
[c291]Jie Cao, Luanxuan Hou, Ming-Hsuan Yang, Ran He, Zhenan Sun:
ReMix: Towards Image-to-Image Translation With Limited Data. CVPR 2021: 15018-15027
[c290]Yuankai Qi
, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. ICCV 2021: 1635-1644
[c289]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Hybrid Neural Fusion for Full-frame Video Stabilization. ICCV 2021: 2279-2288
[c288]Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar:
COMISR: Compression-Informed Video Super-Resolution. ICCV 2021: 2523-2532
[c287]Tiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang:
Video Matting via Consistency-Regularized Graph Neural Networks. ICCV 2021: 4882-4891
[c286]Chun-Han Yao, Wei-Chih Hung, Varun Jampani, Ming-Hsuan Yang:
Discovering 3D Parts from Image Collections. ICCV 2021: 12961-12970
[c285]Sanath Narayan, Hisham Cholakkal, Munawar Hayat
, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations. ICCV 2021: 13588-13597
[c284]Hsin-Ping Huang, Hung-Yu Tseng, Saurabh Saini
, Maneesh Singh, Ming-Hsuan Yang:
Learning to Stylize Novel Views. ICCV 2021: 13849-13858
[c283]Kaihao Zhang, Dongxu Li, Wenhan Luo
, Wenqi Ren, Björn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang:
Benchmarking Ultra-High-Definition Image Super-resolution. ICCV 2021: 14749-14758
[c282]An-Chieh Cheng, Xueting Li, Min Sun, Ming-Hsuan Yang, Sifei Liu:
Learning 3D Dense Correspondence via Canonical Point Autoencoder. NeurIPS 2021: 6608-6620
[c281]Yan-Bo Lin, Hung-Yu Tseng, Hsin-Ying Lee, Yen-Yu Lin, Ming-Hsuan Yang:
Exploring Cross-Video and Cross-Modality Signals for Weakly-Supervised Audio-Visual Video Parsing. NeurIPS 2021: 11449-11461
[c280]Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Intriguing Properties of Vision Transformers. NeurIPS 2021: 23296-23308
[c279]Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang:
End-to-end Multi-modal Video Temporal Grounding. NeurIPS 2021: 28442-28453
[c278]Xin-Yu Zhang, Kai Zhao, Taihong Xiao, Ming-Ming Cheng, Ming-Hsuan Yang:
Structured sparsification with joint optimization of group convolution and channel shuffle. UAI 2021: 440-450
[c277]Yijun Li, Lu Jiang, Ming-Hsuan Yang:
Controllable and Progressive Image Extrapolation. WACV 2021: 2139-2148
[c276]Qifei Wang, Junjie Ke, Joshua Greaves, Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew G. Howard, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar, Feng Yang:
Multi-path Neural Networks for On-device Multi-domain Visual Classification. WACV 2021: 3018-3027
[i184]Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, Ming-Hsuan Yang:
Low Light Image Enhancement via Global and Local Context Modeling. CoRR abs/2101.00850 (2021)
[i183]Weihao Xia, Yulun Zhang, Yujiu Yang, Jing-Hao Xue, Bolei Zhou, Ming-Hsuan Yang:
GAN Inversion: A Survey. CoRR abs/2101.05278 (2021)
[i182]Xiangyu Xu, Muchen Li, Wenxiu Sun, Ming-Hsuan Yang:
Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising. CoRR abs/2101.10760 (2021)
[i181]Xiangyu Xu, Yongrui Ma, Wenxiu Sun, Ming-Hsuan Yang:
Exploiting Raw Images for Real-Scene Super-Resolution. CoRR abs/2102.01579 (2021)
[i180]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Multi-Stage Progressive Image Restoration. CoRR abs/2102.02808 (2021)
[i179]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Neural Re-rendering for Full-frame Video Stabilization. CoRR abs/2102.06205 (2021)
[i178]Yun-Chun Chen, Marco Piccirilli, Robinson Piramuthu, Ming-Hsuan Yang:
Self-Attentive 3D Human Pose and Shape Estimation from Videos. CoRR abs/2103.14182 (2021)
[i177]Jie Cao, Luanxuan Hou, Ming-Hsuan Yang, Ran He, Zhenan Sun:
ReMix: Towards Image-to-Image Translation with Limited Data. CoRR abs/2103.16835 (2021)
[i176]Yan-Bo Lin, Hung-Yu Tseng, Hsin-Ying Lee, Yen-Yu Lin, Ming-Hsuan Yang:
Unsupervised Sound Localization via Iterative Contrastive Learning. CoRR abs/2104.00315 (2021)
[i175]Yen-Chi Cheng, Chieh Hubert Lin, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Ming-Hsuan Yang:
In&Out : Diverse Image Outpainting via GAN Inversion. CoRR abs/2104.00675 (2021)
[i174]Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang:
Regularizing Generative Adversarial Networks under Limited Data. CoRR abs/2104.03310 (2021)
[i173]Chieh Hubert Lin, Hsin-Ying Lee, Yen-Chi Cheng, Sergey Tulyakov, Ming-Hsuan Yang:
InfinityGAN: Towards Infinite-Resolution Image Synthesis. CoRR abs/2104.03963 (2021)
[i172]Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. CoRR abs/2104.04167 (2021)
[i171]Dingwen Zhang, Junwei Han, Gong Cheng, Ming-Hsuan Yang:
Weakly Supervised Object Localization and Detection: A Survey. CoRR abs/2104.07918 (2021)
[i170]Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Understanding Synonymous Referring Expressions via Contrastive Features. CoRR abs/2104.10156 (2021)
[i169]Yu-Chuan Su, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay, Cho-Jui Hsieh, Lior Shapira, Radu Soricut, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang, Boqing Gong:
2.5D Visual Relationship Detection. CoRR abs/2104.12727 (2021)
[i168]Jingkai Zhou, Varun Jampani, Zhixiong Pi, Qiong Liu, Ming-Hsuan Yang:
Decoupled Dynamic Filter Networks. CoRR abs/2104.14107 (2021)
[i167]Yinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang, Peyman Milanfar:
COMISR: Compression-Informed Video Super-Resolution. CoRR abs/2105.01237 (2021)
[i166]Muzammal Naseer, Kanchana Ranasinghe
, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Intriguing Properties of Vision Transformers. CoRR abs/2105.10497 (2021)
[i165]Hsin-Ping Huang, Hung-Yu Tseng, Saurabh Saini, Maneesh Kumar Singh, Ming-Hsuan Yang:
Learning to Stylize Novel Views. CoRR abs/2105.13509 (2021)
[i164]Shanghua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip H. S. Torr:
Large-scale Unsupervised Semantic Segmentation. CoRR abs/2106.03149 (2021)
[i163]Tsai-Shien Chen, Wei-Chih Hung, Hung-Yu Tseng, Shao-Yi Chien, Ming-Hsuan Yang:
Incremental False Negative Detection for Contrastive Learning. CoRR abs/2106.03719 (2021)
[i162]Xin Li, Wenjie Pei, Zikun Zhou, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang:
Crop-Transform-Paste: Self-Supervised Learning for Visual Tracking. CoRR abs/2106.10900 (2021)
[i161]An-Chieh Cheng, Xueting Li, Min Sun, Ming-Hsuan Yang, Sifei Liu:
Learning 3D Dense Correspondence via Canonical Point Autoencoder. CoRR abs/2107.04867 (2021)
[i160]Yi-Wen Chen, Yi-Hsuan Tsai, Ming-Hsuan Yang:
End-to-end Multi-modal Video Temporal Grounding. CoRR abs/2107.05624 (2021)
[i159]Chun-Han Yao, Wei-Chih Hung, Varun Jampani, Ming-Hsuan Yang:
Discovering 3D Parts from Image Collections. CoRR abs/2107.13629 (2021)
[i158]Chun-Han Yao, Boqing Gong, Yin Cui, Hang Qi, Yukun Zhu, Ming-Hsuan Yang:
Federated Multi-Target Domain Adaptation. CoRR abs/2108.07792 (2021)
[i157]Sanghyun Son, Jaeha Kim, Wei-Sheng Lai, Ming-Hsuan Yang, Kyoung Mu Lee:
Toward Real-World Super-Resolution via Adaptive Downsampling Models. CoRR abs/2109.03444 (2021)
[i156]Taihong Xiao, Sifei Liu, Shalini De Mello, Zhiding Yu, Jan Kautz, Ming-Hsuan Yang:
Learning Contrastive Representation for Semantic Correspondence. CoRR abs/2109.10967 (2021)
[i155]Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burst Image Restoration and Enhancement. CoRR abs/2110.03680 (2021)
[i154]Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang:
ViDT: An Efficient and Effective Fully Transformer-based Object Detector. CoRR abs/2110.03921 (2021)
[i153]Yufeng Wang, Yi-Hsuan Tsai, Wei-Chih Hung, Wenrui Ding, Shuo Liu, Ming-Hsuan Yang:
Semi-supervised Multi-task Learning for Semantics and Depth. CoRR abs/2110.07197 (2021)
[i152]Yi-Wen Chen, Xiaojie Jin, Xiaohui Shen, Ming-Hsuan Yang:
Video Salient Object Detection via Contrastive Features and Attention Modules. CoRR abs/2111.02368 (2021)
[i151]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Restormer: Efficient Transformer for High-Resolution Image Restoration. CoRR abs/2111.09881 (2021)
[i150]Wei-Sheng Lai, Yi-Chang Shih, Chia-Kai Liang, Ming-Hsuan Yang:
Correcting Face Distortion in Wide-Angle Videos. CoRR abs/2111.09950 (2021)
[i149]Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Ming-Hsuan Yang:
Multi-modal Transformers Excel at Class-agnostic Object Detection. CoRR abs/2111.11430 (2021)
[i148]Hanhua Ye, Guorong Li, Yuankai Qi, Shuhui Wang, Qingming Huang, Ming-Hsuan Yang:
Hierarchical Modular Network for Video Captioning. CoRR abs/2111.12476 (2021)
[i147]Zhihao Shi, Xiangyu Xu, Xiaohong Liu, Jun Chen, Ming-Hsuan Yang:
Video Frame Interpolation Transformer. CoRR abs/2111.13817 (2021)
[i146]Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang:
Learning Discriminative Shrinkage Deep Networks for Image Deconvolution. CoRR abs/2111.13876 (2021)
[i145]Xueting Li, Shalini De Mello, Xiaolong Wang, Ming-Hsuan Yang, Jan Kautz, Sifei Liu:
Learning Continuous Environment Fields via Implicit Functions. CoRR abs/2111.13997 (2021)
[i144]Kaihao Zhang, Wenhan Luo, Boheng Chen, Wenqi Ren, Björn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang:
Benchmarking Deep Deblurring Algorithms: A Large-Scale Multi-Cause Dataset and A New Baseline Model. CoRR abs/2112.00234 (2021)
[i143]Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge J. Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui:
Exploring Temporal Granularity in Self-Supervised Video Representation Learning. CoRR abs/2112.04480 (2021)
[i142]Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu:
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision. CoRR abs/2112.05181 (2021)
[i141]Xin Li, Qiao Liu, Wenjie Pei, Qiuhong Shen, Yaowei Wang, Huchuan Lu, Ming-Hsuan Yang:
An Informative Tracking Benchmark. CoRR abs/2112.06467 (2021)
[i140]Qing Li, Boqing Gong, Yin Cui, Dan Kondratyuk, Xianzhi Du, Ming-Hsuan Yang, Matthew Brown:
Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text. CoRR abs/2112.07074 (2021)- 2020
[j118]Longyin Wen, Dawei Du, Zhaowei Cai, Zhen Lei, Ming-Ching Chang, Honggang Qi, Jongwoo Lim, Ming-Hsuan Yang
, Siwei Lyu:
UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking. Comput. Vis. Image Underst. 193: 102907 (2020)
[j117]Shun Zhang
, Jia-Bin Huang, Jongwoo Lim, Yihong Gong, Jinjun Wang, Narendra Ahuja, Ming-Hsuan Yang
:
Tracking Persons-of-Interest via Unsupervised Representation Adaptation. Int. J. Comput. Vis. 128(1): 96-120 (2020)
[j116]Wenqi Ren, Jinshan Pan, Hua Zhang, Xiaochun Cao, Ming-Hsuan Yang
:
Single Image Dehazing via Multi-scale Convolutional Neural Networks with Holistic Edges. Int. J. Comput. Vis. 128(1): 240-259 (2020)
[j115]Yi-Wen Chen, Yi-Hsuan Tsai, Yen-Yu Lin
, Ming-Hsuan Yang
:
VOSTR: Video Object Segmentation via Transferable Representations. Int. J. Comput. Vis. 128(4): 931-949 (2020)
[j114]Xinyi Zhang
, Hang Dong, Zhe Hu, Wei-Sheng Lai, Fei Wang
, Ming-Hsuan Yang
:
Gated Fusion Network for Degraded Image Super Resolution. Int. J. Comput. Vis. 128(6): 1699-1721 (2020)
[j113]Xiang Wang, Sifei Liu
, Huimin Ma, Ming-Hsuan Yang
:
Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning. Int. J. Comput. Vis. 128(6): 1736-1749 (2020)
[j112]Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, Jan Kautz, Ming-Hsuan Yang
:
Exploiting Semantics for Face Image Deblurring. Int. J. Comput. Vis. 128(7): 1829-1846 (2020)
[j111]Hsin-Ying Lee, Hung-Yu Tseng, Qi Mao, Jia-Bin Huang, Yu-Ding Lu, Maneesh Singh, Ming-Hsuan Yang
:
DRIT++: Diverse Image-to-Image Translation via Disentangled Representations. Int. J. Comput. Vis. 128(10): 2402-2417 (2020)
[j110]Xuhong Li, Yves Grandvalet, Franck Davoine
, Jingchun Cheng, Yin Cui, Han Zhang, Serge J. Belongie
, Yi-Hsuan Tsai, Ming-Hsuan Yang
:
Transfer learning in computer vision tasks: Remember where you come from. Image Vis. Comput. 93: 103853 (2020)
[j109]Dong Li
, Jia-Bin Huang
, Yali Li, Shengjin Wang
, Ming-Hsuan Yang
:
Progressive Representation Adaptation for Weakly Supervised Object Localization. IEEE Trans. Pattern Anal. Mach. Intell. 42(6): 1424-1438 (2020)
[j108]Jufeng Yang
, Jie Liang
, Kai Wang
, Paul L. Rosin
, Ming-Hsuan Yang
:
Subspace Clustering via Good Neighbors. IEEE Trans. Pattern Anal. Mach. Intell. 42(6): 1537-1544 (2020)
[j107]Lerenhan Li
, Yunlong Dong
, Wenqi Ren
, Jinshan Pan
, Changxin Gao
, Nong Sang
, Ming-Hsuan Yang
:
Semi-Supervised Image Dehazing. IEEE Trans. Image Process. 29: 2766-2779 (2020)
[j106]Jingyu Liu
, Wei Wang, Liang Wang, Ming-Hsuan Yang
:
Attribute-Guided Attention for Referring Expression Generation and Comprehension. IEEE Trans. Image Process. 29: 5244-5258 (2020)
[j105]Lerenhan Li
, Jinshan Pan
, Wei-Sheng Lai
, Changxin Gao
, Nong Sang
, Ming-Hsuan Yang
:
Dynamic Scene Deblurring by Depth Guided Model. IEEE Trans. Image Process. 29: 5273-5288 (2020)
[j104]Nian Liu
, Junwei Han
, Ming-Hsuan Yang
:
PiCANet: Pixel-Wise Contextual Attention Learning for Accurate Saliency Detection. IEEE Trans. Image Process. 29: 6438-6451 (2020)
[j103]Xiangyu Xu
, Muchen Li, Wenxiu Sun
, Ming-Hsuan Yang
:
Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising. IEEE Trans. Image Process. 29: 7153-7165 (2020)
[c275]Taihong Xiao, Yi-Hsuan Tsai, Kihyuk Sohn, Manmohan Chandraker, Ming-Hsuan Yang:
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations. AAAI 2020: 12434-12441
[c274]Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai, Sifei Liu
, Yen-Yu Lin
, Ming-Hsuan Yang:
Regularizing Meta-learning via Gradient Dropout. ACCV (4) 2020: 218-234
[c273]Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang:
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization. BMVC 2020
[c272]Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung, Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization. BMVC 2020
[c271]Yu-Lun Liu, Wei-Sheng Lai, Yu-Sheng Chen
, Yi-Lung Kao, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline. CVPR 2020: 1648-1657
[c270]Huan Wang, Yijun Li, Yuehai Wang, Haoji Hu, Ming-Hsuan Yang:
Collaborative Distillation for Ultra-Resolution Universal Style Transfer. CVPR 2020: 1857-1866
[c269]Hang Dong, Jinshan Pan, Lei Xiang, Zhe Hu, Xinyi Zhang
, Fei Wang
, Ming-Hsuan Yang:
Multi-Scale Boosted Dehazing Network With Dense Feature Fusion. CVPR 2020: 2154-2164
[c268]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
CycleISP: Real Image Restoration via Improved Data Synthesis. CVPR 2020: 2693-2702
[c267]Debang Li, Junge Zhang, Kaiqi Huang, Ming-Hsuan Yang:
Composing Good Shots by Exploiting Mutual Relations. CVPR 2020: 4212-4221
[c266]Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong:
Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition From a Domain Adaptation Perspective. CVPR 2020: 7607-7616
[c265]Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung, Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Weakly-Supervised Semantic Segmentation via Sub-Category Exploration. CVPR 2020: 8988-8997
[c264]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Learning to See Through Obstructions. CVPR 2020: 14203-14212
[c263]Hung-Yu Tseng, Matthew Fisher, Jingwan Lu, Yijun Li, Vladimir G. Kim, Ming-Hsuan Yang:
Modeling Artistic Workflows for Image Generation and Editing. ECCV (18) 2020: 158-174
[c262]Yen-Chi Cheng
, Hsin-Ying Lee, Min Sun, Ming-Hsuan Yang:
Controllable Image Synthesis via SegVAE. ECCV (7) 2020: 159-174
[c261]Chun-Han Yao, Chen Fang, Xiaohui Shen, Yangyue Wan, Ming-Hsuan Yang:
Video Object Detection via Object-Level Temporal Aggregation. ECCV (14) 2020: 160-177
[c260]Hung-Yu Tseng, Hsin-Ying Lee, Lu Jiang, Ming-Hsuan Yang, Weilong Yang:
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval. ECCV (8) 2020: 242-257
[c259]Taihong Xiao
, Jinwei Yuan, Deqing Sun
, Qifei Wang, Xin-Yu Zhang, Kehan Xu, Ming-Hsuan Yang
:
Learnable Cost Volume Using the Cayley Representation. ECCV (9) 2020: 483-499
[c258]Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B. Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang:
Neural Design Network: Graphic Layout Generation with Constraints. ECCV (3) 2020: 491-506
[c257]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat
, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Learning Enriched Features for Real Image Restoration and Enhancement. ECCV (25) 2020: 492-511
[c256]Xueting Li, Sifei Liu
, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz:
Self-supervised Single-View 3D Reconstruction via Semantic Consistency. ECCV (14) 2020: 677-693
[c255]Dawei Du, Longyin Wen, Pengfei Zhu, Heng Fan, Qinghua Hu, Haibin Ling, Mubarak Shah
, Junwen Pan
, Apostolos Axenopoulos
, Arne Schumann, Athanasios Psaltis
, Ayush Jain, Bin Dong, Changlin Li, Chen Chen, Chengzhen Duan, Chongyang Zhang, Daniel Stadler, Dheeraj Reddy Pailla, Dong Yin, Faizan Khan, Fanman Meng, Guangyu Gao, Guosheng Zhang, Hansheng Chen, Hao Zhou, Haonian Xie, Heqian Qiu, Hongliang Li, Ioannis Athanasiadis
, Jincai Cui, Jingkai Zhou, Jong Hwan Ko, Joo Chan Lee
, Jun Yu, Jungyeop Yoo, Lars Wilko Sommer, Lu Xiong, Michael Schleiss, Ming-Hsuan Yang, Mingyu Liu, Minjian Zhang, Murari Mandal
, Petros Daras, Pratik Narang, Qiong Liu, Qiu Shi, Qizhang Lin, Rohit Ramaprasad, Sai Wang, Sarvesh Mehta, Shuai Li, Shuqin Huang, Sungtae Moon, Taijin Zhao, Ting Sun, Wei Guo, Wei Tian, Weida Qin, Weiping Yu, Wenxiang Lin, Xi Zhao, Xiaogang Jia, Xin He, Xingjie Zhao, Xuanxin Liu, Yan Ding, Yan Luo, Yang Xiao, Yi Wang, Yingjie Liu
, Yongwoo Kim, Yu Sun, Yuehan Yao, Yuyao Huang, Zehui Gong, Zhenyu Xu, Zhipeng Luo, Zhiguo Cao, Zhiwei Wei, Zhongjie Fan, Zichen Song, Ziming Liu:
VisDrone-DET2020: The Vision Meets Drone Object Detection in Image Challenge Results. ECCV Workshops (4) 2020: 692-712
[c254]Cheng-Chun Hsu, Yi-Hsuan Tsai, Yen-Yu Lin
, Ming-Hsuan Yang:
Every Pixel Matters: Center-Aware Feature Alignment for Domain Adaptive Object Detector. ECCV (9) 2020: 733-748
[c253]Weitao Wan, Jiansheng Chen, Ming-Hsuan Yang:
Adversarial Training with Bi-directional Likelihood Regularization for Visual Classification. ECCV (24) 2020: 785-800
[c252]Jongbin Ryu, Gitaek Kwon, Ming-Hsuan Yang, Jongwoo Lim:
Generalized Convolutional Forest Networks for Domain Generalization and Visual Recognition. ICLR 2020
[c251]Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Ming-Hsuan Yang:
Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation. ICLR 2020
[c250]Mohammad K. Ebrahimpour, J. Benjamin Falandays, Samuel Spevack, Ming-Hsuan Yang, David C. Noelle:
WW-Nets: Dual Neural Networks for Object Detection. IJCNN 2020: 1-8
[c249]Xueting Li, Sifei Liu, Shalini De Mello, Kihwan Kim, Xiaolong Wang, Ming-Hsuan Yang, Jan Kautz:
Online Adaptation for Consistent Mesh Reconstruction in the Wild. NeurIPS 2020
[c248]Han-Kai Hsu, Chun-Han Yao, Yi-Hsuan Tsai, Wei-Chih Hung, Hung-Yu Tseng, Maneesh Kumar Singh, Ming-Hsuan Yang:
Progressive Domain Adaptation for Object Detection. WACV 2020: 738-746
[c247]Shih-Han Chou, Wei-Lun Chao, Wei-Sheng Lai, Min Sun, Ming-Hsuan Yang:
Visual Question Answering on 360° Images. WACV 2020: 1596-1605
[c246]Weixiang Hong, Yu-Ting Chang, Haifang Qin
, Wei-Chih Hung, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Image Hashing via Linear Discriminant Learning. WACV 2020: 2520-2528
[c245]Wei Gao, Yijun Li, Yihang Yin, Ming-Hsuan Yang:
Fast Video Multi-Style Transfer. WACV 2020: 3211-3219
[i139]Yun-Chun Chen, Yen-Yu Lin, Ming-Hsuan Yang, Jia-Bin Huang:
CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency. CoRR abs/2001.03182 (2020)
[i138]Shih-Han Chou, Wei-Lun Chao, Wei-Sheng Lai, Min Sun, Ming-Hsuan Yang:
Visual Question Answering on 360° Images. CoRR abs/2001.03339 (2020)
[i137]Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, Jan Kautz, Ming-Hsuan Yang:
Deep Semantic Face Deblurring. CoRR abs/2001.06822 (2020)
[i136]Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Ming-Hsuan Yang:
Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation. CoRR abs/2001.08735 (2020)
[i135]Xiang Wang, Sifei Liu, Huimin Ma, Ming-Hsuan Yang:
Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning. CoRR abs/2002.08098 (2020)
[i134]Xin-Yu Zhang, Kai Zhao, Taihong Xiao, Ming-Ming Cheng, Ming-Hsuan Yang:
Model-Agnostic Structured Sparsification with Learnable Channel Shuffle. CoRR abs/2002.08127 (2020)
[i133]Xinyi Zhang, Hang Dong, Zhe Hu, Wei-Sheng Lai, Fei Wang, Ming-Hsuan Yang:
Gated Fusion Network for Degraded Image Super Resolution. CoRR abs/2003.00893 (2020)
[i132]Xueting Li, Sifei Liu, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz:
Self-supervised Single-view 3D Reconstruction via Semantic Consistency. CoRR abs/2003.06473 (2020)
[i131]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Learning Enriched Features for Real Image Restoration and Enhancement. CoRR abs/2003.06792 (2020)
[i130]Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
CycleISP: Real Image Restoration via Improved Data Synthesis. CoRR abs/2003.07761 (2020)
[i129]Huan Wang, Yijun Li, Yuehai Wang, Haoji Hu, Ming-Hsuan Yang:
Collaborative Distillation for Ultra-Resolution Universal Style Transfer. CoRR abs/2003.08436 (2020)
[i128]Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong:
Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective. CoRR abs/2003.10780 (2020)
[i127]Junyi Feng, Songyuan Li, Xi Li, Fei Wu, Qi Tian, Ming-Hsuan Yang, Haibin Ling:
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge. CoRR abs/2003.13260 (2020)
[i126]Yun-Chun Chen, Po-Hsiang Huang, Li-Yu Yu, Jia-Bin Huang, Ming-Hsuan Yang, Yen-Yu Lin:
Deep Semantic Matching with Foreground Detection and Cycle-Consistency. CoRR abs/2004.00144 (2020)
[i125]Yu-Lun Liu, Wei-Sheng Lai, Yu-Sheng Chen, Yi-Lung Kao, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline. CoRR abs/2004.01179 (2020)
[i124]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Learning to See Through Obstructions. CoRR abs/2004.01180 (2020)
[i123]Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai, Sifei Liu, Yen-Yu Lin, Ming-Hsuan Yang:
Regularizing Meta-Learning via Gradient Dropout. CoRR abs/2004.05859 (2020)
[i122]Hang Dong, Jinshan Pan, Lei Xiang, Zhe Hu, Xinyi Zhang, Fei Wang, Ming-Hsuan Yang:
Multi-Scale Boosted Dehazing Network with Dense Feature Fusion. CoRR abs/2004.13388 (2020)
[i121]Mohammad K. Ebrahimpour, J. Benjamin Falandays, Samuel Spevack, Ming-Hsuan Yang, David C. Noelle:
WW-Nets: Dual Neural Networks for Object Detection. CoRR abs/2005.07787 (2020)
[i120]Mohammad K. Ebrahimpour, Jiayun Li, Yen-Yun Yu, Jackson L. Reese, Azadeh Moghtaderi, Ming-Hsuan Yang, David C. Noelle:
Ventral-Dorsal Neural Networks: Object Detection via Selective Attention. CoRR abs/2005.09727 (2020)
[i119]Xin-Yu Zhang, Hao-Lin Jia, Taihong Xiao, Ming-Ming Cheng, Ming-Hsuan Yang:
Semi-Supervised Learning with Meta-Gradient. CoRR abs/2007.03966 (2020)
[i118]Hung-Yu Tseng, Matthew Fisher, Jingwan Lu, Yijun Li, Vladimir G. Kim, Ming-Hsuan Yang:
Modeling Artistic Workflows for Image Generation and Editing. CoRR abs/2007.07238 (2020)
[i117]Yen-Chi Cheng, Hsin-Ying Lee, Min Sun, Ming-Hsuan Yang:
Controllable Image Synthesis via SegVAE. CoRR abs/2007.08397 (2020)
[i116]Hung-Yu Tseng, Hsin-Ying Lee, Lu Jiang, Ming-Hsuan Yang, Weilong Yang:
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval. CoRR abs/2007.08513 (2020)
[i115]Taihong Xiao, Jinwei Yuan, Deqing Sun, Qifei Wang, Xin-Yu Zhang, Kehan Xu, Ming-Hsuan Yang:
Learnable Cost Volume Using the Cayley Representation. CoRR abs/2007.11431 (2020)
[i114]Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung, Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Weakly-Supervised Semantic Segmentation via Sub-category Exploration. CoRR abs/2008.01183 (2020)
[i113]Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung, Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang:
Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization. CoRR abs/2008.01201 (2020)
[i112]Rui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge J. Belongie, Yin Cui:
Spatiotemporal Contrastive Video Representation Learning. CoRR abs/2008.03800 (2020)
[i111]Yu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang, Yung-Yu Chuang, Jia-Bin Huang:
Learning to See Through Obstructions with Layered Decomposition. CoRR abs/2008.04902 (2020)
[i110]Wenqing Chu, Wei-Chih Hung, Yi-Hsuan Tsai, Yu-Ting Chang, Yijun Li, Deng Cai, Ming-Hsuan Yang:
Learning to Caricature via Semantic Shape Transform. CoRR abs/2008.05090 (2020)
[i109]Wei-Chih Hung, Henrik Kretzschmar, Tsung-Yi Lin, Yuning Chai, Ruichi Yu, Ming-Hsuan Yang, Dragomir Anguelov:
SoDA: Multi-Object Tracking with Soft Data Association. CoRR abs/2008.07725 (2020)
[i108]Cheng-Chun Hsu, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang:
Every Pixel Matters: Center-aware Feature Alignment for Domain Adaptive Object Detector. CoRR abs/2008.08574 (2020)
[i107]Qifei Wang, Junjie Ke, Joshua Greaves, Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew Howard, Feng Yang, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar:
Multi-path Neural Networks for On-device Multi-domain Visual Classification. CoRR abs/2010.04904 (2020)
[i106]Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang:
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization. CoRR abs/2010.09211 (2020)
[i105]Qi Mao, Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Siwei Ma, Ming-Hsuan Yang:
Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors. CoRR abs/2011.01215 (2020)
[i104]


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID