default search action
18th ECCV 2024: Milan, Italy - Part XVIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XVIII. Lecture Notes in Computer Science 15076, Springer 2025, ISBN 978-3-031-72648-4 - Cheng Shi, Yulin Zhang, Bin Yang, Jiajin Tang, Yuexin Ma, Sibei Yang:
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation. 1-18 - Risa Shinoda, Kaede Shiohara:
PetFace: A Large-Scale Dataset and Benchmark for Animal Identification. 19-36 - Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu:
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. 37-53 - Davide Cozzolino, Giovanni Poggi, Matthias Nießner, Luisa Verdoliva:
Zero-Shot Detection of AI-Generated Images. 54-72 - Kecheng Zheng, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen:
DreamLIP: Language-Image Pre-training with Long Captions. 73-90 - Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu:
GKGNet: Group K-Nearest Neighbor Based Graph Convolutional Network for Multi-label Image Recognition. 91-107 - Xinyu Xu, Shengcheng Luo, Yanchao Yang, Yong-Lu Li, Cewu Lu:
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-Level Control. 108-125 - Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo:
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception. 126-146 - Jiaqi Xu, Mengyang Wu, Xiaohu You, Chi-Wing Fu, Qi Dou, Pheng-Ann Heng:
Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models. 147-164 - Yifan Li, Anh Dao, Wentao Bao, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong:
Facial Affective Behavior Analysis with Instruction Tuning. 165-186 - Xiaoyi Bao, Siyang Sun, Shuailei Ma, Kecheng Zheng, Yuxin Guo, Guosheng Zhao, Yun Zheng, Xingang Wang:
CoReS: Orchestrating the Dance of Reasoning and Segmentation. 187-204 - Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing. 205-221 - Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia:
MambaIR: A Simple Baseline for Image Restoration with State-Space Model. 222-241 - Ishan Khatri, Kyle Vedder, Neehar Peri, Deva Ramanan, James Hays:
I Can't Believe It's Not Scene Flow! 242-257 - Zhonghang Liu, Panzhong Lu, Guoyang Xie, Zhichao Lu, Wen-Yan Lin:
Rethinking Unsupervised Outlier Detection via Multiple Thresholding. 258-275 - Bowen Zhang, Tianyu Yang, Yu Li, Lei Zhang, Xi Zhao:
Compress3D: A Compressed Latent Space for 3D Generation from a Single Image. 276-292 - Nhat Le, Khoa Do, Xuan Bui, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen:
Scalable Group Choreography via Variational Phase Manifold Learning. 293-311 - Mingfang Zhang, Yifei Huang, Ruicong Liu, Yoichi Sato:
Masked Video and Body-Worn IMU Autoencoder for Egocentric Action Recognition. 312-330 - Jian Ma, Wenguan Wang, Yi Yang, Feng Zheng:
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-Driven Diffusion. 331-349 - Huankang Guan, Rynson W. H. Lau:
PoseSOR: Human Pose Can Guide Our Attention. 350-366 - Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. 367-384 - Minjung Kim, Hyung Suk Lim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim:
Bi-directional Contextual Attention for 3D Dense Captioning. 385-401 - Peng Xiao, Yi Xie, Xuemiao Xu, Weihong Chen, Huaidong Zhang:
Multi-person Pose Forecasting with Individual Interaction Perceptron and Prior Learning. 402-419 - Fangcen Liu, Chenqiang Gao, Yaming Zhang, Junjie Guo, Jinghao Wang, Deyu Meng:
InfMAE: A Foundation Model in the Infrared Modality. 420-437 - Bin-Shih Wu, Hong-En Chen, Sheng-Yu Huang, Yu-Chiang Frank Wang:
TPA3D: Triplane Attention for Fast Text-to-3D Generation. 438-455 - Jiangming Shi, Xiangbo Yin, Yeyun Chen, Yachao Zhang, Zhizhong Zhang, Yuan Xie, Yanyun Qu:
Multi-memory Matching for Unsupervised Visible-Infrared Person Re-identification. 456-474 - Xi Chen, Zhiheng Liu, Mengting Chen, Yutong Feng, Yu Liu, Yujun Shen, Hengshuang Zhao:
LivePhoto: Real Image Animation with Text-Guided Motion Control. 475-491
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.