


default search action
18th ECCV 2024: Milan, Italy - Part XV
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XV. Lecture Notes in Computer Science 15073, Springer 2025, ISBN 978-3-031-72632-3 - Yinghao Xu, Zifan Shi, Yifan Wang, Hansheng Chen, Ceyuan Yang, Sida Peng, Yujun Shen, Gordon Wetzstein:
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation. 1-20 - Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Jingdong Wang, Baining Guo:
IRGen: Generative Modeling for Image Retrieval. 21-41 - Kyu Ri Park
, Hong Joo Lee
, Jung Uk Kim
:
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality. 42-59 - Florian Langer, Jihong Ju, Georgi Dikov, Gerhard Reitmayr, Mohsen Ghafoorian:
FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos. 60-77 - Wouter Van Gansbeke, Bert De Brabandere:
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting. 78-97 - Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang, Weidi Xie, Efstratios Gavves:
VISA: Reasoning Video Object Segmentation via Large Language Models. 98-115 - Saman Motamed
, Danda Pani Paudel
, Luc Van Gool
:
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models. 116-133 - Yuanhao Zhai
, Kevin Lin
, Linjie Li
, Chung-Ching Lin
, Jianfeng Wang
, Zhengyuan Yang
, David S. Doermann
, Junsong Yuan
, Zicheng Liu
, Lijuan Wang
:
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation. 134-152 - Ryo Nakamura
, Ryu Tadokoro
, Ryosuke Yamada
, Yuki M. Asano
, Iro Laina
, Christian Rupprecht
, Nakamasa Inoue
, Rio Yokota
, Hirokatsu Kataoka
:
Scaling Backwards: Minimal Synthetic Pre-Training? 153-171 - Ekkasit Pinyoanuntapong
, Muhammad Usama Saleem
, Pu Wang
, Minwoo Lee
, Srijan Das
, Chen Chen
:
BAMM: Bidirectional Autoregressive Motion Model. 172-190 - Jiahui Yuan, Hebei Li, Yansong Peng, Jin Wang, Yuheng Jiang, Yueyi Zhang, Xiaoyan Sun:
Event-Based Head Pose Estimation: Benchmark and Method. 191-208 - Ekta Prashnani, Koki Nagano, Shalini De Mello, David Luebke, Orazio Gallo:
Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos. 209-228 - Guangyu Sun
, Matías Mendieta
, Aritra Dutta
, Xin Li
, Chen Chen
:
Towards Multi-modal Transformers in Federated Learning. 229-246 - Wenke Huang
, Mang Ye
, Zekun Shi
, Bo Du
, Dacheng Tao
:
Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning. 247-265 - Pengbo Guo
, Chengxu Liu
, Xingsong Hou
, Xueming Qian
:
QueryCDR: Query-Based Controllable Distortion Rectification Network for Fisheye Images. 266-284 - Shishira R. Maiya
, Anubhav Gupta
, Matthew Gwilliam
, Max Ehrlich
, Abhinav Shrivastava
:
Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics. 285-302 - Shrey Singh
, Prateek Keserwani
, Masakazu Iwamura
, Partha Pratim Roy
:
DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution. 303-320 - Jeongmin Bae
, Seoha Kim
, Youngsik Yun
, Hahyun Lee
, Gun Bang
, Youngjung Uh
:
Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting. 321-335 - Liao Shen
, Tianqi Liu
, Huiqiang Sun
, Xinyi Ye
, Baopu Li
, Jianming Zhang
, Zhiguo Cao
:
DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion. 336-353 - Shuang Hao
, Chunlin Zhong
, He Tang
:
CoLA: Conditional Dropout and Language-Driven Robust Dual-Modal Salient Object Detection. 354-371 - Zhiyu Wu, Jinshi Cui:
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-supervised Learning. 372-388 - Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng:
RPBG: Towards Robust Neural Point-Based Graphics in the Wild. 389-406 - Jiahao Chang, Yinglin Xu, Yihao Li, Yuantao Chen, Wensen Feng, Xiaoguang Han:
GaussReg: Fast 3D Registration with Gaussian Splatting. 407-423 - Yifan Pu
, Zhuofan Xia, Jiayi Guo
, Dongchen Han, Qixiu Li
, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang, Li Xiu:
Efficient Diffusion Transformer with Step-Wise Dynamic Attention Mediators. 424-441 - Pengfei Wang
, Yuxi Wang
, Shuai Li
, Zhaoxiang Zhang, Zhen Lei, Lei Zhang
:
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation. 442-460 - Kihwan Yoon
, Yong Han Kim
, Sungjei Kim
, Jinwoo Jeong
:
IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with Motion Complexity Map. 461-477 - Siyi Du
, Shaoming Zheng
, Yinsong Wang
, Wenjia Bai
, Declan P. O'Regan
, Chen Qin
:
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data. 478-496

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.