


default search action
18th ECCV 2024: Milan, Italy - Part XLIII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLIII. Lecture Notes in Computer Science 15101, Springer 2025, ISBN 978-3-031-72774-0 - Jiaxin Ge, Sanjay Subramanian, Baifeng Shi, Roei Herzig, Trevor Darrell:
Recursive Visual Programming. 1-18 - Hao Zhang, Hongyang Li, Feng Li, Tianhe Ren, Xueyan Zou, Shilong Liu, Shijia Huang, Jianfeng Gao, Leizhang, Chunyuan Li, Jainwei Yang:
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models. 19-35 - Hunmin Yang
, Jongoh Jeong
, Kuk-Jin Yoon
:
Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks. 36-53 - Xidong Peng, Runnan Chen, Feng Qiao, Lingdong Kong, Youquan Liu, Yujing Sun, Tai Wang, Xinge Zhu
, Yuexin Ma:
Learning to Adapt SAM for Segmenting Cross-Domain Point Clouds. 54-71 - In Cho
, Hyunbo Shim
, Seon Joo Kim
:
Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging. 72-89 - Jinke Li
, Xiao He
, Chonghua Zhou, Xiaoqiang Cheng
, Yang Wen, Dan Zhang:
ViewFormer: Exploring Spatiotemporal Modeling for Multi-view 3D Occupancy Perception via View-Guided Transformers. 90-106 - Ziwei Zheng
, Lijun He
, Le Yang
, Fan Li
:
Fine-Grained Dynamic Network for Generic Event Boundary Detection. 107-123 - Mingyu Zhang
, Jiting Cai
, Mingyu Liu
, Yue Xu
, Cewu Lu
, Yong-Lu Li
:
Take a Step Back: Rethinking the Two Stages in Visual Reasoning. 124-141 - Jiannan Ge, Lingxi Xie, Hongtao Xie, Pandeng Li, Xiaopeng Zhang, Yong-Dong Zhang, Qi Tian:
AlignZeg: Mitigating Objective Misalignment for Zero-Shot Semantic Segmentation. 142-161 - Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang
:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. 162-180 - Weilong Chai
, Dandan Zheng
, Jiajiong Cao
, Zhiquan Chen
, Changbao Wang
, Chenguang Ma
:
SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models. 181-196 - Jiakang Yuan
, Bo Zhang
, Kaixiong Gong, Xiangyu Yue, Botian Shi, Yu Qiao
, Tao Chen
:
Reg-TTA3D: Better Regression Makes Better Test-Time Adaptive 3D Object Detection. 197-213 - Zekun Qi
, Runpei Dong
, Shaochen Zhang
, Haoran Geng
, Chunrui Han
, Zheng Ge
, Li Yi
, Kaisheng Ma
:
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction. 214-238 - Weihang Liu
, Xue Xian Zheng
, Jingyi Yu
, Xin Lou
:
Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization. 239-256 - Alberto Hojel, Yutong Bai, Trevor Darrell, Amir Globerson, Amir Bar:
Finding Visual Task Vectors. 257-273 - Zongrui Li
, Minghui Hu
, Qian Zheng
, Xudong Jiang
:
Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation. 274-291 - Yan Yang, Liyuan Pan, Liu Liu:
Event Camera Data Dense Pre-training. 292-310 - Yunbin Tu, Liang Li, Li Su, Chenggang Yan, Qingming Huang:
Distractors-Immune Representation Learning with Cross-Modal Contrastive Regularization for Change Captioning. 311-328 - Rui Qian
, Shuangrui Ding
, Dahua Lin
:
Rethinking Image-to-Video Adaptation: An Object-Centric Perspective. 329-348 - Seitaro Otsuki
, Tsumugi Iida, Félix Doublet, Tsubasa Hirakawa
, Takayoshi Yamashita
, Hironobu Fujiyoshi
, Komei Sugiura
:
Layer-Wise Relevance Propagation with Conservation Property for ResNet. 349-364 - Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen
:
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism. 365-381 - Qiao Gu
, Zhaoyang Lv
, Duncan P. Frost
, Simon Green
, Julian Straub
, Chris Sweeney
:
EgoLifter: Open-World 3D Segmentation for Egocentric Perception. 382-400 - Gyeongrok Oh
, Jaehwan Jeong
, Sieun Kim
, Wonmin Byeon
, Jinkyu Kim
, Sungwoong Kim
, Sangpil Kim
:
MEVG: Multi-event Video Generation with Text-to-Video Models. 401-418 - Haobo Yuan
, Xiangtai Li
, Chong Zhou
, Yining Li
, Kai Chen
, Chen Change Loy
:
Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively. 419-437 - Ahmad Sajedi
, Samir Khaki
, Lucy Z. Liu, Ehsan Amjadian
, Yuri A. Lawryshyn, Konstantinos N. Plataniotis
:
Data-to-Model Distillation: Data-Efficient Learning Framework. 438-457 - Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang:
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays. 458-476 - Yuxi Li
, Fuyuan Cheng
, Wangbo Yu
, Guangshuo Wang
, Guibo Luo
, Yuesheng Zhu
:
AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-Aware Transformer Network. 477-493

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.