default search action
WACV 2021: Waikoloa, HI, USA
- IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021. IEEE 2021, ISBN 978-1-6654-0477-8
Human Applications: Faces, Driving, Etc.
- Hao Chen, Benoit Lagadec, François Brémond:
Enhancing Diversity in Teacher-Student Networks via Asymmetric branches for Unsupervised Person Re-identification. 1-10 - Harsimran Kaur, Roberto Manduchi:
Subject Guided Eye Image Synthesis with Application to Gaze Redirection. 11-20 - Siwei Zhang, Zhiwu Huang, Danda Pani Paudel, Luc Van Gool:
Facial Emotion Recognition with Noisy Multi-task Annotations. 21-31 - Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade:
Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder. 32-40 - Alexander Richard, Colin Lea, Shugao Ma, Juergen Gall, Fernando De la Torre, Yaser Sheikh:
Audio- and Gaze-driven Facial Animation of Codec Avatars. 41-50 - Abdelhak Loukkal, Yves Grandvalet, Tom Drummond, You Li:
Driving among Flatmobiles: Bird-Eye-View occupancy grids from a monocular camera for holistic trajectory planning. 51-60 - Varun Ravi Kumar, Marvin Klingner, Senthil Kumar Yogamani, Stefan Milz, Tim Fingscheidt, Patrick Mäder:
SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized with Semantic Segmentation for Autonomous Driving. 61-71 - Heng Zhang, Élisa Fromont, Sébastien Lefèvre, Bruno Avignon:
Guided Attentive Feature Fusion for Multispectral Pedestrian Detection. 72-80 - Matthew Shere, Hansung Kim, Adrian Hilton:
Temporally Consistent 3D Human Pose Estimation Using Dual 360° Cameras. 81-90 - Okan Köpüklü, Jiapeng Zheng, Hang Xu, Gerhard Rigoll:
Driver Anomaly Detection: A Dataset and Contrastive Learning Approach. 91-100
3D, Domain Adaptation, Video, etc.
- Tobias Ringwald, Rainer Stiefelhagen:
Adaptiope: A Modern Benchmark for Unsupervised Domain Adaptation. 101-110 - Peri Akiva, Matthew Purri, Kristin J. Dana, Beth Tellman, Tyler Anderson:
H2O-Net: Self-Supervised Flood Segmentation via Adversarial Domain Adaptation and Label Refinement. 111-122 - Idan Achituve, Haggai Maron, Gal Chechik:
Self-Supervised Learning for Domain Adaptation on Point Clouds. 123-133 - Zhangsihao Yang, Or Litany, Tolga Birdal, Srinath Sridhar, Leonidas J. Guibas:
Continuous Geodesic Convolutions for Learning on 3D Shapes. 134-144 - Lê Minh Ngô, Wei Wang, Burak Mandira, Sezer Karaoglu, Henri Bouma, Hamdi Dibeklioglu, Theo Gevers:
Identity Unbiased Deception Detection by 2D-to-3D Face Reconstruction. 145-154 - Yang Wang, Gedas Bertasius, Tae-Hyun Oh, Abhinav Gupta, Minh Hoai, Lorenzo Torresani:
Supervoxel Attention Graphs for Long-Range Video Modeling. 155-166 - Xiang Hao, Kripa Chettiar, Ben Cheung, Vernon Germano, Raffay Hamid:
Intro and Recap Detection for Movies and TV Series. 167-176 - Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, Mario Lucic:
Representation learning from videos in-the-wild: An object-centric approach. 177-187 - Gil Ben-Artzi:
Separable Four Points Fundamental Matrix. 188-196 - René Schuster, Oliver Wasenmüller, Christian Unger, Didier Stricker:
SSGP: Sparse Spatial Guided Propagation for Robust and Generic Interpolation. 197-206
Synthesis, Reconstruction, Recognition, Learning
- Jacob Shermeyer, Thomas Hossler, Adam Van Etten, Daniel Hogan, Ryan Lewis, Daeil Kim:
RarePlanes: Synthetic Data Takes Flight. 207-217 - Abhijith Punnappurath, Michael S. Brown:
Spatially Aware Metadata for Raw Reconstruction. 218-226 - Yash Patel, Srikar Appalaraju, R. Manmatha:
Saliency Driven Perceptual Image Compression. 227-236 - Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang:
Text-to-Image Generation Grounded by Fine-Grained User Attention. 237-246 - René Schuster, Christian Unger, Didier Stricker:
A Deep Temporal Fusion Framework for Scene Flow Using a Learnable Motion Model and Occlusions. 247-255 - David Peer, Sebastian Stabinger, Antonio Jose Rodríguez-Sánchez:
Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks. 256-265 - Jinglun Feng, Liang Yang, Haiyan Wang, Yingli Tian, Jizhong Xiao:
Subsurface Pipes Detection Using DNN-based Back Projection on GPR Data. 266-275 - Daniel Stanley Tan, Yi-Chun Chen, Trista Pei-Chun Chen, Wei-Chao Chen:
TrustMAE: A Noise-Resilient Defect Classification Framework using Memory-Augmented Auto-Encoders with Trust Regions. 276-285 - Dvir Samuel, Yuval Atzmon, Gal Chechik:
From generalized zero-shot learning to long-tail with class descriptors. 286-295 - Zeqian Li, Michael Mozer, Jacob Whitehill:
Compositional Embeddings for Multi-Label One-Shot Learning. 296-304
Segmentation, Image Manipulation, Image Processing
- Jun Hao Liew, Scott Cohen, Brian L. Price, Long Mai, Jiashi Feng:
Deep Interactive Thin Object Selection. 305-314 - Kratarth Goel, Praveen Srinivasan, Sarah Tariq, James Philbin:
QuadroNet: Multi-Task Learning for Real-Time Semantic Depth Aware Instance Segmentation. 315-324 - Tianyu Ma, Hang Zhang, Hanley Ong, Amar Vora, Thanh D. Nguyen, Ajay Gupta, Yi Wang, Mert R. Sabuncu:
Ensembling Low Precision Models for Binary Biomedical Image Segmentation. 325-334 - Anqi Yang, Feng Pan, Vishwanath Saragadam, Duy Dao, Zhuo Hui, Jen-Hao Rick Chang, Aswin C. Sankaranarayanan:
SliceNets - A Scalable Approach for Object Detection in 3D CT Scans. 335-344 - Zichen Liu, Jun Hao Liew, Xiangyu Chen, Jiashi Feng:
DANCE : A Deep Attentive Contour Model for Efficient Instance Segmentation. 345-354 - Weimin Chen, Yuqing Ma, Xianglong Liu, Yi Yuan:
Hierarchical Generative Adversarial Networks for Single Image Super-Resolution. 355-364 - He Zhang, Jianming Zhang, Federico Perazzi, Zhe Lin, Vishal M. Patel:
Deep Image Compositing. 365-374 - Myung-Joon Kwon, In-Jae Yu, Seung-Hun Nam, Heung-Kyu Lee:
CAT-Net: Compression Artifact Tracing Network for Detection and Localization of Image Splicing. 375-384 - Chang Liu, Henghui Ding, Xudong Jiang:
Towards Enhancing Fine-grained Details for Image Matting. 385-393 - Mahdiar Nekoui, Fidel Omar Tito Cruz, Li Cheng:
EAGLE-Eye: Extreme-pose Action Grader using detaiL bird's-Eye view. 394-402 - Joshua D. Rego, Karthik Kulkarni, Suren Jayasuriya:
Robust Lensless Image Reconstruction via PSF Estimation. 403-412 - Aditya Mehta, Harsh Sinha, Murari Mandal, Pratik Narang:
Domain-Aware Unsupervised Hyperspectral Reconstruction for Aerial Image Dehazing. 413-422 - Thomas Hartley, Kirill A. Sidorov, Christopher Willis, A. David Marshall:
SWAG: Superpixels Weighted by Average Gradients for Explanations of CNNs. 423-432 - Chenhao Li, Yuta Taniguchi, Min Lu, Shin'ichi Konomi:
Few-shot Font Style Transfer between Different Languages. 433-442 - Tunai Porto Marques, Alexandra Branzan Albu, Patrick O'Hara, Norma Serra, Ben Morrow, Lauren McWhinnie, Rosaline Canessa:
Size-invariant Detection of Marine Vessels From Visual Time Series. 443-453
Domain Adaptation, Saliency, Segmentation, Captioning, Tracking, Image Processing
- Tongxin Wang, Zhengming Ding, Wei Shao, Haixu Tang, Kun Huang:
Towards Fair Cross-Domain Adaptation via Generative Learning. 454-463 - Pengfei Fang, Pan Ji, Lars Petersson, Mehrtash Harandi:
Set Augmented Triplet Loss for Video Person Re-Identification. 464-473 - Hao-Wei Yeh, Baoyao Yang, Pong C. Yuen, Tatsuya Harada:
SoFA: Source-data-free Feature Alignment for Unsupervised Domain Adaptation. 474-483 - Yifeng Zhang, Ming Jiang, Qi Zhao:
Saliency Prediction with External Knowledge. 484-493 - Philipp Benz, Chaoning Zhang, Adil Karjauv, In So Kweon:
Revisiting Batch Normalization for Improving Corruption Robustness. 494-503 - Yizhou Wang, Zhongyu Jiang, Xiangyu Gao, Jenq-Neng Hwang, Guanbin Xing, Hui Liu:
RODNet: Radar Object Detection using Cross-Modal Supervision. 504-513 - Jinyu Yang, Weizhi An, Chaochao Yan, Peilin Zhao, Junzhou Huang:
Context-Aware Domain Adaptation in Semantic Segmentation. 514-524 - Haochen Wang, Yandan Yang, Xianbin Cao, Xiantong Zhen, Cees Snoek, Ling Shao:
Variational Prototype Inference for Few-Shot Semantic Segmentation. 525-534 - Laura Sevilla-Lara, Shengxin Zha, Zhicheng Yan, Vedanuj Goswami, Matt Feiszli, Lorenzo Torresani:
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling. 535-544 - Xianyu Chen, Ming Jiang, Qi Zhao:
Self-Distillation for Few-Shot Image Captioning. 545-555 - Camilo Pestana, Wei Liu, David G. Glance, Ajmal Mian:
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty. 556-565 - Heng Fan, Haibin Ling:
MART: Motion-Aware Recurrent Neural Network for Robust Visual Tracking. 566-575 - Badri N. Patro, Mayank Lunayach, Deepankar Srivastava, Sarvesh, Hunar Singh, Vinay P. Namboodiri:
Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms. 576-585 - Ge Liu, Linglan Zhao, Wei Li, Dashan Guo, Xiangzhong Fang:
Class-wise Metric Scaling for Improved Few-Shot Classification. 586-595 - Jinsoo Choi, Jaesik Park, In So Kweon:
High-quality Frame Interpolation via Tridirectional Inference. 596-604
Domain Adaptation, Representation, Visual Analytics, Uncertainty and Attention
- Taotao Jing, Zhengming Ding:
Adversarial Dual Distinct Classifiers for Unsupervised Domain Adaptation. 605-614 - Vinod K. Kurmi, Venkatesh K. Subramanian, Vinay P. Namboodiri:
Domain Impression: A Source Data Free Domain Adaptation Method. 615-625 - Yandan Yang, Lu Sheng, Xiaolong Jiang, Haochen Wang, Dong Xu, Xianbin Cao:
IncreACO: Incrementally Learned Automatic Check-out with Photorealistic Exemplar Augmentation. 626-634 - Youshan Zhang, Hui Ye, Brian D. Davison:
Adversarial Reinforcement Learning for Unsupervised Domain Adaptation. 635-644 - Or Litany, Ari Morcos, Srinath Sridhar, Leonidas J. Guibas, Judy Hoffman:
Representation Learning Through Latent Canonicalizations. 645-654 - Wenhu Chen, Zhe Gan, Linjie Li, Yu Cheng, William Yang Wang, Jingjing Liu:
Meta Module Network for Compositional Visual Reasoning. 655-664 - Jianan Wang, Boyang Li, Xiangyu Fan, Jing Lin, Yanwei Fu:
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions. 665-675 - Olga Moskvyak, Frédéric Maire, Feras Dayoub, Mahsa Baktashmotlagh:
Keypoint-Aligned Embeddings for Image Retrieval and Re-identification. 676-685 - Hao Guo, Brian Dolhansky, Eric Hsin, Phong Dinh, Cristian Canton-Ferrer, Song Wang:
Deep Poisoning: Towards Robust Image Data Sharing against Visual Disclosure. 686-696 - Xinyi Zheng, Douglas Burdick, Lucian Popa, Xu Zhong, Nancy Xin Ru Wang:
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context. 697-706 - Yichen Shen, Zhilu Zhang, Mert R. Sabuncu, Lin Sun:
Real-Time Uncertainty Estimation in Computer Vision via Uncertainty-Aware Distribution Distillation. 707-716 - Saurabh Satish Desai, Stefan Lee:
Auxiliary Tasks for Efficient Learning of Point-Goal Navigation. 717-725 - Badri N. Patro, G. S. Kasturi, Ansh Jain, Vinay P. Namboodiri:
Self Supervision for Attention Networks. 726-735 - Vinod K. Kurmi, Badri N. Patro, Venkatesh K. Subramanian, Vinay P. Namboodiri:
Do not Forget to Attend to Uncertainty while Mitigating Catastrophic Forgetting. 736-745 - Jeya Maria Jose Valanarasu, Vishal M. Patel:
Overcomplete Deep Subspace Clustering Networks. 746-755
Rectification and Tracking, 3D and Action, Motion and Tracking
- Sijie Zhu, Taojiannan Yang, Chen Chen:
Revisiting Street-to-Aerial View Image Geo-localization and Orientation Estimation. 756-765 - Michal Uricár, Ganesh Sistu, Hazem Rashed, Antonín Vobecký, Varun Ravi Kumar, Pavel Krízek, Fabian Bürger, Senthil Kumar Yogamani:
Let's Get Dirty: GAN Based Data Augmentation for Camera Lens Soiling Detection in Autonomous Driving. 766-775 - Luis Bermudez, Nadine L. Dabby, Yingxi Adelle Lin, Sara Hilmarsdottir, Narayan Sundararajan, Swarnendu Kar:
A Learning-Based Approach to Parametric Rotoscoping of Multi-Shape Systems. 776-785 - Pranav Verma, Dominique E. Meyer, Hanyang Xu, Falko Kuester:
Splatty- A Unified Image Demosaicing and Rectification Method. 786-795 - Hung Tran, Vuong Le, Truyen Tran:
Goal-driven Long-Term Trajectory Prediction. 796-805 - Rodrigo Santa Cruz, Léo Lebrat, Pierrick Bourgeat, Clinton Fookes, Jurgen Fripp, Olivier Salvado:
DeepCSR: A 3D Deep Learning Approach for Cortical Surface Reconstruction. 806-815 - Yu Lin, Yigong Wang, Yifan Li, Yang Gao, Zhuoyi Wang, Latifur Khan:
Attention-Based Spatial Guidance for Image-to-Image Translation. 816-825 - Chenxi Xiao, Juan P. Wachs:
Triangle-Net: Towards Robustness in Point Cloud Learning. 826-835 - Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Yen-Yu Lin, Xiaohui Xie:
MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation. 836-845 - Yizhak Ben-Shabat, Xin Yu, Fatemeh Sadat Saleh, Dylan Campbell, Cristian Rodriguez Opazo, Hongdong Li, Stephen Gould:
The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose. 846-858 - Kakani Katija, Paul L. D. Roberts, Joost Daniels, Alexandra Lapides, Kevin Barnard, Mike Risi, Ben Y. Ranaan, Benjamin G. Woodward, Jonathan Takahashi:
Visual tracking of deepwater animals using machine learning-controlled robotic underwater vehicles. 859-868 - Shuo-Diao Yang, Hung-Ting Su, Winston H. Hsu, Wen-Chin Chen:
Class-agnostic Few-shot Object Counting. 869-877 - Neeraj Battan, Yudhik Agrawal, Sai Soorya Rao, Aman Goel, Avinash Sharma:
GlocalNet: Class-aware Long-term Human Motion Synthesis. 878-887
Detection and Recognition, Segmentation and Tracking, Low-Level Vision
- Kai Yang, Zihao Xu, Jingjing Fei:
DualSANet: Dual Spatial Attention Network for Iris Recognition. 888-896 - Jongmin Lee, Yoonwoo Jeong, Seungwook Kim, Juhong Min, Minsu Cho:
Learning to Distill Convolutional Features into Compact Local Descriptors. 897-907 - Yanguang Bi, Zhiqiang Hu:
Disentangled Contour Learning for Quadrilateral Text Detection. 908-917 - Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Premkumar Natarajan:
Class-agnostic Object Detection. 918-927 - Myungchul Kim, Sanghyun Woo, Dahun Kim, In So Kweon:
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation. 928-937 - Hao Tang, Xingwei Liu, Kun Han, Xiaohui Xie, Xuming Chen, Qian Huang, Yong Liu, Shanlin Sun, Narisu Bai:
Spatial Context-Aware Self-Attention Model For Multi-Organ Segmentation. 938-948 - Yimian Dai, Yiquan Wu, Fei Zhou, Kobus Barnard:
Asymmetric Contextual Modulation for Infrared Small Target Detection. 949-958 - Wei He, Meiqing Wu, Mingfu Liang, Siew-Kei Lam:
CAP: Context-Aware Pruning for Semantic Segmentation. 959-968 - Heng Fan, Fan Yang, Peng Chu, Yuewei Lin, Lin Yuan, Haibin Ling:
TracKlinic: Diagnosis of Challenge Factors in Visual Tracking. 969-978 - Mehrdad Hosseinzadeh, Yang Wang:
Video Captioning of Future Frames. 979-988 - Alireza Shafaei, James J. Little, Mark Schmidt:
AutoRetouch: Automatic Professional Face Retouching. 989-997 - Satish Kumar, A. S. M. Iftekhar, Michael Goebel, Tom Bullock, Mary H. MacLean, Michael B. Miller, Tyler Santander, Barry Giesbrecht, Scott T. Grafton, B. S. Manjunath:
StressNet: Detecting Stress in Thermal Videos. 998-1008 - Sadbhavana Babar, Sukhendu Das:
Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization. 1009-1018 - Jaedong Hwang, Seohyun Kim, Jeany Son, Bohyung Han:
Weakly Supervised Instance Segmentation by Deep Community Learning. 1019-1028 - Kangning Liu, Shuhang Gu, Andrés Romero, Radu Timofte:
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning. 1029-1039
3D, Video Processsing, Detection and Recognition
- Arwen Bradley, Jason Klivington, Joseph Triscari, Rudolph van der Merwe:
Cinematic-L1 Video Stabilization with a Log-Homography Model. 1040-1048 - Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Yen-Yu Lin, Xiaohui Xie:
Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos. 1049-1058 - Kellie Corona, Katie Osterdahl, Roderic Collins, Anthony Hoogs:
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection. 1059-1067 - Kyle Min, Jason J. Corso:
Integrating Human Gaze into Attention for Egocentric Activity Recognition. 1068-1077 - Cristian Rodriguez Opazo, Edison Marrese-Taylor, Basura Fernando, Hongdong Li, Stephen Gould:
DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video. 1078-1087 - Xide Xia, Tianfan Xue, Wei-Sheng Lai, Zheng Sun, Abby Chang, Brian Kulis, Jiawen Chen:
Real-time Localized Photorealistic Video Style Transfer. 1088-1097 - Simon Niklaus, Long Mai, Oliver Wang:
Revisiting Adaptive Convolutions for Video Frame Interpolation. 1098-1108 - Longlong Jing, Toufiq Parag, Zhe Wu, Yingli Tian, Hongcheng Wang:
VideoSSL: Semi-Supervised Learning for Video Classification. 1109-1118 - Zhenqiang Li, Weimin Wang, Zuoyue Li, Yifei Huang, Yoichi Sato:
Towards Visually Explaining Video Understanding Networks with Perturbation. 1119-1128 - Shaojie Wang, Wentian Zhao, Ziyi Kou, Jing Shi, Chenliang Xu:
How to Make a BLT Sandwich? Learning VQA towards Understanding Web Instructional Videos. 1129-1138 - Muhammad Umer Anwaar, Egor Labintcev, Martin Kleinsteuber:
Compositional Learning of Image-Text Query for Image Retrieval. 1139-1148 - Ahmed-Shehab Khan, Zhiyuan Li, Jie Cai, Yan Tong:
Regional Attention Networks with Context-aware Fusion for Group Emotion Recognition. 1149-1158 - Yuqi Gong, Xuehui Yu, Yao Ding, Xiaoke Peng, Jian Zhao, Zhenjun Han:
Effective Fusion Factor in FPN for Tiny Object Detection. 1159-1167