default search action
Fahad Shahbaz Khan
Person information
- affiliation: Linköping University, Sweden
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j50]Shahina K. Kunhimon, Abdelrahman M. Shaker, Muzammal Naseer, Salman H. Khan, Fahad Shahbaz Khan:
Learnable weight initialization for volumetric medical image segmentation. Artif. Intell. Medicine 151: 102863 (2024) - [j49]Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah:
CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation. Comput. Vis. Image Underst. 238: 103860 (2024) - [j48]Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Dana Dascalescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah:
Lightning fast video anomaly detection via multi-scale adversarial distillation. Comput. Vis. Image Underst. 247: 104074 (2024) - [j47]Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, Jian Yang, Joost van de Weijer:
MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains. Int. J. Comput. Vis. 132(2): 490-514 (2024) - [j46]Mohammed Hassanin, Saeed Anwar, Ibrahim Radwan, Fahad Shahbaz Khan, Ajmal Mian:
Visual attention methods in deep learning: An in-depth survey. Inf. Fusion 108: 102417 (2024) - [j45]Long Li, Junwei Han, Nian Liu, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan:
Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 479-496 (2024) - [j44]Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah:
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 525-542 (2024) - [j43]Mustansar Fiaz, Mubashir Noman, Hisham Cholakkal, Rao Muhammad Anwer, Jacob Hanna, Fahad Shahbaz Khan:
Guided-attention and gated-aggregation network for medical image segmentation. Pattern Recognit. 156: 110812 (2024) - [j42]Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Salman H. Khan, Fahad Shahbaz Khan:
ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection. IEEE Trans. Geosci. Remote. Sens. 62: 1-11 (2024) - [j41]Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan:
Remote Sensing Change Detection With Transformers Trained From Scratch. IEEE Trans. Geosci. Remote. Sens. 62: 1-14 (2024) - [j40]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation. IEEE Trans. Medical Imaging 43(9): 3377-3390 (2024) - [j39]Muzammal Naseer, Salman H. Khan, Fatih Porikli, Fahad Shahbaz Khan:
Guidance Through Surrogate: Toward a Generic Diagnostic Attack. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2042-2053 (2024) - [j38]Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan:
Effectiveness assessment of recent large vision-language models. Vis. Intell. 2(1): 17 (2024) - [c171]Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal:
Semi-supervised Open-World Object Detection. AAAI 2024: 4305-4314 - [c170]Sheng Zhang, Muzammal Naseer, Guangyi Chen, Zhiqiang Shen, Salman H. Khan, Kun Zhang, Fahad Shahbaz Khan:
S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment. AAAI 2024: 7278-7286 - [c169]Yang Bai, Xinxing Xu, Yong Liu, Salman Khan, Fahad Shahbaz Khan, Wangmeng Zuo, Rick Siow Mong Goh, Chun-Mei Feng:
Sentence-level Prompts Benefit Composed Image Retrieval. ICLR 2024 - [c168]Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang:
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models. ICLR 2024 - [c167]Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan, Lei Huang:
Modulate Your Spectrum in Self-Supervised Learning. ICLR 2024 - [c166]Yuanwei Liu, Junwei Han, Xiwen Yao, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Nian Liu, Fahad Shahbaz Khan:
Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation. ICML 2024 - [c165]Jean Lahoud, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan:
Long-Tailed 3D Semantic Segmentation with Adaptive Weight Constraint and Sampling. ICRA 2024: 5037-5044 - [c164]Shahina K. Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Language Guided Domain Generalized Medical Image Segmentation. ISBI 2024: 1-5 - [i197]Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding. CoRR abs/2401.00901 (2024) - [i196]Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Shahbaz Khan:
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes. CoRR abs/2401.01164 (2024) - [i195]Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang:
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models. CoRR abs/2402.05375 (2024) - [i194]Sara Pieri, Sahal Shaji Mullappilly, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman H. Khan, Timothy Baldwin, Hisham Cholakkal:
BiMediX: Bilingual Medical Mixture of Experts LLM. CoRR abs/2402.13253 (2024) - [i193]Muhammad Maaz, Hanoona Abdul Rasheed, Abdelrahman M. Shaker, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Tim Baldwin, Michael Felsberg, Fahad Shahbaz Khan:
PALO: A Polyglot Large Multimodal Model for 5B People. CoRR abs/2402.14818 (2024) - [i192]Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal:
Semi-supervised Open-World Object Detection. CoRR abs/2402.16013 (2024) - [i191]Omkar Thawakar, Ashmal Vayani, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Tim Baldwin, Eric P. Xing, Fahad Shahbaz Khan:
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT. CoRR abs/2402.16840 (2024) - [i190]Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman Khan:
MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation. CoRR abs/2402.17725 (2024) - [i189]Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan:
Effectiveness Assessment of Recent Large Vision-Language Models. CoRR abs/2403.04306 (2024) - [i188]Hashmat Shadab Malik, Muhammad Huzaifa, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes. CoRR abs/2403.04701 (2024) - [i187]Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan:
Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery. CoRR abs/2403.05419 (2024) - [i186]Yuning Cui, Syed Waqas Zamir, Salman H. Khan, Alois Knoll, Mubarak Shah, Fahad Shahbaz Khan:
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation. CoRR abs/2403.14614 (2024) - [i185]Hasindri Watawana, Kanchana Ranasinghe, Tariq Mahmood, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning. CoRR abs/2403.14616 (2024) - [i184]Ahmad Mahmood, Ashmal Vayani, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding. CoRR abs/2403.14743 (2024) - [i183]Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer, Salman H. Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan:
Composed Video Retrieval via Enriched Context and Discriminative Embeddings. CoRR abs/2403.16997 (2024) - [i182]Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Salman Khan, Fahad Shahbaz Khan:
ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection. CoRR abs/2403.17909 (2024) - [i181]Abdelrahman M. Shaker, Syed Talal Wasim, Martin Danelljan, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Efficient Video Object Segmentation via Modulated Cross-Attention Memory. CoRR abs/2403.17937 (2024) - [i180]Shahina K. Kunhimon, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Language Guided Domain Generalized Medical Image Segmentation. CoRR abs/2404.01272 (2024) - [i179]Akshay Dudhane, Omkar Thawakar, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration. CoRR abs/2404.02154 (2024) - [i178]Shiming Chen, Wenjin Hou, Salman H. Khan, Fahad Shahbaz Khan:
Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning. CoRR abs/2404.07713 (2024) - [i177]Amaya Dharmasiri, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan:
Cross-Modal Self-Training: Aligning Images and Pointclouds to Learn Classification without Labels. CoRR abs/2404.10146 (2024) - [i176]Wenjin Hou, Shiming Chen, Shuhuang Chen, Ziming Hong, Yan Wang, Xuetao Feng, Salman Khan, Fahad Shahbaz Khan, Xinge You:
Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning. CoRR abs/2404.14808 (2024) - [i175]Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Jameel Hassan, Muzammal Naseer, Federico Tombari, Fahad Shahbaz Khan, Salman H. Khan:
How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs. CoRR abs/2405.03690 (2024) - [i174]Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan:
Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging. CoRR abs/2406.00449 (2024) - [i173]Mohamed El Amine Boudjoghra, Angela Dai, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation. CoRR abs/2406.02548 (2024) - [i172]Yuhao Li, Muzammal Naseer, Jiale Cao, Yu Zhu, Jinqiu Sun, Yanning Zhang, Fahad Shahbaz Khan:
Multi-Granularity Language-Guided Multi-Object Tracking. CoRR abs/2406.04844 (2024) - [i171]Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman H. Khan, Fahad Shahbaz Khan:
On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models. CoRR abs/2406.08486 (2024) - [i170]Hashmat Shadab Malik, Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar, Fahad Shahbaz Khan, Salman H. Khan:
Towards Evaluating the Robustness of Visual State Space Models. CoRR abs/2406.09407 (2024) - [i169]Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan:
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding. CoRR abs/2406.09418 (2024) - [i168]Rohit K. Bharadwaj, Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman H. Khan:
VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs. CoRR abs/2406.10326 (2024) - [i167]Akshita Gupta, Aditya Arora, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Graham W. Taylor:
Open-Vocabulary Temporal Action Localization using Multimodal Guidance. CoRR abs/2406.15556 (2024) - [i166]Jin Zhang, Ruiheng Zhang, Yanjiao Shi, Zhe Cao, Nian Liu, Fahad Shahbaz Khan:
Learning Camouflaged Object Detection from Noisy Pseudo Label. CoRR abs/2407.13157 (2024) - [i165]Abdelrahman M. Shaker, Syed Talal Wasim, Salman Khan, Juergen Gall, Fahad Shahbaz Khan:
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model. CoRR abs/2407.13772 (2024) - 2023
- [j37]Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob V. Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah:
SSMTL++: Revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229: 103656 (2023) - [j36]Haotong Qin, Ge-Peng Ji, Salman Khan, Deng-Ping Fan, Fahad Shahbaz Khan, Luc Van Gool:
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges. Mach. Intell. Res. 20(5): 605-613 (2023) - [j35]Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, Radu Tudor Ionescu:
CyTran: A cycle-consistent transformer with multi-level consistency for non-contrast to contrast CT translation. Neurocomputing 538: 126211 (2023) - [j34]Fahad Shamshad, Salman H. Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu:
Transformers in medical imaging: A survey. Medical Image Anal. 88: 102802 (2023) - [j33]Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao:
Learning Enriched Features for Fast Image Restoration and Enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 1934-1948 (2023) - [j32]Jiale Cao, Yanwei Pang, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Ling Shao:
SipMaskv2: Enhanced Fast Image and Video Instance Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 3798-3812 (2023) - [j31]Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Fatih Porikli:
Stylized Adversarial Defense. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6403-6414 (2023) - [j30]Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas:
Visual Object Tracking With Discriminative Filters and Siamese Networks: A Survey and Outlook. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6552-6574 (2023) - [j29]Salman H. Khan, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming-Hsuan Yang, Mubarak Shah:
Guest Editorial Introduction to the Special Section on Transformer Models in Vision. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12721-12725 (2023) - [j28]Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Joost van de Weijer:
Generative Multi-Label Zero-Shot Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14611-14624 (2023) - [j27]Abdulaziz Amer Aleissaee, Amandeep Kumar, Rao Muhammad Anwer, Salman Khan, Hisham Cholakkal, Gui-Song Xia, Fahad Shahbaz Khan:
Transformers in Remote Sensing: A Survey. Remote. Sens. 15(7): 1860 (2023) - [c163]Mamona Awan, Muhammad Haris Khan, Sanoojan Baliah, Muhammad Ahmad Waseem, Salman Khan, Fahad Shahbaz Khan, Arif Mahmood:
Unsupervised Landmark Discovery Using Consistency-Guided Bottleneck. BMVC 2023: 598-600 - [c162]Omkar Thawakar, Alexandre Rivkind, Ehud Ahissar, Fahad Shahbaz Khan:
Fast Video Instance Segmentation via Recurrent Encoder-Based Transformers. CAIP (1) 2023: 262-272 - [c161]Abdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan:
D3Former: Debiased Dual Distilled Transformer for Incremental Learning. CVPR Workshops 2023: 2421-2430 - [c160]Sheng Zhang, Salman H. Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Shahbaz Khan:
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery. CVPR 2023: 3479-3488 - [c159]Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. CVPR 2023: 5703-5712 - [c158]Ankan Kumar Bhunia, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Mubarak Shah, Fahad Shahbaz Khan:
Person Image Synthesis via Denoising Diffusion Model. CVPR 2023: 5968-5976 - [c157]Hanoona Abdul Rasheed, Muhammad Uzair Khattak, Muhammad Maaz, Salman H. Khan, Fahad Shahbaz Khan:
Fine-tuned CLIP Models are Efficient Video Learners. CVPR 2023: 6545-6554 - [c156]Long Li, Junwei Han, Ni Zhang, Nian Liu, Salman H. Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan:
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection. CVPR 2023: 7247-7256 - [c155]Muhammad Akhtar Munir, Muhammad Haris Khan, Salman H. Khan, Fahad Shahbaz Khan:
Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection. CVPR 2023: 11474-11483 - [c154]Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang:
3D-Aware Multi-Class Image-to-Image Translation with NeRFs. CVPR 2023: 12652-12662 - [c153]Muhammad Uzair Khattak, Hanoona Abdul Rasheed, Muhammad Maaz, Salman H. Khan, Fahad Shahbaz Khan:
MaPLe: Multi-modal Prompt Learning. CVPR 2023: 19113-19122 - [c152]Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan:
Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement. CVPR 2023: 22201-22210 - [c151]Syed Talal Wasim, Muzammal Naseer, Salman H. Khan, Fahad Shahbaz Khan, Mubarak Shah:
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting. CVPR 2023: 23034-23044 - [c150]Sahal Shaji Mullappilly, Abdelrahman M. Shaker, Omkar Thawakar, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM. EMNLP (Findings) 2023: 14126-14136 - [c149]Salwa K. Al Khatib, Mohamed El Amine Boudjoghra, Jean Lahoud, Fahad Shahbaz Khan:
3D Instance Segmentation via Enhanced Spatial and Semantic Supervision. ICCV 2023: 541-550 - [c148]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Generative Multiplane Neural Radiance for 3D-Aware Image Generation. ICCV 2023: 7354-7364 - [c147]Syed Talal Wasim, Muhammad Uzair Khattak, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan:
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition. ICCV 2023: 13732-13743 - [c146]Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Self-regulating Prompts: Foundational Model Adaptation without Forgetting. ICCV 2023: 15144-15154 - [c145]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications. ICCV 2023: 17379-17390 - [c144]Nian Liu, Kepan Nan, Wangbo Zhao, Yuanwei Liu, Xiwen Yao, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Junwei Han, Fahad Shahbaz Khan:
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation. ICCV 2023: 18816-18825 - [c143]Muzammal Naseer, Ahmad Mahmood, Salman Khan, Fahad Shahbaz Khan:
Boosting Adversarial Transferability using Dynamic Cues. ICLR 2023 - [c142]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Fahad Shahbaz Khan:
Cross-Modulated Few-Shot Image Generation for Colorectal Tissue Classification. MICCAI (3) 2023: 128-137 - [c141]Asif Hanif, Muzammal Naseer, Salman H. Khan, Mubarak Shah, Fahad Shahbaz Khan:
Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation. MICCAI (2) 2023: 457-467 - [c140]Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, Fahad Shahbaz Khan:
A Spatial-Temporal Deformable Attention Based Framework for Breast Lesion Detection in Videos. MICCAI (2) 2023: 479-488 - [c139]Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, Fahad Shahbaz Khan:
3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers. MICCAI (8) 2023: 613-623 - [c138]Wafa Al Ghallabi, Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan:
Accelerated MRI Reconstruction via Dynamic Deformable Alignment Based Transformer. MLMI@MICCAI (1) 2023: 104-114 - [c137]Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Shahbaz Khan:
Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes. MLMI@MICCAI (2) 2023: 357-366 - [c136]Mohamed El Amine Boudjoghra, Salwa K. Al Khatib, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Fahad Shahbaz Khan:
3D Indoor Instance Segmentation in an Open-World. NeurIPS 2023 - [c135]Muhammad Akhtar Munir, Salman H. Khan, Muhammad Haris Khan, Mohsen Ali, Fahad Shahbaz Khan:
Cal-DETR: Calibrated Detection Transformer. NeurIPS 2023 - [c134]Vaishnav Potlapalli, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan:
PromptIR: Prompting for All-in-One Image Restoration. NeurIPS 2023 - [c133]Jameel Abdul Samadh, Hanan Gani, Noor Hussein, Muhammad Uzair Khattak, Muzammal Naseer, Fahad Shahbaz Khan, Salman H. Khan:
Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization. NeurIPS 2023 - [c132]Dmitry Demidov, Muhammad Hamza Sharif, Aliakbar Abdurahimov, Hisham Cholakkal, Fahad Shahbaz Khan:
Salient Mask-Guided Vision Transformer for Fine-Grained Classification. VISIGRAPP (4: VISAPP) 2023: 27-38 - [c131]Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan:
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution. WACV 2023: 2194-2204 - [c130]Mustansar Fiaz, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan:
SAT: Scale-Augmented Transformer for Person Search. WACV 2023: 4809-4818 - [i164]Muzammal Naseer, Ahmad Mahmood, Salman H. Khan, Fahad Shahbaz Khan:
Boosting Adversarial Transferability using Dynamic Cues. CoRR abs/2302.12252 (2023) - [i163]Zhiqiang Dong, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang:
LEAPS: End-to-End One-Step Person Search With Learnable Proposals. CoRR abs/2303.11859 (2023) - [i162]Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, Fahad Shahbaz Khan:
3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers. CoRR abs/2303.12073 (2023) - [i161]Muhammad Akhtar Munir, Muhammad Haris Khan, Salman H. Khan, Fahad Shahbaz Khan:
Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection. CoRR abs/2303.14404 (2023) - [i160]Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang:
3D-Aware Multi-Class Image-to-Image Translation with NeRFs. CoRR abs/2303.15012 (2023) - [i159]Abdelrahman M. Shaker, Muhammad Maaz, Hanoona Abdul Rasheed, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications. CoRR abs/2303.15446 (2023) - [i158]Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang:
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing. CoRR abs/2303.15649 (2023) - [i157]Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan:
Generative Multiplane Neural Radiance for 3D-Aware Image Generation. CoRR abs/2304.01172 (2023) - [i156]Akshay Dudhane, Syed Waqas Zamir, Salman H. Khan, Fahad Shahbaz Khan, Ming-Hsuan Yang:
Burstormer: Burst Image Restoration and Enhancement Transformer. CoRR abs/2304.01194 (2023) - [i155]