default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 445 matches
- 2024
- Riccardo Ricci, Yakoub Bazi, Farid Melgani:
Machine-to-Machine Visual Dialoguing with ChatGPT for Enriched Textual Image Description. Remote. Sens. 16(3): 441 (2024) - Zhili Feng, Anna Bair, J. Zico Kolter:
Text Descriptions are Compressive and Invariant Representations for Visual Learning. Trans. Mach. Learn. Res. 2024 (2024) - Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny:
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions. Trans. Mach. Learn. Res. 2024 (2024) - Yangbing Ge, Lilian Zhang, Yuanxin Wu, Dewen Hu:
PIPO-SLAM: Lightweight Visual-Inertial SLAM With Preintegration Merging Theory and Pose-Only Descriptions of Multiple View Geometry. IEEE Trans. Robotics 40: 2046-2059 (2024) - Silvia Soler Gallego:
Diversity of experience: action, sensation, and immersion in audio descriptions of (visual) art. Univers. Access Inf. Soc. 23(2): 609-619 (2024) - María Eugenia Larreina-Morales, Carme Mangiron:
Audio description in video games? Persons with visual disabilities weigh in. Univers. Access Inf. Soc. 23(2): 577-588 (2024) - Wenhui Jiang, Yibo Cheng, Linxin Liu, Yuming Fang, Yuxin Peng, Yang Liu:
Comprehensive Visual Grounding for Video Description. AAAI 2024: 2552-2560 - Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv:
Zero-Shot Aerial Object Detection with Visual Description Regularization. AAAI 2024: 6926-6934 - David Mogrovejo, Thamar Solorio:
Question-Instructed Visual Descriptions for Zero-Shot Video Answering. ACL (Findings) 2024: 9329-9339 - Antonio Serpa, Frédéric Vella, Nadine Vigouroux, Florence D'Inca:
Comparative study of semantic description for locating objects for visually impaired people: Preliminary results: Étude comparative de description sémantique pour la localisation d'objets pour des personnes déficientes visuelles : Résultats préliminaires. IHM (Adjunct) 2024: 6:1-6:6 - Suhyun Kim, Semin Lee, Kyungok Kim, Uran Oh:
Utilizing a Dense Video Captioning Technique for Generating Image Descriptions of Comics for People with Visual Impairments. IUI 2024: 750-760 - Michael Ogezi, Bradley Hauer, Grzegorz Kondrak:
Semantically-Prompted Language Models Improve Visual Descriptions. NAACL-HLT (Findings) 2024: 4285-4302 - João Marcelo dos Santos Marques, Simone Bacellar Leal Ferreira:
How to promote descriptions in dynamic charts in light of transparency? A study on the accessibility barriers faced by citizens with severe visual impairment: Como promover descrições em gráficos dinâmicos à luz da transparência? Um estudo sobre as barreiras de acessibilidade enfrentadas por cidadãos com deficiência visual grave. SBSI 2024: 6:1-6:10 - Muskan Sarvesh, Ryan Kang, Mehdi Marzban, Isaac Cho, Simon Park, Ron Hugo, Kangsoo Kim:
Immersive 3D Digital Twin for Collaborative Hydrogen Pipeline Simulation and Visualization: A Project Description. VR Workshops 2024: 294-296 - He Wang, Pengcheng Guo, Wei Chen, Pan Zhou, Lei Xie:
The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023. CoRR abs/2401.06788 (2024) - David Romero, Thamar Solorio:
Question-Instructed Visual Descriptions for Zero-Shot Video Question Answering. CoRR abs/2402.10698 (2024) - Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv:
Zero-Shot Aerial Object Detection with Visual Description Regularization. CoRR abs/2402.18233 (2024) - M. Z. Naser, Mohammad Khaled al-Bashiti, Arash Teymori Gharah Tapeh, Armin Dadras Eslamlou, Ahmed Naser, Venkatesh Kodur, Rami Hawileh, Jamal A. Abdalla, Nima Khodadadi, Amir H. Gandomi:
A Review of 315 Benchmark and Test Functions for Machine Learning Optimization Algorithms and Metaheuristics with Mathematical and Visual Descriptions. CoRR abs/2406.09581 (2024) - Nagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye:
M3T: Multi-Modal Medical Transformer to bridge Clinical Context with Visual Insights for Retinal Image Medical Description Generation. CoRR abs/2406.13129 (2024) - Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadipour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi:
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions. CoRR abs/2407.06723 (2024) - He Wang, Lei Xie:
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024. CoRR abs/2408.02369 (2024) - Yizhang Jin, Jian Li, Jiangning Zhang, Jianlong Hu, Zhenye Gan, Xin Tan, Yong Liu, Yabiao Wang, Chengjie Wang, Lizhuang Ma:
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description. CoRR abs/2408.04957 (2024) - 2023
- Manuel Lange:
Visual Odometry Using Line Features and Machine Learning Enhanced Line Description. University of Tübingen, Germany, 2023 - Sergey Nesteruk, Julia Agafonova, Igor Pavlov, Maxim Gerasimov, Nikolay Latyshev, Denis Dimitrov, Andrey Kuznetsov, Artur Kadurin, Pavel Plechov:
MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description. Comput. Geosci. 178: 105414 (2023) - Guillermo Martínez, Jose M. Saavedra, Nils Murrugarra-Llerena:
VETE: improving visual embeddings through text descriptions for eCommerce search engines. Multim. Tools Appl. 82(26): 41343-41379 (2023) - Varsha Singh, Uma Shanker Tiwary:
Visual content generation from textual description using improved adversarial network. Multim. Tools Appl. 82(7): 10943-10960 (2023) - Hanjing Ye, Weinan Chen, Jingwen Yu, Li He, Yisheng Guan, Hong Zhang:
Condition-invariant and compact visual place description by convolutional autoencoder. Robotica 41(6): 1718-1732 (2023) - José Martínez-Carranza, Delia Irazú Hernández Farías, Victoria Eugenia Vazquez-Meza, Leticia Oyuki Rojas-Perez, Aldrich Alfredo Cabrera-Ponce:
A Study on Generative Models for Visual Recognition of Unknown Scenes Using a Textual Description. Sensors 23(21): 8757 (2023) - Virginia Pinto Campos, Luiz M. G. Gonçalves, Wesnydy L. Ribeiro, Tiago Maritan Ugulino de Araújo, Thaís Gaudencio do Rêgo, Pedro H. V. Figueiredo, Suanny Vieira, Thiago F. S. Costa, Caio Moraes, Alexandre C. S. Cruz, Felipe Araújo, Guido Lemos de Souza Filho:
Machine Generation of Audio Description for Blind and Visually Impaired People. ACM Trans. Access. Comput. 16(2): 14:1-14:28 (2023) - Leni Yang, Cindy Xiong, Jason K. Wong, Aoyu Wu, Huamin Qu:
Explaining With Examples: Lessons Learned From Crowdsourced Introductory Description of Information Visualizations. IEEE Trans. Vis. Comput. Graph. 29(3): 1638-1650 (2023)
skipping 415 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-09-21 08:54 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint