default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Publication search results

found 445 matches

2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/RicciBM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/RicciBM24
Riccardo Ricci, Yakoub Bazi, Farid Melgani:
Machine-to-Machine Visual Dialoguing with ChatGPT for Enriched Textual Image Description. Remote. Sens. 16(3): 441 (2024)
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/FengBK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/FengBK24
Zhili Feng, Anna Bair, J. Zico Kolter:
Text Descriptions are Compressive and Invariant Representations for Visual Learning. Trans. Mach. Learn. Res. 2024 (2024)
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/Zhu0HSZE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Zhu0HSZE24
Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny:
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions. Trans. Mach. Learn. Res. 2024 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/trob/GeZWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/trob/GeZWH24
Yangbing Ge, Lilian Zhang, Yuanxin Wu, Dewen Hu:
PIPO-SLAM: Lightweight Visual-Inertial SLAM With Preintegration Merging Theory and Pose-Only Descriptions of Multiple View Geometry. IEEE Trans. Robotics 40: 2046-2059 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/uais/Gallego24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/uais/Gallego24
Silvia Soler Gallego:
Diversity of experience: action, sensation, and immersion in audio descriptions of (visual) art. Univers. Access Inf. Soc. 23(2): 609-619 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/uais/LarreinaMoralesM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/uais/LarreinaMoralesM24
María Eugenia Larreina-Morales, Carme Mangiron:
Audio description in video games? Persons with visual disabilities weigh in. Univers. Access Inf. Soc. 23(2): 577-588 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiangCLFPL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiangCLFPL24
Wenhui Jiang, Yibo Cheng, Linxin Liu, Yuming Fang, Yuxin Peng, Yang Liu:
Comprehensive Visual Grounding for Video Description. AAAI 2024: 2552-2560
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZangLTW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZangLTW024
Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv:
Zero-Shot Aerial Object Detection with Visual Description Regularization. AAAI 2024: 6926-6934
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/acl/MogrovejoS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/MogrovejoS24
David Mogrovejo, Thamar Solorio:
Question-Instructed Visual Descriptions for Zero-Shot Video Answering. ACL (Findings) 2024: 9329-9339
- view
  authority control:
- export record
  dblp key:
  - conf/ihm/SerpaVVD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ihm/SerpaVVD24
Antonio Serpa, Frédéric Vella, Nadine Vigouroux, Florence D'Inca:
Comparative study of semantic description for locating objects for visually impaired people: Preliminary results: Étude comparative de description sémantique pour la localisation d'objets pour des personnes déficientes visuelles : Résultats préliminaires. IHM (Adjunct) 2024: 6:1-6:6
- view
  authority control:
- export record
  dblp key:
  - conf/iui/KimLKO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iui/KimLKO24
Suhyun Kim, Semin Lee, Kyungok Kim, Uran Oh:
Utilizing a Dense Video Captioning Technique for Generating Image Descriptions of Comics for People with Visual Impairments. IUI 2024: 750-760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/OgeziHK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/OgeziHK24
Michael Ogezi, Bradley Hauer, Grzegorz Kondrak:
Semantically-Prompted Language Models Improve Visual Descriptions. NAACL-HLT (Findings) 2024: 4285-4302
- view
  authority control:
- export record
  dblp key:
  - conf/sbsi/MarquesF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sbsi/MarquesF24
João Marcelo dos Santos Marques, Simone Bacellar Leal Ferreira:
How to promote descriptions in dynamic charts in light of transparency? A study on the accessibility barriers faced by citizens with severe visual impairment: Como promover descrições em gráficos dinâmicos à luz da transparência? Um estudo sobre as barreiras de acessibilidade enfrentadas por cidadãos com deficiência visual grave. SBSI 2024: 6:1-6:10
- view
  authority control:
- export record
  dblp key:
  - conf/vr/SarveshKMCPHK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vr/SarveshKMCPHK24
Muskan Sarvesh, Ryan Kang, Mehdi Marzban, Isaac Cho, Simon Park, Ron Hugo, Kangsoo Kim:
Immersive 3D Digital Twin for Collaborative Hydrogen Pipeline Simulation and Visualization: A Project Description. VR Workshops 2024: 294-296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06788
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06788
He Wang, Pengcheng Guo, Wei Chen, Pan Zhou, Lei Xie:
The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023. CoRR abs/2401.06788 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10698
David Romero, Thamar Solorio:
Question-Instructed Visual Descriptions for Zero-Shot Video Question Answering. CoRR abs/2402.10698 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18233
Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv:
Zero-Shot Aerial Object Detection with Visual Description Regularization. CoRR abs/2402.18233 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-09581
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-09581
M. Z. Naser, Mohammad Khaled al-Bashiti, Arash Teymori Gharah Tapeh, Armin Dadras Eslamlou, Ahmed Naser, Venkatesh Kodur, Rami Hawileh, Jamal A. Abdalla, Nima Khodadadi, Amir H. Gandomi:
A Review of 315 Benchmark and Test Functions for Machine Learning Optimization Algorithms and Metaheuristics with Mathematical and Visual Descriptions. CoRR abs/2406.09581 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13129
Nagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye:
M3T: Multi-Modal Medical Transformer to bridge Clinical Context with Visual Insights for Retinal Image Medical Description Generation. CoRR abs/2406.13129 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-06723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-06723
Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadipour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi:
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions. CoRR abs/2407.06723 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02369
He Wang, Lei Xie:
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024. CoRR abs/2408.02369 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-04957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-04957
Yizhang Jin, Jian Li, Jiangning Zhang, Jianlong Hu, Zhenye Gan, Xin Tan, Yong Liu, Yabiao Wang, Chengjie Wang, Lizhuang Ma:
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description. CoRR abs/2408.04957 (2024)
2023
- view
  authority control:
- export record
  dblp key:
  - phd/dnb/Lange23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/dnb/Lange23
Manuel Lange:
Visual Odometry Using Line Features and Machine Learning Enhanced Line Description. University of Tübingen, Germany, 2023
- view
  authority control:
- export record
  dblp key:
  - journals/gandc/NesterukAPGLDKKP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/gandc/NesterukAPGLDKKP23
Sergey Nesteruk, Julia Agafonova, Igor Pavlov, Maxim Gerasimov, Nikolay Latyshev, Denis Dimitrov, Andrey Kuznetsov, Artur Kadurin, Pavel Plechov:
MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description. Comput. Geosci. 178: 105414 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/mta/MartinezSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/MartinezSM23
Guillermo Martínez, Jose M. Saavedra, Nils Murrugarra-Llerena:
VETE: improving visual embeddings through text descriptions for eCommerce search engines. Multim. Tools Appl. 82(26): 41343-41379 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/mta/SinghT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/SinghT23
Varsha Singh, Uma Shanker Tiwary:
Visual content generation from textual description using improved adversarial network. Multim. Tools Appl. 82(7): 10943-10960 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/robotica/YeCYHGZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/robotica/YeCYHGZ23
Hanjing Ye, Weinan Chen, Jingwen Yu, Li He, Yisheng Guan, Hong Zhang:
Condition-invariant and compact visual place description by convolutional autoencoder. Robotica 41(6): 1718-1732 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/MartinezCarranzaFVRC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/MartinezCarranzaFVRC23
José Martínez-Carranza, Delia Irazú Hernández Farías, Victoria Eugenia Vazquez-Meza, Leticia Oyuki Rojas-Perez, Aldrich Alfredo Cabrera-Ponce:
A Study on Generative Models for Visual Recognition of Unknown Scenes Using a Textual Description. Sensors 23(21): 8757 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/taccess/CamposGRARFVCMCAF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taccess/CamposGRARFVCMCAF23
Virginia Pinto Campos, Luiz M. G. Gonçalves, Wesnydy L. Ribeiro, Tiago Maritan Ugulino de Araújo, Thaís Gaudencio do Rêgo, Pedro H. V. Figueiredo, Suanny Vieira, Thiago F. S. Costa, Caio Moraes, Alexandre C. S. Cruz, Felipe Araújo, Guido Lemos de Souza Filho:
Machine Generation of Audio Description for Blind and Visually Impaired People. ACM Trans. Access. Comput. 16(2): 14:1-14:28 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/tvcg/YangXWWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvcg/YangXWWQ23
Leni Yang, Cindy Xiong, Jason K. Wong, Aoyu Wu, Huamin Qu:
Explaining With Examples: Lessons Learned From Crowdsourced Introductory Description of Information Visualizations. IEEE Trans. Vis. Comput. Graph. 29(3): 1638-1650 (2023)

skipping 415 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results