Остановите войну!
for scientists:
default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 249 matches
- 2024
- Elham Motamedi, Danial Khosh Kholgh, Sorush Saghari, Mehdi Elahi, Francesco Barile, Marko Tkalcic:
Predicting movies' eudaimonic and hedonic scores: A machine learning approach using metadata, audio and visual features. Inf. Process. Manag. 61(2): 103610 (2024) - Gülnaziye Bingöl, Simone Porcu, Alessandro Floris, Luigi Atzori:
QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features. ACM Trans. Multim. Comput. Commun. Appl. 20(5): 130:1-130:23 (2024) - Sze An Peter Tan, Guangyu Gao, Jia Zhao:
Audio-Visual Segmentation by Leveraging Multi-scaled Features Learning. MMM (2) 2024: 156-169 - Xueyuan Chen, Yuejiao Wang, Xixin Wu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng:
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction. CoRR abs/2401.17796 (2024) - 2023
- Yangke Li, Xinman Zhang:
Lip landmark-based audio-visual speech enhancement with multimodal feature fusion network. Neurocomputing 549: 126432 (2023) - Yiming Zhao, Hongdong Zhao, Xuezhi Zhang, Weina Liu:
Vehicle classification based on audio-visual feature fusion with low-quality images and noise. J. Intell. Fuzzy Syst. 45(5): 8931-8944 (2023) - Marouane Kihal, Lamia Hamza:
Robust multimedia spam filtering based on visual, textual, and audio deep features and random forest. Multim. Tools Appl. 82(26): 40819-40837 (2023) - Yogita D. Mistry, Gajanan K. Birajdar, Archana M. Khodke:
Time-frequency visual representation and texture features for audio applications: a comprehensive review, recent trends, and challenges. Multim. Tools Appl. 82(23): 36143-36177 (2023) - Guizhu Li, Min Fu, Mengnan Sun, Xuefeng Liu, Bing Zheng:
A Facial Feature and Lip Movement Enhanced Audio-Visual Speech Separation Model. Sensors 23(21): 8770 (2023) - Yu-Ching Chung, Ji-Yan Han, Bo-Sin Wang, Wei-Zhong Zheng, Kung-Yao Shen, Ying-Hui Lai:
An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids. APSIPA ASC 2023: 1131-1137 - Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata:
Text-to-Feature Diffusion for Audio-Visual Few-Shot Learning. DAGM 2023: 491-507 - Prerna Singh, Ayush Tripathi, Lalan Kumar, Tapan Kumar Gandhi:
Brain Connectivity Features-based Age Group Classification using Temporal Asynchrony Audio-Visual Integration Task. EMBC 2023: 1-4 - Alessandro Ilic Mezza, Paolo Sani, Augusto Sarti:
Automatic TV Genre Classification Based on Visually-Conditioned Deep Audio Features. EUSIPCO 2023: 166-170 - Kazuki Seto, Yumi Asahi:
Sound Logo to Increase TV Advertising Effectiveness Based on Audio-Visual Features. HCI (5) 2023: 136-151 - Hongbo Chen, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Xiaolin Zhang, Jiamao Li:
CM-CS: Cross-Modal Common-Specific Feature Learning For Audio-Visual Video Parsing. ICASSP 2023: 1-5 - Ya Jiang, Hang Chen, Jun Du, Qing Wang, Chin-Hui Lee:
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. ICASSP 2023: 1-5 - Haitao Xu, Liangfa Wei, Jie Zhang, Jianming Yang, Yannan Wang, Tian Gao, Xin Fang, Li-Rong Dai:
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech Enhancement. ICASSP 2023: 1-5 - Sunan Li, Hailun Lian, Cheng Lu, Yan Zhao, Chuangao Tang, Yuan Zong, Wenming Zheng:
Audio-Visual Group-based Emotion Recognition using Local and Global Feature Aggregation based Multi-Task Learning. ICMI 2023: 741-745 - Moinak Bhattacharya, Prateek Prasanna:
Audio-visual feature fusion for improved thoracic disease classification. Medical Imaging: Computer-Aided Diagnosis 2023 - Jinxin Wang, Chao Yang, Zhongwen Guo, Xiaomei Li, Weigang Wang:
An End-to-End Mandarin Audio-Visual Speech Recognition Model with a Feature Enhancement Module. SMC 2023: 572-577 - Salam Nandakishor, Debadatta Pati:
Improvement of Audio-Visual Keyword Spotting System Accuracy Using Excitation Source Feature. SPECOM (2) 2023: 344-356 - Prerna Singh, Ayush Tripathi, Lalan Kumar, Tapan Kumar Gandhi:
Brain Connectivity Features-based Age Group Classification using Temporal Asynchrony Audio-Visual Integration Task. CoRR abs/2304.06315 (2023) - Sagnik Majumder, Ziad Al-Halah, Kristen Grauman:
Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos. CoRR abs/2307.04760 (2023) - Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata:
Text-to-feature diffusion for audio-visual few-shot learning. CoRR abs/2309.03869 (2023) - Ju-Chieh Chou, Chung-Ming Chien, Karen Livescu:
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement. CoRR abs/2309.08030 (2023) - Edward Fish, Jon Weinbren, Andrew Gilbert:
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization. CoRR abs/2310.03456 (2023) - Sneha Muppalla, Shan Jia, Siwei Lyu:
Integrating Audio-Visual Features for Multimodal Deepfake Detection. CoRR abs/2310.03827 (2023) - 2022
- Lei Wang, Guodao Sun, Yunchao Wang, Ji Ma, Xiaomin Zhao, Ronghua Liang:
AFExplorer: Visual analysis and interactive selection of audio features. Vis. Informatics 6(1): 47-55 (2022) - Shangjun Lu, Xiaoxia Du, Juan Liu, Yu-Mei Zhang, Shaofeng Zhao, Rongfeng Su, Lan Wang, Nan Yan:
A New Method for Predicting Severity Level of Dysarthric Speech Based on Joint Feature-Sample Selection using Audio-Visual Data. IALP 2022: 190-195 - Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. INTERSPEECH 2022: 2838-2842
skipping 219 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-03-28 10:11 CET from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint