default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Venue search results

Likely matches

Publication search results

found 735 matches

2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/AryaHB0IADKG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/AryaHB0IADKG24
Greeshma Arya, Mohammad Kamrul Hasan, Ashish Bagwari, Nurhizam Safie, Shayla Islam, Fatima Rayan Awad Ahmed, Aaishani De, Muhammad Attique Khan, Taher M. Ghazal:
Multimodal Hate Speech Detection in Memes Using Contrastive Language-Image Pre-Training. IEEE Access 12: 22359-22375 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/MoonSK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/MoonSK24
Eesun Moon, A. S. M. Sharifuzzaman Sagar, Hyung Seok Kim:
Multimodal Daily-Life Emotional Recognition Using Heart Rate and Speech Data From Wearables. IEEE Access 12: 96635-96648 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/apin/LeiWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/apin/LeiWW24
Jianjun Lei, Jing Wang, Ying Wang:
Multi-level attention fusion network assisted by relative entropy alignment for multimodal speech emotion recognition. Appl. Intell. 54(17-18): 8478-8490 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/cbm/NeumannKR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cbm/NeumannKR24
Michael Neumann, Hardik Kothare, Vikram Ramanarayanan:
Multimodal speech biomarkers for remote monitoring of ALS disease progression. Comput. Biol. Medicine 180: 108949 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/csi/PanGRV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csi/PanGRV24
Ronghao Pan, José Antonio García-Díaz, Miguel Ángel Rodríguez-García, Rafael Valencia-García:
Spanish MEACorpus 2023: A multimodal speech-text corpus for emotion analysis in Spanish from natural environments. Comput. Stand. Interfaces 90: 103856 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/csl/WanCLZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/WanCLZD24
Yujie Wan, Yuzhong Chen, Jiali Lin, Jiayuan Zhong, Chen Dong:
A knowledge-augmented heterogeneous graph convolutional network for aspect-level multimodal sentiment analysis. Comput. Speech Lang. 85: 101587 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/eait/ZhouSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eait/ZhouSC24
Qi Zhou, Wannapon Suraworachet, Mutlu Cukurova:
Detecting non-verbal speech and gaze behaviours with multimodal data and computer vision to interpret effective collaborative learning interactions. Educ. Inf. Technol. 29(1): 1071-1098 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/eswa/KhanGEK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eswa/KhanGEK24
Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El-Saddik, Soonil Kwon:
MSER: Multimodal speech emotion recognition using cross-attention with deep fusion. Expert Syst. Appl. 245: 122946 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/iot/ZhaoZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/iot/ZhaoZC24
Gang Zhao, Yinan Zhang, Jie Chu:
A multimodal teacher speech emotion recognition method in the smart classroom. Internet Things 25: 101069 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/is/AyetiranO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/is/AyetiranO24
Eniafe Festus Ayetiran, Özlem Özgöbek:
An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection. Inf. Syst. 123: 102378 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/mta/KumarMR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/KumarMR24
Puneet Kumar, Sarthak Malik, Balasubramanian Raman:
Interpretable multimodal emotion recognition using hybrid fusion of speech and image data. Multim. Tools Appl. 83(10): 28373-28394 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/mti/RosiliusSWGBL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mti/RosiliusSWGBL24
Maximilian Rosilius, Martin Spiertz, Benedikt Wirsing, Manuel Geuen, Volker Bräutigam, Bernd Ludwig:
Impact of Industrial Noise on Speech Interaction Performance and User Acceptance when Using the MS HoloLens 2. Multimodal Technol. Interact. 8(2): 8 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/nn/SadokLGAS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/SadokLGAS24
Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier:
A multimodal dynamical variational autoencoder for audiovisual speech representation learning. Neural Networks 172: 106120 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/speech/BoudinBROB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/BoudinBROB24
Auriane Boudin, Roxane Bertrand, Stéphane Rauzy, Magalie Ochs, Philippe Blache:
A multimodal model for predicting feedback position and type during conversation. Speech Commun. 159: 103066 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenLW24
Zhe Chen, Hongcheng Liu, Yu Wang:
DialogMCF: Multimodal Context Flow for Audio Visual Scene-Aware Dialog. IEEE ACM Trans. Audio Speech Lang. Process. 32: 753-764 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/CuiCCSLLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/CuiCCSLLS24
Shiyao Cui, Jiangxia Cao, Xin Cong, Jiawei Sheng, Quangang Li, Tingwen Liu, Jinqiao Shi:
Enhancing Multimodal Entity and Relation Extraction With Variational Information Bottleneck. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1274-1285 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LinCRY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LinCRY24
Changkai Lin, Hongju Cheng, Qiang Rao, Yang Yang:
M${3}$SA: Multimodal Sentiment Analysis Based on Multi-Scale Feature Extraction and Multi-Task Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1416-1429 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MeiMLKKZPZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MeiMLKKZPZW24
Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3339-3354 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/VarelaSKDK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/VarelaSKDK24
Alejandro Santorum Varela, Svetlana Stoyanchev, Simon Keizer, Rama Doddipatla, Kate M. Knill:
Entity Resolution in Situated Dialog With Unimodal and Multimodal Transformers. IEEE ACM Trans. Audio Speech Lang. Process. 32: 704-713 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YuanFXG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YuanFXG24
Ziqi Yuan, Jingliang Fang, Hua Xu, Kai Gao:
Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3669-3683 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/tcss/KhuranaGSR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcss/KhuranaGSR24
Yash Khurana, Swamita Gupta, R. Sathyaraj, S. P. Raja:
RobinNet: A Multimodal Speech Emotion Recognition System With Speaker Recognition for Social Interactions. IEEE Trans. Comput. Soc. Syst. 11(1): 478-487 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/tvcg/HuangHMDLMQLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvcg/HuangHMDLMQLW24
Ze-Yuan Huang, Qiang He, Kevin T. Maher, Xiaoming Deng, Yu-Kun Lai, Cuixia Ma, Sheng Feng Qin, Yong-Jin Liu, Hongan Wang:
SpeechMirror: A Multimodal Visual Analytics System for Personalized Reflection of Online Public Speaking Effectiveness. IEEE Trans. Vis. Comput. Graph. 30(1): 606-616 (2024)
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/acl/VoasHM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/VoasHM24
Jordan Voas, David Harwath, Raymond Mooney:
Multimodal Contextualized Semantic Parsing from Speech. ACL (1) 2024: 7354-7369
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/case-ws/ElSayedN24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case-ws/ElSayedN24a
Ahmed El-Sayed, Omar Nasr:
AAST-NLP at Multimodal Hate Speech Event Detection 2024 : A Multimodal Approach for Classification of Text-Embedded Images Based on CLIP and BERT-Based Models. CASE 2024: 139-144
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/case-ws/GangulyEPRGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case-ws/GangulyEPRGZ24
Amrita Ganguly, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Md. Nishat Raihan, Dhiman Goswami, Marcos Zampieri:
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles. CASE 2024: 125-131
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/case-ws/ThapaRJVJJVHN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case-ws/ThapaRJVJJVHN24
Surendrabikram Thapa, Kritesh Rauniyar, Farhan Jafri, Hariram Veeramani, Raghav Jain, Sandesh Jain, Francielle Vargas, Ali Hürriyetoglu, Usman Naseem:
Extended Multimodal Hate Speech Event Detection During Russia-Ukraine Crisis - Shared Task at CASE 2024. CASE 2024: 221-228
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/case-ws/WangM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case-ws/WangM24
Yeshan Wang, Ilia Markov:
CLTL@Multimodal Hate Speech Event Detection 2024: The Winning Approach to Detecting Multimodal Hate Speech and Its Targets. CASE 2024: 73-78
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/case-ws/Yamagishi24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case-ws/Yamagishi24
Yosuke Yamagishi:
YYama@Multimodal Hate Speech Event Detection 2024: Simpler Prompts, Better Results - Enhancing Zero-shot Detection with a Large Multimodal Model. CASE 2024: 60-66
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/coling/MacaireDALELS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/MacaireDALELS24
Cécile Macaire, Chloé Dion, Jordan Arrigo, Claire Lemaire, Emmanuelle Esperança-Rodier, Benjamin Lecouteux, Didier Schwab:
A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation. LREC/COLING 2024: 839-849
- view
  authority control:
- export record
  dblp key:
  - conf/doceis/KalatehENB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/doceis/KalatehENB24
Sepideh Kalateh, Luis Alberto Estrada-Jimenez, Sanaz Nikghadam-Hojjati, José Barata:
Multimodal Creativity State Detection from Speech and Voice. DoCEIS 2024: 123-136

skipping 705 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results