default search action

combined dblp search
author search
venue search
publication search

ask others

Yui Sudo

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/PengS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/PengS0024
Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe:
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification. ACL (1) 2024: 10192-10209
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShakeelSPW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShakeelSPW24
Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe:
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation. ICASSP Workshops 2024: 570-574
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Sudo0FP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Sudo0FP024
Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe:
Contextualized Automatic Speech Recognition With Attention-Based Bias Phrase Boosted Beam Search. ICASSP 2024: 10896-10900
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/ieaaie/OsakiSINN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ieaaie/OsakiSINN24
Takahiro Osaki, Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improving Noise Robustness of Automatic Speech Recognition Based on a Parallel Adapter Model with Near-Identity Initialization. IEA/AIE 2024: 454-466
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10449
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10449
Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe:
Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search. CoRR abs/2401.10449 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-16658
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-16658
Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, Yui Sudo, Muhammad Shakeel, Kwanghee Choi, Jiatong Shi, Xuankai Chang, Jee-weon Jung, Shinji Watanabe:
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer. CoRR abs/2401.16658 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12654
Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe:
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification. CoRR abs/2402.12654 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-13344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-13344
Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Shinji Watanabe:
Contextualized Automatic Speech Recognition with Dynamic Vocabulary. CoRR abs/2405.13344 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-13514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-13514
Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe:
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation. CoRR abs/2405.13514 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02950
Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Brian Yan, Jiatong Shi, Yifan Peng, Shinji Watanabe:
4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders. CoRR abs/2406.02950 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16120
Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe:
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss. CoRR abs/2406.16120 (2024)
2023
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TakedaSK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TakedaSK23
Ryu Takeda, Yui Sudo, Kazunori Komatani:
Flexible Evidence Model to Reduce Uncertainty Mismatch Between Speech Enhancement and ASR Based on Encoder-Decoder Architecture. APSIPA ASC 2023: 1830-1837
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PengTYBCLSACSZSSJMW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PengTYBCLSACSZSSJMW23
Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-Weon Jung, Soumi Maiti, Shinji Watanabe:
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data. ASRU 2023: 1-8
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengS0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengS0023
Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe:
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models. INTERSPEECH 2023: 62-66
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SudoHN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SudoHN23
Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. INTERSPEECH 2023: 491-495
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sudo0YS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sudo0YS023
Yui Sudo, Muhammad Shakeel, Brian Yan, Jiatong Shi, Shinji Watanabe:
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders. INTERSPEECH 2023: 3312-3316
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sudo0P023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sudo0P023
Yui Sudo, Muhammad Shakeel, Yifan Peng, Shinji Watanabe:
Time-synchronous one-pass Beam Search for Parallel Online and Offline Transducers with Dynamic Block Training. INTERSPEECH 2023: 4479-4483
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/ro-man/SudoTTNN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ro-man/SudoTTNN23
Yui Sudo, Masayuki Takigahira, Hideo Tsuru, Kazuhiro Nakadai, Hirofumi Nakajima:
Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation. RO-MAN 2023: 2058-2063
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17651
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17651
Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe:
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models. CoRR abs/2305.17651 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17846
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17846
Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. CoRR abs/2305.17846 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13876
Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel, Jee-weon Jung, Soumi Maiti, Shinji Watanabe:
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data. CoRR abs/2309.13876 (2023)
2022
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakedaSNK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakedaSNK22
Ryu Takeda, Yui Sudo, Kazuhiro Nakadai, Kazunori Komatani:
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model. INTERSPEECH 2022: 3789-3793
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sudo0NS022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sudo0NS022
Yui Sudo, Muhammad Shakeel, Kazuhiro Nakadai, Jiatong Shi, Shinji Watanabe:
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection. INTERSPEECH 2022: 4641-4645
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-10818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-10818
Yui Sudo, Muhammad Shakeel, Brian Yan, Jiatong Shi, Shinji Watanabe:
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders. CoRR abs/2212.10818 (2022)
2021
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/apin/SudoINN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/apin/SudoINN21
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multichannel environmental sound segmentation. Appl. Intell. 51(11): 8245-8259 (2021)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/sii/SudoINN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sii/SudoINN21
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental Sound Segmentation utilizing Sound Source Localization and Separation U-Net. SII 2021: 382-387
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ar/SudoINN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ar/SudoINN20
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound event aware environmental sound segmentation with Mask U-Net. Adv. Robotics 34(20): 1280-1290 (2020)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/sii/SudoINN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sii/SudoINN20
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental sound segmentation. SII 2020: 820-825

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/SudoINN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/SudoINN19
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improvement of DOA Estimation by using Quaternion Output in Sound Event Localization and Detection. DCASE 2019: 244-247
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SudoINN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SudoINN19
Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Environmental sound segmentation utilizing Mask U-Net. IROS 2019: 5340-5345

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.