default search action
Josef V. Psutka
Person information
- affiliation: University of West Bohemia, NIIS, Pilsen, Czech Republic
- not to be confused with: Josef Psutka
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2020
- [j5]Ales Prazák, Zdenek Loose, Josef V. Psutka, Vlasta Radová, Josef Psutka:
Live TV subtitling through respeaking with remote cutting-edge technology. Multim. Tools Appl. 79(1-2): 1203-1220 (2020) - 2019
- [j4]Josef V. Psutka, Josef Psutka:
Sample size for maximum-likelihood estimates of Gaussian model depending on dimensionality of pattern space. Pattern Recognit. 91: 25-33 (2019) - 2012
- [j3]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors. IEEE Trans. Speech Audio Process. 20(6): 1818-1828 (2012) - 2011
- [j2]Josef Psutka, Jan Svec, Josef V. Psutka, Jan Vanek, Ales Prazák, Lubos Smídl, Pavel Ircing:
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive. EURASIP J. Audio Speech Music. Process. 2011: 10 (2011) - 2009
- [j1]Pavel Ircing, Josef V. Psutka, Josef Psutka:
Using Morphological Information for Robust Language Modeling in Czech ASR System. IEEE Trans. Speech Audio Process. 17(4): 840-847 (2009)
Conference and Workshop Papers
- 2023
- [c52]Jan Lehecka, Jan Svec, Josef V. Psutka, Pavel Ircing:
Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech. INTERSPEECH 2023: 201-205 - [c51]Jan Lehecka, Josef V. Psutka, Josef Psutka:
Transfer Learning of Transformer-Based Speech Recognition Models from Czech to Slovak. TSD 2023: 328-338 - 2022
- [c50]Jan Lehecka, Josef V. Psutka, Josef Psutka:
Transformer-Based Automatic Speech Recognition of Formal and Colloquial Czech in MALACH Project. TSD 2022: 301-312 - 2021
- [c49]Ales Prazák, Zdenek Loose, Josef V. Psutka, Vlasta Radová, Josef Psutka, Jan Svec:
Live TV Subtitling Through Respeaking. Interspeech 2021: 2339-2340 - [c48]Jan Svec, Lubos Smídl, Josef V. Psutka, Ales Prazák:
Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings. Interspeech 2021: 4398-4402 - [c47]Josef V. Psutka, Jan Vanek, Ales Prazák:
Various DNN-HMM Architectures Used in Acoustic Modeling with Single-Speaker and Single-Channel. SLSP 2021: 85-96 - [c46]Josef V. Psutka, Ales Prazák, Jan Vanek:
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures. SPECOM 2021: 553-564 - [c45]Josef V. Psutka, Jan Svec, Ales Prazák:
CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task. TDS 2021: 523-533 - 2020
- [c44]Petr Stanislav, Josef V. Psutka, Josef Psutka:
Increasing the Accuracy of the ASR System by Prolonging Voiceless Phonemes in the Speech of Patients Using the Electrolarynx. SPECOM 2020: 562-571 - [c43]Zbynek Zajíc, Josef V. Psutka, Ludek Müller:
Diarization Based on Identification with X-Vectors. SPECOM 2020: 667-678 - [c42]Josef V. Psutka, Jan Vanek, Ales Prazák:
Complexity of the TDNN Acoustic Model with Respect to the HMM Topology. TDS 2020: 465-473 - 2019
- [c41]Zbynek Zajíc, Josef V. Psutka, Lucie Zajícová, Ludek Müller, Petr Salajka:
Diarization of the Language Consulting Center Telephone Calls. SPECOM 2019: 549-558 - 2018
- [c40]Jan Svec, Josef V. Psutka, Jan Trmal, Lubas Smfdl, Pavel Ircing, Jan Sedmidubský:
On the Use of Grapheme Models for Searching in Large Spoken Archives. ICASSP 2018: 6259-6263 - [c39]Zbynek Zajíc, Lucie Skorkovská, Petr Neduchal, Pavel Ircing, Josef V. Psutka, Marek Hrúz, Ales Prazák, Daniel Soutner, Jan Svec, Lukás Bures, Ludek Müller:
Towards Processing of the Oral History Interviews and Related Printed Documents. LREC 2018 - [c38]Zbynek Zajíc, Lucie Zajícová, Josef V. Psutka, Petr Salajka, Jaromír Novotný, Ales Prazák, Ludek Müller:
First Insight into the Processing of the Language Consulting Center Data. SPECOM 2018: 778-787 - 2017
- [c37]Jan Svec, Josef V. Psutka, Lubos Smídl, Jan Trmal:
A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings. INTERSPEECH 2017: 2934-2938 - [c36]Jan Svec, Lubos Smídl, Josef V. Psutka:
An Analysis of the RNN-Based Spoken Term Detection Training. SPECOM 2017: 119-129 - [c35]Petr Stanislav, Josef V. Psutka, Josef Psutka:
Recognition of the Electrolaryngeal Speech: Comparison Between Human and Machine. TSD 2017: 509-517 - 2015
- [c34]Josef V. Psutka, Josef Psutka:
Sample Size for Maximum Likelihood Estimates of Gaussian Model. CAIP (2) 2015: 462-469 - [c33]Josef V. Psutka:
Gaussian Mixture Model Selection Using Multiple Random Subsampling with Initialization. CAIP (1) 2015: 678-689 - 2014
- [c32]Josef V. Psutka, Ales Prazák, Josef Psutka, Vlasta Radová:
Captioning of Live TV Commentaries from the Olympic Games in Sochi: Some Interesting Insights. TSD 2014: 515-522 - 2013
- [c31]Ales Prazák, Josef V. Psutka, Josef Psutka, Zdenek Loose:
Towards Live Subtitling of TV Ice-hockey Commentary. SIGMAP 2013: 151-155 - [c30]Jan Vanek, Lukás Machlica, Josef V. Psutka, Josef Psutka:
Covariance Matrix Enhancement Approach to Train Robust Gaussian Mixture Models of Speech Data. SPECOM 2013: 92-99 - [c29]Pavel Campr, Ales Prazák, Josef V. Psutka, Josef Psutka:
Online Speaker Adaptation of an Acoustic Model Using Face Recognition. TSD 2013: 378-385 - 2012
- [c28]Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka:
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs. INTERSPEECH 2012: 1372-1375 - [c27]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Full covariance Gaussian mixture models evaluation on GPU. ISSPIT 2012: 203-207 - [c26]Petr Stanislav, Josef V. Psutka:
Influence of Different Phoneme Mappings on the Recognition Accuracy of Electrolaryngeal Speech. SIGMAP 2012: 204-207 - [c25]Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka:
Captioning of Live TV Programs through Speech Recognition and Re-speaking. TSD 2012: 513-519 - 2011
- [c24]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Optimization of the Gaussian Mixture Model Evaluation on GPU. INTERSPEECH 2011: 1737-1740 - [c23]Josef V. Psutka, Jan Vanek, Josef Psutka:
Speaker-Clustered Acoustic Models Evaluated on GPU for On-line Subtitling of Parliament Meetings. TSD 2011: 284-290 - 2010
- [c22]Josef Psutka, Jan Svec, Josef V. Psutka, Jan Vanek, Ales Prazák, Lubos Smídl:
Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions. TSD 2010: 385-391 - [c21]Jan Vanek, Josef V. Psutka:
Gender-Dependent Acoustic Models Fusion Developed for Automatic Subtitling of Parliament Meetings Broadcasted by the Czech TV. TSD 2010: 431-438 - 2009
- [c20]Ales Prazák, Zbynek Zajíc, Lukás Machlica, Josef V. Psutka:
Fast Speaker Adaptation in Automatic Online Subtitling. SIGMAP 2009: 126-130 - [c19]Jan Vanek, Josef V. Psutka, Ales Prazák, Josef Psutka:
Training of Speaker-clustered Acoustic Models for use in Real-time Recognizers. SIGMAP 2009: 131-135 - [c18]Jan Vanek, Josef V. Psutka, Jan Zelinka, Ales Prazák, Josef Psutka:
Discriminative Training of Gender-Dependent Acoustic Models. TSD 2009: 331-338 - 2007
- [c17]Pavel Ircing, Josef V. Psutka, Jan Vavruska:
What Can and Cannot Be Found in Czech Spontaneous Speech Using Document-Oriented IR Methods - UWB at CLEF 2007 CL-SR Track. CLEF 2007: 712-718 - [c16]Ales Prazák, Ludek Müller, Josef V. Psutka, Josef Psutka:
Live TV Subtitling - Fast 2-pass LVCSR System for Online Subtitling. SIGMAP 2007: 139-142 - [c15]Josef V. Psutka, Lubos Smídl, Ales Prazák:
Searching for a Robust MFCC-Based Parameterization for ASR Application. SIGMAP 2007: 196-199 - [c14]Josef V. Psutka, Ludek Müller, Lubos Smídl, Josef Psutka:
Feature space reduction and decorrelation in a large number of speech recognition experiments. SIP 2007: 151-156 - [c13]Josef V. Psutka:
Benefit of Maximum Likelihood Linear Transform (MLLT) Used at Different Levels of Covariance Matrices Clustering in ASR Systems. TSD 2007: 431-438 - 2006
- [c12]Ales Prazák, Josef V. Psutka, Jan Hoidekr, Jakub Kanis, Ludek Müller, Josef Psutka:
Adaptive language model in automatic online subtitling. Computational Intelligence 2006: 366-370 - [c11]Lubos Smídl, Josef V. Psutka:
Comparison of keyword spotting methods for searching in speech. INTERSPEECH 2006 - [c10]Jan Hoidekr, Josef V. Psutka, Ales Prazák, Josef Psutka:
Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries. LREC 2006: 2064-2067 - [c9]Ales Prazák, Josef V. Psutka, Jan Hoidekr, Jakub Kanis, Ludek Müller, Josef Psutka:
Automatic Online Subtitling of the Czech Parliament Meetings. TSD 2006: 501-508 - 2005
- [c8]Josef Psutka, Pavel Ircing, Josef V. Psutka, Jan Hajic, William J. Byrne, Jirí Mírovský:
Automatic transcription of Czech, Russian, and Slovak spontaneous speech in the MALACH project. INTERSPEECH 2005: 1349-1352 - 2004
- [c7]Josef Psutka, Pavel Ircing, Jan Hajic, Vlasta Radová, Josef V. Psutka, William J. Byrne, Samuel Gustman:
Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH project. LREC 2004 - 2003
- [c6]Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Jan Hajic, Jirí Mírovský, Samuel Gustman:
Large vocabulary ASR for spontaneous czech in the MALACH project. INTERSPEECH 2003: 1821-1824 - [c5]Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Veera Venkataramani, Jan Hajic, Samuel Gustman:
Towards Automatic Transcription of Spontaneous Czech Speech in the MALACH Project. TSD 2003: 214-219 - [c4]Josef Psutka, Ilja Iljuchin, Pavel Ircing, Josef V. Psutka, Václav Trejbal, William J. Byrne, Jan Hajic, Samuel Gustman:
Building LVCSR System for Transcription of Spontaneously Pronounced Russian Testimonies in the MALACH Project: Initial Steps and First Results. TSD 2003: 327-332 - 2002
- [c3]Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Jan Hajic, Samuel Gustman, Bhuvana Ramabhadran:
Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments. TSD 2002: 253-260 - 2001
- [c2]Josef Psutka, Ludek Müller, Josef V. Psutka:
Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition task. INTERSPEECH 2001: 1813-1816 - [c1]Josef Psutka, Ludek Müller, Josef V. Psutka:
The Influence of a Filter Shape in Telephone-Based Recognition Module Using PLP Parameterization. TSD 2001: 222-228
Informal and Other Publications
- 2024
- [i5]Jan Lehecka, Josef V. Psutka, Lubos Smídl, Pavel Ircing, Josef Psutka:
A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives. CoRR abs/2407.17160 (2024) - 2023
- [i4]Jan Lehecka, Josef V. Psutka, Josef Psutka:
Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak. CoRR abs/2306.04399 (2023) - 2022
- [i3]Jan Lehecka, Jan Svec, Ales Prazák, Josef V. Psutka:
Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech. CoRR abs/2206.07627 (2022) - [i2]Jan Lehecka, Josef V. Psutka, Josef Psutka:
Transformer-based Automatic Speech Recognition of Formal and Colloquial Czech in MALACH Project. CoRR abs/2206.07666 (2022) - [i1]Jan Svec, Lubos Smídl, Josef V. Psutka, Ales Prazák:
Spoken Term Detection and Relevance Score Estimation using Dot-Product of Pronunciation Embeddings. CoRR abs/2210.11895 (2022)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-20 22:51 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint