Mikolaj Kundegorski
Two-Microphone Dereverberation for Automatic Speech Recognition of Polish
Kundegorski, Mikolaj; Jackson, Philip J.B.; Ziółko, Bartosz
Authors
Philip J.B. Jackson
Bartosz Ziółko
Abstract
Reverberation is a common problem for many speech technologies, such as automatic speech recognition (ASR) systems. This paper investigates the novel combination of precedence, binaural and statistical independence cues for enhancing reverberant speech, prior to ASR, under these adverse acoustical conditions when two microphone signals are available. Results of the enhancement are evaluated in terms of relevant signal measures and accuracy for both English and Polish ASR tasks. These show inconsistencies between the signal and recognition measures, although in recognition the proposed method consistently outperforms all other combinations and the spectral-subtraction baseline.
Citation
Kundegorski, M., Jackson, P. J., & Ziółko, B. (2015). Two-Microphone Dereverberation for Automatic Speech Recognition of Polish. Archives of Acoustics, 39(3), 411-420. https://doi.org/10.2478/aoa-2014-0045
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 2, 2014 |
Publication Date | Mar 1, 2015 |
Deposit Date | Mar 25, 2015 |
Publicly Available Date | Apr 2, 2015 |
Journal | Archives of Acoustics |
Print ISSN | 0137-5075 |
Electronic ISSN | 2300-262X |
Publisher | De Gruyter Open |
Peer Reviewed | Peer Reviewed |
Volume | 39 |
Issue | 3 |
Pages | 411-420 |
DOI | https://doi.org/10.2478/aoa-2014-0045 |
Keywords | Speech enhancement, Reverberation, ASR, Polish. |
Public URL | https://durham-repository.worktribe.com/output/1413071 |
Files
Published Journal Article
(6 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
© 2014 Polish Academy of Sciences & Institute of Fundamental Technological Research (IPPT PAN). This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License. (CC BY-NC-ND 3.0)
You might also like
Transfer Learning Using Convolutional Neural Networks For Object Classification Within X-Ray Baggage Security Imagery
(2016)
Presentation / Conference Contribution
A Photogrammetric Approach for Real-time 3D Localization and Tracking of Pedestrians in Monocular Infrared Imagery
(2014)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search