A non-linear VAD for noisy environments
Other authors
Publication date
2010ISSN
1866-9956
Abstract
This paper deals with non-linear transformations for improving the
performance of an entropy-based voice activity detector (VAD). The idea to use
a non-linear transformation has already been applied in the field of speech
linear prediction, or linear predictive coding (LPC), based on source separation
techniques, where a score function is added to classical equations in order to
take into account the true distribution of the signal. We explore the possibility
of estimating the entropy of frames after calculating its score function, instead
of using original frames. We observe that if the signal is clean, the estimated
entropy is essentially the same; if the signal is noisy, however, the frames
transformed using the score function may give entropy that is different in
voiced frames as compared to nonvoiced ones. Experimental evidence is given
to show that this fact enables voice activity detection under high noise, where
the simple entropy method fails.
Document Type
Article
Language
English
Keywords
Veu, Processament de
Pages
11 p.
Publisher
Springer
Citation
Solé Casals, J. & Zaiats Protchenko, V. 2010, "A Non-Linear VAD for Noisy Environments", Cognitive Computation, vol. 2, no. 3, pp. 191-198.
This item appears in the following Collection(s)
- Articles [1389]
Rights
Tots els drets reservats
(c) Springer (The original publication is available at www.springerlink.com)