A non-linear VAD for noisy environments
Otros/as autores/as
Fecha de publicación
2010ISSN
1866-9956
Resumen
This paper deals with non-linear transformations for improving the
performance of an entropy-based voice activity detector (VAD). The idea to use
a non-linear transformation has already been applied in the field of speech
linear prediction, or linear predictive coding (LPC), based on source separation
techniques, where a score function is added to classical equations in order to
take into account the true distribution of the signal. We explore the possibility
of estimating the entropy of frames after calculating its score function, instead
of using original frames. We observe that if the signal is clean, the estimated
entropy is essentially the same; if the signal is noisy, however, the frames
transformed using the score function may give entropy that is different in
voiced frames as compared to nonvoiced ones. Experimental evidence is given
to show that this fact enables voice activity detection under high noise, where
the simple entropy method fails.
Tipo de documento
Artículo
Lengua
Inglés
Palabras clave
Veu, Processament de
Páginas
11 p.
Publicado por
Springer
Citación
Solé Casals, J. & Zaiats Protchenko, V. 2010, "A Non-Linear VAD for Noisy Environments", Cognitive Computation, vol. 2, no. 3, pp. 191-198.
Este ítem aparece en la(s) siguiente(s) colección(ones)
- Articles [1389]
Derechos
Tots els drets reservats
(c) Springer (The original publication is available at www.springerlink.com)