A non-linear VAD for noisy environments
Visualitza/Obre
Altres autors/es
Data de publicació
2010ISSN
1866-9956
Resum
This paper deals with non-linear transformations for improving the
performance of an entropy-based voice activity detector (VAD). The idea to use
a non-linear transformation has already been applied in the field of speech
linear prediction, or linear predictive coding (LPC), based on source separation
techniques, where a score function is added to classical equations in order to
take into account the true distribution of the signal. We explore the possibility
of estimating the entropy of frames after calculating its score function, instead
of using original frames. We observe that if the signal is clean, the estimated
entropy is essentially the same; if the signal is noisy, however, the frames
transformed using the score function may give entropy that is different in
voiced frames as compared to nonvoiced ones. Experimental evidence is given
to show that this fact enables voice activity detection under high noise, where
the simple entropy method fails.
Tipus de document
Article
Llengua
Anglès
Paraules clau
Veu, Processament de
Pàgines
11 p.
Publicat per
Springer
Citació
Solé Casals, J. & Zaiats Protchenko, V. 2010, "A Non-Linear VAD for Noisy Environments", Cognitive Computation, vol. 2, no. 3, pp. 191-198.
Aquest element apareix en la col·lecció o col·leccions següent(s)
- Articles [1389]
Drets
Tots els drets reservats
(c) Springer (The original publication is available at www.springerlink.com)