Exploring Non-linear Transformations for an Entropybased Voice Activity Detector
Visualitza/Obre
Altres autors/es
Data de publicació
2009ISBN
9788493618681
Resum
In this paper we explore the use of non-linear transformations in
order to improve the performance of an entropy based voice activity detector
(VAD). The idea of using a non-linear transformation comes from some
previous work done in speech linear prediction (LPC) field based in source
separation techniques, where the score function was added into the classical
equations in order to take into account the real distribution of the signal. We
explore the possibility of estimating the entropy of frames after calculating its
score function, instead of using original frames. We observe that if signal is
clean, estimated entropy is essentially the same; but if signal is noisy
transformed frames (with score function) are able to give different entropy if
the frame is voiced against unvoiced ones. Experimental results show that this
fact permits to detect voice activity under high noise, where simple entropy
method fails.
Tipus de document
Capítol o part de llibre
Llengua
Anglès
Paraules clau
Processament de la parla
Pàgines
8 p.
Citació
J. Solé-Casals, P. Martí-Puig, R. Reig-Bolaño, "Exploring Non-linear Transformations for an Entropy based Voice Activity Detector", Workshop on Non-linear Speech Processing - NOLISP, Vic (Spain), 2009.
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
(c) Universitat de Vic
Tots els drets reservats