Score Function for Voice Activity Detection
View/Open
Other authors
Publication date
2010ISBN
9783642115080
Abstract
In this paper we explore the use of non-linear transformations in order
to improve the performance of an entropy based voice activity detector
(VAD). The idea of using a non-linear transformation comes from some previous
work done in speech linear prediction (LPC) field based in source separation
techniques, where the score function was added into the classical equations
in order to take into account the real distribution of the signal. We explore the
possibility of estimating the entropy of frames after calculating its score function,
instead of using original frames. We observe that jf signal is clean, estimated
entropy is essentially the same; but if signal is noisy transformed frames
(with score function) are able to give different entropy if the frame is voiced
against unvoiced ones. Experimental results show that this fact permits to detect
voice activity under high noise, where simple entropy method fails.
Document Type
Object of conference
Language
English
Keywords
Processament de la parla
Pages
8 p.
Publisher
Springer
Citation
Solé casals, Jordi [et al.]. "Score Function for Voice Activity Detection". A: International conference on non-linear speech processing. "Advances in nonlinear speech processing : international conference on nonlinear speech processing, NOLISP 2009 : Vic, Spain, June 25-27, 2009 : revised selected papers". Vic: Springer, 2009, p. 76-83.
This item appears in the following Collection(s)
- Documents de Congressos [174]
Rights
(c) Springer (The original publication is available at www.springerlink.com)
Tots els drets reservats