Recent Advances in Robust Speech Recognition Technology

Using GARCH Process for Voice Activity Detection

Author(s): Rasool Tahmasbi

Pp: 13-29 (17)

DOI: 10.2174/978160805172411101010013

* (Excluding Mailing and Handling)

Abstract

GARCH (Generalized Autoregressive Conditional Heteroscedasticity) models are new statistical methods that are used especially in economic time series. There is a consensus that speech signals exhibit variance that changes through time. GARCH models are a popular choice to model these changing variances. In this chapter, we propose three methods for VAD, which are based on GARCH models. In the first method, heteroscedasticity will be modeled by GARCH process and hard detection is the result of comparing a Multiple Observation Likelihood Ratio Test (MOLRT) with a threshold function. In the second method, no distinct probability functions are assumed for speech and noise distributions and no LRT is employed. We will show that VAD is related to the parameter constancy test in GARCH process. For testing parameter constancy in GARCH models, the algorithm of the Cramer-von Mises (CVM) test is described. In the last method, the process of outlier detection in GARCH model is presented and the motivation for using it and its relation with VAD are discussed.


Keywords: voice activity detection, generalized autoregressive conditional heteroscedasticity process.

Related Journals
Related Books
© 2024 Bentham Science Publishers | Privacy Policy