Cogprints

EXTENDED BAUM TRANSFORMATIONS FOR GENERAL FUNCTIONS, II

Kanevsky, Dr Dimitri (2005) EXTENDED BAUM TRANSFORMATIONS FOR GENERAL FUNCTIONS, II. [Departmental Technical Report] (Unpublished)

Full text available as:

[img]
Preview
PDF
195Kb

Abstract

The discrimination technique for estimating the parameters of Gaussian mixtures that is based on the Extended Baum transformations (EB) has had significant impact on the speech recognition community. The proof that definitively shows that these transformations increase the value of an objective function with iteration (i.e., so-called "growth transformations") was presented by the author two years ago for a diagonal Gaussian mixture densities. In this paper this proof is extended to a multidimensional multivariate Gaussian mixtures. The proof presented in the current paper is based on the linearization process and the explicit growth estimate for linear forms of Gaussian mixtures.

Item Type:Departmental Technical Report
Keywords:Multidimensional Multivariate Gaussuan Mixture, Extended Baum-Welch transformations, discriminative training, estimation of statistical parameters, maximum mutual informaiton estimation
Subjects:Computer Science > Statistical Models
ID Code:5058
Deposited By: Kanevsky, Dr Dimitri
Deposited On:08 Aug 2006
Last Modified:11 Mar 2011 08:56

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative Training of Subspace Constrained GMMs for Speech Recognition," to be submitted to IEEE Transactions on Speech and Audio Processing.

L.E.Baum and J.A. Eagon, "An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology," Bull. Amer. Math. Soc., vol. 73, pp.360-363, 1967.

A. Gunawardana and W. Byrne, ``Discriminative Speaker Adaptation with Conditional Maximum Likelihood Linear Regression,'' ICASSP, 2002.

P.S. Gopalakrishnan, D. Kanevsky, D. Nahamoo and A. Nadas, "An inequality for rational functions with applications to some statistical estimation problems", IEEE Trans. Information Theory, Vol. 37, No.1 January 1991

D. Kanevsky, "Growth Transformations for General Functions", RC22919 (W0309-163), September 25, 2003.

D. Kanevsky, "Extended Baum transformations for general functions", in Proc. ICASSP, 2004.

Y. Normandin, "An improved MMIE Training Algorithm for Speaker Independent, Small Vocabulary, Continuous Speech Recognition", Proc. ICASSP'91, pp. 537-540, 1991.

R. Schluter, W. Macherey, B. Muler and H. Ney, "Comparison of discriminative training criteria and optimization methods for speech recognition", Speech Communication, Vol. 34, pp.287-310, 2001.

V. Valtchev , P.C. Woodland and S. J. Young, "Lattice-based Discriminative Training for Large Vocabulary Speech Recognition Systems", Speech Communication, Vol. 22, pp. 303-314, 1996.

Metadata

Repository Staff Only: item control page