Cogprints

Generalization of Extended Baum-Welch Parameter Estimation for Discriminative Training and Decoding

Kanevsky, Dr Dimitri and Sainath, Dr Tara and Ramabhadran, Dr Bhuvana and Nahamoo, Dr David (2008) Generalization of Extended Baum-Welch Parameter Estimation for Discriminative Training and Decoding. [Preprint]

Full text available as:

[img]
Preview
PDF (We demonstrate the generalizability of the Extended Baum-Welch (EBW) algorithm not only for HMM parameter estimation but for decoding as well.)
177Kb

Abstract

We demonstrate the generalizability of the Extended Baum-Welch (EBW) algorithm not only for HMM parameter estimation but for decoding as well. We show that there can exist a general function associated with the objective function under EBW that reduces to the well-known auxiliary function used in the Baum-Welch algorithm for maximum likelihood estimates. We generalize representation for the updates of model parameters by making use of a differentiable function (such as arithmetic or geometric mean) on the updated and current model parameters and describe their effect on the learning rate during HMM parameter estimation. Improvements on speech recognition tasks are also presented here.

Item Type:Preprint
Keywords:Extended Baum-Welch, optimization, speech
Subjects:Computer Science > Speech
ID Code:6038
Deposited By: Kanevsky, Dr Dimitri
Deposited On:30 Apr 2008 18:34
Last Modified:11 Mar 2011 08:57

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

C.Liu and P.Liu and H.Jiang F.Soong and R. Wang, Constrained Line Search Optimization for Discriminative Training in Speech Recognition, ICASSP, 2007

C.Liu and P.Liu and H.Jiang F.Soong and R. Wang, Constrained Line Search Approach to General Discriminative HMM Training, ASRU 2007

P.S. Gopalakrishnan and D. Kanevsky and D. Nahamoo and A. Nadas, An Inequality for Rational Functions with Applications to Some Statistical Estimation Problems, IEEE Trans. Information Theory, 1991, v.37, n. 1, January

D. Kanevsky, Extended Baum Transformations for General Functions, Proc. ICASSP, 2004,

D. Kanevsky, Extended Baum Transformations For General Functions, II, Human Language Technologies, IBM, 2005, RC23645(W0506-120)

T. N. Sainath and D. Kanevsky and B. Ramabhadran, Gradient Steepness Metrics Using Extended Baum-Welch Transformations for Universal Pattern Recognition Tasks, Proc. ICASSP, April, 2008

T. N. Sainath and V. Zue and D. Kanevsky, Audio Classification using EBW Transformations, Proc. Interspeech, 2007

T. N. Sainath and D. Kanevsky and B. Ramabhadran, Broad Phoentic Recognition in a Hidden Markov Model Framework Using Extended Baum-Welch Transformations, Proc. ASRU, 2007

D. Povey and B.Kingsbury, Evaluation of Proposed Modifications to MPE for Large Scale Discriminative Training, Proc. ICASSP, 2007

D. Povey et. al, Boosted MMI for Model and Feature-space Discriminative Training, Proc. ICASSP, 2008

P.C. Woodland and D. Povey, Large Scale Discriminative Training of Hidden Markov Models for Speech Recognition, Computer Speech and Language, 2002, vol. 16, n. 1, pp. 25-47, January

Metadata

Repository Staff Only: item control page