Quick FMLLR for speaker adaptation in speech recognition

Balakrishnan Varadarajan; Daniel Povey; Stephen M. Chu

doi:10.1109/ICASSP.2008.4518605

ICASSP 2008

Conference paper

16 Sep 2008

Quick FMLLR for speaker adaptation in speech recognition

View publication

Abstract

Feature space Maximum Likelihood Linear Regression (fMLLR) is a widely used technique for speaker adaptation in HMM-based speech recognition. However, in extremely resource constrained systems the time required to perform the sufficient statistics accumulation for fMLLR adaptation can be considerable. In this paper we describe a novel method that can lead to significant reduction in the time taken for statistics accumulation while preserving the adaptation gains. The proposed Quick fMLLR (Q-fMLLR) algorithm is implemented in a state-of-the-art large-vocabulary continuous speech recognition system, and evaluated on a broadcast transcription task. We present results both in terms of the average likelihood after adaptation and the character error rate. It is shown that Q-fMLLR attains the performance of regular fMLLR with a fraction of the computation. ©2008 IEEE.

Paper