Scott Axelrod, Vaibhava Goel, et al.
INTERSPEECH - Eurospeech 2003
In this paper, we propose a new fast and flexible algorithm based on the maximum entropy (MAXENT) criterion to estimate stream weights in a state-synchronous multi-stream HMM. The technique is compared to the minimum classification error (MCE) criterion and to a brute-force, grid-search optimization of the WER on both a small and a large vocabulary audio-visual continuous speech recognition task. When estimating global stream weights, the MAXENT approach gives comparable results to the grid-search and the MCE. Estimation of state dependent weights is also considered: We observe significant improvements in both the MAXENT and MCE criteria, which, however, do not result in significant WER gains.
Scott Axelrod, Vaibhava Goel, et al.
INTERSPEECH - Eurospeech 2003
Jintao Jiang, Gerasimos Potamianos, et al.
ICASSP 2004
Gerasimos Potamianos, Juergen Luettin, et al.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Scott Axelrod, Ramesh Gopinath, et al.
ICSLP 2002