R. Donovan
ICASSP 2000
In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.
R. Donovan
ICASSP 2000
Jiri Navratil, Jan Kleindienst, et al.
ICASSP 2000
Mukund Padmanabhan, Lalit R. Bahl, et al.
IEEE Transactions on Speech and Audio Processing
Mukund Padmanabhan, Ken Martin
IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing