Michael Picheny, Zoltan Tuske, et al.
INTERSPEECH 2019
This paper describes the technical advances in IBM's conversational telephony submission to the DARPA-sponsored 2004 Rich Transcription evaluation (RT-04). These advances include a system architecture based on cross-adaptation; a new form of feature-based MPE training; the use of a full-scale discriminatively trained full covariance gaussian system; the use of septaphone cross-word acoustic context in static decoding graphs; and the incorporation of 2100 hours of training data in every system component. These advances reduced the error rate by approximately 21% relative, on the 2003 test set, over the best-performing system in last year's evaluation, and produced the best results on the RT-04 current and progress CTS data. © 2005 IEEE.
Michael Picheny, Zoltan Tuske, et al.
INTERSPEECH 2019
Hagen Soltau, Lidia Mangu, et al.
ASRU 2011
Mohamed Kamal Omar, Lidia Mangu
ICASSP 2007
Hagen Soltau, George Saon, et al.
IEEE Transactions on Audio, Speech and Language Processing