Iain Matthews, Gerasimos Potamianos, et al.
ICME 2001
In this paper, we introduce a non-linear enhancement technique called Audio-Visual Codebook Dependent Cepstral Normalization (AVCDCN) and we consider its use with both audio-only and audio-visual speech recognition. AVCDCN is inspired from CDCN [1] [2], an audio-only enhancement technique that approximates the non-linear effect of noise on speech with a piece-wise constant function. Our experiments show that the use of visual information in AVCDCN allows significant performance gains over CDCN.
Iain Matthews, Gerasimos Potamianos, et al.
ICME 2001
Sabine Deligne, Ellen Eide, et al.
INTERSPEECH - Eurospeech 2001
Jennifer C. Lai, Kwan Min Lee
ICSLP 2002
Sabine Deligne
ICSLP 2000