Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASRTakashi FukudaGakuto Kurataet al.2025ICASSP 2025
LLM based Text Generation for Improved Low-resource Speech Recognition ModelsTohru NaganoGakuto Kurataet al.2025ICASSP 2025
Beyond neuropsychological tests: AI speech analysis in PKUSusan WaisbrenKely Norelet al.2024J. Inherit. Metab. Dis.
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsYuchen HuChen Chenet al.2024NeurIPS 2024
Robust ASR Error Correction with Conservative Data FilteringTakuma UdagawaMasayuki Suzukiet al.2024EMNLP 2024
M2 ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective OptimizationA SaifLisha Chenet al.2024INTERSPEECH 2024
Exploring the limits of decoder-only models trained on public speech recognition corporaAnkit GuptaGeorge Saonet al.2024INTERSPEECH 2024
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerSlava ShechtmanAvihu Dekel2024INTERSPEECH 2024