Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label SmoothingXiaodong CuiGeorge Saonet al.2022INTERSPEECH 2022
Effective Training of RNN Transducer Models on Diverse Sources of Speech and Text DataTakashi FukudaSamuel Thomas2023ICASSP 2023
Global RNN Transducer Models For Multi-dialect Speech RecognitionTakashi FukudaSamuel Thomaset al.2022INTERSPEECH 2022
Improving ASR Robustness in Noisy Condition Through VAD IntegrationSashi NovitasariTakashi Fukudaet al.2022INTERSPEECH 2022
Knowledge distillation based training of universal ASR source models for cross-lingual transferTakashi FukudaSamuel Thomas2021INTERSPEECH 2021
Generalized Knowledge Distillation from An Ensemble of Specialized Teachers Leveraging Unsupervised Neural ClusteringTakashi FukudaGakuto Kurata2021ICASSP 2021
Implicit transfer of privileged acoustic information in a generalized knowledge distillation frameworkTakashi FukudaSamuel Thomas2020INTERSPEECH 2020
Mixed Bandwidth Acoustic Modeling Leveraging Knowledge DistillationTakashi FukudaSamuel Thomas2019ASRU 2019
Data Augmentation Based on Vowel Stretch for Improving Children's Speech RecognitionTohru NaganoTakashi Fukudaet al.2019ASRU 2019
Direct neuron-wise fusion of cognate neural networksTakashi FukudaMasayuki Suzukiet al.2019INTERSPEECH 2019