TinyTL: Reduce Memory, Not Parameters for Efficient On-Device LearningHan CaiChuang Ganet al.2020NeurIPS 2020
HAT: Hardware-Aware Transformers for Efficient Neural Machine TranslationHanrui WangZhanghao Wuet al.2020ACL 2020
Once for All: Train One Network and Specialize it for Efficient DeploymentHan CaiChuang Ganet al.2020ICLR 2020