CharED: Character-wise Ensemble Decoding for Large Language ModelsKevin GuEva Tueckeet al.2024ICML 2024
Improving Performance Prediction of Electrolyte Formulations with Transformer-based Molecular Representation ModelIndra Priyadarsini SVidushi Sharmaet al.2024ICML 2024
How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability?Hongkang LiMeng Wenget al.2024ICML 2024