Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language ModelsShengyun PengPin-Yu Chenet al.2024NeurIPS 2024
Dense Associative Memory Through the Lens of Random FeaturesBenjamin HooverDuen Horng Chauet al.2024NeurIPS 2024