Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarQuentin Fournieret al.2025ACL 2025
Combining Domain and Alignment Vectors to Achieve Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarYash Moreet al.2024NeurIPS 2024
A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment TechniquesMegh ThakkarQuentin Fournieret al.2024ACL 2024