Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMsSwanand Ravindra KadheFarhan Ahmedet al.2024ICML 2024
Simulating Iterative Human-AI Interaction in Programming with LLMsHussein MozannarValerie Chenet al.2023NeurIPS 2023
Ground-Truth, Whose Truth? - Examining the Challenges with Annotating Toxic Text DatasetsKofi ArhinIoana Baldini Soareset al.2021NeurIPS 2021
Your Fairness May Vary: Pretrained Language Model Fairness in Toxic Text ClassificationIoana Baldini SoaresDennis Weiet al.2022ACL 2022
Performance guarantees for adaptive estimation of sparse signalsDennis WeiAlfred O. Hero2015IEEE Trans. Inf. Theory
A Statistical Interpretation of the Maximum Subarray ProblemDennis WeiDmitry Malioutov2023ICASSP 2023
Who Should Predict? Exact Algorithms For Learning to Defer to HumansHussein MozannarHunter Langet al.2023AISTATS 2023
Convex Bounds on the Softmax Function with Applications to Robustness VerificationDennis WeiHaoze Wuet al.2023AISTATS 2023
Heavy Sets with Applications to Interpretable Machine Learning DiagnosticsDmitry MalioutovSanjeeb Dashet al.2023AISTATS 2023