Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMsSwanand Ravindra KadheFarhan Ahmedet al.2024ICML 2024
Humans Linguistically Align to their Conversational Partners, and Language Models Should TooRachel OstrandSara Berger2024ICML 2024
CharED: Character-wise Ensemble Decoding for Large Language ModelsKevin GuEva Tueckeet al.2024ICML 2024
Exploring Vulnerabilities in LLMs: A Red Teaming Approach to Evaluate Social BiasYuya Jeremy OngJay Pankaj Galaet al.2024IEEE CISOSE 2024
Quantifying Representation Reliability in Self-Supervised Learning ModelsYoung Jin ParkHao Wanget al.2024UAI 2024
Contextualizing Single-Cell Analyses: An AI Pipeline for Evidence Search from Literature and Gene DatabasesJoao Bettencourt-SilvaNatasha Mulliganet al.2024ISMB 2024
Effect of dataset partitioning strategies for evaluating out-of-distribution generalisation for predictive models in biochemistryRaúl Fernández DíazLam Thanh Hoanget al.2024ISMB 2024