Distilling Event Sequence Knowledge From Large Language ModelsSomin WadhwaOktie Hassanzadehet al.2024ISWC 2024
Exploring Vulnerabilities in LLMs: A Red Teaming Approach to Evaluate Social BiasYuya Jeremy OngJay Pankaj Galaet al.2024IEEE CISOSE 2024
AUTOLYCUS: Exploiting Explainable Artificial Intelligence (XAI) for Model Extraction Attacks against Interpretable ModelsAbdullah Caglar OksuzAnisa Halimiet al.2024PETS 2024
Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language ModelsYue ZhouYada Zhuet al.2024NAACL 2024
MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification ModelsGrace GuoLifu Denget al.2024FAccT 2024
Manifold-Aligned Counterfactual Explanations for Neural NetworksGeorgia PerakisWei Sunet al.2024AISTATS 2024
Learning Granger Causality from Instance-wise Self-attentive Hawkes ProcessesDongxia WuIde-San Ideet al.2024AISTATS 2024
PROMINET: Prototype-based Multi-View Network for Interpretable Email Response PredictionYuqing WangPrashanth Vijayaraghavanet al.2023EMNLP 2023