A Graph per Persona: Reasoning about Subjective Natural Language DescriptionsEunjeong HwangVered Shwartzet al.2024ACL 2024
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language ModelsShreyas BasavatiaKeerthiram Murugesanet al.2024ACL 2024
Data Contamination Report from the 2024 CONDA Shared TaskOscar SainzIker García-ferreroet al.2024ACL 2024
A Framework for Agents Guiding Foundation Models through Knowledge and ReasoningDebarun BhattacharjyaJunkyu Leeet al.2024IJCAI 2024
Trust Regions for Explanations via Black-Box Probabilistic CertificationAmit DhurandharSwagatam Haldaret al.2024ICML 2024
AUTOLYCUS: Exploiting Explainable Artificial Intelligence (XAI) for Model Extraction Attacks against Interpretable ModelsAbdullah Caglar OksuzAnisa Halimiet al.2024PETS 2024
Exploring Vulnerabilities in LLMs: A Red Teaming Approach to Evaluate Social BiasYuya Jeremy OngJay Pankaj Galaet al.2024IEEE CISOSE 2024
Effective In-Silico Gene Perturbation by Machine Learning Model Interpretation for ImmunotherapiesTanwi BiswasAkira Kosekiet al.2024ISMB 2024
Transformer Models with Explainability for IT Telemetry and Business EventsShiau Hong LimLaura Wynter2024SSE 2024