Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning SkillsChangsheng WangChongyu Fanet al.2025EMNLP 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssistElizabeth DalyErik Miehlinget al.2025EMNLP 2025
Optimistic Exploration for Risk-Averse Constrained Reinforcement LearningRadu MarinescuElizabeth Dalyet al.2025ECAI 2025
XABPs: Towards eXplainable Autonomous Business ProcessesPeter FettkeFabiana Fournieret al.2025ECAI 2025
Agentic Process Observability: Discovering Behavioral VariabilityFabiana FournierLior Limonadet al.2025ECAI 2025
Exposing AI Bias by Crowdsourcing: Democratizing Critique of Large Language ModelsHangzhi GuoPranav Venkitet al.2025AIES 2025