Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health MonitoringShuxin LinDhaval Patelet al.2025EMNLP 2025Paper
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025Paper
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssistElizabeth DalyErik Miehlinget al.2025EMNLP 2025Demo paper
An Automatically Improving Method for Generating Descriptions of Financial Data Quality Grading with LLMsYang ZhaoYohei Ikawaet al.2025EMNLP 2025Workshop paper
Exploring Cooperative Behavior in LLMs with Game TheoryAylin GunalBaihan Linet al.2025EMNLP 2025Workshop paper