A Systematic Benchmarking Methodology for Efficient LLM Inference EvaluationZhuoran LiuNelson Mimura Gonzalezet al.2025SC 2025
Training-Control-as-Code: Towards a declarative solution to control trainingPadmanabha Venkatagiri SeshadriHarikrishnan Balagopalet al.2025ASE 2025
Automatically Calculated Context-Sensitive Features of Connected Speech Improve Prediction of Impairment in Alzheimer's DiseaseGraham FlickRachel Ostrand2025J. Speech Lang. Hear. Res.
Declarative Techniques for NL Queries over Heterogeneous DataElham KhabiriJeff Kephartet al.2025EMNLP 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language ModelsDebarun BhattacharjyaBalaji Ganesanet al.2025EMNLP 2025
FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language ModelsRadu MarinescuDebarun Bhattacharjyaet al.2025EMNLP 2025
Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health MonitoringShuxin LinDhaval Patelet al.2025EMNLP 2025
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssistElizabeth DalyErik Miehlinget al.2025EMNLP 2025