Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model PerformanceAladin DjuheraSwanand Ravindra Kadheet al.2025NeurIPS 2025
Objective Soups: Multilingual Multi-Task Modeling for Speech ProcessingA SaifLisha Chenet al.2025NeurIPS 2025
Rollout Roulette: A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo MethodsIsha PuriShivchander Sudalairajet al.2025NeurIPS 2025
Structured Sparse Transition Matrices to Enable State Tracking in State-Space ModelsAleksandar TerzicNicolas Menetet al.2025NeurIPS 2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample ComplexityGholamali AminianAmir R Asadiet al.2025NeurIPS 2025
Latent Principle Discovery for Language Model Self-ImprovementKeshav RamjiTahira Naseemet al.2025NeurIPS 2025
Representation Similarity Reveals Implicit Layer Grouping in Neural NetworksTian GaoAmit Dhurandharet al.2025NeurIPS 2025
Language Model Enabled Structure Prediction from Infrared Spectra of MixturesMarvin AlbertsFilippo Ficarraet al.2025NeurIPS 2025
Geospatial Chain of Thought Reasoning for Enhanced Visual Question Answering on Satellite ImageryShambhavi ShankerManikandan Padmanabanet al.2025NeurIPS 2025