Task-driven Sensing with Coarse-to-Fine Glimpse-based Active PerceptionOleh KolnerThomas Bohnstinglet al.2025NeurIPS 2025
Representation Similarity Reveals Implicit Layer Grouping in Neural NetworksTian GaoAmit Dhurandharet al.2025NeurIPS 2025
Licence to Scale: A Microservice Simulation Environment for Benchmarking Agentic AIChristopher LohseAdrian Selket al.2025NeurIPS 2025
Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous EvaluationJung koo Kang2025NeurIPS 2025
Uncertainty-Aware Prediction of Climate Extremes Using Fine-Tuned Time-Series Foundation ModelsImran NasimJoao Lucas de Sousa Almeida2025NeurIPS 2025
SafeCOMM: Investigating Safety Degradation in Fine-Tuned Telecom Large Language ModelsAladin DjuheraSwanand Ravindra Kadheet al.2025NeurIPS 2025
MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram GenerationBasel ShbitaFarhan Ahmedet al.2025NeurIPS 2025
FlowState: Sampling-Rate Invariant Time Series Foundation Model with Dynamic Forecasting HorizonsLars GrafThomas Bohnstinglet al.2025NeurIPS 2025
Automatic Correction of AI Reports using Fact-Checking Model-guided LLMsRazi MahmoodPingkun Yanet al.2025NeurIPS 2025