Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and AnalysisOrit DavidovichShimrit Shternet al.2025arXiv
Evaluation of partitioning algorithms for trustworthy out-of-distribution evaluation of machine learning models in biochemistryRaúl Fernández DíazLam Thanh Hoanget al.2025VIBE 2025
Transfer Learning on Edge Using 14nm CMOS-compatible ReRAM Array and Analog In-memory Training AlgorithmTakashi AndoOmobayode Fagbohungbeet al.2025IEDM 2025
Flick: Empowering Federated Learning with Commonsense KnowledgeRan ZhuMingkun Yanget al.2025NeurIPS 2025
Latent Principle Discovery for Language Model Self-ImprovementKeshav RamjiTahira Naseemet al.2025NeurIPS 2025
Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model PerformanceAladin DjuheraSwanand Ravindra Kadheet al.2025NeurIPS 2025
Structured Sparse Transition Matrices to Enable State Tracking in State-Space ModelsAleksandar TerzicNicolas Menetet al.2025NeurIPS 2025
Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response FunctionsZhaoxian WuQuan Xiaoet al.2025NeurIPS 2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample ComplexityGholamali AminianAmir R Asadiet al.2025NeurIPS 2025