Value Alignment from Unstructured TextInkit PadhiKarthikeyan Natesan Ramamurthyet al.2024NeurIPS 2024
Better Bias Benchmarking of Language Models via Multi-factor AnalysisHannah PowersIoana Baldini Soareset al.2024NeurIPS 2024
SocialStigmaQA Spanish and Japanese - Towards Multicultural Adaptation of Social Bias BenchmarksClara Higuera CabañesRyo Iwakiet al.2024NeurIPS 2024
Consistency-based Black-box Uncertainty Quantification for Text-to-SQLDebarun BhattacharjyaBalaji Ganesanet al.2024NeurIPS 2024
Enhancing Reasoning to Adapt Large Language Models for Domain-Specific ApplicationsBo WenXin Zhang2024NeurIPS 2024
MemReasoner: A Memory-augmented LLM Architecture for Multi-hop ReasoningIrene KoSihui Daiet al.2024NeurIPS 2024
Memorization to Generalization: The Emergence of Diffusion Models from Associative MemoryBao PhamGabriel Rayaet al.2024NeurIPS 2024
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAIAmbrish RawatStefan Schoepfet al.2024NeurIPS 2024
Advanced Physics-AI Models for Rain Enhancement in Arid RegionsLloyd TreinishMukul Tewariet al.2024AGU 2024