Shape it Up! Restoring LLM Safety during FinetuningShengyun PengPin-Yu Chenet al.2025NeurIPS 2025Conference paper
Dense Associative Memory Through the Lens of Random FeaturesBenjamin HooverDuen Horng Chauet al.2024NeurIPS 2024Conference paper
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language ModelsShengyun PengPin-Yu Chenet al.2024NeurIPS 2024Conference paper