Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!Xiangyu QiYi Zenget al.2024ICLR 2024
SalUn: Empowering Machine Unlearning via Gradient-Based Weight Saliency in Both Image Classification and GenerationChongyu FanJiancheng Liuet al.2024ICLR 2024
DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic ManipulationZilin SiGu Zhanget al.2024ICLR 2024
COVLM: COMPOSING VISUAL ENTITIES AND RELATIONSHIPS IN LARGE LANGUAGE MODELS VIA COMMUNICATIVE DECODINGJunyan LiDelin Chenet al.2024ICLR 2024
THE DEVIL IS IN THE NEURONS: INTERPRETING AND MITIGATING SOCIAL BIASES IN PRE-TRAINED LANGUAGE MODELSYan LiuYu Liuet al.2024ICLR 2024