SALMON: SELF-ALIGNMENT WITH INSTRUCTABLE REWARD MODELSZhiqing SunYikang Shenet al.2024ICLR 2024Conference paper
HAZARD CHALLENGE: EMBODIED DECISION MAKING IN DYNAMICALLY CHANGING ENVIRONMENTSQinhong ZhouSunli Chenet al.2024ICLR 2024Conference paper
Visual Chain-of-Thought Prompting for Knowledge-Based Visual ReasoningZhenfang ChenQinhong Zhouet al.2024AAAI 2024Conference paper