Generating Stable Materials with Large Language Model Reasoning and Reinforcement LearningZhang-Wei HongNofit Segalet al.2025NeurIPS 2025