Christoph Hagleitner, Charles Johns, et al.
IEEE JVA Symposium 2023
Designing stable crystal structures is central to accelerating the discovery of new materials, yet most generative approaches remain limited to reproducing known patterns rather than exploring novel possibilities. We present a method that trains large language models with reinforcement learning guided by verifiable energy-based rewards, optimizing toward physically grounded stability objectives. Compared to supervised finetuning and base models, our reinforcement learning–trained model generates crystals with higher predicted stability and a greater proportion of previously unreported structures. These results suggest that combining verifiable energy rewards and reinforcement learning provides a powerful path toward automated discovery of novel, stable materials.
Christoph Hagleitner, Charles Johns, et al.
IEEE JVA Symposium 2023
Lukas Heuberger, Daniel Messmer, et al.
Advanced Science
Christodoulos Constantinides, Dhaval Patel, et al.
NeurIPS 2025
Lisa Hamada, Akihiro Kishimoto, et al.
NeurIPS 2025