Dmitri Maslov, Jin-Sung Kim, et al.
APS March Meeting 2021
Discovery and development of polymer materials is driven by experimental data acquisition. Experiments unfold under conditions of delayed rewards on incredibly rich landscapes shaped by multiple experimental degrees of freedom, including continuous (concentration, temperature, radiation, time) and categorical (monomers, catalysts, initiators, solvents) [1,2]. Deep reinforcement learning (RL) emerges as an appealing approach with a capability to interact with lab equipment, handle delayed rewards, and find non-trivial research strategies under realistic constraints of discovery/development projects. We report development of an end-to-end RL approach applied to preparation of spin-on-glasses (SOGs). The primary focus of the talk is meta-learning strategies [3] that ensure generalizability of the RL agent performance, and associated task of data augmentation at the training stage.