Safe Policy Optimization with Local Generalized Linear Function ApproximationsAkifumi WachiYunyue Weiet al.2021NeurIPS 2021Conference paper
Safe reinforcement learning in constrained markov decision processesAkifumi WachiYanan Sui2020ICML 2020Conference paper