A system for predicting health of an e-contract
Nishtha Madaan, Shashank Mujumdar, et al.
SCC 2018
Most real world applications of multi-agent systems, need to keep a balance between maximizing the rewards and minimizing the risks. In this work we consider a popular risk measure, variance of return (VOR), as a constraint in the agent's policy learning algorithm in the mixed cooperative and competitive environments. We present a multi-timescale actor critic method for risk sensitive Markov games where the risk is modeled as a VOR constraint. We also show that the risk-averse policies satisfy the desired risk constraint without compromising much on the overall reward for a popular task.
Nishtha Madaan, Shashank Mujumdar, et al.
SCC 2018
Raghuram Bharadwaj Diddigi, K. J. Prabuchandran, et al.
AAMAS 2019
Nanjangud Narendra, Karthikeyan Ponnalagu, et al.
ITSC 2015
Rohith Dwarakanath Vallam, Sarthak Ahuja, et al.
AAMAS 2019