News
This letter proposes a model-free safe RL algorithm that achieves near-zero constraint violations with high rewards. Our key idea is to jointly learn a policy and a neural barrier certificate under ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results