1
Action stable updating algorithm for policy gradient methods in continuous time
用于连续时间中策略梯度算法的动作稳定更新算法
No. 10, 2023 : 2928-2932,2944
doi:10.19734/j.issn.1001-3695.2023.02.0092