融合确定性策略梯度与可行策略迭代的PID参数优化方法
娄亚军, 刘一达, 郭聪聪, 宋鑫, 徐士铎
A PID Parameter Optimization Method Integrating Deterministic Policy Gradient and Feasible Policy Iteration
LOU Ya-jun, LIU Yi-da, GUO Cong-cong, SONG Xin, XU Shi-duo
制造业自动化
.
2026, (1): 145
-154
.
DOI: 10.3969/j.issn.1009-0134.2026.01.016