关键词:n阶机械臂单多智能体单智能体参考文档:1.《Proximal Policy Optimization Algorithms》2.《Asynchronous Methods for Deep Reinforcement Learning》3.《High-Dimensional Continuous Control Using Generalized Advantage Estimation》仿真平台:MATLAB、SIMULINK主要内容:利用MATLAB和强化学习控制机械臂到达目标点,现有二维代码需定制为三维。