基于深度强化学习的空间机械臂柔顺捕获控制方法研究

doi:10.3969/j.issn.1674 1579.2022.01.001

摘要/Abstract

摘要： 针对空间机械臂在轨捕获问题，提出了一种基于深度强化学习原理的柔顺捕获控制方法，采用深度确定性策略梯度算法设计了控制器.在仿真环境中使用6自由度机械臂对特定质量、初始速度的目标进行了大量抓捕训练，使得控制器能够根据机械臂状态输出合适的力矩，促使目标运动速度最终趋近于0并能够有效降低交互过程中的冲击力.同时，对于不同质量和初始速度的目标，该控制器同样具备良好的适应性并可实现柔顺捕获.与传统基于阻抗控制原理的柔顺控制方法相比，该方法能够减小碰撞过程的最大冲击力，实现不依赖模型的柔顺控制，经工程化改进后有望应用于空间智能捕获任务中.

关键词: 深度强化学习, 深度确定性策略梯度, 机械臂, 柔顺捕获

Abstract: A compliant capture control method based on the principle of deep reinforcement learning is proposed to solve the in orbit capture problem by a space manipulator. In the simulation environment the controller is trained to use a 6 DOF manipulator to capture the target, which has specified mass and initial velocity. The controller learns to output appropriate control forces according to the states of the manipulator and it can make the target speed eventually approach to 0 and effectively reduce the impact force. The controller also shows good compliance interaction performance for targets with different mass and initial velocity. Compared with the traditional compliant control method based on the principle of impedance control, this method can reduce the max impact force effectively and realize the model independent control. After improving for space application, it is expected to be used in an intelligence capture task.

Key words: deep reinforcement learning, deep deterministic policy gradient (DDPG), manipulator, compliant capture

中图分类号:

V448.2

文闻, 周元子, 周晓东, 陶东. 基于深度强化学习的空间机械臂柔顺捕获控制方法研究[J]. 空间控制技术与应用, 2022, 48(1): 1-8.

WEN Wen, ZHOU Yuanzi, ZHOU Xiaodong, TAO Dong. On Compliant Capture Control Method by Space Manipulator Based on Deep Reinforcement Learning[J]. Aerospace Contrd and Application, 2022, 48(1): 1-8.

0
/ 收藏文章 0 / 推荐

导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks

链接本文: http://journal01.magtech.org.cn/Jwk3_kjkzjs/CN/10.3969/j.issn.1674 1579.2022.01.001

http://journal01.magtech.org.cn/Jwk3_kjkzjs/CN/Y2022/V48/I1/1

参考文献

Metrics

Viewed

Full text

110

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	0	0	0	110

From	Others	local

Times	2	108
Rate	2%	98%

Abstract

377

Just accepted	Online first	Issue

0	0	377

From	Others	local

Times	376	1
Rate	100%	0%

Cited

Web of Science	Crossref	ScienceDirect	Search for Citations in Google Scholar >>


This page requires you have already subscribed to WoS.

Shared

[1]	张月龄, 向国菲, 税懿, 佃松宜. 基于解耦双通道线性自抗扰控制的连续型机械臂轨迹跟踪策略[J]. 空间控制技术与应用, 2020, 46(5): 27-35.
[2]	张双, 赵涛, 佃松宜, 胡怡, 江浩. 机械臂的区间二型模糊超螺旋滑模控制[J]. 空间控制技术与应用, 2019, 45(3): 44-.
[3]	吴昊, 谭元, 郭小龙, 毛新涛. 一种基于模型预测控制的柔性关节空间机械臂的轨迹跟踪控制#br#[J]. 空间控制技术与应用, 2019, 45(2): 35-.
[4]	李焕, 王奉文, 徐世杰, 侯月阳, 卢山. 基于阻抗控制的机械臂末端工具的柔顺控制[J]. 空间控制技术与应用, 2019, 45(1): 20-.
[5]	吴昊, 郭小龙, 谭元, 毛新涛. 一种基于控制参数化方法的柔性关节机械臂的#br# 最优PID参数整定方法#br#[J]. 空间控制技术与应用, 2019, 45(1): 27-.
[6]	郭小龙, 郭敏华, 谭元, 曹函宇, 佃松宜, 李彬. 一种基于控制参数化的双连杆机械臂最优PID参数整定方法[J]. 空间控制技术与应用, 2018, 44(5): 70-75.
[7]	黎凯, 张尧, 陈余军. 柔性空间机械臂在轨操作仿真与分析[J]. 空间控制技术与应用, 2018, 44(5): 60-69.
[8]	邱志成, 李城. 基于加速度反馈的两杆柔性机械臂振动控制与实验研究[J]. 空间控制技术与应用, 2018, 44(5): 1-6.
[9]	倪风雷, 林鹏飞, 邹添. 基于六维加速度传感器的大型机械臂柔性关节振动抑制[J]. 空间控制技术与应用, 2018, 44(5): 7-13.
[10]	张玉翠, 刘峰, 张晖辉. 空间机械臂间隙影响分析[J]. 空间控制技术与应用, 2017, 43(6): 54-60.
[11]	张瀚文. 正交试验法在空间自由漂浮机械臂控制参数寻优中的应用[J]. 空间控制技术与应用, 2017, 43(6): 47-53.
[12]	夏新会, 冯骁, 贾英宏, 徐世杰. CMGs驱动空间机械臂的自适应终端滑模控制[J]. 空间控制技术与应用, 2017, 43(6): 32-39.
[13]	肖帅, 刘蕊, 饶卫东. 空间机械臂地面仿真与测试系统设计[J]. 空间控制技术与应用, 2017, 43(5): 73-78.
[14]	邓雅, 王泽国, 张锦江. 一种可变形桁架避障规划方法*[J]. 空间控制技术与应用, 2017, 43(5): 37-42.
[15]	王超, 江洁, 林森海, 张文辉, 陈荣昌. 基于神经网络的自由漂浮空间机械臂自适应鲁棒控制[J]. 空间控制技术与应用, 2017, 43(2): 7-12.