基于状态差异的火星巡视器快速任务规划修复方法

姓名
邮箱
手机号码
标题
留言内容
验证码

doi:10.15982/j.issn.2096-9287.2021.20200075

陈超^{1, 2,},
徐瑞^{1, 2,,},
李朝玉^{1, 2}

1.
bob手机在线登陆宇航学院，北京 100081
2.
深空自主导航与控制工信部重点实验室，北京 100081

基金项目:国家重点研发资助项目（2019YFA0706500）；国家自然科学基金资助项目（61976020）

详细信息

作者简介:
陈超（1994– ），男，博士生，主要研究方向：航天器任务规划、航天器任务重规划。通讯地址：北京市海淀区中关村南大街5号bob手机在线登陆宇航学院22号信箱（100081）E-mail：p_chenchao@126.com

通讯作者:
徐瑞（1975– ），男，教授，博士生导师，主要研究方向：航天器任务规划、自主导航、智能控制。本文通讯作者。通讯地址：北京市海淀区中关村南大街5号 bob手机在线登陆宇航学院22号信箱（100081）E-mail：xurui@bit.edu.cn

●　Partial states are constructed by extracting the key information from the existing plan，lying on the difference between the perception state and the necessary state of action execution. ●　A fast plan repair strategy of Mars rover is presented based on the partial state. ●　A search guiding method is proposed，which can generate search nodes selectively according to the differences between partial state and real state. ●　The rapidity of the method is evaluated，and it is found that the proposed plan maintains good plan stability.
中图分类号:V419+.9

Fast Mission Plan Repair Method for Mars Rover Based on State Difference

CHEN Chao^{1, 2,},
XU Rui^{1, 2,,},
LI Zhaoyu^{1, 2}

1.
School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China
2.
Key Laboratory of Autonomous Navigation and Control for Deep Space Exploration, Ministry of Industry and Information Technology, Beijing 100081, China

摘要:火星环境存在不确知、电子设备故障难预测等特点，严重影响巡视器已有规划在火星表面实际应用的效果，执行任务易造成失败。在火星巡视器与地面站之间通信存在长时延的客观事实下，提出一种基于状态差异的快速任务规划修复方法。利用感知状态与动作执行的必要状态之间的差异，从已有规划中提取关键信息来构建不同时刻的部分状态，提出了基于部分状态的火星巡视器快速任务规划修复策略，并考虑实际状态与部分状态之间的差异，设计了基于状态差异的搜索空间删减方法，有目的地生成、扩展节点并消解冲突，以提高规划修复效率。仿真实验结果表明，该方法不仅提高了火星巡视器任务规划修复的效率，而且保障了规划稳定性，可为航天器快速自主应对执行时的突发事件提供技术支持。
- 规划修复/
- 火星巡视器/
- 部分状态/
- 状态差异
Abstract:The uncertainty of Mars environment and the difficulty to predict the failure of electronic equipment will seriously affect the actual effect of the pre-designed plan of the rover on Martian surface, resulting in plan execution failure. To solve this problem, a fast mission plan repair method based on state difference is proposed based on the fact that there is long delay between the Mars rover and the ground station. This method constructs partial states at different times by extracting the key information from the existing plan, lying on the difference between the perception state and the necessary state of action execution. And then the rapid mission plan repair strategy for Mars rover based on the partial state is presented. In this strategy, to improve the efficiency of plan repair, the search space deletion method based on the state difference between the actual state and the partial state is proposed. Simulation results demonstrate that this method can not only improve the efficiency of mission plan repair, but also ensure the plan stability.
- plan repair/
- Mars rover/
- partial state/
- state difference
Highlights

●　Partial states are constructed by extracting the key information from the existing plan，lying on the difference between the perception state and the necessary state of action execution. ●　A fast plan repair strategy of Mars rover is presented based on the partial state. ●　A search guiding method is proposed，which can generate search nodes selectively according to the differences between partial state and real state. ●　The rapidity of the method is evaluated，and it is found that the proposed plan maintains good plan stability.

图 1对岩石采样的动作模型示例

Fig. 1An example of an action model for rock sampling

下载: 全尺寸图片幻灯片

图 2规划 ${\Pi _A}$ 的部分状态序列

Fig. 2Partial state sequence of the plan ${\Pi _A}$

下载: 全尺寸图片幻灯片

图 3任务规划修复策略示意图

Fig. 3Illustration of mission plan repair strategy

下载: 全尺寸图片幻灯片

图 4火星巡视器快速任务规划修复策略伪代码

Fig. 4Pseudocode of fast mission plan repair strategy for Mars rover

下载: 全尺寸图片幻灯片

图 5基于状态差异的节点生成方法示意

Fig. 5Node generation method based on state difference

下载: 全尺寸图片幻灯片

图 6部分状态与回退状态之间的冲突示意

Fig. 6Conflict between the partial state and the regressed state

下载: 全尺寸图片幻灯片

图 7仿真场景示意及规划结果

Fig. 7Simulation scenario and its planning results

下载: 全尺寸图片幻灯片

图 8不同方法的运行时间对比

Fig. 8Comparison of operation time of different methods

下载: 全尺寸图片幻灯片

图 9不同方法的扩展节点数对比

Fig. 9Comparison of the number of expansion nodes in different methods

下载: 全尺寸图片幻灯片

表 1火星巡视器模型中的活动及其含义

Table 1Activities in the Mars rover model and their meanings

活动名称	含义
navigate	导航
recharge	充电
sample_soil	土壤采样分析
sample_rock	岩石采样分析
drop	丢弃样本
calibrate	校准相机
take_image	成像
communicate_soil_data	向着陆器传输土壤分析数据
communicate_rock_data	向着陆器传输岩石分析数据
communicate_image_data	向着陆器传输图像数据

下载: 导出CSV

表 2不同方法测试结果对比

Table 2Comparison of test results of different methods

问题编号	修复耗时/ms	扩展节点数	修复动作数	规划稳定性
1	0.198/0.561	2/260	1/1	1/1
2	0.213/0.514	3/260	1/1	1/1
3	0.243/0.572	5/260	2/2	1/1
4	0.458/2.055	16/260	4/4	0.8/0.8
5	0.517/1.316	9/369	1/1	0.9/0.9
6	0.587/1.236	8/122	3/3	0.833/0.833
7	0.461/1.189	5/155	2/2	0.833/0.833
8	0.549/1.457	8/122	3/3	0.833/0.833
9	0.386/1.190	5/155	2/2	0.833/0.833
10	0.482/—	5/—	1/—	1/—
11	0.727/0.927	14/305	2/2	1/1
12	0.595/0.821	14/305	2/2	1/1
13	0.590/0.825	14/305	2/2	1/1
14	0.474/0.725	2/172	2/1	1/1
注：中间数据遵循RPDS/RP的形式，例如0.198/0.561表示RPDS耗时0.198 ms，RP耗时0.561 ms。

下载: 导出CSV

[1]	MAIMONE M W, LEGER P C, BIESIADECKI J J. Overview of the Mars exploration rovers' autonomous mobility and vision capabilities[C]//IEEE International Conference on Robotics and Automation. Rome, Italy: Space Robotics Workshop, 2007: 1-8.
[2]	李群智,贾阳,彭松,等. 月面巡视探测器任务规划顶层设计与实现[J]. 深空探测学报(中英文),2017,4(1):58-65. LI Q Z,JIA Y,PENG S,et al. Top design and implementation of the lunar rover mission planning[J]. Journal of Deep Space Exploration,2017,4(1):58-65.
[3]	PÉREZ-AYÚCAR M,ASHMAN M,ALMEIDA M,et al. The Rosetta science operations and planning implementation[J]. Acta Astronautica,2018,152:163-174.doi:10.1016/j.actaastro.2018.07.049
[4]	陈德相,徐瑞,崔平远. 航天器资源约束的时间拓扑排序处理方法[J]. 宇航学报,2014,35(6):669-676.doi:10.3873/j.issn.1000-1328.2014.06.008 CHEN D X,XU R,CUI P Y. A temporal topological sort processing method for spacecraft resources constraints[J]. Journal of Astronautics,2014,35(6):669-676.doi:10.3873/j.issn.1000-1328.2014.06.008
[5]	金颢,徐瑞,崔平远,等. 基于状态转移图的启发式深空探测器任务规划方法[J]. 深空探测学报(中英文),2019,6(4):364-368. JIN H,XU R,CUI P Y,et al. Heuristic search based on state transition graphs for deep space task planning[J]. Journal of Deep Space Exploration,2019,6(4):364-368.
[6]	BRESINA J, DEARDEN R, MEULEAU N, et al. Planning under continuous time and resource uncertainty: a challenge for AI[C]//Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence. San Francisco, CA: Morgan Kaufmann Publishers Inc., 2002: 77-84.
[7]	徐瑞,陈超,崔平远,等. 航天器自主任务规划修复技术研究进展[J]. 宇航学报,2019,40(7):733-741. XU R,CHEN C,CUI P Y,et al. Research on spacecraft autonomous mission plan repair[J]. Journal of Astronautics,2019,40(7):733-741.
[8]	NEBEL B,KOEHLER J. Plan reuse versus plan generation:a theoretical and empirical analysis[J]. Artificial Intelligence,1995,76(1):427-454.
[9]	CHIEN S, KNIGHT R, STECHERT A, et al. Using iterative repair to improve the responsiveness of planning and scheduling[C]//Proceedings of the Fifth International Conference on Artificial Intelligence Planning and Scheduling. Menlo Park, California: The AIAA Press, 2000.
[10]	CHEN C,XU R,ZHU S Y,et al. RPRS:a reactive plan repair strategy for rapid response to plan failures of deep space missions[J]. Acta Astronautica,2020,175:155-162.doi:10.1016/j.actaastro.2020.05.011
[11]	FOX M, GEREVINI A, LONG D, et al. Plan Stability: replanning versus plan repair[C]//ICAPs’06: Proceedings of the Sixteenth International Conference on International Conference on Automated Planning and Scheduling. Menlo Park, California: The AAAI Press, 2005: 212-221.
[12]	GEREVINI A, SERINA I. Fast plan adaptation through planning graphs: local and systematic search techniques[C]//Proceedings of the Fifth International Conference on Artificial Intelligence Planning Systems. Breckenridge, CO: [s. n. ], 2000.
[13]	SCALA E, MICALIZIO R, TORASSO P. ReCon: an online task reconfiguration approach for robust plan execution[C]//The Sixth International Conference on Agents and Artificial Intelligence （ICAART）. ESEO, Angers, France: [s. n. ], 2014.
[14]	GALLIEN M, INGRAND F, LEMAI S. Robot actions planning and execution control for autonomous exploration rovers[C]//International Workshop on Planning under Uncertainty for Autonomous Systems. Monterey, California: [s. n.]: 2005.
[15]	GUZMAN C,CASTEJON P,ONAINDIA E,et al. Reactive execution for solving plan failures in planning control applications[J]. Integrated Computer Aided Engineering,2015,22(4):343-360.doi:10.3233/ICA-150493
[16]	GHALLAB M, NAU D, TRAVERSO P. Automated planning: theory and practice[M]. Amsterdam, Boston: Elsevier/Morgan Kaufmann, 2004.
[17]	FOX M,LONG D. PDDL2.1:an extension to PDDL for expressing temporal planning domains[J]. Journal of Artificial Intelligence Research,2003,20(20):61-124.
[18]	DO M B,KAMBHAMPATI S. Sapa:a multi-objective metric temporal planner[J]. Journal of Artificial Intelligence Research,2003,20(20):155-194.

[1]	毛维杨, 王彬, 柳景兴, 熊新.基于强化学习的深空探测器自主任务规划方法. 深空探测学报(中英文）, 2023, 10(2): 220-230.doi:10.15982/j.issn.2096-9287.2023.20220049
[2]	柳景兴, 王彬, 毛维杨, 熊新.深空探测器任务规划认知图谱及多属性约束冲突检测. 深空探测学报(中英文）, 2023, 10(1): 88-96.doi:10.15982/j.issn.2096-9287.2023.20220064
[3]	张东林, 曹一凡, 段战胜, 王鹏程, 郭明, 张永合.空间引力波探测航天器高精度状态估计器设计. 深空探测学报(中英文）, 2023, 10(3): 1-7.doi:10.15982/j.issn.2096-9287.2023.20230035
[4]	王棒, 徐瑞, 李朝玉, 高越.小天体表面探测器弹跳运动与路径规划. 深空探测学报(中英文）, 2022, 9(4): 447-454.doi:10.15982/j.issn.2096-9287.2022.20220042
[5]	李硕, 余萌, 曹涛, 郑博, 胡涛.基于路标重观测的月面巡视器激光雷达定位方法研究. 深空探测学报(中英文）, 2022, 9(6): 625-632.doi:10.15982/j.issn.2096-9287.2022.20220090
[6]	杨祎, 刘奕宏, 汪静, 吴乐群, 张晓峰, 韩阅, 张秀红.“天问一号”着陆巡视器转动电缆设计方法. 深空探测学报(中英文）, 2022, 9(6): 633-640.doi:10.15982/j.issn.2096-9287.2022.20220081
[7]	王鑫, 赵清杰, 徐瑞.基于知识图谱的深空探测器任务规划建模. 深空探测学报(中英文）, 2021, 8(3): 315-323.doi:10.15982/j.issn.2096-9287.2021.20210030
[8]	姜啸, 徐瑞, 陈俐均.深空探测器动态约束规划中的外延约束过滤方法研究. 深空探测学报(中英文）, 2019, 6(6): 586-594.doi:10.15982/j.issn.2095-7777.2019.06.010
[9]	于天一, 费江涛, 李立春, 程肖.月面巡视器路径规划方法研究. 深空探测学报(中英文）, 2019, 6(4): 384-390.doi:10.15982/j.issn.2095-7777.2019.04.011
[10]	金颢, 徐瑞, 崔平远, 朱圣英.基于状态转移图的启发式深空探测器任务规划方法. 深空探测学报(中英文）, 2019, 6(4): 364-368.doi:10.15982/j.issn.2095-7777.2019.04.008
[11]	金颢, 徐瑞, 崔平远, 朱圣英.基于扩展状态深空探测器任务规划方法. 深空探测学报(中英文）, 2018, 5(6): 569-574.doi:10.15982/j.issn.2095-7777.2018.06.010
[12]	叶斌龙, 赵健楠, 黄俊.美国2020火星车着陆区遴选进展及对2020中国火星任务着陆探测部分的一些思考. 深空探测学报(中英文）, 2017, 4(4): 310-324.doi:10.15982/j.issn.2095-7777.2017.04.002
[13]	李群智, 贾阳, 彭松, 韩璐.月面巡视探测器任务规划顶层设计与实现. 深空探测学报(中英文）, 2017, 4(1): 58-65.doi:10.15982/j.issn.2095-7777.2017.01.009
[14]	魏祥泉, 黄建明, 顾冬晴, 陈凤.火星车自主导航与路径规划技术研究. 深空探测学报(中英文）, 2016, 3(3): 275-281.doi:10.15982/j.issn.2095-7777.2016.03.012
[15]	武长青, 徐瑞, 朱圣英.基于对数势函数的深空探测器姿态规划与控制方法. 深空探测学报(中英文）, 2015, 2(4): 365-370.doi:10.15982/j.issn.2095-7777.2015.04.011
[16]	陈德相, 徐文明, 杜智远, 徐瑞.航天器任务规划中资源约束的可分配处理方法. 深空探测学报(中英文）, 2015, 2(2): 180-185.doi:10.15982/j.issn.2095-7777.2015.02.013
[17]	刘建忠, 郭弟均, 籍进柱, 刘敬稳, 王庆龙.月球的构造格架及其演化差异. 深空探测学报(中英文）, 2015, 2(1): 75-79.doi:10.15982/j.issn.2095-7777.2015.01.011
[18]	李朝玉, 徐瑞.一种基于时标状态的启发式航天器任务规划算法. 深空探测学报(中英文）, 2015, 2(1): 20-26.doi:10.15982/j.issn.2095-7777.2015.01.003
[19]	傅惠民, 娄泰山, 肖强.火星进入段探测器自校准状态估计. 深空探测学报(中英文）, 2015, 2(3): 224-228.doi:10.15982/j.issn.2095-7777.2015.03.006
[20]	董元元, 崔祜涛, 田阳.基于栅格地图的火星车路径规划方法. 深空探测学报(中英文）, 2014, 1(4): 289-293.doi:10.15982/j.issn.2095-7777.2014.04.007

点击查看大图

图(9)/ 表 (2)

计量

文章访问数:441
HTML全文浏览量:253
PDF下载量:60
被引次数:0

注释:

●　Partial states are constructed by extracting the key information from the existing plan，lying on the difference between the perception state and the necessary state of action execution.

●　A fast plan repair strategy of Mars rover is presented based on the partial state.

●　A search guiding method is proposed，which can generate search nodes selectively according to the differences between partial state and real state.

●　The rapidity of the method is evaluated，and it is found that the proposed plan maintains good plan stability.

全文HTML

引　言

由于距离地球遥远，火星探测器与地面站之间的通信存在3~20 min的长时延，严重影响着任务的成败。火星探测器需具备高度的自主能力，例如高增益天线的自主指向、利用星敏进行姿态校准、在线数据的存储与转发，以及在时间和资源约束条件下活动的合理安排等^[1]，以降低地面依赖性、缩短任务周期。自主任务规划技术可实现这一目标^[2]。现有文献已经从动作^[3]、资源^[4]、时间线^[5]等多个角度对自主任务规划技术进行研究。但目前人类对火星的认知不全面，火星表面地形的变化、土壤的坚硬程度、岩石的大小等仍存在极大的不确知性，影响着巡视器执行任务的时间长短、消耗能量以及存储空间大小等，严重制约着既定规划的成功执行。例如，“探路者号”（Pathfinder）任务中的“索杰纳号”（Sojourner）巡视器就因为在先验规划过程中没有充分考虑到这些不确定性因素的影响，整个任务中大约有70%的时间处于闲置状态^[6]。此外，风沙等突发事件、电子器件故障等意外事件，都会对任务的成功执行造成巨大威胁。因此，火星巡视器不仅需要具备任务规划能力，自主管理日常活动，更需要具备任务规划修复能力，自主应对执行过程中出现的执行失败情况，以增强自身的鲁棒性，提高任务回报率。

在应对突发事件导致任务规划执行失败的问题方面，存在两种策略：完全重规划和规划修复^[7]。其中，完全重规划是指放弃已有的规划结果，重新决策出一个新的动作序列来完成任务目标，而规划修复是指通过增、删、替换、移动等操作，对已有规划结果进行修补来达成任务目标。虽然规划修复在理论上不一定比完全重规划简单^[8]，但大量的仿真实验表明规划修复效率更高^[9-12]。Scala等^[13]基于巡视器动作模式的多样性，采用约束满足问题（Constraint Satisfaction Problem，CSP）求解技术，在规划执行失败时，通过动作模式重构完成任务规划修复。但是该方法不改变动作顺序及类型，无法处理一般的规划执行失败问题。Gallien等^[14]将执行阶段设计成感知–修复–行动三阶段的循环过程，对执行任务过程中出现的错误，在限定时间内采用规划器推理得出相应的解，用于解决执行失败的问题。但是该方法采用偏序规划方法，效率较低。Guzman等^[15]提出响应式规划（Reactive Plan，RP）方法，为执行体设计了一个响应式规划器，通过与任务规划器互相配合，并采用修复结构树来提高规划修复的速度。但是修复结构树的长度和深度需要通过机器学习获得，不能普遍适用于所有情况；并且即使修复结构树只包含少量动作，其中也会包含大量无用节点，这样会造成存储空间和搜索时间的浪费。

针对巡视器在火星表面执行任务过程中由环境不确定性和自身设备故障等引起的规划执行失败问题，本文提出了一种基于状态差异的规划修复方法（State Difference Plan Repair，RPDS），使巡视器能够迅速从失败中恢复正常。与响应式规划方法RP不同的是，本文不提前构建修复结构树以辅助规划修复，而是在执行失败后，利用不同状态之间的差异，有目的地生成搜索节点，并对扩展的节点根据谓词冲突进行筛选，从而删减搜索空间，加快巡视器规划修复的速度。

本文的结构组织如下：第一部分构建火星巡视器任务模型并引出部分状态的概念，为后续的方法描述奠定基础。第二部分提出基于部分状态的火星巡视器快速任务规划修复策略，为第三部分描述基于实际状态与部分状态之间的差异而设计的搜索空间删减方法建好框架。第四部分给出仿真测试结果和分析。最后给出本文结论。

活动名称	含义
navigate	导航
recharge	充电
sample_soil	土壤采样分析
sample_rock	岩石采样分析
drop	丢弃样本
calibrate	校准相机
take_image	成像
communicate_soil_data	向着陆器传输土壤分析数据
communicate_rock_data	向着陆器传输岩石分析数据
communicate_image_data	向着陆器传输图像数据

问题编号	修复耗时/ms	扩展节点数	修复动作数	规划稳定性
1	0.198/0.561	2/260	1/1	1/1
2	0.213/0.514	3/260	1/1	1/1
3	0.243/0.572	5/260	2/2	1/1
4	0.458/2.055	16/260	4/4	0.8/0.8
5	0.517/1.316	9/369	1/1	0.9/0.9
6	0.587/1.236	8/122	3/3	0.833/0.833
7	0.461/1.189	5/155	2/2	0.833/0.833
8	0.549/1.457	8/122	3/3	0.833/0.833
9	0.386/1.190	5/155	2/2	0.833/0.833
10	0.482/—	5/—	1/—	1/—
11	0.727/0.927	14/305	2/2	1/1
12	0.595/0.821	14/305	2/2	1/1
13	0.590/0.825	14/305	2/2	1/1
14	0.474/0.725	2/172	2/1	1/1
注：中间数据遵循RPDS/RP的形式，例如0.198/0.561表示RPDS耗时0.198 ms，RP耗时0.561 ms。

5. 结　论

火星表面凹坑、凸起、坡度和石块的随机分布，土壤松软、厚度不均等复杂地貌，以及自身电子设备的意外故障等，均容易导致火星巡视器任务执行失败。针对该问题，本文提出了适用于火星巡视器的基于状态差异的快速任务规划修复方法RPDS，完成了以下工作：①利用火星巡视器的感知状态与动作执行的必要状态之间的差异，在已有规划的基础上，构建了不同时刻的部分状态，为规划修复提供可选修复目标；②基于部分状态，给出了火星巡视器快速规划修复策略；③利用部分状态和实际状态之间的差异，提出了基于状态差异的搜索空间删减方法，有目的性地生成、扩展节点并消解冲突。通过仿真实验验证，RPDS能够快速有效地完成任务规划修复，并能够保障规划稳定性，可以为航天器自主快速应对执行时的突发事件提供技术支撑。

参考文献 (18)

留言板