基于动态智能体交互图的深空探测器任务规划方法

姓名
邮箱
手机号码
标题
留言内容
验证码

doi:10.15982/j.issn.2096-9287.2021.20210020

赵宇庭^{1, 2,},
徐瑞^{1, 2,},
李朝玉^{1, 2},
朱圣英^{1, 2}

1.
bob手机在线登陆宇航学院, 北京 100081
2.
深空自主导航与控制工信部重点实验室, 北京 100081

基金项目:国家重点研发计划（2019YFA0706500）；国家自然科学基金（61976020，U2037602）

详细信息

作者简介:
赵宇庭（1994– ），女，博士生，主要研究方向：航天器自主任务规划，多智能体规划技术。通讯地址：bob手机在线登陆宇航学院22信箱（100081）电话：（010）68913550E-mail:zhaoyuting_bit@163.com
徐瑞（1975– ），男，教授，主要研究方向：航天器自主任务规划、自主导航、智能控制。通讯地址：bob手机在线登陆宇航学院22信箱（100081）电话：（010）68913550E-mail：xurui@bit.edu.cn

●　A distributed multi-agent plan-space planning method was developed for deep-space probe planning. ●　A dynamic agent interaction graph was proposed to coordinate the multi-agent interactions during the planning process. ●　Temporal constraints between agents were handled by time constraints propagation between temporal networks. ●　The proposed multi-agent planning method can plan for a probe correctly and save computing time.
中图分类号:TP18

Mission Planning Based on Dynamic Agent Interaction Graph for Deep Space Probes

ZHAO Yuting^{1, 2,},
XU Rui^{1, 2,},
LI Zhaoyu^{1, 2},
ZHU Shengying^{1, 2}

1.
School of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, China
2.
Key Laboratory of Autonomous Navigation and Control for Deep Space Exploration, Ministry of Industry and Information Technology, Beijing 100081, China

摘要:面对日益复杂的深空探测任务和动态多变的深空环境，深空探测器需要更高效的任务规划技术以快速生成规划方案。探测器内部各子系统具有分布并行的特征，可抽象为多智能体系统进行规划。而现有的多智能体规划方法无法直接应用于需要处理时间资源等数值约束的深空探测器任务规划中。针对上述问题，提出基于分布式求精搜索的多智能体规划空间规划方法，设计动态智能体交互图引导多智能体协同规划，将时间资源约束处理抽象为约束满足问题并采用基于图论的方法进行处理。实验表明，对于包含多子系统的深空探测器的任务规划问题，所提方法能够有效提高规划效率。
- 深空探测器/
- 多智能体规划空间规划/
- 数值约束
Abstract:Facing the increasingly complex deep-space exploration missions and the dynamic space environment, deep-space probes need efficient planning methods for the fast generation of plans. The distribution and concurrency of subsystems make a probe suitable to be modeled as a multi-agent system. Existing multi-agent planners, however, cannot be used directly in mission planning of deep space probes that involve handling numeric constraints such as time resources. To solve the above problem, a multi-agent mission plan-space planning method based on distributed refinement search was proposed. A dynamic agent interaction graph (DAIG) was designed to coordinate interactions between agents during planning. Temporal constraints and resource constraints were modeled as constraint satisfaction problems and were handled by graph theory methods. Experiments show that the method proposed in this paper can save computing time of mission planning problems for a probe with multiple subsystems.
- deep space probe/
- multi-agent plan-space planning/
- numeric constraints
Highlights

●　A distributed multi-agent plan-space planning method was developed for deep-space probe planning. ●　A dynamic agent interaction graph was proposed to coordinate the multi-agent interactions during the planning process. ●　Temporal constraints between agents were handled by time constraints propagation between temporal networks. ●　The proposed multi-agent planning method can plan for a probe correctly and save computing time.

下载: 全尺寸图片幻灯片

图 1规划智能体输入文件示意

Fig. 1Inputs of planning agents

下载: 全尺寸图片幻灯片

图 2动态智能体交互图

Fig. 2Illustration of dynamic agent interaction graph

下载: 全尺寸图片幻灯片

图 3多智能体时间约束网络解耦示意

Fig. 3Illustration of multi-agent temporal network decoupling

下载: 全尺寸图片幻灯片

图 4多智能体求精规划中每个智能体的规划流程

Fig. 4Flowchart of multi-agent refinement planning

下载: 全尺寸图片幻灯片

图 5规划用时结果曲线

Fig. 5Computational results of runtime curves

下载: 全尺寸图片幻灯片

图 6各系统规划结果甘特图

Fig. 6Gantt chart of subsystems

下载: 全尺寸图片幻灯片

图 7规划步数结果对比

Fig. 7Comparison of planning steps

下载: 全尺寸图片幻灯片

表 1火星环绕器各子系统功能

Table 1Functions of subsystems in Mars Orbiter

子系统	功能	约束中定义的公共状态数
姿态子系统	控制环绕器姿态	0
载荷相机	拍照	1
对地通信系统	与地球通信	1
对火通信系统	与巡视器通信	2
推进系统	推进轨道器	0
太阳帆板	获取太阳能	2
GNC模式调整	调整导航模式	3
轨道控制	调整轨道	1
综合电子系统	调整测控模式	1

下载: 导出CSV

表 2火星环绕器各子系统状态列表

Table 2States of subsystems in Mars Orbiter

子系统	状态列表
姿态子系统	指向、转动
载荷相机	关、打开、开、对准、工作、关闭
对地通信系统	关、打开、开、打开通信、通信、关闭
对火通信系统	关、打开、开、打开通信、通信、关闭
推进系统	关、打开、开、开始推进、推进、关闭
太阳帆板	保持、对日、锁定
GNC模式调整	默认模式、模式1、模式2
轨道控制	空闲、调整轨道
综合电子系统	默认模式、模式1、模式2

下载: 导出CSV

[1]	BARREIRO J, BOYCE M, DO M, et al. EUROPA: a platform for AI planning, scheduling, constraint programming, and optimization[Z]. ICAPS-12 International Competition on Knowledge Engineering for Planning and Scheduling (ICKEPS). [S. l. ]: ICKEPS, 2012.
[2]	FRANK J, JONSSON A. Constraint-based attribute and interval planning[J]. Journal of Constraints Special Issue on Constraints and Planning,2003(8):339-364.
[3]	MUSCETTOLA N, NAYAK P P, PELL B, et al. Remote agent: to boldly go where no AI system has gone before[J]. Artificial Intelligence,1998(103):5-47.
[4]	MUSCETTOLA N, FRY C, RAJAN K, et al. On-board planning for New Millennium Deep Space One autonomy[C]//IEEE Aerospace Conference. [S. l. ]: IEEE, 2002.
[5]	TRAN D, CHIEN S, SHERWOOD R, et al. The autonomous sciencecraft experiment onboard the eo-1 spacecraft[C]//Autonomous Agents and Multiagent Systems, 2004. [S. l. ]: IEEE, 2004.
[6]	CHIEN S, RABIDEAU G, KNIGHT R, et al. ASPEN - automating space mission operations using automated planning and scheduling[C]//In Proceedings of SpaceOps. Toulouse, France: [s. n. ], 2000.
[7]	KNIGHT R, RABIDEAU G, CHIEN S, et al. Casper: space exploration through continuous planning[J]. IEEE Intelligent Systems and Their Applications,2001,16(5):70-75.doi:10.1109/5254.956084
[8]	金颢, 徐瑞, 崔平远, 等. 基于状态转移图的启发式深空探测器任务规划方法[J]. 深空探测学报(中英文),2019,6(4):364-368. JIN H, XU R, CUI P Y, et al. Heuristic search based on state transition graphs for deep space task planning[J]. Journal of Deep Space Exploration,2019,6(4):364-368.
[9]	王晓晖, 李爽. 深空探测器约束简化与任务规划方法研究[J]. 宇航学报,2016,37(7):768-774.doi:10.3873/j.issn.1000-1328.2016.07.002 WANG X H, LI S. Research on constraint simplification and mission planning method for deep space explorer[J]. Journal of Astronautics,2016,37(7):768-774.doi:10.3873/j.issn.1000-1328.2016.07.002
[10]	BRAFMAN R I, DOMSHLAK C. From one to many: planning for loosely coupled multi-agent systems[C]// ICAPS. [S. l. ]: ICAPS, 2008.
[11]	ŠTOLBA M, KOMENDA A. The MADLA planner: multi-agent planning by combination of distributed and local heuristic search[J]. Artificial Intelligence,2017(252):175-210.
[12]	TORREÑO A, ONAINDIA E, SAPENA Ó. FMAP: distributed cooperative multi-agent planning[J]. Applied Intelligence,2015(41):606-626.
[13]	TORREÑO A, ONAINDIA E, KOMENDA A, et al. Cooperative multi-agent planning: a survey[J]. ACM Computing Surveys,2017(50):1-32.
[14]	XU R, CUI P Y, XU X. Realization of multi-agent planning system for autonomous spacecraft[J]. Advances in Engineering Software,2005(36):266-72.
[15]	DECHTER R, MEIRI I, PEARL J. Temporal constraint networks[J]. Artificial Intelligence,1991(49):61-95.
[16]	陈德相, 徐瑞, 崔平远. 航天器资源约束的时间拓扑排序处理方法[J]. 宇航学报,2014,35(6):669-676.doi:10.3873/j.issn.1000-1328.2014.06.008 CHEN D X, XU R, CUI P Y. A Temporal topological sort processing method for spacecraft resources constraints[J]. Journal of Astronautics,2014,35(6):669-676.doi:10.3873/j.issn.1000-1328.2014.06.008
[17]	IATAURO M. How does EUROPA solve problems?[EB/OL](2014-10-22)[2021-04-06].https://github.com/nasa/europa/wiki/Bak-Problem-Solving, (accessed 01.10.20).

[1]	柳景兴, 王彬, 毛维杨, 熊新.深空探测器任务规划认知图谱及多属性约束冲突检测. 深空探测学报(中英文）, 2023, 10(1): 88-96.doi:10.15982/j.issn.2096-9287.2023.20220064
[2]	毛维杨, 王彬, 柳景兴, 熊新.基于强化学习的深空探测器自主任务规划方法. 深空探测学报(中英文）, 2022, 9(0): 1-12.doi:10.15982/j.issn.2096-9287.2022.20220049
[3]	王鑫, 赵清杰, 徐瑞.基于知识图谱的深空探测器任务规划建模. 深空探测学报(中英文）, 2021, 8(3): 315-323.doi:10.15982/j.issn.2096-9287.2021.20210030
[4]	徐瑞, 李朝玉, 朱圣英, 王棒, 梁子璇, 尚海滨.深空探测器自主规划技术研究进展. 深空探测学报(中英文）, 2021, 8(2): 111-123.doi:10.15982/j.issn.2096-9287.2021.20210039
[5]	王卓, 徐瑞.基于多目标优化的深空探测器姿态组合规划方法. 深空探测学报(中英文）, 2021, 8(2): 147-153.doi:10.15982/j.issn.2096-9287.2021.20200069
[6]	刘志强, 赵晨, 曹彦, 陈建岳, 杨敏, 李天义.“嫦娥五号”轨道器供配电系统高比能设计. 深空探测学报(中英文）, 2021, 8(3): 237-243.doi:10.15982/j.issn.2096-9287.2021.20210007
[7]	徐浩, 裴福俊, 蒋宁.一种基于李群描述的深空探测器姿态估计方法. 深空探测学报(中英文）, 2020, 7(1): 102-108.doi:10.15982/j.issn.2095-7777.2020.20171117002
[8]	朱立颖, 叶志玲, 李玉庆, 付中梁, 徐勇.小天体探测自主绕飞智能规划建模. 深空探测学报(中英文）, 2019, 6(5): 463-469.doi:10.15982/j.issn.2095-7777.2019.05.007
[9]	马辛, 宁晓琳, 刘劲, 刘刚.一种平面约束辅助测量的深空探测器自主天文导航方法. 深空探测学报(中英文）, 2019, 6(3): 293-300.doi:10.15982/j.issn.2095-7777.2019.03.014
[10]	金颢, 徐瑞, 崔平远, 朱圣英.基于状态转移图的启发式深空探测器任务规划方法. 深空探测学报(中英文）, 2019, 6(4): 364-368.doi:10.15982/j.issn.2095-7777.2019.04.008
[11]	陈略, 谢剑锋, 韩松涛, 曹建峰, 平劲松.“嫦娥4号”中继星开环测速方案设计与试验验证. 深空探测学报(中英文）, 2019, 6(3): 236-240.doi:10.15982/j.issn.2095-7777.2019.03.006
[12]	姜啸, 徐瑞, 陈俐均.深空探测器动态约束规划中的外延约束过滤方法研究. 深空探测学报(中英文）, 2019, 6(6): 586-594.doi:10.15982/j.issn.2095-7777.2019.06.010
[13]	李春来, 刘建军, 严韦, 封剑青, 任鑫, 刘斌.小行星探测科学目标进展与展望. 深空探测学报(中英文）, 2019, 6(5): 424-436.doi:10.15982/j.issn.2095-7777.2019.05.003
[14]	王大轶, 符方舟, 孟林智, 李文博, 李茂登, 徐超, 葛东明.深空探测器自主控制技术综述. 深空探测学报(中英文）, 2019, 6(4): 317-327.doi:10.15982/j.issn.2095-7777.2019.04.002
[15]	金颢, 徐瑞, 崔平远, 朱圣英.基于扩展状态深空探测器任务规划方法. 深空探测学报(中英文）, 2018, 5(6): 569-574.doi:10.15982/j.issn.2095-7777.2018.06.010
[16]	姜啸, 徐瑞, 朱圣英.基于约束可满足的深空探测任务规划方法研究. 深空探测学报(中英文）, 2018, 5(3): 262-268.doi:10.15982/j.issn.2095-7777.2018.6.008
[17]	朱安文, 刘飞标, 杜辉, 马世俊.核动力深空探测器现状及发展研究. 深空探测学报(中英文）, 2017, 4(5): 405-416.doi:10.15982/j.issn.2095-7777.2017.05.002
[18]	于登云, 张玉花, 褚英志, 李昊, 王建炜, 杜冬.深空探测器模块化结构动力学研究. 深空探测学报(中英文）, 2016, 3(3): 268-274.doi:10.15982/j.issn.2095-7777.2016.03.011
[19]	陈德相, 徐文明, 杜智远, 徐瑞.航天器任务规划中资源约束的可分配处理方法. 深空探测学报(中英文）, 2015, 2(2): 180-185.doi:10.15982/j.issn.2095-7777.2015.02.013
[20]	武长青, 徐瑞, 朱圣英.基于对数势函数的深空探测器姿态规划与控制方法. 深空探测学报(中英文）, 2015, 2(4): 365-370.doi:10.15982/j.issn.2095-7777.2015.04.011

点击查看大图

图(8)/ 表 (2)

计量

文章访问数:142
HTML全文浏览量:43
PDF下载量:24
被引次数:0

注释:

●　A distributed multi-agent plan-space planning method was developed for deep-space probe planning.

●　A dynamic agent interaction graph was proposed to coordinate the multi-agent interactions during the planning process.

●　Temporal constraints between agents were handled by time constraints propagation between temporal networks.

●　The proposed multi-agent planning method can plan for a probe correctly and save computing time.

全文HTML

引　言

随着空间中航天器数量增长和航天任务的日益复杂，地面站的管控压力不断增大，从而产生了对航天器规划技术的需求。地面自动规划能快速生成规划方案，提高航天器的管控效率；在轨自主规划能赋予航天器自我管理能力，减轻地面管控压力。

国内外机构和学者已经对航天器规划技术展开研究。美国国家航空航天局（National Aeronautics and Space Administration）开发了可扩展标准化远程操作规划框架（Extensible Universal Remote Operations Planning Architecture，EUROPA）^[1-2]，该框架内的时间网络模块负责传播时间约束并保持规划解的时间一致性，资源管理模块负责处理存储能源等数值资源约束。EUROPA在多个航天器上得到了应用，包括哈勃望远镜的观测调度^[3]，“深空一号”（Deep Space 1）^[4]和“地球观测一号”（Earth Observing-1）的自主控制^[5]。

自主规划和调度环境（Automated Scheduling and Planning Environment,，ASPEN）^[6]是美国喷气推进实验室（JPL）的人工智能小组开发的一个可重构规划调度框架，采用基于冲突修复的规划方法。连续活动调度规划执行和重规划系统（Continuous Activity Scheduling Planning Execution and Replanning，CASPER）^[7]是ASPEN的软实时星上规划版本，能够在环境变化或目标变化后快速进行重新规划。

以上3个规划系统都采用状态时间线知识表示方法，适合描述探测器的时间资源约束知识，可通过在搜索算法中加入启发式提高规划效率。金颢等^[8]提出了基于状态转移图的启发式加速规划搜索。王晓晖等^[9]设计时间线启发式因子引导规划扩展过程，提高任务规划效率。

除了设计启发式，利用深空探测器子系统分布并行的特性，将子系统抽象为智能体，采用多智能体任务规划方法，也能够提高规划效率。在领域无关规划（domain-independent planning）的研究中，多智能体规划是一个重要分支。Brafman和Domshlak^[10]量化表示了MA-STRIPS（Multi-Agent Stanford Research Institute Problem Solver）问题中智能体之间的耦合程度，证明多智能体规划问题的复杂度主要与智能体间耦合程度相关。MADLA（Multi-Agent Distributed and Local Asynchronous）^[11]规划器针对MA-STRIPS问题提出了分布式状态空间前向链式多启发式搜索方法。FMAP（Forward Multi-Agent Planning）^[12]为了表示状态变量采用PDDL3.1描述领域知识，并提出一种启发式前向链式偏序规划方法。这些规划器虽然致力于解决多智能体规划问题，但缺乏对时间、资源等数值约束的表达和处理能力^[13]。徐瑞等^[14]提出了一个用于航天器的多智能体规划器，能够处理时间资源约束，采用有中心的多智能体分布式规划，设置管理智能体作为其它智能体通信和协作的中心。

深空探测器与地球距离遥远，传统的规划后再上传指令的控制方式会造成较长的时间延迟，因此需要研究自主规划技术以提高探测器的决策效率和任务收益。深空探测器任务规划存在领域知识复杂、时间资源约束复杂、子系统多、任务目标多等难点。

本文的研究目的是探索应用于复杂多系统探测器的高效规划方法，主要思想是结合多智能体规划高效率的优势和经典航天器规划器的数值处理能力，提出基于动态智能体交互图的深空探测器任务规划架构与方法。

本文建立基于状态时间线的多智能体规划问题模型，并为实现探测器各系统活动及数值约束知识的表示而设计了相应的多智能体规划建模语言。提出基于分布式求精搜索的多智能体规划空间规划方法，不设置管理智能体作为协调中心，而是设计动态智能体交互图引导多智能体协同规划，并采用基于图论的方法处理探测器的时间和资源约束。最后，通过仿真实验验证了方法的有效性。

子系统	功能	约束中定义的公共状态数
姿态子系统	控制环绕器姿态	0
载荷相机	拍照	1
对地通信系统	与地球通信	1
对火通信系统	与巡视器通信	2
推进系统	推进轨道器	0
太阳帆板	获取太阳能	2
GNC模式调整	调整导航模式	3
轨道控制	调整轨道	1
综合电子系统	调整测控模式	1

子系统	状态列表
姿态子系统	指向、转动
载荷相机	关、打开、开、对准、工作、关闭
对地通信系统	关、打开、开、打开通信、通信、关闭
对火通信系统	关、打开、开、打开通信、通信、关闭
推进系统	关、打开、开、开始推进、推进、关闭
太阳帆板	保持、对日、锁定
GNC模式调整	默认模式、模式1、模式2
轨道控制	空闲、调整轨道
综合电子系统	默认模式、模式1、模式2

4. 结　论

本研究实现了基于求精搜索的深空探测器多智能体规划空间规划的理论设计和仿真验证。结合数值约束处理技术和多智能体规划方法，即满足了深空探测器的时间资源约束处理需求，又实现了高效的多智能体协同规划。实验证明所设计的规划方法能够为深空探测器任务规划生成合理规划，并显著提高了规划效率，为未来深空探测任务规划提供了一种新的可行方案。

参考文献 (17)

留言板