☆ 4.6 Article

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS (2022)

期刊

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS

卷 -, 期 -, 页码 -

出版社

AMER INST AERONAUTICS ASTRONAUTICS

DOI: 10.2514/1.G006746

关键词

类别

Engineering, Aerospace Instruments & Instrumentation

资金

German Federal Ministry for Economic Affairs and Climate Action

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper addresses the decision-making problem of balancing distance coverage and exploiting thermal updrafts in cross-country soaring flight, and proposes a control strategy using reinforcement learning. The paper presents a model based on a Markov decision process and utilizes stochastic gradient ascent to solve the hierarchical reinforcement learning problem.

Solving the decision-making problem between pursuing the objective of covering distance and exploiting thermal updrafts is the central challenge in cross-country soaring flight. The need for trading short-term rewarding actions against actions that pay off in the long term makes for a hard-to-solve problem. Policies resulting from reinforcement learning offer the potential to handle long-term correlations between actions taken and rewards received. The paper presents a reinforcement learning setup, which results in a control strategy for the autonomous soaring sample application of GPS Triangle racing. First, we frame the problem in terms of a Markov decision process. In particular, we present a straightforward model for the three-degrees-of-freedom system dynamics of a glider aircraft that does not make any simplifying assumptions regarding the wind field or the relative aircraft velocity. The competition task is decomposed into subtasks, then. Stochastic gradient ascent solves the associated hierarchical reinforcement learning problem without the designer employing any further, potentially deficient heuristics. We present an implementation of the overall policy alongside an updraft estimator on embedded hardware aboard an unpiloted glider aircraft. Flight-test results validate the successful transfer of the hierarchical control policy trained in simulation to real-world autonomous cross-country soaring.

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

期刊

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS

出版社

AMER INST AERONAUTICS ASTRONAUTICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Hierarchical Reinforcement Learning Approach for Autonomous Cross-Country Soaring

期刊

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS

出版社

AMER INST AERONAUTICS ASTRONAUTICS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文