☆ 4.7 Article

Dynamic Positioning using Deep Reinforcement Learning

OCEAN ENGINEERING (2021)

Journal

OCEAN ENGINEERING

Volume 235, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.oceaneng.2021.109433

Keywords

Dynamic Positioning; Deep Reinforcement Learning; Proximal policy optimization; Reward shaping

Categories

Engineering, Marine Engineering, Civil Engineering, Ocean Oceanography

Funding

Research Council of Norway through the Centre of Excellence funding scheme [223254]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper presents the implementation and performance testing of a Deep Reinforcement Learning based control scheme for Dynamic Positioning of a marine surface vessel, demonstrating good positioning performance and energy efficiency through simulations and model scale sea trials.

This paper demonstrates the implementation and performance testing of a Deep Reinforcement Learning based control scheme used for Dynamic Positioning of a marine surface vessel. The control scheme encapsulated motion control and control allocation by using a neural network, which was trained on a digital twin without having any prior knowledge of the system dynamics, using the Proximal Policy Optimization learning algorithm. By using a multivariate Gaussian reward function for rewarding small errors between the vessel and the various setpoints, while encouraging small actuator outputs, the proposed Deep Reinforcement Learning based control scheme showed good positioning performance while being energy efficient. Both simulations and model scale sea trials were carried out to demonstrate performance compared to traditional methods, and to evaluate the ability of neural networks trained in simulation to perform on real life systems.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Within the scope of prediction: Shaping intrinsic rewards via evaluating uncertainty

Xiaoshu Zhou, Fei Zhu, Peiyao Zhao

Summary: The method of prediction based on uncertainty exploration (SPE) improves the quality of exploration and reduces noise interference in deep reinforcement learning, leading to significant improvements in simulated environments.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Add to Collection

Article Computer Science, Information Systems

UCAV Air Combat Maneuver Decisions Based on a Proximal Policy Optimization Algorithm with Situation Reward Shaping

Kaibiao Yang, Wenhan Dong, Ming Cai, Shengde Jia, Ri Liu

Summary: This paper proposes a method for unmanned combat air vehicle air combat maneuver decision based on the proximal policy optimization algorithm. The method is validated through a simulation experiment, demonstrating its effectiveness.

ELECTRONICS (2022)

Add to Collection

Article Computer Science, Information Systems

Learning Potential in Subgoal-Based Reward Shaping

Takato Okudo, Seiji Yamada

Summary: Human knowledge can reduce the number of iterations required in reinforcement learning, and the subgoal-based reward shaping method shows promise in certain domains. By learning the potential function through parameterization of a hyperparameter, we are able to accelerate value learning and obtain more effective results compared to baseline algorithms.

IEEE ACCESS (2023)

Add to Collection

Article Energy & Fuels

An AGC Dynamic Optimization Method Based on Proximal Policy Optimization

Zhao Liu, Jiateng Li, Pei Zhang, Zhenhuan Ding, Yanshun Zhao

Summary: This article proposes a novel framework based on the PPO reinforcement learning algorithm for AGC dynamic optimization, which aims to handle fluctuations and uncertainties in power systems and improve the frequency characteristic to meet control performance standards.

FRONTIERS IN ENERGY RESEARCH (2022)

Add to Collection

Article Automation & Control Systems

Proximal policy optimization-based controller for chaotic systems

Her-Terng Yau, Ping-Huan Kuo, Po-Chien Luan, Yung-Ruen Tseng

Summary: This article presents a DRL-based control method for nonlinear chaotic systems without prior knowledge of the system's equations. Experimental results demonstrate that the PPO algorithm is the most efficient and effective for controlling chaotic systems.

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Image captioning via proximal policy optimization

Le Zhang, Yanshuo Zhang, Xin Zhao, Zexiao Zou

Summary: Image captioning involves generating captions for images using natural language. By applying the PPO algorithm to a state-of-the-art architecture like X-Transformer, improvements in system performance can be achieved. Experimental results suggest that combining PPO with dropout regularization may decrease performance, possibly due to the KL-divergence of RL policies. Using word-level baseline estimation instead of sentence-level baseline in the policy gradient estimator can lead to better results.

IMAGE AND VISION COMPUTING (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

An agile and intelligent dynamic economic emission dispatcher based on multi-objective proximal policy optimization

Zhuang Shao, Fengqi Si, Huaijiang Wu, Xiaozhong Tong

Summary: The research defines a novel dynamic economic emission dispatcher problem and learning framework, transferring the optimization tasks offline and using multi-objective proximal policy optimization to significantly improve the speed and performance of the neural network dispatcher, showcasing its generalization capabilities.

APPLIED SOFT COMPUTING (2021)

Add to Collection

Article Computer Science, Information Systems

Proximal policy optimization via enhanced exploration efficiency

Junwei Zhang, Zhenghao Zhang, Shuai Han, Shuai Lue

Summary: This paper discusses the exploration issue in the PPO algorithm and proposes an exploration enhancement mechanism based on uncertainty estimation. By applying the exploration enhancement theory to the PPO algorithm, the IEM-PPO algorithm is proposed, and it is evaluated in experiments using the MuJoCo physical simulator. The results show that the IEM-PPO algorithm outperforms PPO in terms of sample efficiency and cumulative reward.

INFORMATION SCIENCES (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Dynamic Weights and Prior Reward in Policy Fusion for Compound Agent Learning

Meng Xu, Yechao She, Yang Jin, Jianping Wang

Summary: We propose a new method for policy fusion in deep reinforcement learning, which dynamically selects sub-tasks and reduces fusion bias. Experimental results show significant improvements in task duration, episode reward, and score difference.

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Learning Humanoid Robot Running Motions with Symmetry Incentive through Proximal Policy Optimization

Luckeciano C. Melo, Dicksiano C. Melo, Marcos R. O. A. Maximo

Summary: This study introduces a methodology based on deep reinforcement learning to improve running skills in a humanoid robot, achieving remarkable results with Proximal Policy Optimization. The approach outperforms existing technologies by approximately 50% in terms of sprint speed in the RoboCup 3D Soccer Simulation competition. Evaluation of training procedures and controllers in terms of speed, reliability, and human similarity were conducted, with a focus on encouraging symmetry in movements for top speed running policies. Key factors leading to surpassing previous results and suggestions for future research are discussed.

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

Differentiable Logic Policy for Interpretable Deep Reinforcement Learning: A Study From an Optimization Perspective

Xin Li, Haojie Lei, Li Zhang, Mingzhong Wang

Summary: This paper explores interpretable Deep Reinforcement Learning (DRL) by representing policy using Differentiable Inductive Logic Programming (DILP). The research focuses on the optimization perspective of DILP-based policy learning and proposes using Mirror Descent for policy optimization. The theoretical and empirical studies verify the effectiveness of the proposed approach.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Add to Collection

Article Computer Science, Information Systems

An Object Recognition Grasping Approach Using Proximal Policy Optimization With YOLOv5

Qingchun Zheng, Zhi Peng, Peihao Zhu, Yangyang Zhao, Ran Zhai, Wenpeng Ma

Summary: This paper proposes an object recognition grasping approach using Proximal Policy Optimization (PPO) with You Only Look Once v5 (YOLOv5) to overcome the problems of traditional grasping methods for mobile manipulators. The approach combines a vision recognition algorithm with a deep reinforcement learning algorithm to achieve object recognition grasping. Experimental results show that the proposed method outperforms the original YOLOv4 model in terms of object recognition speed and achieves higher detection precision and lower hardware requirements. The proposed method also outperforms the SAC and TRPO algorithms in object grasping, with the average reward of the PPO algorithm improved significantly compared to the other algorithms.

IEEE ACCESS (2023)

Add to Collection

Article Management

Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management

Bram J. De Moor, Joren Gijsbrechts, Robert N. Boute

Summary: This study demonstrates the feasibility of applying transfer learning to deep reinforcement learning for improving performance and training stability in inventory management. Additionally, potential-based reward shaping is implemented to manage inventory control efficiently.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2022)

Add to Collection

Article Computer Science, Information Systems

An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization

Rousslan Fernand Julien Dossa, Shengyi Huang, Santiago Ontanon, Takashi Matsubara

Summary: This paper investigates the impact of the optimization technique called "early stopping" on the performance of the PPO algorithm. The results show that PPO's performance is sensitive to the number of update iterations per epoch, and early stopping optimizations can dynamically adjust the update iterations, serving as a convenient alternative to tuning on K.

IEEE ACCESS (2021)

Add to Collection

Article Robotics

A pretrained proximal policy optimization algorithm with reward shaping for aircraft guidance to a moving destination in three-dimensional continuous space

Zhuang Wang, Hui Li, Zhaoxin Wu, Haolin Wu

Summary: A pretrained PPO algorithm is proposed to solve the guidance problem of manned aircraft and unmanned aerial vehicles, with continuous action reward function and position reward function to increase training speed and trajectory performance.

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS (2021)

Add to Collection

Article Engineering, Marine

Fault tolerant position-mooring control for offshore vessels

Mogens Blanke, Dong T. Nguyen

OCEAN ENGINEERING (2018)

Add to Collection

Article Engineering, Environmental

Position-moored drilling vessel in level ice by control of riser end angles

Dat H. Nguyen, Dong T. Nguyen, Ser T. Quek, Asgeir J. Sorensen

COLD REGIONS SCIENCE AND TECHNOLOGY (2011)

Add to Collection

Article Automation & Control Systems

Switching control for thruster-assisted position mooring

Dong T. Nguyen, Asgeir J. Sorensen

CONTROL ENGINEERING PRACTICE (2009)

Add to Collection

Article Automation & Control Systems

Control of marine riser end angles by position mooring

Dat H. Nguyen, Dong T. Nguyen, Ser T. Quek, Asgeir J. Sorensen

CONTROL ENGINEERING PRACTICE (2010)

Add to Collection

Article Engineering, Civil

Setpoint Chasing for Thruster-Assisted Position Mooring

Dong Trong Nguyen, Asgeir J. Sorensen

IEEE JOURNAL OF OCEANIC ENGINEERING (2009)

Add to Collection

Article Automation & Control Systems

Multi-operational controller structure for station keeping and transit operations of marine vessels

Trong Dong Nguyen, Asgeir J. Sorensen, Ser Tong Quek

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY (2008)

Add to Collection

Article Engineering, Ocean

Reliability of Switched Model-Based Controller for Vessel Dynamic Positioning With Switching Under Estimated Motion Frequency

Dong T. Nguyen, Ser Tong Quek

JOURNAL OF OFFSHORE MECHANICS AND ARCTIC ENGINEERING-TRANSACTIONS OF THE ASME (2010)

Add to Collection

Article Engineering, Marine

Verification of collision avoidance algorithms in open sea and full visibility using fuzzy logic

Dong Trong Nguyen, Marius Trodahl, Tom Arne Pedersen, Azzeddine Bakdi

Summary: This paper proposes a fuzzy logic-based method for evaluating the compliance of Collision-Avoidance Systems (CAS) in Autonomous Surface Vehicles (ASV). The evaluation systems were verified on simulated scenarios and found to provide variables that would be challenging or impossible to obtain by visual assessment.

OCEAN ENGINEERING (2023)

Add to Collection

Proceedings Paper Automation & Control Systems

Combining Supervised Learning and Digital Twin for Autonomous Path-planning

Chanjei Vasanthan, Dong T. Nguyen

Summary: This paper presents the development of an autonomous path planner based on supervised learning, addressing concerns about uncertainties introduced by deep learning models. Through thorough research and parameter tuning, the authors identified the most suitable model and utilized large-scale training data to enhance performance.

IFAC PAPERSONLINE (2021)

Add to Collection

Article Computer Science, Information Systems

Finite-Time Backstepping of a Nonlinear System in Strict-Feedback Form: Proved by Bernoulli Inequality

Zhengru Ren, Bo Zhao, Dong Trong Nguyen

IEEE ACCESS (2020)

Add to Collection

Proceedings Paper Engineering, Ocean

FULL-SCALE VALIDATION OF A VESSEL'S STATION-KEEPING CAPABILITY WITH DYNCAP

Luca Pivano, Dong Nguyen, Olyvind Smogeli

PROCEEDINGS OF THE ASME 36TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2017, VOL 9 (2017)

Add to Collection

Article Automation & Control Systems

Design of hybrid controller for dynamic positioning from calm to extreme sea conditions

Trong Dong Nguyen, Asgeir J. Sorensen, Ser Tong Quek

AUTOMATICA (2007)

Add to Collection

Article Engineering, Marine

HySwash: A hybrid model for nearshore wave processes

Alba Ricondo, Laura Cagigal, Beatriz Perez-Diaz, Fernando J. Mendez

Summary: This research presents a site-specific metamodel based on the SWASH numerical model simulations, which can predict coastal hydrodynamic variables in a fast and efficient manner. The metamodel uses downscaled and dimensionality reduced synthetic database to accurately reproduce wave setup, wave heights associated with different frequency bands, and wave runup. This method has great potential in coastal risk assessments, early warning systems, and climate change projections.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Experimental study on the mechanical behavior and energy absorption capacity of coral sand at high strain rates

Xiao Yu, Wangjun Ren, Bukui Zhou, Li Chen, Xiangyun Xu, Genmao Ren

Summary: This study investigated and compared the compression responses and energy absorption capacities of coral sand and silica sand at a strain rate of approximately 1000 s-1. The results showed that coral sand had significantly higher energy absorption capacity than silica sand due to its higher compressibility. The study findings suggest that using poorly graded coral sand can improve its energy absorption capacity.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Cooperative model predictive control for ship formation tracking with communication delays

Jingxi Zhang, Junmin Mou, Linying Chen, Pengfei Chen, Mengxia Li

Summary: This paper proposes a cooperative control scheme for ship formation tracking based on Model Predictive Control. A predictive observer is designed to estimate the current motion states of the leader ship using delayed motion information. Comparative simulations demonstrate the effectiveness and robustness of the proposed controller.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

A numerical investigation of the 2DH wave characteristics across a fringing reef profile with reef-flat excavation pit

Yu Yao, Danni Zhong, Qijia Shi, Ji Wu, Jiangxia Li

Summary: This study proposes a 2DH numerical model based on Boussinesq equations to investigate the impact of dredging reef-flat sand on wave characteristics and wave-driven current. The model is verified through wave flume experiments and wave basin experiments, and the influences of incident wave conditions and pit morphological features on wave characteristics are examined.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Double-averaged turbulence statistics of wave current flow over rough bed with staggered arrangement of hemispherical blocks

Jayanta Shounda, Krishnendu Barman, Koustuv Debnath

Summary: This study investigates the double-average turbulence characteristics of combined wave-current flow over a rough bed with different spacing arrangements. The results show that a spacing ratio of p/r=4 offers the highest resistance to the flow, and the double-average Reynolds stress decreases throughout the flow depth. The advection of momentum-flux of normal stress shows an increase at the outer layer and a decrease near the bed region after wave imposition. Maximum turbulence kinetic energy production and diffusion occur at different layers. The turbulence structure is strongly anisotropic at the bottom region and near the outer layer, with a decrease in anisotropy observed with an increase in roughness spacing.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

A monitoring method of hull structural bending and torsional moment

Meng Zhang, Lianghui Sun, Yaoguo Xie

Summary: The research proposes a method for online identification of wave bending and torsional moment in hull structures. For structures without large openings, the method optimizes sensor positions and establishes a mathematical model to improve accuracy. For structures with large openings, a joint dual-section monitoring method is proposed to simultaneously identify bending and torsional moments in multiple key cross sections.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Study on the dynamic characteristics of pile wharves subjected to underwater explosion

Longming Chen, Shutao Li, Yeqing Chen, Dong Guo, Wanli Wei, Qiushi Yan

Summary: This study investigated the dynamic response characteristics and damage modes of pile wharves subjected to underwater explosions. The results showed that the main damaged components of the pile wharf were the piles, and inclined piles had a higher probability of moderate or more significant damage compared to vertical piles. The study also suggested that replacing inclined piles with alternative optimized structures benefits the blast resistance of pile wharves.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

A real-time wave prediction in directional wave fields: Strategies for accurate continuous prediction in time

I. -C Kim, G. Ducrozet, V. Leroy, F. Bonnefoy, Y. Perignon, S. Bourguignon

Summary: Previous research focused on the accuracy and efficiency of short-term wave fields in specific prediction zones, while we developed algorithms for continuous wave prediction based on the practical prediction zone and discussed important time factors and strategies to reduce computational costs.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Experimental study on the slamming pressure distribution of a 3D stern model entering water with pitch angles

Hang Xie, Xianglin Dai, Fang Liu, Xinyu Liu

Summary: This study investigates the load characteristics of a three-dimensional stern model with pitch angle through a drop test, and reveals complex characteristics of pressure distribution near the stern shaft. The study also shows that the vibration characteristics of the load are influenced by the drop height and pitch angle, with the drop height having a greater effect on the high-frequency components.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Influence of blocking ratio on hydrodynamic force on deep-water pier under earthquake

Hangyuan Zhang, Wanli Yang, Dewen Liu, Xiaokun Geng, Wangyu Dai, Yuzhi Zhang

Summary: The deep-water bridge is more vulnerable to earthquake damage than the bridge standing in air. The larger blocking ratio has a significant impact on the added mass coefficient, which requires further comprehensive study. The generation mechanism of block effect is analyzed using numerical simulation software ANSYS Fluent. The results show that the recirculation zone with focus reduces the pressure on the back surface of the cylinder, resulting in the peak value of in-line force not occurring synchronously with the peak value of acceleration. The change in position and intensity of the recirculation zone with focus, as well as the change in water flow around the cylinder surface, are identified as the generation mechanism of the block effect, which has a 10% influence on the hydrodynamic force. The changing rule of the added mass coefficient with blocking ratio is discussed in detail, and a modification approach to the current added mass coefficient calculation method is suggested. Physical experiments are conducted to validate the modification approach, and the results show that it is accurate and can be used in further study and real practice.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Flow past rotating cylinders using deterministic vortex method

Golnesa Karimi-Zindashti, Ozgur Kurc

Summary: This study examines the performance of an in-house code utilizing a deterministic vortex method on the rotation of circular and square cylinders. The results show that rotational motion reduces drag forces, suppresses fluctuating forces, and increases lift forces. The code accurately predicts vortex shedding suppression and identifies the emergence of near-field wakes in the flow over rotating square cylinders.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

A dynamic simulation tool for ship's response during damage-generated compartment flooding

George Dafermos, George Zaraphonitis

Summary: The survivability of damaged ships is of great importance and the regulatory framework is constantly updated. The introduction of the probabilistic damage stability framework has rationalized the assessment procedure. Flooding simulation tools can be used to investigate the dynamic response of damaged ships.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

A real-time drilling parameters optimization method for offshore large-scale cluster extended reach drilling based on intelligent optimization algorithm and machine learning

Xuyue Chen, Xu Du, Chengkai Weng, Jin Yang, Deli Gao, Dongyu Su, Gan Wang

Summary: This paper proposes a real-time drilling parameters optimization method for offshore large-scale cluster extended reach drilling based on intelligent optimization algorithm and machine learning. By establishing a ROP model with long short-term memory neurons, and combining genetic algorithm, differential evolution algorithm, and particle swarm algorithm, the method achieves real-time optimization of drilling parameters and significantly improves the ROP.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Dynamics of a moored submerged floating tunnel under tsunami waves

Sung-Jae Kim, Chungkuk Jin, MooHyun Kim

Summary: This study investigates the dynamic behavior of a moored submerged floating tunnel (SFT) under tsunami-like waves through numerical simulations and sensitivity tests. The results show that design parameters significantly affect the dynamics of the SFT system and mooring tensions, with shorter-duration and higher-elevation tsunamis having a greater impact.

OCEAN ENGINEERING (2024)

Add to Collection

Article Engineering, Marine

Environmental contours of sea states by the I-FORM approach derived with the Burr-Lognormal statistical model

G. Clarindo, C. Guedes Soares

Summary: Environmental contours are constructed using the Inverse-First Order Reliability Method based on return periods. The paper proposes the use of the Burr distribution to model the marginal distribution of long-term significant wave heights. The newly implemented scheme results in different environmental contours compared to the reference approach.

OCEAN ENGINEERING (2024)

Add to Collection

© Peeref 2019-2024. All rights reserved.