4.8 Article

Wind-Farm Power Tracking Via Preview-Based Robust Reinforcement Learning

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Volume 18, Issue 3, Pages 1706-1715

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2021.3093300

Keywords

Model-free control; reinforcement learning (RL); renewable energy; wind-farm control; wind power

Funding

  1. U.K. Engineering and Physical Sciences Research Council [EP/S001905/1]
  2. EPSRC [EP/S001905/1] Funding Source: UKRI

Ask authors/readers for more resources

This article presents a novel preview-based robust deep reinforcement learning method for wind-farm power tracking problem, which can handle tasks subject to uncertain environmental conditions and strong aerodynamic interactions among wind turbines. The control problem is transformed into a zero-sum game to quantify the influence of unknown wind conditions and future reference signals.
This article aims to address the wind-farm power tracking problem, which requires the farm's total power generation to track time-varying power references and, therefore, allows the wind farm to participate in ancillary services such as frequency regulation. A novel preview-based robust deep reinforcement learning (PRDRL) method is proposed to handle such tasks which are subject to uncertain environmental conditions and strong aerodynamic interactions among wind turbines. To our knowledge, this is for the first time that a data-driven model-free solution is developed for wind-farm power tracking. Particularly, reference signals are treated as preview information and embedded in the system as specially designed augmented states. The control problem is then transformed into a zero-sum game to quantify the influence of unknown wind conditions and future reference signals. Built upon the H-infinity control theory, the proposed PR-DRL method can successfully approximate the resulting zero-sum game's solution and achieve wind-farm power tracking. Time-series measurements and long short-term memory networks are employed in our DRL structure to handle the non-Markovian property induced by the time-delayed feature of aerodynamic interactions. Tests based on a dynamic wind-farm simulator demonstrate the effectiveness of the proposed PR-DRL wind-farm control strategy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available