15
Reinforcement Learning for Optimal Frequency Control: A Lyapunov Approach ICML Workshop Wenqi Cui and Baosen Zhang 2021. 06.

RL for Optimal Frequency Control

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: RL for Optimal Frequency Control

Reinforcement Learning for Optimal Frequency Control: A Lyapunov Approach

ICML Workshop

Wenqi Cui and Baosen Zhang

2021. 06.

Page 2: RL for Optimal Frequency Control

1. Background

2[1] Tamrakar, Ujjwol, et al. "Virtual inertia: Current trends and future directions." Applied Sciences 7.7 (2017): 654.

Page 3: RL for Optimal Frequency Control

1. Background

- Frequency deviation

reflects the demand –

supply mismatch

- In frequency control

problem, we adjust the

active power from

generators to reduce

the frequency

deviation. Increased renewable

energy sources

3[2] https://www.mitsubishielectric.com/eig/energysystems/ictpowersystem/solutions/Large-capacityStorageBattery.html

Page 4: RL for Optimal Frequency Control

1. Background

4

Linear droop control

[3] Denholm, Paul, et al. Inertia and the power grid: A guide without the spin. National Renewable Energy Laboratory, 2020.

Synchronous generators follow negative linear feedback from frequency deviation

Frequency

Po

wer

Generator-+

[3]

Page 5: RL for Optimal Frequency Control

1. Background - Challenge

5

Linear control may not be optimal✓ Stabilizing

✓ Frequency deviation

✓ Control cost

Frequency

Po

wer

Frequency

Po

wer

?

Inverter-based resources can implement almost arbitrary control law

Page 6: RL for Optimal Frequency Control

1. Background – Our approach

Reinforcement learning (RL) for optimal

frequency control

- Parameterize the controllers with neural

network and RL is used to train them

- Obtain structure property of stabilizing

controller using Lyapunov function

- RNN-based framework for efficient

training

6

Page 7: RL for Optimal Frequency Control

2. Problem Formulation - Model

Angle Frequency State Variables

The dynamics of the power system are represented by the swing equation

Inertia

constant

The susceptance

of the line (i, j)

Static controller that change active

power for primary frequency regulation

7

Net power

injection

Damping

coefficient

Page 8: RL for Optimal Frequency Control

2. Problem Formulation – Optimization Objective

8

Inverter

Power System

t

0

t0

u

ത𝑢

Cost on frequency

deviation

Cost on control effort

+

Stability

𝜔𝑡

𝑢𝑡

𝝎 ∞

Page 9: RL for Optimal Frequency Control

2. Problem Formulation – Hard Constraint on Stability

(b) Dynamics of 𝜔(left) and 𝛿 (right) for RNN-Wo-Lyapunov

(c) Dynamics of 𝜔(left) and 𝛿 (right) for RNN-Lyapunov 9

(a) Average batch loss along episodes

- Necessity to consider Stability

Page 10: RL for Optimal Frequency Control

3. Lyapunov Approach for a Stabilizing ControllerA local Lyapunov function 𝑉(𝛿, 𝜔) for the dynamic system is

The total derivative of the Lyapunov function with respect to t is

10

Page 11: RL for Optimal Frequency Control

3. Lyapunov Approach for a Stabilizing Controller

According to Lyapunov stability theory, we design the neural networks to have

the following structures such that the controller will be locally exponentially

stabilizing

1) 𝑢𝜃𝑖 𝜔𝑖 has the same sign as 𝜔𝑖

2) 𝑢𝜃𝑖 𝜔𝑖 is monotonically increasing

3) 𝑢𝑖 ≤ 𝑢𝜃𝑖 𝜔𝑖 ≤ 𝑢𝑖

11

Page 12: RL for Optimal Frequency Control

4. RNN for Efficient TrainingIntegrate state transition dynamics in recurrent neural network (RNN)

- Define the cell states to be 𝛿𝑖 and 𝜔𝑖

- Operation of cell unit follows the swing equation

12

Compared with the general reinforcement learning structure, the proposed RNN based structure reduces computational time by approximate 74.32%

Page 13: RL for Optimal Frequency Control

5. Case studyCase studies are conducted on the IEEE New England 10-machine 39-bus (NE39) power network

- Benchmark: Linear droop control with optimal linear coefficient

- The proposed approach learns a non-linear control law Control Action u obtained by different approaches

13

Page 14: RL for Optimal Frequency Control

5. Case study

- Start from the same initial states at t=0, loss of load at bus 2 at t=2s

- Compared with the linear droop control, RNN-Lyapunov achieve similar frequency deviation with much smaller control effort.

(a) Dynamics of 𝜔(left) and 𝑢 (right) for RNN-Lyapunov

(b) Dynamics of 𝜔(left) and 𝑢 (right) for linear droop control 14

Page 15: RL for Optimal Frequency Control

Thank you!

15