Carnegie Mellon Electricity Industry Center Carnegie ...nsf-itr/Conference Papers...Sarosh Talukdar,...

Can cascading failures be eliminated?

Sarosh Talukdar, Paul Hines, Jay Apt

Carnegie Mellon Electricity Industry CenterCarnegie Mellon University

What is a cascading failure?

A cascading failure is a succession of automatic switching operations

These switching operations are the responses of the equipment-protection-network to certain random disturbances.

Each switching operation de-energizes some equipment (causes failures)

Fault Here

A much simplified illustration...

The protection network de-energizes the faulted line

The first outage is followed by a second set of outages. The second set is followed by a third set, and so on.

A way to protect equipment and interrupt the sequence of failures:

Shed 96 MW Gen

Shed 151 MW Gen

Shed 125 MW Load

Shed 117 MW Load

The N-1 Criterion

NERC recommends that power systems be operated so that no single transmission or generation failure can compromise the quality of service to any customer.

Consequence: cascading failures are caused by bizarre (multiple) failures

• a succession of failures that happens too fast for human intervention to re-establish the N-1 criterion

• a single failure in the presence of long standing but hidden failures

How cascading failures happen in grids with (N-1) security: A Bizarre Outage Excessive Stresses Outages Excessive Stresses More Outages and so on

In the past, human operators have not been effective in limiting the spread of cascading failures.

But cascading failures tend to be self-limiting

Source: Defense Meteorological Satellite Progra

Cascading failures cause blackouts that cost from 10 to 50 billion dollars per year.

(The number depends on the measure used)

Cascading failures are expensive...

“SECURITY OF SUPPLY IS TOP GLOBAL CONCERN FOR UTILITIES

Blackouts on both sides of the Atlantic have propelled security of energy supply to become the top concern for utilities companies across the world, according to the sixth annual PricewaterhouseCoopers report 'Supply Essentials: Utilities Global Survey 2004'. The report, which presents the views of 148 leading companies across 47 countries throughout Europe, the Americas, Asia Pacific, Africa and the Middle East, indicates that securing power supply has risen from the fourth concern only twelve months ago to the highest ranking issue of 2004.”

Besides being large, the societal costs of cascading failures are complex...

Failure Statistics 1984-2000Compiled by the North American Electric Reliability Council

• 533 transmission or generation events

• 324 (1 every 19 days) had power losses > 1 MW

• 46 of these (3 per year) were > 1000 MW

Cascading failures happen often enough to give the distribution of blackouts a fat tail:

11/9/65 Northeast 30 million people6/5/67 PA-NJ-MD 4 million5/17/77 Miami 1 million7/13/77 NYC 9 million1/1/81 Idaho-Utah-Wyoming 1.5 million3/27/82 West 1 million12/14/94 West 2 million7/2/96 West 2 million8/10/96 West 7.5 millionFeb-Apr 9 Auckland 1.3 million12/8/98 San Francisco ½ million8/14/03 Great Lakes-NYC 50 million8/30/03 London ½ million9/23/03 Denmark & Sweden 4 million9/28/03 Italy 57 million11/7/03 Most of Chile 15 million7/12/04 Athens 3 million

Log of Size

Log

of P

roba

bilit

y

We are not winning the battle against cascading failures

Number of blackouts 10 MW or larger

0

10

20

30

40

50

60

70

80

1984

1985

1986

1987

1988

1989

1990

1991

1992

1993

1994

1995

1996

1997

1998

1999

2000

2001

2002

2003

2004

10,001+ MW1,001-10,000 MW101-1,000 MW10-100 MW

ISO NE opens

CAISO opens

0.8 1 1.2 1.4 1.60

0.2

0.4

0.6

0.8

1

x

Pr (c|x∧m)

The probability, Pr, of a cascading failure plotted against a measure of stress, x.c is a cascading failure of 20 or more transmission lines; m is a random multiple contingency. Each point on the plot is the result of 10,000 simulations of a network with 3357 nodes. The points were generated by Huaiwei Liao.

Blackout probability seems to be critically dependent on network stress—at a critical stress there is a sharp increase in blackout-probability. Is the underlying mechanism a phase transition? a relaxation? ...?

• Cascading failures in electric grids happen quite frequently• Their distribution has a fat tail• They appear to be difficult to control, and have resisted elimination • There are other phenomena with fat-tailed distributions that are

difficult to control--forest fires, for instance.

Critical phenomena

Why are cascading failures difficult to control?

Some of the events that initiate forest fires --lightning strokes, for instance--are beyond human control. Therefore, as long as there are inflammable forests, there will be forest fires.

Some of the events that initiate cascading failures are difficult, if not impossible, to control:

• lightning strokes and other random, natural events• hidden failures• deliberate attacks

Many solutions to the problem of cascading failures have been proposed. Some have been implemented. Examples:

1. add transmission capacity2. improve regulations 3. improve coordination 4. better training for human operators5. better automatic control systems6. more data collection7. more data processing8. load management9. more conservation10. more chainsaws11. RAS’s (Remedial Action Schemes)12. SPS’s (Special Protection Schemes)

But what exactly is the problem?

Of course, it is critical to get the problem right.

After all, the best possible solution to the wrong problem is still the wrong solution

Let:

X be a space of control schemes and strategies, such as

• use PMU’s to collect data• shed load at bus-37 if the voltage drops 10%

C be the space of all possible operating conditions of the system, such as

• a short circuit on line-25 while it is carrying 1000 MW

• a short circuit on line-25 while it is carrying 1003 MW, and the state-estimator is down

gk(x, c) be the stress on device-k as a result of condition-c and strategy-x

f(x, c) be the societal cost of condition-c and strategy-x

Notation

At present, P (protection) is treated as a constraint satisfaction problem...

Find a strategy, x, such that:

gk(x, c) ≤ Tk for all k and all c ∈ C

Stress-Threshold beyond which device-k could be damaged

The existing strategy is to de-energize each and every device that approaches its threshold.

The beauty of this strategy is that it is unaffected by C, except for hidden failures and controller mis-operations.

The disadvantage is that makes no attempt to limit the number of devices that are de-energized

P+CFC (protection + cascading failure control), can be formulated as a constraint satisfaction or optimization problem. The constraint satisfaction formulation is:

Find a strategy, x, such that:

f(x, c) ≤ $

gk(x, c) ≤ Tk

for all k and all c ∈ C

Maximum allowable cost per incident

Societal cost Stress on device-k

Stress-threshold

C is so large that finding solutions to this problem is difficult, and verifying solutions is impossible.

The solution to a problem is only as good as the tests by which it is verified.

For P+CFC, the tests can never be complete or absolute. (At least, not until there is a breakthrough in testing technology.)

At best, one can verify only a small sub-set of the possible operating conditions, say (N-2) or (N-3).

The Aug. 14, 2003 blackout was an (N-k) event, where k was about 25. (See Fig 6.1 in the Final Report On The August 14, 2003 Blackout In The United States And Canada.)

Verification

The complete verification problem:

Given a control strategy, x*, show that

f(x*, c) ≤ $

gk(x*, c) ≤ Tk

for all k and all c ∈ C(x*)

This problem is impossible to solve because C, the space of all possible operating conditions, is very large. And C depends on x*, the control strategy to be verified. C grows dramatically with the complexity of x*.

At best, one can verify only a small sub-set of the operating conditions, C, and with only approximations to stress and cost.

Given x*, a control strategy, show that

F(x*, c) ≤ $

Gk(x*, c) ≤ Tk

for all k , and all c ∈ C’

where F and Gk are approximations to f and gk, and C’ is a small sub-set of C

A partial verification problem:

The dangers of partial verification

1. Untested conditions can, and invariably will, occur

2. There is an unavoidable tendency to optimize the system for the conditions to be tested. This could make the system more vulnerable to the untested conditions.

A measure of solution-quality with partial verification

Solution-x1 is better than solution-x2 if x1 is less costly than x2, that is, if:

∑c F(x1, c) < ∑c F(x2, c)

Gk(x1, c) ≤ Tk, Gk(x2, c) ≤ Tk

for c ∈ C’

Our approach to solving the P+CFC problem

Cascading failures are the sub-optimal reactions of relays to bizarre conditions. (These reactions are good solutions to problem-P, but poor solutions to problem-P+CFC.)

Existing relays are autonomous agents of very limited intelligence.

Our approach is to give the relays additional intelligence, and make them use:

• distributed model predictive control,• cooperation (each relay with its neighbors),• and eventually, learning

to better solve the P+CFC problem

0 2 4 6 8 100

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Time

Max

mim

um b

ranc

h cu

rrent

vio

latio

n(fr

actio

n ab

ove

limit)

Control goal

Agents acting unilaterally

Agents acting cooperatively

Branch outages 8,40rl=2, re=10

Typical results

0 2 4 6 8 100

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Time

Max

mim

um b

ranc

h cu

rrent

vio

latio

n(fr

actio

n ab

ove

limit)

Control goal

Agents acting unilaterally

Agents acting cooperatively

Control error

Control error…

1 2 3 4 5 60

1

2

3

4

5

6

7

Quantity of Communication (r)

Con

trol E

rror

Unilateral, 95 percentile

Unilateral, mean

Cooperative, 95 percentile Cooperative, mean

Local neighborhood radius

Control error vs. communication

Conclusions

1. Cascading failures are the sub-optimal reactions of relays to bizarre conditions. (The reactions are optimal for problem-P, but sub-optimal for P+CFC.)

2. We can design new and better solutions to P+CFC3. But a solution is only as good as the tests (and conditions) by

which it is verified.4. We can only test a small sub-set of the possible conditions.5. Until a breakthrough in verification technology occurs, most

conditions will remain untested. These untested conditions will inevitably be a source of cascading failures.

Failure Statistics 1984-2000�Compiled by the North American Electric Reliability Council Cascading failures happen often enough to give the distribution of � blackouts a fat tail:

Carnegie Mellon Electricity Industry Center Carnegie ...nsf-itr/Conference Papers...Sarosh Talukdar,...

Documents

Overview of Research in the Carnegie Mellon Electricity ...electriconf/2008/PDFs/4-3 Apt...Research in the Carnegie Mellon Electricity Industry Center Jay Apt Tepper School of Business

Carnegie Mellon Electricity Industry Center Working Paper

Paul Luo Li (Carnegie Mellon University) James Herbsleb (Carnegie Mellon University)

Carnegie MellonCarnegie MellonCarnegie Mellon ...coursecatalog.web.cmu.edu/previous/0406catalog.pdf1 Carnegie Mellon University does not discriminate and Carnegie Mellon University

Ensuring Reliable Electricity Supplies Using Distributed Generation Gregory Tress Carnegie Mellon University

Bio-based electricity: How biomass resources can support renewable electricity policies Matt Kocoloski and Paulina Jaramillo Carnegie Mellon University

Carnegie Mellon University Notice · Carnegie Mellon University Notice This video and all related information and materials (“materials”) are owned by Carnegie Mellon ... MELLON

Yudi Chen, Carnegie Mellon University Catherine Groschner, Carnegie Mellon University Brent Heard, Carnegie Mellon University Avisha Shah, University of

Carnegie Mellon Facts 1993 · Carnegie Mellon Facts 1993 Volume 7 Patrick J. Roach, Editor University Planning end Budget Carnegie Mellon University March, 1993. ... CARNEGIE MELLON

Carnegie Mellon University

Carnegie Mellon - cs.cmu.edu

Electricity Generation Mix by US Industrial Sectors Disaggregating Electricity Generation and Modeling Interstate Transfers Joe Marriott, Carnegie Mellon

Carnegie Mellon Agenda

Carnegie Mellon Electricity Impacts of Data Networks Carnegie Mellon University Green Design Initiative Francis McMichael - H. Scott Matthews Chris Hendrickson

Carnegie Mellon

Electricity Industry Center Carnegie Mellon University Understanding the Effects of Restructuring Jay Apt Carnegie Mellon University

Ryan ’Donnell Carnegie Mellon University O. Ryan ’Donnell Carnegie Mellon University

Carnegie Mellon Universitycschafer/cmspbs.pdf · 2009. 2. 26. · Carnegie Mellon University

Carnegie Mellon Paper

Carnegie mellon google_ad_study