Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Geography, Mobilization, and the Outcome of Insurgencies
Sebastian Schutte
(Paper prepared to be presented at the 6th ECPR General Conference, 25th - 27th Aug. 2011, Reykjavik,
Iceland)
Nations are born, grow, �ght, conquer, are conquered, become empires, and rise and fall on thegreat stage of physical geography and human passions and knowledge, and not on the homogeneouswhite planes on which we draw our diagrams.Kenneth Boulding
Abstract
Insurgency has been identi�ed as the predominant type of warfare in civil wars. While the bulk of
quantitative con�ict research is focused on war onset, less is known about the processes and dynamics
that drive insurgency once it is underway. This study provides a theoretical model of how insurgency
develops as a function of geography and insurgent mobilization. The model suggests that geographic
conditions crucially determine the accuracy of incumbent force and thereby its public perception which in
turn determines the rebels' ability to mobilize. These expectations are operationalized in an Agent-Based
Model and tested with large-N data on the outcome of insurgencies. The empirical results con�rm the
theoretical expectations and even contribute to improving the prediction of insurgent civil war outcomes.
1
1 Introduction
The most frequent form of large scale political violence in the post-1945 era are civil wars (Harbom and
Wallensteen, 2005). De�ned as an armed con�ict between a government and another political actor from
within the same country, many civil wars are fought based on irregular tactics. In a typical scenario, the
regular armed forces of a government face a public insurgency employing guerrilla warfare (Fearon and Laitin,
2003, Kalyvas, 2005).1 The majority of quantitative research on civil war is focused on war onset and initial
motivation of the rebels. Ancient hatred (Geertz, 1963, Huntington, 1993), economic reward (Collier, 2000,
Collier and Hoe�er, 2004), and ethnic nationalism (Cederman, 2008, Weidmann, 2009) have been proposed
as explanations of why civil wars start. But civil wars can display puzzling attributes once they are underway,
including speci�c spatial distributions of violence (Buhaug and Gates, 2002, Raleigh and Hegre, 2009, Kalyvas
and Kocher, 2009, O'Loughlin and Witmer, 2010) and counter-intuitive dynamics of mobilization: insurgent
movements can start with a very small number of initial combatants and grow in size despite heavy losses
and military inferiority to the incumbent (Fall, 1965, Macaulay, 1978, Herbst, 2004). It is therefore obvious
that a more complete understanding of civil wars must pay attention to their internal processes and drivers,
as it has been done is a small number of recent publications (Lyall and Wilson III, 2009, Buhaug, 2010,
Weidmann and Salehyan, 2011).
This paper attempts to contribute to this line of research by answering the following research question:
What is the role of geographic factors in determining the outcome of insurgencies? The relevance of this
project is threefold. First, a theoretical dispute on the e�ects of force is made explicit and resolved the-
oretically. Second, the inclusion of Agent-Based Simulation techniques allows for illustrating the complex
mechanism of reactive mobilization in insurgencies. Third, the implications of this research are policy-relevant
in that they help to understand the counter-intuitive e�ects of military force in insurgent civil wars.
The paper proceeds as follows: a thorough discussion of the existing literature in the next section paves
the way for a subsequent theoretical discussion. After that, a theoretical model of reactive mobilization under
varying geographic conditions is speci�ed and implemented as an Agent-Based Model. Several hypotheses are
derived and tested in a large-N con�ict-level analysis. Finally, the empirically results and their implications
will be discussed.
1To illustrate the importance of insurgency consider the following numbers: Fearon and Laitin (2003) introduced a datasetwith 127 cases of civil war since 1945 that lead to at least 1000 battle death. Drawing on a subset of this data, Connableand Libicki (2010) identi�ed 89 clear cases of insurgency. Kalyvas and Balcells (2010) report a declining importance of insur-gency in the post Cold War era in comparison to symmetric confrontations in civil wars. However, they identify �symmetricunconventional� wars as a new category that shares many aspects of insurgencies.
2
2 Existing Literature
Insurgency and irregular warfare have received major attention throughout the twentieth century, starting
with the works of Lawrence ([1927]1998) on the British-Arab guerrilla campaigns during Word War I. After
World War II and the successful partisan activities against the Axis powers in Europe and South-East Asia,
the topic received major attention in the works of communist theorists, such as Mao ([1938]1967) and Guevara
(1961). The French defeat in Vietnam led to a growing American involvement and counterinsurgency doctrine
moved into the focus of Western military advisers and scholars. While IR literature was majorly concerned
with interstate war throughout much of the 1970s and 80s, a turn toward civil wars has brought the notion
of insurgency back on the agenda in the last two decades in the context of contemporary civil war literature.
There are two recurrent themes in all of these scattered schools. First and most importantly, every serious
student and practitioner of insurgency has stressed its population-centric nature. In Lawrence's case, the
orientation toward the population is a more abstract commitment to the Pan-Arab cause and a vision of
liberation from foreign rule. In his famous metaphor, Mao ([1937]1961, 93) compared the revolutionary more
directly to the �sh swimming among the peasants as if those were the sea. He also stressed the necessity for
a large population basis for guerrilla war to succeed (Mao [1938]1967, 11; 66). Guevara (1961, 2) echos this
basic doctrine: �The guerrilla is supported by the peasant and worker masses of the region and of the whole
territory in which it acts. Without these prerequisites, guerrilla warfare is not possible�. Consequently, the
counterinsurgency school has been equally focused on the problem of public support. Military adviser John
Paul Vann was one of the �rst outspoken critics of military tactics that alienated the civilian population in
Vietnam (Sheehan, 1988, 106). Vietnam would become both the cradle for counterinsurgency theory as well
as a testbed for its application. In order to secure the loyalties of the local population, the �strategic hamlet
program� was intended to separate the `�shes' from the `sea'. Resettling a signi�cant number of peasants
into forti�ed compounds was meant to deny the insurgents access to the civilian population (Greiner, 2009,
34). Awareness to the problem of civilian loyalties developed up to the highest levels of command. The
expression of �winning hearts and minds� of the Vietnamese was made famous through its ritualized use
by President Johnson. The comprehensive failure of the American struggle for South Vietnam temporarily
lowered its military focus on counterinsurgency. With the ongoing wars in Iraq and Afghanistan, the interest
has reemerged and the emphasis on population centric warfare is an essential part of today's doctrine (DOD,
2007, 1-27).
The question of why civilians participate in ongoing insurgency has also been approached in the con-
3
text of con�ict research. Rational choice models portray the problem in terms of straight-forward utility
maximization: rebels engage in rebellion if the prospects of economic revenue exceed the individual risks.
Following this logic, Grossman (1999, 269) stated that in many insurgencies the rebels are indistinguishable
from bandits or pirates and Collier (2000) developed a model in which �rebellion is a special case of crime,
with all the di�erences from conventional crime arising as a consequence of the particular loot sought by
rebels� (page 841). Clearly, the solution to stopping an insurgency within this model is simple: increase the
risk for the rebels through the increased application of military force. Certain military leaders have put their
money on this horse. As Kissinger put it: �I refuse to believe that a third-class power such as Vietnam does
not have a breaking point� (Greiner, 2009, 22). Greiner coins the term �tonnage ideology� referring to the
assumption that a su�cient amount of attrition will translate into enemy surrender through a breakdown of
either will or risk-reward calculus (Greiner, 2009, 23). Another source of this kind of reasoning is Air Power
Theory starting with Douhet (1921). Douhet perceived the airplane as an inherently o�ensive weapons and
suggested that future wars would be decided mainly by the strategic bombing capacity of the adversaries. A
breakdown of the will of the civilian population was the essential mechanism in his reasoning. A prominent
World War II advocate of this logic was British Air Chief Marshal Arthur Harris: �There are a lot of people
who say that bombing cannot win the war. My answer to that is that it has never been tried yet.� (Lowe,
2007, 56). Pape (1996) o�ers a scienti�c study on the abilities and limitations of coercion through air power.
A central element in his model is also the balancing costs and bene�ts (page 16). With regard to the policy
debate on how to defeat the Afghan insurgency, this logic is far from dead. Former US ambassador to India
and deputy national security adviser Robert Blackwill makes no secret of his intentions to partition the
country to then unleash the Air Force:
The sky over Pashtun Afghanistan would be dark with manned and unmanned coalition air-craft � targeting not only terrorists but, as necessary, the new Taliban government in all itsdimensions. Taliban civil o�cials � like governors, mayors, judges and tax collectors � wouldwake up every morning not knowing if they would survive the day in their o�ces, while involvedin daily activities or at home at night (Blackwill, 2010).
While the rationalist school and advocates of tonnage ideology do not necessarily agree on all aspects of
counterinsurgency, they share a �rm belief in the basic mechanism: the quantity of violent force by the
incumbent is proportional to the probability of victory, because the use of force translates into increased risks
for the rebels. According to the classic collective action problem (Olsen, 1965), this should provide incentives
for free-riding within the insurgency and therefore a weakening of its military capabilities.
Translated into an observable implication this approach suggests a positive association of the number of
4
killed insurgents with the probability of incumbent victory. The strong emphasis on the �body count� as a
measurement of military success in the Vietnam war is a prime example for this logic in action (Greiner,
2009, 105).
A series of scholarly and military contributions strongly disagrees with this theory and its implications.
Especially the use of indiscriminate violence is assumed to have massively negative e�ects for the perpetrator
(Kalyvas, 2006, 150). The killing of innocent civilians and the destruction of property can alienate the
population from the user of force. Even if violence is targeted against combatants, their killing might help
the opponent to recruit additional combatants from their kin. This problem also surfaced in the strategic
discussion on Afghanistan. General Stanley McChrystal refers to this e�ect as �insurgent math� (Hastings,
2010). In an exceptional contribution to the problem of civilian alienation, Ellsberg (1970, 6) introduces
a interesting metaphor. The rebels sometimes provoke military action by the state which in turn alienates
the civilian population and aids the rebels. According to Ellsberg, this mechanism resembles the use of the
opponent's weight in Judo: instead of creating a comparative advantage for the opponent it is essentially
used against them bringing them to fall the harder and faster the more weight they put behind their attack.
Kilcullen (2009) coins the term �Accidental Guerrilla� referring to elements of the local population that
are drawn into the �ght instead of being a-priory adversaries of the incumbent. Clearly, this accidental
process is closely linked to incumbent behavior in the �eld: as civilian casualties mount and destruction of
property continues, more locals might be willing to retaliate independent of antecedent strategic loyalties.
The US Army Counterinsurgency Handbook also stresses the relevance of avoiding unnecessary destruction
and violence and explicitly rejects the body count indicator (DOD, 2007, 5-27). While all of these accounts
approach the problem from slightly di�erent angles, the basic mechanism hereafter referred to as �reactive
mobilization� is obvious: instead of increasing the risks for the rebels and weakening the insurgency, state
violence can have the opposite e�ect. More civilians are repelled by the government's actions and join the
rebellion. Fearon and Laitin's widely cited study opened the �oor for incorporating the alienating e�ects of
violence into their global analysis, reducing economic variables to proxies for more complex social processes:
We hypothesize that �nancially, organizationally, and politically weak central governments renderinsurgency more feasible and attractive due to weak local policing or inept and corrupt counterin-surgency practices. These often include a propensity for brutal and indiscriminate retaliation thathelps drive noncombatant locals into rebel forces (Fearon and Laitin, 2003, 3).
The central role of civilians in civil wars has been underlined repeatedly in the literature: �The �ght is
conducted through the people [...] Its like a man who has to hit the opponent through the body of the
5
referee� (Durell, 1957, 224 in Kalyvas 2006, 91). In his 2006 book, Kalyvas laid out a theoretically thorough
explanation of how violence against civilians is applied in civil wars. A central piece in this theory is
the distinction between selective and indiscriminate violence. The former is applied more accurately against
combatants, o�cers, and informants of the adversary and the latter is applied less accurately against civilians.
Drawing on a stylized spatial model that consists of �ve zones, Kalyvas (2006) concluded that violence is used
most indiscriminately in zones of predominant, but incomplete control. Assuming that zone 5 is associated
with full insurgent control and zone 1 is associated with full incumbent control, violence is most likely to erupt
in zones 2 and 4. With regard to how violence feeds back into the dynamics of control, the theory makes the
case for both coercive and alienating e�ects, but ultimately assumes both violence and civilian collaboration
to be endogenous to military control (Kalyvas, 2006, 12;118-132). In a recent study explicitly devoted to
explaining the outcome of insurgencies, Lyall and Wilson III (2009) argue that modern combat tactics have
undermined the incumbent's ability to win over the civilian population: increasingly mechanized armies
and long-range supply systems make contact with the civilian population less frequent and less important.
However, the authors argue that this type of interaction with the civilian population is essential to secure
loyalties and to learn to tell apart friend and foe. Their study provides an especially interesting point of
departure since it is also devoted to the problem of understanding the outcomes of insurgent civil wars.
Therefore, their empirical study is an important benchmark for outcome prediction and provides essential
control variables for the subsequent analysis.
Before turning to the empirical analysis, however, there is a second recurrent theme in many studies
on insurgency to be taken into account. This theme is geography and the line starts again with Lawrence.
The superior ability of irregular forces to travel the desert informed his strategic decisions (Lawrence, 1998,
157). Mao (1967, 7) assumed guerrilla war to be most feasible when employed in large countries (such as
China) where the conventional forces of the incumbent or the invader tend to overstretch their lines of supply.
Guevara put a higher emphasis on escaping the state's reach through the utilization of di�cult terrain:
As we have already said, guerrilla �ghting will not always take place in the country most favorableto the employment of its tactics; but when it does, that is, when the guerrilla band is locatedin zones di�cult to reach, either because of dense forests, steep mountains, impassable desertsor marshes, the general tactics, based on the fundamental postulates of guerrilla warfare, mustalways be the same (Guevara, 1961, 10).
Consequently, the counterinsurgency community has also identi�ed geography as a decisive factor. John Paul
Vann was troubled by the low degree of urbanization in South Vietnam in the early 1960s. With an estimated
85% of the population living in the countryside, guerrilla recruitment could take place largely unnoticed by
6
Figure 1: Boulding's original LSG concept: the states A and B have unequal military powers. Although A'spower declines over distance, it is still superior to B's even in B's capital.
the central government (Sheehan, 1988, 50). In the context of con�ict research, geographic indicators have
been also taken into account. McColl (1969) o�ered a geographic model that identi�ed suitable territorial
bases for guerrilla movements.2 A classic model on the role of distance in interstate wars was presented by
Boulding (1962). Boulding assumed that a state's ability to project power toward an enemy was dependent on
both its military strength and the distance that separated the adversaries. His notion of a �Loss of Strength
Gradient� (LSG) assumes that for every unit of distance, a certain number of personnel had to be subtracted
from the �ghting forces and added to the supply troops. In theory, putting this relationship into numbers
allows for the exact calculation of the geographic limits in power projection. While still employed in the
context of interstate con�ict research (for example Gartzke, 2010), the model has also been modi�ed to serve
in the context of civil war research (Herbst, 1992, Buhaug and Gates, 2002, Herbst, 2004, Cederman, 2008,
Buhaug et al., 2008, Buhaug, 2010). Figure 1 illustrates Boulding's model.
Apart from distance, mountainous terrain has been identi�ed as a determinant of war onset. Fearon and
Laitin (1999) theorized that especially rough terrain allows �for a guerrilla movement to subsist and thrive�
(page 21). Although the percentage of mountainous terrain in a given country is fairly basic indicator, Hegre
and Sambanis (2006) found this measurement to be a robust predictor for civil war onset. With more large
and accurate event data sets being available to the research community, the question of where �ghting takes
place in armed con�ict has emerged as an independent research topic. The spatial footprint of insurgent
warfare has been analyzed for Liberia (Raleigh and Hegre, 2009) and Vietnam (Kalyvas and Kocher, 2009,
Kongsgard, 2010). Moreover, the settlement patters have been linked to both the likelihood of civil con�ict
2In the study of interstate war, geography has been identi�ed early on as an essential variable (von Clausewitz, 1873, BookV, Chapter 17). Apart from the direct military aspect of terrain utilization, the spatial arrangement of political actors shapesand constraints the opportunity structure in armed con�ict. The conceptual distinction between opportunities and motivesfor war places a high emphasis on this aspect of geography (Starr, 1978). Research on interstate war has also advanced theunderstanding of �relevant neighborhoods�: the set of states within military reach for any potential aggressor (Lemke, 1995).
7
as well as the occurrence of con�ict events (Weidmann, 2009, Weidmann and Ward, 2010).
With a strong emphasis on geography and civilian population, the existing literature has laid out many
of the conceptual tools for this research. However, several theoretical problems remain unsolved. Although
the signi�cance of civilian population and geography in insurgent warfare has been widely acknowledged
they are usually treated as unrelated topics and their exact roles remain disputed. A theoretical approach
that integrates the two is missing so far. Moreover, some of the open theoretical questions deserve explicit
attention:
First, and most importantly, there is the unresolved question of how state reach can explain attributes of
modern insurgencies: The LSG provides an elegant and simple formalism for why rebellion is more likely and
more successful in a state's periphery. However, the two most prominent advocates of state reach explanations
stress the mostly historical relevance of their research:
The airplane and now the missile have brought about a revolution of quite unprecedented dimen-sions, as we have suggested earlier. For the air-born carrier or weapon, the world is an almostfeatureless globe: coasts, mountains, deserts, and forests hardly exist as long as there are landingstrips on the other side of them. The intricate geographical structure of national power, therefore,which rests on the combination of sea-power with a very low LSG and land-power with a muchhigher LSG, has largely been swept away as far as air-power is concerned. Everywhere now isaccessible to everybody; there are no nooks, corners, or retreats left, and no snugly protectedcenters of national power (Boulding, 1962, 272).
Similarly, Scott (2009, xii) remarked that his own theory of statehood's inability to permanently integrate the
South-East Asian highlands is largely irrelevant for the post Word War II period. Scott assumed that terrain
friction accounts for a large variety of political attributes of the region, including the geographic shapes of
states, the spatial distribution of literacy, and much of its early modern history of political violence. However,
with the infrastructural and logistical advancements of the last sixty years, terrain friction has lost much of
its signi�cance according to Scott.
In Boulding's case, the question of how a geographic model that was deemed outdated �fty years ago by its
own creator can deliver reasonable predictions for the present springs to mind. The mechanism he proposed
assumes supply lines to be hold and maintained at all times by soldiers that are permanently occupied with
this task. It is agnostic to mechanized forces or air lifting capabilities as potential game changers. With few
exceptions, modern states can execute violence within their territory. Any air force today has the technical
ability project its force within the territorial boundaries of its home state. Moreover, successful insurgencies
were won against superpowers with global strike capabilities, such as the United States and the Soviet Union
in Vietnam and Afghanistan. This �rst remaining question can be summarized like this: if the problem of
8
state reach has been largely solved for the post World War II period how can it then explain aspects of
modern insurgencies?
Another unresolved question is the systematic e�ect of violence. Qualitative studies, military accounts,
and anecdotal evidence suggest the existence of reactive mobilization as a mechanism in insurgency: indis-
criminate force creates a backlash in popular reactions and leads to accelerated mobilization for the opponent.
However, this logic is in direct contradiction to the the assumption that increased force leads to victory. Vi-
olence should increase the risk for the insurgent and thereby increase the incentives for free-riding. Instead,
violence seems to sometimes have the opposite e�ect. Despite heavy losses and dim life expectancies for
the individual combatant, some insurgencies were recruiting at peak capacity. The Loss Exchange Ratio in
Vietnam was usually estimated to be around 1:10 against the insurgents. Palestinian Intifada movements
grew strong despite the smallest chance of victory. Afghan Mujaheddin are known for suicide tactics that
completely deny worldly reward to the attacker. Any yet, for many insurgencies the number of available small
arms presents the limiting factor in mobilization and not the lack of volunteers.3 The second open question
is therefore the following: under which conditions does application of violence by the incumbent strengthen
or weaken the insurgency?
A third open question is why counterinsurgency e�orts that explicitly emphasize the problem of alienating
e�ects of violence still fail to reliably secure civilian loyalties. A focus on the selective application of force
is central to counterinsurgency doctrine. Awareness to the problem of civilian alienation can be traced back
at least to World War II. However, a considerable number of insurgencies end with either a rebel victory
or a negotiated settlement (Connable and Libicki, 2010, 27). Interestingly, the advancement of weapons
technology, long- range communication, precision munition, and military doctrine does not appear to have
a�ected this pattern in the post-WWII era. Factors beyond the development of arms technology seem to be
at work in determining the success and failure of insurgencies. The third remaining puzzle can be put like
this: which factor outside the belligerents' control determine success and failure of insurgencies.
3 Theoretical Synthesis
Drawing on a distance-decay mechanism largely inspired by Boulding (1962) and Kalyvas' (2006) typology of
violence, a theoretical model is developed in this section. Most importantly, a strong link between geographic
3Ho Chi Minh was con�dent that more American small arms for Saigon would translate into more captured arms for the VietMinh and therefore a stronger rebellion (Sheehan, 1988, 100). Many Eastern European partisan groups as well as the Cubanmovement of the 26th of July demanded recruits to bring their own ri�es as a condition for joining the group. Therefore, thepopular will to join the lethal �ght can sometimes even exceed the rebels' ability to supply new combatants.
9
conditions, types of applied violence, and resulting e�ects of incumbent control are incorporated into the
model.
3.1 Loss of Accuracy and Rise of Resentment
Boulding's LSG might be suitable to estimate a state's potential reach, but it does not explain the devel-
opment of insurgencies over time. It is static in that it predicts con�ict locations and outcome based on
a preexisting distribution of military capability and geography. The theory also fails to account for the
endogenous dynamics in insurgent mobilization. Moreover, a series of cases where actors with global strike
capabilities lost in insurgent warfare point to possible limitations of the paradigm.
Kalyvas typology of violence provides some of the building blocks for a more advanced explanation. His
theory is concerned with the explanation of types of violence in a stylized, one dimensional space based on
the assumption that violence and collaboration are largely endogenous to military control (Kalyvas, 2006,
12; 118-132). The theory is as such not suitable for explaining where military control and therefore patterns
of violence arise geographically. It is also not focused on the question how alienation or coercion feeds back
into the dynamics of con�ict. Moreover, the underlying stylized model does not allow for the identi�cation
of geographic regions that are more or less likely to be controlled by either military actor and more or less
prone to di�erent types of violence.
However, a theoretical synthesis between these models is possible based on additional assumptions. Most
importantly, the application of violence and the tactics of their application are severely restricted by tech-
nological, cognitive, and historical factors. Let us consider technological aspects �rst. A nuclear bomb will
never be able to apply violence selectively. It is inherently indiscriminate. Similar in this regard, but less
devastating is conventional, strategic bombing: dropping tons of explosive from the stratosphere has a greater
potential for indiscriminate destruction than direct small arms �re. No one has expressed this logic more
clearly than John Paul Vann:
This is a political war and it calls for discrimination in killing. The best weapon for killing wouldbe a knife, but I'm afraid we can't do it that way. The worst is an airplane. The next worst isartillery. Barring a knife, the best is a ri�e � you know who you're killing (Sheehan, 1988, 317).
Technological constraints are somewhat symmetrical for incumbent and insurgent. For a rural insurgent
movement, the available forms of violence might be ambushes and hit-and-run attacks within its area of
control, as well as terrorist attacks outside this area. Insurgents going global are rarely capable of assas-
sinating leading decision makers or military commanders. Instead, they resort to indiscriminate, terrorist
10
methods such as using improvised explosives against civilians. A similar picture arises for the incumbent:
patrolling controlled and slightly contested areas in searching for insurgents is an option. Outside these areas,
the use of ground troops becomes increasingly costly. Aerial bombardment and the use of heavy arms on
the ground are the remaining options for the state. Aerial bombardment and international terrorism are
prototypical examples of indiscriminate violence and both tactics can be applied globally. Individual arrests
and targeted assassinations are prototypical examples of selective violence, but their application is restricted
to the controlled or contested areas. We can therefore assume that long-range attacks are �on average� less
discriminate than short range attacks and we would expect civilian alienation to be a function of distance as
well. Thereby, it is not necessarily the quantity of force that diminishes over distance as assumed by Boulding,
but its quality : arms branches with a low Loss of Strength Gradient tend to apply force indiscriminately. In
Vann's words, a knife should be the weapon of choice in a political war. But a knife combines the highest
accuracy with the shortest range. An airplane combines the lowest accuracy with the highest range. in
between the two, there are ri�es and artillery that follow the same trade-o�.
Cognitive accuracy also declines as a function of distance. As language and customs are known, the
identi�cation of loyalties is comparatively easy. Disagreement with the government can be inferred from
subtle cues. Sub-tones and connotations can give away collaborators. Clothing, decoration, and dialect allow
inferences on ethnic and religious ties. Knowledge of local history allows for a better understanding of local
grievances. These insights are part of modern counterinsurgency theory:
Learn about the people, topography, economy, history, religion, and culture of the area of opera-tions (AOs). Know every village, road, �eld, population group, tribal leader and ancient grievance.Become the expert on these topics (DOD, 2007, Appendix A, Paragraph A2).
This demand is reasonable in theory but increasingly di�cult to achieve in practice as distance to the own
cultural circle grows. Europeans and Americans in Algeria and Vietnam found a variety of ethnic, linguistic,
and cultural aspects that are mind boggling to the outsider and hard to generalize from. Many anthropologists
would �nd this task extremely challenging. Mental access to foreign regions is hard to acquire and hard to
communicate. Therefore, the cognitive ability to identify loyalties also diminishes as a function of distance.
The absence of cognitive access to local customs can provide incentives for brutal and indiscriminate behavior.
Incumbent violence in many counterinsurgencies involves torture as a means to acquire information.4 Third
party actors in counterinsurgency operations often struggle with the problem of understanding local societies.
4In an attempt to circumvent this problem, incumbent forces have sometimes tried to use local informants to acquire accurateinformation on political loyalties. This is problematic in that it creates opportunities for civilians to utilize military force forsettling private vendettas. The CIA-led �Phoenix Program� during the later stages of the Vietnam War relied on civilianinformants and su�ered from exactly this problem (MacPherson, 1984, 625).
11
Figure 2: The �Loss of Accuracy� Gradient is graphically illustrated in this �gure. As distance to the powercenter increases, accuracy declines. This leads to an increase in resentment toward the user of force.
But even governments �ghting insurgencies within their own territorial boundaries face this problem. While
insurgents often mobilize local combatants for local operations, draft-based regular armies employ soldiers
usually in areas unknown to them.
Several empirical implications arise from this discussion. First, unlike Boulding's model, the presented
theory assumes that the quality (not primarily the quantity) of force is a function of distance in modern
warfare. As distance between the incumbent and the insurgent grows, the accuracy of their violent action
decreases. Compensating for the lack in accuracy with increased lethality only ampli�es the problem: more
innocent bystanders fall victim to the inaccurate use of force. Second, indiscriminate force usually creates
a backlash in popular opinions. The alienating e�ect of inaccurate violence vastly out-weights its deterrent
e�ect, simply because inaccuracy is not conditional on speci�c behavior: even supporters of the user of force
can fall victim to their measures. This logic is illustrated in Figure 2.
Revolutionary movements have sometimes tried to harness the backlash of indiscriminate violence into
bootstrapping insurgent movements: a state that can be bated into overreaction in �ghting a small guerrilla
army alienates the population and thereby helps the insurgents recruit combatants. Combined with an active
e�ort to build up a movement on the �y around a vanguard core, this mechanism forms the basis of Guevara's
�foco theory� of revolution.5
5Even the unsuccessful �Red Army Fraction� occasionally reechoed this logic in their writings, asserting that the West-Germanstate resorted to totalitarian measures in �ghting them, thereby alienating the population (RAF, 1982).
12
Accounting for this systematic e�ect of declining accuracy and rising resentment o�ers a theoretical
solution to the unresolved questions mentioned above. This theory rejects limited state reach as a suitable
explanation for why insurgencies succeed. Instead, limited accuracy in the application of power is put
forward as an explanation. The assumption of economic self-interest seems to make false predictions about
increased risks leading to declining mobilization. Acknowledging the existence of non-material motives for
joining insurgencies resolves this problem. Victims of violence and their kin might simply seek revenge and
strive to �ght injustice. Translated into the terminology of the collective action problem, inaccurate force
provides revenge as a selective incentive to the victims and their kin. Such motives are elements of most
national liberation narratives from the German Landwehr to the Boston Tea Party. On an individual level,
reaction to violence might be an even stronger motivator than the prospects of economic reward. Reactive
mobilization as a systematic e�ect arising from the use of indiscriminate force is taken into account in this
theoretical extension. A remaining question is why counterinsurgency fails as a military doctrine in many
instances. The presented theory does assume that mostly static, geographic conditions account for the e�ect
of incumbent violence in insurgencies. This explains why insurgencies can still succeed even as the problem
of reactive mobilization is widely discussed. Having contrasted the Loss of Accuracy with both Kalyvas and
Boulding, it is time to simulated the proposed mechanism to further illustrate the theory and derive empirical
implications.
3.2 Agent-Based Model
The full set of proposed mechanisms is demonstrated in an Agent-Based Model. There are two speci�c reasons
for preferring this methodology over others in the context of this research. The proposed mechanism describes
a complex interaction between three political actors under varying geographic conditions. While Game Theory
would certainly be an elegant mathematical approach to coping with endogenous strategic interactions, it is
usually restricted to two player games. More critically, Game Theory is usually used in non-spatial model
that assume players decide on their moves under perfect information. Restricted information and spatial
interaction are core elements of the proposed theory. In accounting for those, a simulation approach is
preferred over analytic solutions.
3.2.1 Description
The stylized Agent-Based Model di�ers from attempts to model insurgencies more generally such as �Rescape�
(Bhavnani et al., 2008) and �RebeLand�(Cio�-Revilla and Rouleau, 2010). The mentioned works aim at
13
explaining how complex interactions of factors can bring about civil war. The model presented below is used
to explain the outcome of insurgencies based on the introduced theory. In its focus on reactive mobilization
the model is similar to Bennett (2008) simulation approach which focused on the early stages of insurgencies.
Bennett's model (like the analytic model by Ellsberg (1970)) introduced three types of agents and set out to
simulate casualties and mobilization as a function of violence. As such it has been an important guideline
for the model described below, while lacking the geographic focus so central to the presented theory.
The model distinguishes among three types of agents: insurgent forces, incumbent forces, and civilians.
Each civilian holds a value for government Legitimacy ∈ R{0, ..., 1}. This value determines if the civilians
remain neutral (0.2 - 0.8) or join one of the military actors. In every time step each military actor gets a
chance to use lethal force against another agent. Basis for this decision is the fraction of enemy agents in
the agent's neighborhood. Moreover, a local accuracy threshold for the military actors de�nes which fraction
of adversaries in their neighborhood is su�cient for triggering an attack. The decline in accuracy for the
military actors is expressed in two intersecting gradients. The slope of these gradients is de�ned by only
one global constant: the Territorial Balance. As theorized above, numerous factors might in�uence the
actors' ability to apply force accurately. However, the simulation model uses a stylized aggregate of these
cognitive, technological, and cultural factors. If this balance favors the insurgents, their accuracy declines
less as a function of distance from the upper right corner of the simulated space. Should the balance favor the
incumbent, however, the insurgents' accuracy declines rapidly. To control this essential TerritorialBalance
for the adversaries, the location of the accuracy parity at the 0.5 level can be de�ned as a fraction of the
maximal distance from the upper right corner of the simulated space.
In every time-step the military actors get a chance to attack. Should they choose to do so, an agent from
their neighborhood is removed at random. If this agent happens to be a civilian all other civilians in its
neighborhood are alienated from the user of force. Their value for government legitimacy is changed toward
to military opponent of the user of force. Civilians can also be assimilated by either side: If no violent
action takes place over a certain amount of time, civilians change their government legitimacy values toward
the average value in their neighborhood. This mechanism is intended to represent assimilation of civilians
towards the providers of security. All agents conduct a random walk across the simulated space to allow
for repeated interactions between the actors. The logic of the main loop of the simulation is expressed in
Algorithm 1.
14
Algorithm 1 Pseudo-code for the main loop of the Agent-Based Model
For all military agents{if (accuracy of location < fraction of civilians in the neighborhood) �> attackif (civilian hit) �> civilians in neighborhood alienated}
For all civilian agents{if (no attacked witnessed in the last k rounds)�> change legitimacy level toward average of the neighborhood}
For all agents{move one step in a random direction}
If (only military agents of one side left) �> stop
Figure 3: Screen-shot from the Agent-Based Model under varying Territorial Balance. Light gray areasindicate superior accuracy for the incumbent. The frontier of the colored areas indicates where the accuracyof the actors drops below 0.5, that is a 50% probability of hitting a civilian. The plot on the left hand sidealso shows a close-up of the simulation. Civilians are represented as smileys according to their happinesswith the government. Military agents are visible and represented as stick �gures.
3.2.2 Simulated Results
Figure 4 shows the results of the model runs for the risk parameters of insurgent and incumbent. For
each parameter combination, 20 runs were executed with varying random seeds. As the plot indicates, the
territorial balance crucially determines the outcome of the simulated insurgency. Several parameters that
15
potentially also in�uence the outcome were varied in the simulation runs. Most importantly, the initial
loyalties and the civilian solidarity constant were varied across runs. As Figure 4 (left) indicates, balance
settings below 0.4 and above 0.6 reliably bring about defeat or victory. Only settings in between see a
larger variance on the outcome. The Territorial Balance remains a decisive factor in Figure 4 (right) were
biased initial loyalties factor in as well although the outcomes become less certain. Generally, the introduced
territorial balance out-weights initial loyalties as determinants of outcome. The model therefore serves as an
initial demonstration of the proposed mechanism. One additional observation is that the winning side always
su�ers more losses than the losing one, directly contradicting the Vietnam-era focus on a high �body count�.
0.2 0.4 0.6 0.8
0.0
0.2
0.4
0.6
0.8
1.0
Insurgent Victory as a Function of Territorial Balance
Territorial Balance
Pro
babi
lity
of In
surg
ent V
icto
ry
Average Outcome: Loyalties = 0.5 Solidarity = 0.5
95% Conf. Int.
0.2 0.4 0.6 0.8
0.0
0.2
0.4
0.6
0.8
1.0
Insurgent Victory as a Function of Territorial Balance
Territorial Balance
Pro
babi
lity
of In
surg
ent V
icto
ry
Average Outcome: Loyalties = [0.4,0.6] Solidarity = [0.3,0.7]
95% Conf. Int.
Figure 4: The probability of insurgent victory as a function of the territorial balance in the agent-basedmodel. Each parameter combination was run 20 times with varying random seeds. The �balance� variablere�ects how much of the territory under more accurate control for either actor. A balance setting of 0.5indicates a neutral setting. Lower settings favor the insurgents, higher ones the incumbent. The lines aroundthe outcome probability indicate 95% con�dence intervals for all runs. Please note that all �controls� werehold constant for the simulation runs on the left hand side, leaving only varying random seeds to generatevariable outcomes. On the right, biased initial loyalties and various levels of civilian solidarity were tested,substantially widening the error bars. Such factors obviously in�uence the proposed mechanism and waterdown the e�ect of �territorial balance�.
16
3.3 Hypotheses
The main hypothesis of this study directly results from the theory and its simulation: Whoever exercised the
more accurate control over the bulk of the population is more likely to win the civil war. Since the center of
state power is usually associated with the capital city while the most remote areas of the country are easiest
to be controlled by the rebels, a more formal expression of this idea would be:
� H1: Average distance to the population from the capital has a positive e�ect on insurgent victory
Other conditions that a�ect accuracy have also been touched upon in the literature. Most importantly, John
Vann's concerns over the low degree of urbanization in South Vietnam has been picked up in modern con�ict
research (Kalyvas, 2006, 12). Data on the local population, a highly concentrated police force, and advanced
means of communication make urban environments easier to control for the state than vast countrysides.
More importantly, the state's ability to gather information and rely on informants allows them to single
out insurgent cells and detain enemy combatants without having to rely on large-scale destruction. At the
same time, insurgents are deprived of the ability to selectively attack incumbent forces. Usually, the cities as
centers of state control are also the sites of terrorist tactics employed by insurgent forces. And while delayed
explosions of concealed bombs allow terrorists to wage war without facing the superior forces of the state
directly in the cities, they also cause large scale su�ering of the civilian population and shift loyalties toward
the state. One would therefore expect the degree of urbanization to in�uence the outcome of insurgencies:
� H2a: The percentage of urban population negatively a�ects the probability of insurgent victory.
Along these lines, natural land cover can allow insurgents to retreat after hit-and-run attacks, or to build up
forces outside the state's reach. Open terrain should have the opposite e�ect: The state can exercise control
with fewer forces and more e�cient reconnaissance. The absence of accurate information might again create
incentives for the application of inaccurate violence and the resulting backlash in public opinion. Therefore,
natural land-cover should have an e�ect on the outcome of insurgencies:
� H2b: The percentage of population living in open terrain should have a negative e�ect on insurgent
victory
� H2c: The percentage of population living in forested terrain should have a positive e�ect on insurgent
victory
The section below presented empirical tests of these hypotheses followed by a general discussion.
17
4 Empirical Analysis
With Territorial Balance being the key explanatory concept, suitable geographic measurements have to be
de�ned to operationalize it. Drawing on a dataset by Lyall and Wilson III (2009) that provides information
on outcome of insurgent civil wars and many important control variables, this measurement can then be
tested in large-N. This section will give a detailed description of the corresponding indicators followed by a
discussion of the empirical results.
4.1 Territorial Balance Indicator
Both theory and model in the previous sections have put a large emphasis on distance as a crucial determinant
of accuracy in the application of violence. Several equi-�nal mechanisms seem to fuel the decay of accuracy
as a function of distance such as the range-accuracy trade-o� in modern weapons technology and the decline
of situation awareness in foreign human and physical geographies. Incumbent distance to the bulk of the
population therefore has to be operationalized to test for these proposed e�ects.
Data on the spatial distribution of population (CIESIN, 2005) and the location of capital cities and
international boundaries (Weidmann et al., 2010) were combined to form a single indicator for �Territorial
Balance�. Intuitively speaking, the Territorial Balance Indicator (TBI ) measures what fraction of the total
population the government has in reach for what fraction of the sum of all distances to its inhabited territory.
For any given distance D from the capital, the fraction of the population pd<Dwithin that distance can be
calculated and plotted, according to tbD =∑
pd<D∑p ,where p is the total population of a country and d the
distance from the capital to the most remote inhabited spot, normalized to [0, 1]. With growing distance tbD
values form a curve eventually connecting the points (0,0) and (1,1). Figure 4.1 shows the resulting curve for
the United States, based on the 1990 spatial distribution of population.
How can we interpret the exact shape of the curve in Figure 4.1? The curve simply expresses the fraction
of the population that can be reached for any given distance from Washington. At �rst the TBI curve climbs
rapidly as the densely populated east cost is covered. Around a cumulative population value of 0.5, the
curve �attens out as the distance covers the less densely populated Midwest. It �attens out even further as
the Rocky Mountains are reached and then suddenly spikes around a distance value of 0.7 as the densely
populated west coast is integrated. Finally, the sparsely populated wilderness of Alaska appears as an almost
�at section on the right end of the curve.
For the statistical analysis, the described indicator has to be collapsed into a single number. Analogous
18
Figure 5: TBI curve for the United States. The curve on the right describes what fraction of the Americanpopulation is within distance from Washington. The TBI expresses the area under the curve divided bythe whole area of the plot. With the United States' population balance in the east, the resulting TBI iscomparatively high. On the left, a visualization of the GPW dataset (CIESIN, 2005) for the region is visible.
to a ROC curve that plots the rate of true positive predictions against the rate of false positives in statistical
prediction, the area under the curve (AUC) is chosen as a suitable indicator. Formally: TBI = aa+b , where
a is the area under the curve and b the area above it. A caveat of this approach is that global data on
population numbers only dates back to the year 1990. Therefore, the post-WWII period that is frequently
in the focus of civil war research cannot be analyzed entirely based on this indicator.
4.2 Population and Accessibility
Apart from distance, other aspects of geography could in�uence accuracy in the application of force. To
capture these e�ects, a global land-cover dataset (Hansen et al., 2000) was acquired. This data distinguishes
among 12 types of terrain ranging from rain forest to open cropland and urban areas. To keep things simple,
land-cover values were divided into two categories: open and covered terrain. The high resolution land-cover
data was re-sampled to correspond to the roughly 5km² cell size of the population dataset. Each cell-centroid
in the population dataset was then associated with the nearest cell-centroid in the land-cover dataset. This
approach allows for calculating the fraction of the population that is either living in forested (covered) or
open terrain. Data on total population and the percentage of urban population was obtained from the the
United Nations Population Division. The great advantage of this data source over the described static GIS
19
approach is that it provides time-variant data. In the empirical analysis, these variables will be referred to
as �%URBAN�, �%OPEN�, and �%FOREST�.
4.3 Empirical Results
Possible outcomes in insurgent civil wars are incumbent victory (1), settlement of the hostilities without
decisive victory of either side (2), or insurgent victory (3). These outcomes can be ordered from victory
to conclusive defeat for either side. To preserve the ordinal information in the dependent variable, several
Ordered Logistic Regression Models were used to assess the relevance of the proposed variables. The time
period under investigation ranges from 1970 to 2010. The reason for limiting the time period to this range lies
in the static nature of the main independent variables. A population-related variables in their current form
rely on geo-referenced data that can only be obtained from 1990 onwards. Since these aggregate measure on
the country level are not subject to drastic annual changes, 1990± 20 years was chosen as a suitable time
frame for the analysis. Limiting the time-frame to 1990-2010 was also considered, but the resulting number
of observations would have been too small for substantial inferences.
Mounting methodological objections against a sole reliance on p-values (Ward et al., 2010, Schrodt, 2010)
require additional assessments of the predictive capabilities of newly introduced variables. Therefore, both in-
sample and out-of-sample predictive performance are included in the empirical analysis. Moreover, stepwise
variable selection allows for an identi�cation of the central variables.
To create a baseline for the statistical analysis, Model 1 replicates the base model from the Lyall and
Wilson III (2009) study. The main independent variable of their study, modern combat tactics, had to
be omitted in this analysis. Modern combat tactics according to Biddle's (2006) de�nition were generally
introduced after WWI. Therefore, there is simply no variance on this indicator for post-1970 insurgencies.
However, Lyall and Wilson III (2009) still provide a formidable point of departure both theoretically and
empirically. Model 2 includes all introduced variables, including the population-accessibility measurements.
In order to test the main hypothesis of my contribution, the distance facilitated loss of accuracy, the �Terri-
torial Balance Indicator� (TBI ) was added to the base model in Model 3. Model 4 includes the base model
plus two additional variables from the Lyall and Wilson III (2009) study, the degree of mechanization in the
state's armed forces (MECH.) and the availability of helicopters for the state (HELI.). The TBI value is still
highly signi�cant, even as one controls for these additional two variables. Model 5 contains a subset of the
variables in the full model which were chosen based on a step-wise model selection algorithm that minimizes
the AIC value.
20
Model 1 Model 2 Model 3 Model 4 Model 5(Intercept 1—2) −0.6256 −0.7973 −1.8755 −0.9357 −0.8381
(0.7063) (0.0453)∗∗∗(0.4922)∗∗∗(0.9113) (0.5842)(Intercept 2—3) 2.1735 2.25 0.9558 2.0938 2.1728
(0.7623)∗∗ (0.5168)∗∗∗(0.6181)∗ (1.0447)∗ (0.7841)∗
REGIME 0.1116 0.1263 0.1085 0.1307 0.1449(0.0464)∗ (0.0524)∗ (0.0462)∗ (0.0507)∗ (0.0477)∗
SUPPORT 0.7813 0.9748 0.8073 0.9785 1.0532(0.3422)∗ (0.3708)∗ (0.3427)∗ (0.3746)∗ (0.355)∗
POWER −0.2501 −0.−0.5731−0.276 −0.6208 −0.6233(0.1951) (0.2526) (0.1972) (0.2606)∗ (0.2569)∗
ENERGY −0.2807 −0.3408 −0.2645 −0.2718 −0.265(0.1818) (0.2183) (0.1825) (0.1913) (0.1849)
OCCUPATION 1.291 1.109 1.308 1.0513(1.0982) (1.3186) (1.1049) (0.0001)
ELEVATION −0.0001 −0.0001 −0.0001 0.0001(0.0037) (0.0004) (0.0004) (0.0001)
DISTANCE 0.0004 0.0005 0.0004 0.0005 0.0005(0.0004) (0.0004) (0.0004) (0.0003) (0.0004)
COLDWAR −1.387 −1.318 −1.384 −1.3645 −1.304(0.555)∗ (0.5827)∗ (0.5547)∗ (0.5866)∗ (0.556)∗
MECH. −0.0246 −0.0525(0.2929) (0.291)
HELI. 1.7541 1.8482 1.897(0.8848). (0.8536). (0.8173)∗
%URBAN 0.0083(0.0145)
log(POPULATION) 0.0074(0.302)
TBI −2.5398 −2.017 −2.2861 −2.3788(0.1788)∗∗∗(0.3298)∗∗∗(0.5562)∗∗∗(0.393)∗∗∗
%FOREST 0.0051(0.0137)
%OPEN −0.0077(0.0163)
N 64 64 64 64 64AIC 129.52 136.93 130.94 129.64 124.44logL −54.76 −51.47 −54.47 −51.82 −52.22Ord. deviation in-sample 33 25 26 27 29Ord. deviation out-of-sample 38 45 38 40 38Standard errors in parentheses
. indicates significance at p < 0.1; * at p < 0.05; ** at p < 0.01; *** at p < 0.001
Figure 6: Results from the empirical analysis. Five Ordered Logistic Regression Models were estimated toget a clear picture of the relevance of the proposed measurements.
21
4.3.1 Control Variables
As it is visible at �rst glance, several control variables are signi�cantly associated with the ordinal coding of
rebel success. Regime type (REGIME ) and outside support to the rebels (SUPPORT ) are signi�cantly and
positively associated with rebel victory in Models 1 to 3. The positive sign for the REGIME variable seems
to con�rm the wide spread suspicion that democratic societies are more likely to pull out of insurgent civil
wars due to their lower tolerance to casualties. Domestic peace movements, for example, can make themselves
heard and put signi�cant pressure on the incumbent in liberal and democratic societies. Autocratic systems
on the other hand can keep �ghting counterinsurgencies without having to take into account the interests of
a broad electorate.
It should be mentioned, however, that this �nding seems to be driven by the restricted sample size and
does not hold for the entire post-WWII period, as reported by Lyall and Wilson III (2009). Outside support
to the rebels is also reliably and positively associated with rebel victory. Many successful insurgencies greatly
bene�ted from outside support. Foreign �ghters willing to joining the uprising, material support, and assisting
intelligence services can obviously strengthen the rebel forces. The state capability indicator (POWER) does
not reach statistical signi�cance. It is nevertheless negatively associated with insurgent victory which again
meets the intuition that stronger states are better at �ghting o� domestic challengers. The logged energy
use of the incumbent as a share of the energy use of the whole population (ENERGY ) also has a negative
estimate, but fails to reach statistical signi�cance. An indicator for whether the country under investigation
is occupied (OCCUPATION) does not turn out signi�cant, but has a large positive coe�cient. Distance from
the counter-insurgent's home country to the host country of the investigation (DISTANCE ), and the mean
elevation of the country (ELEVATION ) do not appear to in�uence outcome. Clearly, distance should play a
role according to Boulding's classic LSG model.
A more reliable indicator is the cold war dummy (COLDWAR). In all four models, COLDWAR is nega-
tively associated with insurgent victory implying that the probability of insurgent success was signi�cantly
lower during the cold war. African countries that received military aid from the superpowers might drive
this result: the Democratic Republic of the Congo, Liberia, Rwanda all witnessed successful uprisings since
the end of the cold war. Mao's assumption that a large population basis is aids insurgent victory and John
Vann's concern that insurgent mobilization can take place easiest in rural societies were tested empirically.
Both the percentage of the population living in cities as well as the natural log of the total population seem
to have no signi�cant e�ect.
22
4.3.2 Main Independent Variable
The territorial balance indicator (TBI ) seems to be strongly and robustly associated with rebel victory. As
explained above, TBI values close to 1 indicate that the state has all the population in the vicinity of the
capital city. Values close to 0 correspond to situations were the state has to reach out into its most remote
corners to reach the bulk of the population. Therefore, the large, negative estimate and the small standard
error strongly support the theoretical expectation of this research. This result holds also in Model 2-5 as
control variables for land-cover, urbanization, and changing weapon's technology are introduced. The TBI
remains highly signi�cant with the expected negative sign throughout the analysis. The relevance of the
variable is also re�ected in increase in in-sample predictive capabilities. The Ordered Logistic Regression
can be used to predict probabilities for each possible outcome of a given war. Taking the maximum of these
probabilities as the model's prediction allows for a comparison of this prediction with the empirical record.
The absolute value of the mismatch between prediction and empirical record provides a simple distance
metric that still pays tribute to the ordinal scale of the dependent variable. This in-sample deviation can be
formalized as dev =∑ |Y − Y |. The TBI indicator lowers the predictive ordinal deviation from 33 in the
baseline Model 1 to 26 in Model 3. Apart from improving the model �t, the TBI also mildly re�ects the
stylized expectation from the Agent-Based model. Figure 7 on the following page shows predicted probabilities
for Model 3, assuming average values for the remaining explanatory variables. TBI values ranging from 0 to
1 have a profound e�ect on the predicted probabilities for insurgent victory.
A more thorough analysis of the predictive capabilities of the model drawing on the �Receiver Operating
Characteristic� and out-of-sample cross validation techniques will be presented below in the next section.
After that summary and discussion will conclude this study.
4.3.3 Predictive Performance
Inferential statistics provides great tools for testing whether or not a proposed variable is actually associated
with phenomenon of interest. Once a robust relationship is established, however, additional metrics are
necessary in order to �nd out how reliable the proposed e�ect really is. The question if and to what extend
we can rely on the TBI in order to predict the outcomes of insurgent civil wars cannot be answered based
t-statistics alone. One frequently used assessment of predictive performance is the ROC curve: a plot of the
rate of false positive predictions versus the true positives for varying probabilistic thresholds. The ROC is
an easily interpretable statistic allowing for an intuitive visual representation of model performances. The
23
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Predicted Probabilities for the Average Country (Model 3)
TBI Values
Pro
babi
lity
of In
surg
ent V
icto
ry
Figure 7: Predicted Probabilities for the average country from Model 3. Not that these probabilities resemblethe simulated results, although the e�ect of Territorial Balance is not as strong as the simulation suggests.The empirical model predicts a more shallow decline of the probabilities of insurgent victory. Moreover, atheoretical TBI of 0 would only raise probabilities of insurgent success to about 0.5 and not to 1 as thesimulation suggests.
baseline of all predictions is a straight 45° line from the lower left to the upper right of the plot. Such a line
could be produced on the basis of pure guesswork: For every positive case predicted correctly one negative
case would be predicted as positive. Since the ROC curve requires binary prediction techniques each possible
outcome has to be analyzed separately. Three ROC curves are represented in Figure 8 on page 26 each
corresponding to a possible outcome.
Apparently, �Rebel Victory� seems to be the easiest outcome to predict for all four models. Model 1
already predicts around 80% of these outcomes for a little more than 10% of false positive predictions. The
area under curve as a fraction of the total area (AUC) is possible numerical representation of the model
performance. The baseline model comes close to an AUC of 0.88. Model 3 includes the TBI indicator and
slightly increases the predictive performance in-sample. An AUC value of 0.89 and an increased performance
at the top of the curve can be attribute directly to this variable. Model 3 has the largest AUC value of all
�ve models. Model 4 preforms worst in in-sample prediction of insurgent victory.
The second outcome scenario, a peaceful settlement of insurgent con�icts without decisive defeat of either
24
side seems to be hardest to predict based on the introduced variables. The corresponding ROC curves drag
more or less along the random baseline. Model 2 including all variables performs best, although it only
achieves an AUC of 0.62. This overall humbling predictive performance might be due to the lack of relevant
indicators, such as UN intervention attempts or the existence of a legal political wings of the rebel movement
as discussed by Cunningham et al. (2009). The entire presented theory has been agnostic to the possibilities
and the determinants of outside interventions or negotiated settlements. The poor predictive performance of
inconclusive outcomes very much re�ects this fact.
A slightly better predictive performance of all models can be observed for the prediction of government
victory. The variables MECH. and HELI. in models 4 and 5 re�ect military capabilities of the incumbent
and naturally help to predict incumbent victory. It is again Model 3 that predicts best with an AUC of 0.77.
This overall performance of the models is better than for the case of a political settlement of the con�ict.
25
ROC Curve / Predicting Rebel Victory
False positive rate
True
pos
itive
rat
e
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Model 1: 0.87985
Model 2: 0.87985
Model 3: 0.89111
Model 4: 0.85982
Model 5: 0.86358
ROC Curve / Predicting Settlement
False positive rate
True
pos
itive
rat
e0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Model 1: 0.58725
Model 2: 0.62157
Model 3: 0.57157
Model 4: 0.59216
Model 5: 0.59216
ROC Curve / Predicting Government Victory
False positive rate
True
pos
itive
rat
e
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
Model 1: 0.73792
Model 2: 0.82971
Model 3: 0.74517
Model 4: 0.80556
Model 5: 0.81884
Figure 8: ROC curves for the three possible outcomes.
The discussion of predictive performance has so far been restricted to within the sample. In order to
get a more realistic insight into the models' performance in out-of-sample predictions, leave-one-out cross
26
validation was also performed. The cross validation score was again calculated based on ordinal deviation as
introduced above. Cross validation simply chops the empirical sample into a training and a test set. Leave-
one-out cross validation �ts the statistical model on n-1 observation and then predicts observation n. This
process is repeated n times so that all observations are predicted once out-of-sample. The ordinal deviations
out-of-sample are consistently larger than those in-sample. While the TBI indicator does not worsen the
out-of-sample prediction, it also does not improve it in comparison to the base model. Interestingly, the full
model performs worst in the cross validation scheme. This might be due to the fact that additional variables
might very well decrease predictive success. Every estimated coe�cient is random variable in the frequentist
view. Every one of these estimates contributes random variability to the model predictions. Therefore,
a suitable metric is necessary that expresses the bene�t of adding suitable indicators against the costs of
adding random variability. Such a metric is the �Akaike Information Criterion� or AIC for short (Akaike,
1974). Intuitively speaking, the AIC rewards goodness of �t while punishing the inclusion of additional
variables. Lower AIC values indicate that a high goodness-of-�t was achieved based on only a small number
of explanatory variables. More formally, the AIC = 2k − 2ln(L), where k is the number of covariates and L
the maximum Likelihood. The regression table 6 on page 21 indicates the achieved out-of-sample scores and
the AIC values for each of these models. While out-of-sample tests do not belong to the standard procedures
in political science, they can help to put established �ndings into perspective. The TBI indicator that so
prominently dominates the regression table with its low p-value does not improve out-of-sample prediction
compared to the base model 1. In-sample prediction nevertheless bene�ts from the inclusion of this variable.
Model 5 that only includes signi�cant variables shows the lowest AIC value for all models. The additional
complexity and random variability that results from including multiple covariates is well balanced in this
model with increased goodness-of-�t. Model 3 including the TBI and the variables from the base model
preforms best in the prediction task with an ordinal deviation of 27 for the in-sample prediction and an
ordinal deviation of 38 for the out-of-sample case. In conclusion it is fair to say that the TBI is a good albeit
not perfect predictor for insurgent victory. The next section will sum up both the theoretical as well as the
empirical results and outline possible follow-up projects.
5 Discussion and Conclusion
The empirical analysis has con�rmed the theoretical expectation (Hypothesis 1) that capital-population dis-
tance is associated with outcome of insurgent civil wars. The Agent-Based Model has simulated a possible
27
micro-level explanation for this fact. Its central mechanism �reactive mobilization as a function of geograph-
ically determined accuracy� is derived from a broad theoretical literature and goes beyond a simple decline
of �repower as a function of distance.
The proposed e�ect of the Territorial Balance Indicator has been found in the empirical analysis and
shown to be a robust factor from both an inferential and predictive point of view. However, the Lyall and
Wilson III (2009) study has helped to put the proposed mechanism into perspective. Using their dataset
also allowed for the question of insurgent civil war outcomes to be treated in a cumulative manner, directly
building on established knowledge. In conclusion, instead of a strict geographic determinism that follows from
the theoretical discussion, a robust geographic tendency was found in the empirical record. Institutional and
military factors are also robustly related to outcomes in insurgent civil wars, especially to incumbent victory.
With regard to other possible geographic factors in�uencing accuracy, no conclusive support was found. The
percentage of the urban population is not signi�cantly related to outcome, thereby rejecting Hypothesis 2a.
Similarly, for Hypotheses 2b and 2c only small coe�cients were estimated, although they display the expected
signs. Possibly, over-aggregation to the country level and the lack of time variant information might be the
cause of this problem, but only future research can con�rm or reject this assumption.
Although the presented theory has passed these empirical tests, a number of problems need to be addressed
in the future. Most importantly, the gap between the individualist theoretical explanation and the country
level analysis needs to be �lled with more �ne grained data. Investigations on the level of ethnic groups
could provide this missing link. Con�ict events from recent insurgent civil wars could also be aggregated
to test for the proposed reactive patterns. Future Agent-Based Simulations could be adapted to work with
geo-referenced data and be compared more directly to the empirical record.
From a theoretical point of view, the study has generated a new synthesis between (counter-)insurgency
and con�ict literature and between geography and population as decisive factors in insurgent civil wars.
Boulding's (1962) classic distance-decay model has dominated the literature for decades. However, upon
close theoretical inspection, its proposed central mechanism fails to deliver a suitable explanation of how
insurgent civil wars end. A general decrease of the quantity of violence as a function of distance fails
to convince in the age of global strike capabilities and mechanized forces. Moreover, a sole focus on the
quantity of applicable violence falls short of explaining the problem of reactive mobilization in response to
indiscriminate violence. This study has proposed a new explanation involving a distance-decay of the quality
of violence. Highly selective violence deteriorates into largely indiscriminate violence as the distance to the
military actor's centers of control grows. A simulation model has helped to communicate this theory and
28
generated stylized insights. The Territorial Balance in the Agent-Based Simulation was operationalized as an
empirical indicator and successfully tested in large-N. If future research establishes the validity of the theory,
strong policy recommendations could be derived: a weak local incumbent, for example, might be far better at
beating an insurgency than a distant central government. Contrary to Boulding's implications, strengthening
this central government would not imply greater success on the battle�eld, but primarily more �erce resistance
and pointless bloodshed. Contrary to Kalyvas' (2006,12;118-132) expectations, civilian collaboration could be
strongly, but not entirely endogenous to military control. Instead, uncontrollable zones for either incumbent
or insurgent could emerge wherever geographic conditions prevent the actors from using force accurately,
leaving only homemade explosives or indiscriminate shelling as instruments of terror and fuel for resistance.
The expected deterrent e�ect of inaccurate violence that is central to classic air power theory might be
outweighed in many instances by reactive mobilization. If future research shows this to be the case, death
and destruction in�icted on an insurgent force should not be generally seen as military progress in �ghting
a rebel force, but simply as a actions that will trigger a violent reactions. According to this new perception,
a challenged state would have to face an insurgent force with the most even handed approach, defending
not primarily its territory or resources, but its reputation. This conclusion cannot be reached based on the
presented evidence alone, although it follows from the presented theoretical discussion. Future research will
have to complete the picture that has been outlined so far.
References
H. Akaike. A new look at the statistical model identi�cation. IEEE Transactions on Automatic Control, 19
(6):716�723, 1974.
D. Bennett, Scott. Governments, civilians, and the evolution of insurgency: Modeling the early dynamics of
insurgencies. Journal of Arti�cial Societies and Social Simulation, 11(47), 2008.
R. Bhavnani, D. Miodownik, and J. Nart. Rescape: an agent-based framework for modeling resources,
ethnicity, and con�ict. Journal of Arti�cial Societies and Social Simulation, 11(2-7), 2008.
S. Biddle. Military Power. Princeton University Press, 2006.
R. D. Blackwill. A de facto partition for afghanistan. Available online at
http://dyn.politico.com/printstory.cfm?uuid=AACEE164-18FE-70B2-A8E30566E50DFB3A, 2010.
29
K. Boulding. Con�ict and Defense: A General Theory. Harper, 1962.
H. Buhaug and S. Gates. The geography of civil war. Journal of Peace Research, 39:417�433, 2002.
H. Buhaug, L.-E. Cederman, and J.-K. Rød. Disaggregating ethno-nationalist civil wars: A dyadic test of
exclusion theory. International Organization, 62(3):531�551, 2008.
H. Buhaug. Dude, where's my con�ict? lsg, relative strength, and the location of civil war. Con�ict
Management and Peace Science, 27(2), 2010.
L.-E. Cederman. Articulating the geo-cultural logic of nationalist insurgency. In S. Kalyvas, I. Shapiro, and
T. Masoud, editors, Order, Con�ict and Violence. Cambridge University Press, Cambridge, 2008.
CIESIN. Gridded population of the world, version 3 (gpwv3). Columbia University; and Centro Internacional
de Agricultura Tropical (CIAT). 2005. Gridded Population of the World Version 3 (GPWv3): Population
Grids. Palisades, NY: Socioeconomic Data and Applications Center (SEDAC), Columbia University. Avail-
able at http://sedac.ciesin.columbia.edu/gpw, 2005.
C. Cio�-Revilla and M. Rouleau. Mason rebeland: An agent-based model of politics, environment, and
insurgency. International Studies Review, 12(1):31�52, 2010.
P. Collier and A. Hoe�er. Greed and grievance in civil war. Oxford Economic Papers, 56:563�595, 2004.
P. Collier. Rebellion as a quasi-criminal activity. Journal of Con�ict Resolution, 44(6):839�853, 2000.
B. Connable and M. Libicki. How Insurgencies End. National Defense Research Insitute, 2010.
D. E. Cunningham, K. S. Gleditsch, and I. Salehyan. It takes two: A dyadic analysis of civil war duration
and outcome. Journal of Con�ict Resolution, 53(4):570�597, 2009.
DOD. U.S. Army Counterinsurgency Handbook. Skyhorse Publishing, 2007.
G. Douhet. Command of The Air. Air Force History and Museums Program, 1921.
L. Durell. Bitter Lemmons. Marlowe, 1957.
D. Ellsberg. Revolutionary judo. Available online at http://www.rand.org/pubs/documents/2006/D19807.pdf,
1970.
30
B. Fall, Bernard. The theory and practice of insurgency and counterinsurgency. Naval War College Review,
1965.
J. D. Fearon and D. D. Laitin. Weak states, rough terrain, and large-scale ethnic violence. Paper presented
at the 1999 Annual Meetings of the American Political Science Association, 1999.
J. D. Fearon and D. D. Laitin. Ethnicity, insurgency and civil war. American Political Science Review, 97
(1):75�90, 2003.
E. Gartzke. The relevance of power in international relations. Available online at
http://dss.ucsd.edu/ egartzke/papers/relevance_082010.pdf, 2010.
C. Geertz. The integrative revolution: Primordial sentiments and civil politics in the new states. In C. Geertz,
editor, Old Societies and New States. 1963.
B. Greiner. War without Fronts � The USA in Vietnam. The Bodely Head, 2009.
H. Grossman. Kleptocracy and revolutions. Oxford Economic Papers, 51(2):267�83, 1999.
E. C. Guevara. Guerrilla Warfare: Authorized Edition. Ocean Press, 1961.
M. Hansen, R. DeFries, J. Townshend, and R. Sohlberg. Global land cover classi�cation at 1km resolution
using a decision tree classi�er. International Journal of Remote Sensing, 21:1331�1365, 2000.
L. Harbom and P. Wallensteen. Armed con�ict and its international dimensions, 1946-2004. Journal of Peace
Research, 42(5):623�635, 2005.
M. Hastings. The runaway general. Rolling Stone Magazine, (RS 1108/1109), 2010.
H. Hegre and N. Sambanis. Sensitivity analysis of empirical results on civil war onset. Journal of Con�ict
Resolution, 50(4):508�535, 2006.
J. Herbst. States and Power in Africa - Comparative Lessons in Authority and Control. Princeton University
Press, 1992.
J. Herbst. African militaries and rebellion: The political economy of thread and combat e�ectiveness. Journal
of Peace Research, 41(3):357�369, 2004.
S. Huntington. The clash of civilizations? Foreign A�airs, 72(3):22�49, 1993.
31
S. Kalyvas. Warfare in civil wars. In D. I. and J. Angstrom, editors, Rethinking the Nature of War, pages
88�108. Abingdton, 2005.
S. Kalyvas. The Logic of Violence in Civil Wars. Cambridge University Press, 2006.
S. Kalyvas and M. A. Kocher. The dynamics of violence in the vietnam war: An analysis of the hamlet
evaluation system (hes). Journal of Peace Research, 46(3):335�355, 2009.
S. Kalyvas and L. Balcells. International system and technology of rebellion: How the end of the cold war
shaped internal con�ict. American Political Science Review, 104(3):415�429, 2010.
D. Kilcullen. The Accidental Guerrilla: Fighting Small Wars in the Midst of a Big One. Oxford University
Press, 2009.
H. W. Kongsgard. Geography matters, but people are the key; the location of viet cong initiated incidents,
2010.
T. E. Lawrence. Revolt in the Desert. Combined Publishing, 1998.
D. Lemke. The tyranny of distance: Rede�ning relevant dyads. International Interactions, 21(1):23�38, 1995.
K. Lowe. Inferno: The Fiery Destruction of Hamburg. Scribner, 2007.
J. Lyall and I. Wilson III. Rage against the machines: Explaining outcomes in countrinsurgency wars.
International Organization, 63(1):67�106, 2009.
N. Macaulay. The cuban rebel army: A numerical survey. The Hispanic American Historical Review, 58(2),
1978.
M. MacPherson. Long Time Passing: Vietnam and the Haunted Generation. Signet, 1984.
T.-t. Mao. On Guerrilla Warfare. University Of Illinois, 1961.
T.-t. Mao. On Protracted War. Foreign Language Press, 1967.
R. McColl. The insurgent state: Territorial bases of revolution. Annals of the Association of American
Geographers, 59(4):613�631, 1969.
J. O'Loughlin and F. Witmer. The localized geographies of violence in the north caucasus of russia, 1999-2007.
Annals. Association of American Geographers, 100(3), 2010.
32
M. Olsen. The Logic of Collective Action: Public Goods and the Theory of Groups. Harvard Economic
Studies, 1965.
R. A. Pape. Bombing to Win. Cornell University Press, 1996.
RAF. Guerilla, widerstand und antiimperialistische front. Available online at
http://www.ra�nfo.de/archiv/raf/raf-5-82.php, 1982.
C. Raleigh and H. Hegre. Population size, concentration, and civil war. a geographically disaggregated
analysis. Political Geography, 28(4):224�238, 2009.
P. A. Schrodt. Seven deadly sins of contemporary quantitative political analysis. Paper prepared for the
theme panel 'A Sea Change in Political Methodology?' at the Annual Meeting of the American Political
Science Association, Washington, 2 - 5 September 2010, 2010.
C. J. Scott. The Art of Not being Governed. Yale University Press, 2009.
N. Sheehan. A Bright Shining Lie: John Paul Vann and the USA in Vietnam. Jonathan Cape, 1988.
H. Starr. 'opportunity' and 'willingness' as ordering concepts in the study of war. International Interactions,
4(4):363�387, 1978.
C. von Clausewitz. On War. Trübner, 1873. URL http://www.clausewitz.com/readings/OnWar1873/TOC.htm.
M. D. Ward, B. D. Greenhill, and K. Bakke. The perils of policy by p-value: Predicting civil con�icts.
Journal of Peace Research, 47(4):1�13, 2010.
N. B. Weidmann. Geography as motivation and opportunity: Group concentration and ethnic con�ict.
Journal of Con�ict Resolution, 53(4):526�543, 2009.
N. B. Weidmann and I. Salehyan. Violence and ethnic segregation: A computational model applied to
baghdad. International Studies Quarterly, forthcoming, 2011.
N. B. Weidmann and M. Ward. Predicting con�ict in space and time. Journal of Con�ict Resolution,
Forthcoming, 2010.
N. B. Weidmann, D. Kuse, and K. S. Gleditsch. The geography of the international system:the cshapes
dataset. International Interactions, 36(1):86�106, 2010.
33