A NOVEL AUTOMATED OPTICAL TECHNIQUE TO ...personal.kent.edu/~shurley/Shawn Hurley HTC Thesis FINAL...biaxial mineral grains in thin sections has been developed. The system takes images

A NOVEL AUTOMATED OPTICAL TECHNIQUE TO DETERMINE

THE ORIENTATION OF BIAXIAL MINERALS IN A THIN SECTION

A Thesis

Presented to

The Honors Tutorial College

Ohio University

In Partial Fulfillment

of the Requirements for Graduation

from the Honors Tutorial College

with the degree of

Bachelor of Science in Engineering Physics

by

Shawn Patrick Hurley

June 2005

ii

This thesis has been approved by

The Honors Tutorial College and the Department of Physics and Astronomy

______________________________

Dr. Larry Wilen

Professor, Physics and Astronomy

______________________________

Dr. David Ingram

Professor, Physics and Astronomy

______________________________

Dr. Ann Fidler

Dean, Honors Tutorial College

iii

ACKNOWLEDGEMENTS

I would like to thank my advisor, Ohio University (OU) Physics and

Astronomy professor Dr. Larry Wilen, for presenting me with the initial idea for this

thesis. Without his guidance I wouldn’t have been able to develop it into what it is

today. Much of the conversation I have had with him has spurred my own creativity

and allowed me to think independently. Above all, Dr. Wilen enabled me to be

intimately and responsibly involved in research, which has been the most exciting and

irreplaceable portion of my education.

Funding for this project was provided by the National Science Foundation

Office of Polar Programs.

I would like to thank two of Dr. Wilen’s previous researchers, Nathan and Dirk

Hanson, both of whom created computer programs I examined to guide my own

programming. I also would like to thank Phil Skemer and Zhenting Jiang at Yale

University Department of Geology and Geophysics for providing the olivine sample

and scanning electron microscope data used in this thesis. Furthermore, I thank Roger

Smith and Randy Mulford at the machine shop at Clippinger Laboratory for all of their

assistance.

I thank Carlos Di Prinzio, a post doctoral researcher at Ohio University, for his

friendship and company in a windowless Clippinger lab. I have gotten to know Carlos

very well and I appreciate him sharing both his enthusiastic school spirit and his

coffee with me! I also thank him for allowing me to accompany him in traveling to

the 2003 American Geophysical Union Meeting in San Francisco, CA.

iv

I would like to thank all the faculty and administration at the Honors Tutorial

College (HTC). I also thank my director of studies in HTC, OU Physics and

Astronomy professor Dr. David Ingram, for his advice and support.

I would also like to thank all my friends for encouraging me to succeed. I

especially thank my dear friend Tracey Hanna for reviewing my work and showing

her love and understanding.

Finally, I thank my parents for the love and encouragement I received from

them since birth. They truly are my biggest fans.

v

Table of Contents

Title..................................................................................................................................i Approval ....................................................................................................................... ii Acknowledgements ..................................................................................................... iii Table of Contents ..........................................................................................................v List of Tables and Figures...........................................................................................vi Abstract...................................................................................................................... viii

1.0 INTRODUCTION...................................................................................................1

2.0 BACKGROUND .....................................................................................................5 2.1 CRYSTALLOGRAPHY BACKGROUND.......................................................................5 2.2 OPTICS BACKGROUND..........................................................................................12 2.3 OPTICAL MINERALOGY........................................................................................18

3.0 EXPERIMENTAL METHODS ..........................................................................29 3.1 EXPERIMENTAL SYSTEM ......................................................................................29 3.2 DATA COLLECTION AND PRE-ANALYSIS..............................................................36 3.3 THEORETICAL ANALYSIS METHOD ......................................................................39 3.4 ANALYSIS PROCEDURE ........................................................................................49

4.0 RESULTS AND DISCUSSION ...........................................................................62

5.0 CONCLUSIONS AND APPLICATIONS TO FUTURE RESEARCH...........83

6.0 REFERENCES......................................................................................................85

APPENDIX A..............................................................................................................88

APPENDIX B ..............................................................................................................89

APPENDIX C..............................................................................................................92

vi

List of Tables and Figures Table 1. The crystal systems and their properties...........................................................7 Table 2. The nine sample orientations. .........................................................................33 Table 3. Memory usage statistics for floating point vs. two integer bytes. ..................60 Table 4. The Results .....................................................................................................67 Table 5. Index of refraction data for olivine and OPX. ................................................76 Table 6. Final optical parameters: a, b, and c used to analyze olivine and OPX..........81 Figure 1. Defining parameters of the unit cell. ...............................................................7 Figure 2. Illustration of the grain growth process...........................................................8 Figure 3. A photo of the olivine sample that was studied.............................................10 Figure 4. Color image of the olivine sample.................................................................11 Figure 5. The electromagnetic wave as depicted by Abramowitz. ...............................13 Figure 6. Photographic demonstration of Snell’s Law. ................................................14 Figure 7. Drawing that depicts refraction at the air-glass boundary.............................15 Figure 8. Drawing showing the electric field directions in polarized and unpolarized

light. ......................................................................................................................16 Figure 9. Crossed polarizers. ........................................................................................17 Figure 10. The isotropic indicatrix. Image by Nelson. .................................................20 Figure 11. The uniaxial indicatrix. Image by Nelson. ..................................................22 Figure 12. The uniaxial negative and positive indicatrix. Image by Nelson. ...............23 Figure 13. Different cross sections in a uniaxial indicatrix. Image by Nelson.............24 Figure 14. The biaxial indicatrix. Image adapted from Nelson. ...................................25 Figure 15. Different cross sections in a biaxial indicatrix. Image by Nelson...............26 Figure 16. The Euler angle rotations.............................................................................28 Figure 17. Drawing of experimental setup. ..................................................................30 Figure 18. Photo of experimental setup. .......................................................................30 Figure 19. Color images of the nine sample orientations. ............................................33 Figure 20. Image used to calibrate the center of sample rotation. ................................35 Figure 21. Extinction curve...........................................................................................38 Figure 22. Extinction angles and contrast.....................................................................39 Figure 23. Coordinate axes convention used in the experimental setup.......................40 Figure 24. Diagram explaining rotation convention used.............................................42 Figure 25. Cross sectional ellipse showing the theoretical extinction angle. ...............47 Figure 26. Graph of ( )xxf 2sin)( 2= . ..........................................................................51 Figure 27. Depiction of 3-D Euler angle space.............................................................54 Figure 28. File structure for the Simple Grid Method. .................................................58 Figure 29. Image showing each analyzed grain’s location. Pixel resolution = 640 x

480.........................................................................................................................65 Figure 30. Euler angle plot of data from the optical method. .......................................70

vii

Figure 31. Euler angle plot of data from both methods side-by-side. SEM data has been adjusted to compensate for a sample mounting difference. .........................71

Figure 32. R2 vs. all Euler angles for one grain. ...........................................................73 Figure 33. R2 vs. the Euler angles theta and psi for one grain......................................74 Figure 34. R2 vs. the Euler angles theta and psi for one grain......................................74 Figure 35. Relationship between the crystal axes and the indicatrix axes in olivine....77 Figure 36. Optic sign vs. composition for olivine. .......................................................79

viii

ABSTRACT

A new computer automated optical method to determine the orientation of

biaxial mineral grains in thin sections has been developed. The system takes images

of a thin section sample between rotating crossed polarizers. The intensity of light

transmitted through each grain is used to fit a curve yielding the angle of extinction.

The procedure is repeated for nine sample positions. To specify the orientation of a

grain by the Euler angles, phi, theta, and psi, the theoretical extinction angles

corresponding to a grid of possible grain orientations is calculated and matched with

the experimental extinction angles for the nine sample positions. For comparison, a

sample of the orthorhombic mineral Olivine was analyzed with a conventional

scanning electron microscope (SEM) method. The new optical method produces

results in good agreement with the SEM method for a fraction of the cost in equipment

and software.

1

1.0 INTRODUCTION

The study of the mineral composition of rocks is a common aspect of geology,

yielding insight into the structural evolution of the earth. An important component of

this research is the study of the individual crystal orientations of mineral crystals.

Geologists and geophysicists who study the transmission of seismic (earthquake)

waves are interested in determining how the orientation of minerals in the earth’s

upper mantle affects seismic wave transmission direction and rate of travel. These

minerals are anisotropic, meaning they exhibit physical properties that depend on

direction. Because of this fact, the orientation of the mineral crystals in the upper

mantle will have an effect on the propagation of earthquake waves. The collective

“preferred” orientation of these mineral crystals, called “texture” or “fabric,” is an

average orientation of all the crystal grains over a large area. Because of the

dependence of seismic propagation upon texture (“seismic anisotropy”),

measurements of the texture of rock samples is important for interpreting the

information derived from seismic wave analysis.

In addition to seismological studies, the texture of minerals provides important

clues into the deformation and formation of rocks and rock formation. An American

Geophysical Union VHS tape that discusses properties of the earth’s mantle goes into

detail about the significance of studying the texture of minerals, as quoted here from

their web site:

These findings are of significant interest for seismologists, geodynamicists and structural geologists

2

who recognize the importance of anisotropy. The link between the crystal properties and structural features of a geological formation is evident: anisotropy in the mechanical properties of rocks influences their deformations, and the deformations themselves induce the texture that is the source of anisotropy in the properties (Wenk).

The mineral olivine is the most abundant mineral in the earth’s upper mantle.

Understanding what causes texture to develop in rocks in general and specifically in

olivine is an important area of research. Karato describes how the process of

dislocation deformation reorients olivine crystals, “...the shape of a crystal can change

only in a certain manner. Thus, by rotating its orientation, a crystal tries to match the

boundary condition imposed by its surrounding materials” (109). He also adds,

“Simply put, crystals tend to align to match the microscopic deformation by

dislocation motion with the imposed macroscopic deformation geometry” (109-110).

Therefore, anisotropy is sometimes influenced by the deformation processes in the

mantle. Olivine has the strongest elastic anisotropy, so its texture has the largest effect

on the anisotropic structure of the upper mantle (Karato 110). This makes the study of

the texture of olivine important to researchers investigating seismic waves as well.

A prime example of one scientist’s study of seismic anisotropy is Dr. Furlong

at Penn State University who researched the mantle under Tibet. Dr. Furlong’s

research, as documented on a Penn State web site, aimed to study “the role of plate

tectonics in the evolution of continents.” His research looked at the recordings of

seismic events in one region of Tibet and he found a systematic pattern of mantle

deformation. Dr. Furlong further stated, “Theoretically, the olivine fabric should

3

relate to the forces that existed during collision and the actual deformation of the

mantle rock.” He says that better models of continental collisions can be made once

more is known about the deformations in the mantle (Furlong).

Because of the importance of studying the deformation processes in the

mantle, it becomes necessary to design experimental techniques for determining the

texture of olivine and other biaxial minerals. One tool already available for studying

texture is the scanning electron microscope (SEM). This proven method finds the

orientations of the grains in a polycrystalline olivine sample by using electron

backscattering. However, there is a need for a less expensive, yet still reliable

technique to find the orientation of biaxial minerals. Although manual optical

techniques exist for this type of analysis, they are typically laborious and time

consuming to perform (Wilen). Therefore, this thesis introduces an automated optical

method to satisfy the need. My research verified the validity of the optical method by

obtaining the orientations of grains in a sample of olivine. The accuracy of the optical

method was verified by a direct comparison of the data with results obtained using a

SEM.

The optical technique has its advantages over the SEM method. The optical

method can be setup for 1/100th the cost of a SEM. Also, the new optical method uses

images of the grains taken with cross polarized visible light. These images show sharp

contrast along grain boundaries due to differing orientations between grains. This

enables the researcher that uses the optical method to calculate statistics concerning

grain size, deformation, and nearest neighbor correlations using existing software.

4

The SEM method is not as well suited to see individual grain boundaries, which limits

the automation of such additional analyses (Wilen).

Over the next several chapters, I will describe the details of the optical

technique I developed. To the best of my knowledge, this is a unique method because

it uses polarized light to the find the orientation of biaxial crystal grains in a mineral

thin section.

5

2.0 BACKGROUND

2.1 Crystallography Background

“A crystal is a solid body bounded by natural planar surfaces, generally called

crystal faces, that are the external expression of a regular internal arrangement of its

constituent atoms or ions” (Berry 10). Designing experiments involving biaxial

crystals requires an understanding of the basics of crystallography, optics, and optical

mineralogy. I will use Berry’s definition of the crystal as a starting point in this

chapter, which is dedicated to introducing those principles necessary to understand my

research.

Crystallography is the study of the characteristics and evolution of rocks as

well as the internal structure of all crystalline substances. Crystalline substances are

those materials that have a regular three dimensional arrangement of constituent

atoms, where each atom is bonded to its nearest-neighbor, but may or may not be

bounded by crystal faces. Therefore, minerals are crystalline solids by definition

(Berry 10-12, Callister 32).

Crystallography is a practically important area of study. This is because

crystal structure, along with chemical composition, determines all the physical

properties of crystalline materials. These, in turn, determine the uses of crystalline

materials (Berry 14).

6

Crystal Systems

All minerals are classified into crystal systems according to their crystal

structure. Crystal structure is “the manner in which atoms, ions, or molecules are

spatially arranged” (Callister 32). Each crystal system is based upon the symmetry of

the smallest repeatable arrangement of atoms in the crystal structure, called the unit

cell. The unit cell is the basic building block defining the crystal structure through its

geometry and the position of the atoms within it. Each crystal system’s unit cell is

modeled by a unique crystal lattice, a “three-dimensional array of points coinciding

with atom positions” (Callister 33).

The specific geometry of each crystal system is defined using an x, y, and z

coordinate system (not necessarily orthonormal) such that the origin is situated in the

corner of the unit cell and each axis coincides with one of the edges of the unit cell

extending from this corner. Each crystal system is defined by 6 lattice parameters,

which are a, b, c, α, β, and γ. The parameters a, b and c are the lengths of the edges

along the x, y, and z axes respectively, where α, β, and γ are the interaxial angles as

shown in Figure 1 below adapted from Callister (39).

7

Figure 1. Defining parameters of the unit cell.

Table 1 lists the seven possible crystal systems and some characteristics

associated with each, which will be explained later.

Table 1. The crystal systems and their properties. Crystal System Axial

Relationships Interaxial Angles Effect on

Light Optical

Classification Indicatrix

Shape Isometric (Cubic) a = b = c o90=== γβα Isotropic Isotropic Sphere

Hexagonal a = b ≠ c

o

o

120

,90

=

==

γ

βα

Anisotropic Uniaxial Spheroid

Tetragonal a = b ≠ c o90=== γβα Anisotropic Uniaxial Spheroid

Rhombohedral a = b = c o90≠== γβα Anisotropic Uniaxial Spheroid

Orthorhomic a ≠ b ≠ c o90=== γβα Anisotropic Biaxial Ellipsoid

Monoclinic a ≠ b ≠ c βγα ≠== o90 Anisotropic Biaxial Ellipsoid

Triclinic a ≠ b ≠ c o90≠≠≠ γβα Anisotropic Biaxial Ellipsoid

Adapted from Callister (40). More than 50% of minerals exist in the orthorhombic and monoclinic systems. This is

because the most abundant elements in the earth’s crust: silicon (Si), oxygen (O),

magnesium (Mg), iron (Fe), aluminum (Al), and calcium (Ca) usually form

compounds in those crystal systems (Sen 39). For example, the orthorhombic mineral

z

a

b

c y

x

α β γ

8

olivine consists of two iron or magnesium atoms bonded to a silicate compound of a

silicon atom and four oxygen atoms.

Polycrystalline Thin Section The experimental setup for this research utilizes polished thin sections. The

thin sections that are typically studied are made from natural polycrystalline rock

samples. This means the crystalline solid is composed of many small crystal grains,

each with their own specific crystallographic orientation within the sample. The grain

growth process begins with the solidification of small crystals, which are typically

randomly oriented. As they grow to completion, irregular grain shapes form from the

atomic mismatch at grain boundaries. Sometimes there develops a preferred

crystallographic direction, meaning a tendency for crystals to align their orientations

in one direction. The case of preferred orientation among crystal grains is called

texture (Callister 54-56).

A diagram demonstrating the grain growth process as adapted from Callister

(55) is depicted in Figure 2, below.

Figure 2. Illustration of the grain growth process.

9

It is important to note that the grains themselves are three dimensional, unlike the two

dimensional depiction above. The thin section, however, gives the researcher a two

dimensional cross section of the grains in the sample.

I now will briefly describe how a thin section is typically prepared using five

main steps. First, the sample is cut using a diamond saw. Then, the sample is

mounted on the glass slide. Then, the surface is ground flat by using a carborundum

grit and water until the section is ~30 µm thick. The thickness is estimated by

comparing the observed colors of the grains to the colors expected from minerals in

the section. Then the surface is polished, usually with diamond grit plus oil as a

lubricant. Finally, the surface is buffed by using a gamma alumina powder and water

for lubrication (Gribble 32-33).

The two images shown in Figure 3 and Figure 4 are of the thin section of

olivine that was used in the experiment. The first is a photo of olivine mounted in the

sample stage of the system. Note how it looks in ordinary unpolarized light (the

principle of polarization will be described later). The second image is of olivine seen

between crossed polarizers through the camera of the experimental setup. Although

the camera is black and white, the color image was created by combining 3 images

taken under red, green and blue filtered light.

10

Olivine Sample

Figure 3. A photo of the olivine sample that was studied.

11

Figure 4. Color image of the olivine sample.

1 mm

12

2.2 Optics Background

The study of the behavior of light is called optics. I used optical procedures

and principles to guide the design of the experimental setup and analysis. In order to

explain the apparatus used in data collection and the theory behind the data analysis, it

is necessary to describe the most relevant concepts in optics and how they pertain to

this experiment.

Light as an Electromagnetic Wave In the context of the experiment, I will treat light classically as a form of

electromagnetic radiation which can be described by the propagation of a wave

carrying the energy of the radiation. This radiation is a wave that consists of two

perpendicular components, the electric field vector ( Er

) and the magnetic field vector

( Br

). The Er

and Br

vibrate in a sinusoidal manner perpendicular to the direction the

wave propagates (Figure 5Figure 5. The electromagnetic wave as depicted by

Abramowitz). This defines light as a transverse wave, because the fields vibrate

perpendicular to the direction of travel. Ultimately, it is the interaction of the electric

field component of light with the electrically sensitive atoms in the crystal lattice of a

mineral that affects the behavior of light (Nesse 1). It is this basic phenomenon that

makes the experiment possible.

13

Figure 5. The electromagnetic wave as depicted by Abramowitz.

Electromagnetic radiation is characterized by its wavelength as shown in

Figure 5. The wavelength is the distance between successive peaks or crests of the

wave and for visible light it is best measured in nanometers. The frequency of light

describes the number of sinusoidal cycles of the wave that pass a point in space per

second and is measured in Hertz (Hz). Wavelength and frequency are characteristics

that are directly connected by the velocity of the wave. The speed of light, c,

illustrates this relationship as:

fc λ= (1) where lambda is the wavelength and f is the frequency. This relationship describing

the velocity of waves was derived by Newton in the year 1687 (Hecht 16).

So far, I have described the characteristics of light using a picture of single

wave. In practice, light can be depicted as a multitude of waves traveling as a single

unit. The wave front is defined as the plane connecting similar portions of the waves,

such as the crests. The wave normal is a vector that is perpendicular to the wave front

and it points in the direction the wave front is traveling (Nesse 3).

14

Refraction, Index of Refraction, and Snell’s Law As light encounters a boundary between two media it will become reflected,

refracted, or absorbed in some combination: such as partially, selectively, or totally.

Light will always reflect at an angle equal to the angle of incidence, due to the law of

reflection. In addition, if light transmits into another medium, it will undergo

refraction. Refraction is a “bending” of the light ray that occurs because the light

changes its speed. The change in speed causes the ray to change direction at the

boundary between the media (Sen 70). A familiar example of refraction is the pencil

sitting in a glass of water, as seen in Figure 6.

Figure 6. Photographic demonstration of Snell’s Law.

The speed of light in a vacuum, c, is a constant value of approximately 3 x 108

m/s. The refractive index, n, is defined as the ratio of the speed of light in vacuum to

the speed in the medium:

vcn = (2)

15

The refractive index is also simply related to the change in direction that occurs at the

boundary in conjunction with the angle of incidence and the angle of transmittance.

This relationship is known as Snell’s Law, named after its discoverer:

ttniin θθ sinsin = (3)

The refractive index of air can be taken to be approximately 1.00 and many common

minerals have values that range from 1.43 to 3.22 (Shelley 21-22).

Figure 7 is a pictorial demonstration of reflection and refraction with all the

components of the equation for Snell’s Law labeled.

Figure 7. Drawing that depicts refraction at the air-glass boundary.

Polarization Using the model of the electromagnetic wave previously described, I will

explain the nature of polarized light. I am limiting the discussion to only include

linearly polarized light since that is the only type of polarized light relevant to the

experiment. As an example, sunlight is naturally unpolarized, meaning the electric

θt

air ni

glass nt

θi θi

16

field component is randomly oriented in any direction perpendicular to the direction of

propagation. Linearly polarized light has only a single direction in which the electric

field vibrates, (Figure 8).

Ordinary Unpolarized Light Plane Polarized Light

Figure 8. Drawing showing the electric field directions in polarized and unpolarized light. A polarizer is a substance used to create polarized light from unpolarized light.

My experiments use a very common type of polarizer composed of a stretched film of

polyvinyl alcohol impregnated with iodine. This type of polarizer is called Polaroid

and was invented by Edwin Herbert Land (Hecht 335). Polaroid works through

selective absorption, thereby only transmitting light whose electric field is vibrating in

the preferred direction (called the transmission axis).

Another common process to produce linearly polarized light is reflection. At a

special angle of incidence, Brewster’s angle, only the electric field component

polarized parallel to the surface will be reflected. This angle is related to refractive

indices by Brewster’s Law:

i

tp n

n=θtan (4)

Er

Propagation Direction

Er

Propagation Direction

17

named after Sir David Brewster (1781-1868) (Hecht 348). This principle creates a

polarized glare from reflecting surfaces, which can be preferentially reduced by

Polaroid sunglasses. I use Brewster’s Law to calibrate the transmission axis of the

polarizer for the experiment and this procedure is described in detail in the

experimental methods section.

Cross Polarized Light When two linear polarizers are superimposed on each other, as shown in

Figure 9, such that their transmission axes are perpendicular, they are considered to be

crossed. If no polarizing material exists between them, then extinction occurs and no

light will be able to pass through. Therefore, the two crossed polarizers in Figure 9

would appear extinct (black) to the observer in the overlapping region and show some

lowered intensity otherwise (Shelley 20).

Figure 9. Crossed polarizers.

When a mineral sample is placed between crossed polarizers it is said to be

viewed under cross polarized light. The appearance of the mineral depends upon its

crystal symmetry and orientation. A useful theoretical tool called the optical indicatrix

18

describes the way light acts within a given material. Understanding the indicatrix and

how it relates to the experiment was a critical step in my thesis.

2.3 Optical Mineralogy The Optical Indicatrix According to Nesse, the optical indicatrix is “a geometric figure that shows the

index of refraction and vibration direction for light passing in any direction though a

material” (34). It is constructed by drawing vectors whose lengths are proportional to

the index of refraction of light that is vibrating parallel to the vector (Nesse 35). In

general, this geometrical representation is an ellipsoid. The indicatrix is the

theoretical foundation upon which the experimental procedures and calculations of

this experiment are based.

Once the indicatrix is constructed and aligned with the material, it completely

determines how light will behave when incident upon any transparent material

according to the following rules:

1. A plane perpendicular to a ray of light traveling in an arbitrary direction will

intersect the indicatrix ellipsoid (through its center) in an ellipse. The light ray

then splits into 2 rays, one with a polarization aligned parallel to the semi-

major axis of the cross sectional ellipse and the other with a polarization

aligned parallel to the semi-minor axis of the cross sectional ellipse.

2. The ray polarized along the semi-major ellipse axis travels with a velocity of

1nc , where 1n is the length of the semi-major axis. The ray polarized along the

19

semi-minor ellipse axis travels with a velocity of 2n

c , where 2n is the length of

the semi-major axis.

3. Because the two light rays have different indices of refraction, if the light ray is

incident on the material with an angle ≠ 0, then the two rays will physically

split.

4. If the intersection of the indicatrix is a circle, then the polarization of the

incident light is not affected. The direction normal to the circular section in the

indicatrix is called the optic axis.

5. The difference between the maximum and minimum indices of refraction used

to construct the indicatrix define the birefringence of the material.

6. If light incident on the material is already polarized along the direction of the

semi-major or semi-minor axes of the ellipse, then its polarization is unaffected

by the material.

I compiled these rules from the Neese (35), Hecht (336-337), and Shelley (21-23).

According to Table 1, there are three types of optical classifications in minerals and

each one has a different indicatrix geometry. These are based upon the crystal

structure of the atoms, as defined and classified by the crystal system to which the

mineral belongs. The indicatrix makes a progression of increasing complexity from

isotropic to uniaxial to biaxial. Therefore, I will introduce each specific indicatrix in

this order to further aid the understanding of its role in optical mineralogy and the

theoretical framework of this experiment.

20

Isotropic Minerals The light passing through an isotropic mineral of any random orientation will

have the same velocity. On an atomic scale, this is due to a highly symmetric atomic

arrangement, only possible in the isometric (cubic) crystal structure. Other non-

crystalline materials that are isotropic include gases, liquids, and glasses (Nesse 34).

Because the velocity remains constant for all directions, the index of refraction also

remains constant in all directions. Therefore, the isotropic indicatrix is the simplest

indicatrix, a sphere (Figure 10).

Figure 10. The isotropic indicatrix. Image by Nelson.

In the example in Figure 10, the index of refraction is 1.540 for all crystal orientations

because the intersection with the indicatrix is always a circle. Also, the polarization

state of light is not affected, so when any isotropic mineral is viewed between crossed

polarizers, it will appear extinct (Nesse 34-35).

Optic Axis

As previously defined, the optic axis is the direction through any crystal where

the intersection with the indicatrix is a circle. This means that rays entering parallel to

21

the optic axis will not incur double refraction (being split into two rays) or have their

polarization state altered. Therefore, a crystal viewed along its optic axis between

crossed polarizers will show extinction for all rotations of the polarizers, as long as the

polarizers remain crossed. For isotropic minerals, every direction in the crystal is an

optic axis (Nesse 53).

Uniaxial Minerals

The description of uniaxial minerals will begin with a historical introduction to

double refraction. Anisotropic minerals were observed empirically to refract light

with two different indices of refraction. One such mineral, calcite, is commonly used

to illustrate the phenomenon of double refraction. If a calcite rhomb is placed over a

black dot on a piece of paper, then two images of the dot are observed. When the

calcite is rotated, one dot stays fixed while the other dot rotates with the crystal. The

image of the stationary dot behaves as would be expected from an isotropic mineral,

which is according to Snell’s Law, so therefore it was termed the ordinary ray (ω).

Because the image of the rotating dot behaves so differently and does not appear to

obey Snell’s Law, it was called the extraordinary ray (ε) (Nesse 53).

Minerals in the hexagonal and tetragonal crystal systems have only one optic

axis and are called uniaxial minerals. In uniaxial minerals, the relationship between

the indicatrix and the crystallographic axes is such that the optic axis in the indicatrix

and the c crystallographic axis (c-axis) coincide. This makes the indicatrix very useful

in relating the optical and physical properties of uniaxial minerals. Figure 11 is a

representation of the uniaxial indicatrix.

22

Figure 11. The uniaxial indicatrix. Image by Nelson.

As previously defined, the birefringence, δ, is a measure of the difference

between the (higher) index of refraction of the slow ray and the (lower) index of

refraction of the fast ray. Because changing the orientation of an anisotropic crystal

usually changes the birefringence, it is commonly listed as the maximum value for a

given mineral (Nesse 40).

In uniaxial minerals, the ω ray has a fixed index of refraction, whereas the

ε ray’s index of refraction can vary according to the orientation of the mineral.

Because it must always be higher or lower than the index of the ω ray within the same

mineral, uniaxial minerals are classified as optically positive or negative. In positive

minerals, the ε ray is slower because its index of refraction is higher. In negative

minerals, the ε ray is faster because its index of refraction is lower (Nesse 53). Figure

12 shows both classifications of uniaxial minerals (remember that for uniaxial

minerals, the optic axis is the same as the c-axis).

23

Figure 12. The uniaxial negative and positive indicatrix. Image by Nelson.

In addition to having different indices of refraction, the ω ray and ε ray were

shown to be polarized at right angles to each other by Fresnel and Arago in the year

1811 consistent with rule number 1. The ω ray always vibrates in a plane at right

angles to the ω ray path and the c-axis of the mineral. The ε ray always vibrates

within this plane containing the c-axis (Bloss 71).

Three intersections of the uniaxial indicatrix are shown in Figure 13. A section

containing the optic axis is called a principal section and light transmitting through at

this orientation will show maximum birefringence. The permitted vibration directions

are aligned with the semi-major and semi-minor axes of the principal section ellipse.

Light propagating parallel to the c-axis will make a circular section in the indicatrix.

In this situation, the mineral will behave as though it were traveling in an isotropic

mineral because there is zero birefringence. For light propagating in a random

direction, the section will be elliptical and again the permitted vibration directions are

aligned with the semi-major and semi-minor axes of the cross sectional ellipse (Neese

56).

24

Figure 13. Different cross sections in a uniaxial indicatrix. Image by Nelson.

Biaxial Minerals

Biaxial minerals have two optic axes and are classified in the orthorhombic,

monoclinic, and triclinic crystal systems. Biaxial crystals have the least symmetry

since they vary in crystal structure and chemical bonding in all directions. Therefore,

the relationship between the indicatrix axes and the crystallographic axes varies from

mineral to mineral. However, if this relationship is known it makes the indicatrix very

useful in relating the optical and physical properties of biaxial minerals. Figure 14 is a

representation of the biaxial indicatrix.

25

Figure 14. The biaxial indicatrix. Image adapted from Nelson.

As shown in Figure 14, three principal indices of refraction are needed to

construct the indicatrix. These values are α, β, and γ, which are not related or to be

confused with the interaxial angles in crystallography. (From now on in this text,

α, β, and γ, will refer to the indices of refraction in biaxial minerals). These indices of

refraction are defined such that γ > β > α is always true (Nesse 76-77).

The defining features of the biaxial indicatrix are depicted in Figure 15. The

biaxial indicatrix has two circular sections as depicted in Figure 15. The two lines

perpendicular to one or the other circular sections are the optic axes. The angle that

separates the two optic axes is called 2Vα or 2Vγ according to which principal

refractive index bisects it. 2Vα + 2Vγ = 180°, so unless both angles are 90° they will

not be equal. It is interesting to note, in the limits that either 2V angle becomes 0° or

180°, the two circular sections and optic axes converge into one and β = α or β = γ.

Therefore, a uniaxial crystal can be thought of as a special case of the biaxial crystal

(Nesse 76-78).

26

Figure 15. Different cross sections in a biaxial indicatrix. Image by Nelson.

As in uniaxial minerals, the permitted vibration directions in a biaxial crystal

are aligned with the semi-major and semi-minor axes of the elliptical cross section

from the incident wavefront. Light propagating parallel to either optic axis will

“slice” through a circular section in the indicatrix and will behave as though it were

traveling in an isotropic mineral because there is zero birefringence. To avoid

confusion, light in a biaxial mineral is still split into two rays despite needing three

indices of refraction to construct the indicatrix. Of these two rays, one is the fast ray

and one is the slow ray; however, both of these rays are now extraordinary. Biaxial

minerals are also classified as either biaxial positive or biaxial negative. An optically

positive mineral means that 2Vγ < 90° and an optically negative mineral means that

2Vγ > 90°. If the 2Vγ = 180°, then the mineral is optically neutral (Nesse 76-78).

Extinction

Throughout the course of this paper, the term extinction is used to mean that no

light is transmitted when observing a mineral under cross polarized light. Consider a

27

mineral sample observed between crossed polarizers that are rotated through 360

degrees. When an isotropic mineral is viewed in this manner, it will always be extinct.

As discussed above, the polarization state of the light wave is unaltered by the

mineral, regardless of the mineral’s orientation. So, the light is polarized by the first

polarizer and extinguished by the second (perpendicular to first) polarizer. If the

mineral viewed is anisotropic, it will generally show extinction four times as the

polarizers are rotated 360 degrees. The angles at which this extinction occurs are

called the extinction angles and they are spaced ninety degrees apart. The extinction

occurs because each of these four positions correspond to an alignment of the

transmission axis of the polarizer and the characteristic polarization of one of the light

rays transmitted through the mineral (Shelley 23).

A geometric condition for extinction is established if one knows the orientation

of the indicatrix with respect to the light propagation direction. The angle of

extinction can be determined by constructing the ellipse of intersection between the

indicatrix and the plane perpendicular to the propagation direction. If the incident

light is polarized parallel to the semi-major and semi-minor axes of this ellipse, it will

propagate with no change in polarization. Therefore, if the crystal is viewed between

crossed polarizers it will appear extinct if either of the polarizers is aligned with either

ellipse axis (Wilen).

The Orientation of Biaxial Minerals The orientation of biaxial minerals is commonly specified using three angles of

rotation. In my experiment, the three angles used are the Euler orientation angles, phi

28

(φ), theta (θ), and psi (ψ). These angles of rotation are expressed explicitly as rotation

matrices in the section of this paper that outlines the theoretical method. For

simplicity, they are visually depicted in Figure 16.

Figure 16. The Euler angle rotations.

I adopted the Euler angle convention described in a classical dynamics

textbook (Marion 431-433) because it was convenient to express the orientation in the

same three Euler angles as the data obtained from other methods, such as the SEM at

Yale University. This later facilitated a direct comparison of the two methods of

finding the orientation of biaxial minerals, which is detailed in the results section of

this paper.

φ

y

θ

ψ x

z z”

x’

29

3.0 EXPERIMENTAL METHODS

3.1 Experimental System The experimental system for taking images of crystals under thin section was

designed by Dr. Larry Wilen. During data collection and analysis, the system was

kept at Clippinger Laboratory, room 169, where I used it to collect data for my thesis.

My work on the system included writing a software program to collect data and

performing a system calibration. In this section, I will describe the parts of the system

and their functions, define the sample orientation, and explain some of the calibration

procedures.

Components The system consists of the following components mounted on an optical

bench: (A) a black and white CCD (charge coupled device) video camera, (B) a

microscope lens assembly with adjustable zoom and focus, (C) two crossed linear

polarizers mounted on rotation stages, (D) a sample rotation stage, (E) the table

rotation stage, and (F) a diffuse white light source. The system also includes a stage

controller unit and a lab computer (DELL Optiplex GX240, Pentium 4 - 1.7Ghz,

1024MB of RAM) equipped with a video capture card and a computer program that

automates the data collection procedure. Figure 17 is a drawing showing each part’s

location on the optical bench and Figure 18 is a photo of the experimental setup.

30

Figure 17. Drawing of experimental setup.

Figure 18. Photo of experimental setup.

31

I will now explain the details of each part in the system. Light from a white

light source is used to image the crystals. It travels through the first polarizer, then

through the sample, then through the second polarizer, and finally into the camera.

The light source has an adjustable intensity to obtain maximum contrast in the image.

If the intensity is too high, the sample washes out and some light bleeds across grain

boundaries. If the intensity is too low, there will not be enough contrast. This in turn

may cause errors in the experimental extinction angle.

Each of the two polarizers is set in a rotation stage. The stages are set to rotate

in the same direction. The first polarizer, between the light source and the sample, is

set with its transmission axis horizontal and the second polarizer, between the sample

and the camera, is set with its transmission axis vertical. When the sample is not

present in the system, no light will reach the camera because the polarizers are

crossed.

The system is used to image crystal samples in thin sections. Because the

sample is so thin (~30 µm), it is possible to consider the crystal section as a 2-D

sample. There are two rotation stages that control the sample’s position. The table

stage is mounted horizontally so its rotation is akin to a record player. In other words

the rotation occurs in the x-z plane and its rotation angle is called ξ (xi). Mounted on

the table stage is the sample rotation stage. Its rotation is in the x-y plane and its angle

is called φinner (phi inner). The origin of the coordinate system is at the intersection of

the axes of rotation of the sample and table stages. In the home position, both the

table stage and the sample stage are set to zero degrees. This puts the sample stage in

32

a plane parallel to the two polarizers, meaning its axis of rotation is co-linear with the

axis of rotation of the two polarizer stages.

The black and white CCD camera is used to view the sample. It has an

adjustable zoom and focus. These must be adjusted by hand and are set according to

the needs of the experiment. The camera is secured to the optical bench with the axis

of the optical lens system aligned with the axes of rotation of the two polarization

stages. Also, the center of the camera image is aligned with the center of rotation of

the sample as best as possible. Although the camera feeds video to the computer, I use

single image samples of that feed, also known as snapping an image. The image

resolution is 640 by 480 pixels in 8-bit black and white. This gives 256 levels of

intensity. However, during sample installation and testing, it is helpful to watch the

live video feed to be able to make adjusting the image easier and faster.

There is a cover placed over the system to prevent ambient light from spoiling

the images and to protect against excessive dust buildup. A servo controller by

Newport, model MM3000, is used to move the stages with a one thousandth of a

degree precision. The controller can be used to move the stages manually via the front

panel controls, but it is interfaced to a computer via a serial connection to be run

autonomously.

Sample Orientation

There are a total of nine sample orientation sequences, indexed by n. A

sequence of images is taken at each sample orientation. Each sequence is denoted in

the form ξsφinner, or (xi) s (phi inner), where the letter “s” separates the two sample

33

orientation angles in degrees. Table 2 shows the nine sequences used. Figure 19

shows the color images of the 9 sample orientations.

Table 2. The nine sample orientations. n Sequence Xi Phi Inner 1 0s0 0 02 45s0 45 03 45s45 45 454 45s90 45 905 45s135 45 1356 45s180 45 1807 45s225 45 2258 45s270 45 2709 45s315 45 315

Figure 19. Color images of the nine sample orientations.

34

Calibration As with any experimental system, some calibration procedures must be made

to evaluate the consistency and correctness of the data. In this system, the first

polarizer has its transmission axis aligned horizontally by using the principle of

Brewster’s Law, which I described earlier. The procedure involves a laser, a glass

slide, an adjustable linear polarizer, a mirror, a photodetector, and a voltmeter. The

parts are setup on another optical bench in the lab. Brewster’s Law enables me to

polarizer the laser light horizontally using the glass slide. The polarized laser then

shines through the polarizer to be calibrated, is reflected off the mirror, and is detected

by the photodetector. This is hooked up to a voltmeter which displays a voltage

corresponding to the light intensity it detects. With the room lights off, I iterated small

adjustments of the polarizer and the incident angle on the glass slide. When

Brewster’s angle is reached at the glass slide, the light will be completely polarized

horizontally. The laser beam can be extinguished by setting the adjustable polarizer to

transmit vertically. (The mirror is used to keep the laser beam aimed directly into the

photodetector since it would become misaligned as the incident angle on the glass

slide changes).

The next step is to place this vertically calibrated polarizer into the system with

no sample present. Once it is installed into the system, a computer program rotates the

two polarizers of the experimental system independently until maximum transmission

intensity is achieved. Then they are all aligned vertically. Now polarizer #1 is rotated

until the light intensity is as close to zero as possible. This will set polarizer #1 to be

35

transmitting horizontally (parallel to the x-axis of the lab reference frame). Then I

remove the calibrated polarizer I just added and make sure the second (system)

polarizer crosses with the first, meaning it is aligned vertically.

The pre-analysis routine, to be described later, needs the location in the image

(pixel coordinates) of the center of rotation of the stage corresponding to φinner. After

camera and polarizer alignment adjustments are complete, but before taking data, a

calibration must be made to determine this center of rotation. First, an opaque disc

with a centered pin hole is placed in the sample stage and the polarizers are uncrossed

by rotating the second polarizer to horizontal. Then, φinner is rotated through 360

degrees while many images of the pin spot are taken. One image is created by

summing the individual images and the centroid of the spot of light in the image is

found. The pixel location of this centroid is used as the center of rotation for all

calculations done in the pre-analysis routine. Figure 20 is one such image taken on

July 3, 2003.

Figure 20. Image used to calibrate the center of sample rotation.

36

Image pixels are indexed with the origin (0, 0) located in the upper left hand corner.

The x pixel location increases left to right from 0 to 639 and the y pixel location

increases from top to bottom 0 to 479. The center of the image is therefore (319.5,

239.5). For the image above, the center of rotation is calculated to be at x = 340.65

and y = 250.88. The major difference between the center of the image and the actual

center of rotation demonstrates the importance of this calibration.

3.2 Data Collection and Pre-Analysis During experimental data collection, the images from all nine sample

orientations are compiled. I have written a computer program in Labview to automate

steps 4 through 8 in the data collection procedure. The steps are as follows:

1. Remove the protective cover and install a sample into the sample stage. 2. Adjust the focus, zoom, and light intensity if necessary while monitoring the

camera’s view. 3. Replace the protective cover. 4. Set the sample to the home position, 0s0. 5. Rotate the polarizers through 95 degrees in 5 degree increments. An image is

taken at each step, which is an average of 5 video frames. These are grouped and saved as one file consisting of all 20 images.

6. Rotate the sample to the next orientation. 7. Repeat steps 5 and 6 until all 9 sequences are finished. 8. Return the sample and the polarizers to the home position.

Once a sample is in place it takes about twenty minutes for an entire data set to be

taken. A data file is saved for each sequence in a folder named: “(time of data

collection) (name of sample)” which is saved in a folder named: “(date of data

collection)” within the main directory “C:\Geo Data”. In this manner, different runs of

37

the same sample are separated by the date and time. Here is an example of the

directory structure of one of the data folders for olivine:

“C:\Geo Data\2003-11-04-Tue\12.38 Olivine MG”.

Pre-Analysis

The pre-analysis routine is an existing algorithm that is used for the uniaxial

analysis of ice crystals. It was developed and programmed by Dirk Hansen and Larry

Wilen. I have kept this routine essentially intact, implementing only a few

modifications for faster processing. The pre-analysis routine calculates the nine

experimental extinction angles, nexpε , of each grain being tested in a sample. The

procedure to find the nexpε completes the following steps: The intensity of a three by

three region of pixels in the center of the grain is averaged for each image in a

sequence. This intensity is a number from 0 to 255, because the image has 8-bit

grayscale resolution. The 20 data points from a given sequence are used to plot a

curve of intensity vs. polarization angle (with respect to the x-axis). An analytic

function is fit to these data points. The polarizer angle corresponding to the minimum

of this function is the experimental extinction angle, nexpε . The process is repeated for

each of the sequences n = 1 to 9 (and for each grain specified). At the end of pre-

analysis, the results are saved in a file within a folder named after the date and time

the results were calculated. This folder is placed is a directory within “C:\Geo

Results” that includes folders for each set of data analyzed. This organizes the results

first by the data it analyzed and then by the time it was analyzed. Here is an example

38

of the directory structure of one of the results folders for olivine: “C:\Geo

Results\2003-11-04-Tue\12.38 Olivine MG\2005-04-06-Wed, 16.45”.

The pre-analysis routine must be able to determine each grain’s position in the

image after the sample changes orientation. The software uses a mapping algorithm to

ensure it continues to analyze the same three by three region of pixels in the specified

grain. One critical parameter for this mapping algorithm is the center of rotation of

the sample stage, which was determined during the calibration procedure previously

described.

The following images show an example of the pre-analysis results from an

actual olivine grain that was analyzed. The first image, Figure 21, depicts the

extinction curve for the 0s0 sequence. The data points are graphed in white and the

fitted curve is graphed in green. It produces a reliable extinction angle of 30.8

degrees. Note that it has a high contrast, which is the maximum intensity minus the

minimum intensity of the grain.

Figure 21. Extinction curve.

The extinction angles and contrast for each sequence for one particular grain are

indicated in Figure 22.

39

Figure 22. Extinction angles and contrast.

Sequences with contrast lower than 45 may lead to some error in the extinction angle

that is calculated from the fit. Therefore, there is an option to exclude those extinction

angles from the full analysis.

3.3 Theoretical Analysis Method

After the images of a sample have been taken and each grain’s nine extinction

angles are determined experimentally during pre-analysis, the known optical

parameters found in the literature are used to determine the nine theoretical extinction

angles of each grain for many possible combinations of Euler angles. Then, these

values can be compared to the experimental values to find a close match for some

unique set of Euler angles which specify the orientation of the crystal. The details of

this procedure are discussed next.

First, I will describe how to get the nthε (theoretical extinction angle) from a

grain’s crystal orientation φ, θ, and ψ (Euler angles) and a given sample orientation,

(indexed by n). I use a coordinate system where the z-axis is positive towards the

40

camera, the x-axis is positive towards due east, and the y-axis is positive towards due

north, as shown in Figure 23.

Figure 23. Coordinate axes convention used in the experimental setup.

I begin by assuming that the sample is in its home position (0s0) and the

indicatrix that corresponds to the grain under examination is aligned with each of the

three coordinate axes. This is mathematically represented by the general equation for

an ellipsoid centered at the origin:

12

2

2

2

2

2

=++cz

by

ax (5)

Now I describe a procedure that will produce an equation for the ellipsoid

having an arbitrary orientation specified by the 3 Euler angles and also a sample

orientation specified by ξ and φinner. The left side of the equation for an ellipsoid can

be decomposed into the multiplication of the following matrices:

[ ]

=

==zyx

c

b

azyx srq ,

100

010

001

,

2

2

2

(6)

When multiplied together as follows, the left side of equation (5) results:

x-axis

y-axis

z-axis out of page

41

[ ] 2

2

2

2

2

2

2

2

2

100

010

001

cz

by

ax

zyx

c

b

azyx ++=

=qrs (7)

The next step is to rotate the indicatrix through three Euler angles φ, θ, and ψ.

This rotation scheme was first published in 1776 by the famous mathematician

Leonard Euler (1707-1783) where each rotation can be described by a rotation matrix

that transforms one coordinate system into another. The first rotation is

counterclockwise through an angle φ about the z-axis. The rotation matrix is:

−=

1000cossin0sincos

φφφφ

φλ (8)

The next rotation is counterclockwise through an angle θ about the newly created x-

axis. The rotation matrix is:

−=

θθθθθ

cossin0sincos0

001λ (9)

The final rotation is counterclockwise through an angle ψ about the newly created z-

axis. The rotation matrix is:

−=

1000cossin0sincos

ψψψψ

ψλ (10)

The complete transformation is given by the matrix λ:

42

−+−−−

+−==

θφθφθθψψφθφψψφθφψθψψφθφψψφθφψ

φθψ

coscossinsinsinsincoscoscoscossinsincossincoscossinsinsinsincoscossincossinsincoscoscos

λλλλ (11)

The Euler angle derivation and equations are from Marion (431-433).

“When discussing a rotation, there are two possible conventions: rotation of

the axes, and rotation of the object relative to fixed axes” (Weisstein). The Euler

method is a coordinate transformation of the axes. In this study, it is more useful to

leave the lab coordinate reference frame unchanged and instead rotate the ellipsoid in

space. This is a rotation of an object relative to fixed axes. All positive angles are

taken to be counterclockwise about the given axis of rotation. Next, I will describe

how the rotation matrices are altered to accommodate this convention.

Consider an object of rotation, vector x = [1 0 0]. Now consider (A) a rotation

of the coordinate axes equal to φ = 45 degrees. This has the same relative effect as

(B) a rotation of the object equal to φ = −45 degrees. This is easily demonstrated

pictorially as shown in Figure 24.

Figure 24. Diagram explaining rotation convention used.

axis & object object

axis

θ = 45 axis

θ = −45

object

Initially A B

43

A quick and easy solution to change the Euler angle matrices from a coordinate axis

rotation to an object rotation is to substitute the opposite value for each angle.

However, this might cause confusion. Instead, I converted each matrix to accept the

angle of actual rotation in space so that a rotation of a positive angle corresponds to a

positive angle entered into the calculations. This conversion used the following

property of the sine and cosine functions:

)(cos)(cos

)(sin)(sinxx

xx=−

−=− (12)

This happens to be equivalent to the inverse Euler matrices, 111 ,, −−−

ψθφ λλλ :

−=−

1000cossin0sincos

1 φφφφ

φλ (13)

−=−

θθθθθ

cossin0sincos0001

1λ (14)

−=−

1000cossin0sincos

1 ψψψψ

ψλ (15)

φθψ ρρρρ

ρλ==−

havewe,Setting 1

ii (16)

To find the theoretical extinction angle of a grain with a known orientation, I first

rotate its indicatrix through the three Euler angles describing its orientation. This is

done using the inverse Euler matrices, equations (13-15). Remember that rotating the

body in a positive angle is the same as the coordinates rotating a negative angle. Since

44

the second rotation is defined about the newly created x-axis, angle θ will rotate about

the newly created “x-axis” of the object. Therefore, I must rotate the θ rotation matrix

by φ. The same process must be repeated for the ψ rotation, as it must be adjusted by

the two previous rotations. This is equivalent to applying the rotation matrices in

reverse order so that they act upon themselves. This mathematical process is shown

below and a full derivation is completed in Appendix A.

ψθφφφθφφθφψφθφ ρρρρρρρρρρρρρρρ == −−−− )()()( 1111corrected (17)

If the applied rotations are now inserted into equation (7) as follows: srρqρ 1

correctedcorrected− (18)

then I can determine the equation for the ellipsoid at the home position: orientation

0s0. It is also necessary to determine the ellipsoid for each of the other sequences.

This is done by inserting two more rotation matrices for ξ (xi) and φinner (phi inner):

−=

ξξ

ξξ

ξ

cos0sin010

sin0cosλ (19)

−=

1000cossin0sincos

innerinner

innerinner

inner φφφφ

φλ (20)

These matrices are also adjusted to rotate the object so that the inverse rotation matrix

is used. So by using equation (16) I have:

−==−

ξξ

ξξ

ξξ

cos0sin010

sin0cos1 ρλ (21)

45

−==−

1000cossin0sincos

1innerinner

innerinner

innerinner φφφφ

φφ ρλ (22)

Then equation (17) is then modified so that all rotations are applied as shown in

equation (23):

ψθφφξ ρρρρρρ inner=corrected (23)

Remember that the angle ξ causes the incident light to enter the sample at an angle

different from normal incidence and the speed of light is slower in the sample than in

air. Therefore, refraction occurs inside the sample. This effectively reduces the angle

ξ in the sample from the rotation angle specified during data collection. The

adjustment is a simple correction made using Snell’s Law, described earlier by

equation (3). Because the index of refraction of air is approximately 1, the corrected

angle is:

= −

sample of refractionofindexaverage)(sin

sin duncorrecte1 ξξ (24)

As an example, when ξuncorrected = 45o and the average index of refraction is 1.67, then

ξ = 25.05o.

I now have the equation of the ellipsoid, after all necessary rotations, as

referenced in the fixed coordinate axes of the lab reference frame. This is given by the

following equation:

46

[ ] 1)(

100

010

001

1

2

2

2

1correctedcorrected

=

=

−

−

zyx

c

b

azyx innerinner ψθφφξψθφφξ ρρρρρρρρρρ

srρqρ

(25)

To summarize, I now have the equation of an ellipsoid, originally aligned in

the home position along the lab coordinate reference frame, that has been rotated

through 3 Euler orientation angles and 2 sample orientation angles.

Next, I find the extinction angle from the ellipsoid equation using the criterion

for extinction mentioned earlier. The criterion for extinction is that one of the

polarizers must be aligned with the semi-major (or semi-minor) axis of the ellipse that

results from the intersection of the indicatrix ellipsoid with the plane defining the

wavefront of incident light. According to the coordinate convention of the

experimental system, this plane is the z = 0 plane. The ellipse of intersection is found

by taking z = 0 in equation (25). In practice, I take z = 0 at the outset of the derivation

and then equation (20) gives the appropriate equation for the ellipse directly. The

following generalized equation for an ellipse centered at the origin results:

122 =++ CyBxyAx (26)

(where A,B and C are constants) This is the general equation for an ellipse that may have any orientation within the xy-

plane. The semi-major and semi-minor axes correspond to the index of refraction of

the slow ray and fast ray through the crystal lattice, respectively. (If one of the optic

47

axes of the indicatrix is normal to the xy-plane, then the cross section will be a circle

and the grain will be extinct at all angles).

The theoretical extinction angle, each nthε , is defined as the angle between the

semi-major or the semi-minor axis of the ellipse and the x-axis of the lab reference

frame, whichever ellipse axis makes a positive angle (Figure 25). It will then naturally

fall within the range 0-90 degrees, which I choose as my convention. (Recall that the

extinction angles have a periodicity of 90 degrees, so this procedure just picks out the

extinction angle that is in the first quadrant.)

Figure 25. Cross sectional ellipse showing the theoretical extinction angle.

The constants A, B, and C are used to calculate the theoretical extinction angle

according to:

−

= −

CAB1n

th tan21180

πε

o

(27)

48

This equation is derived for the interested reader in Appendix B. To avoid computer

errors, when A = C, I set nthε = 45 degrees. Also, the computer’s inverse tangent

function has a domain of -90 to 90 degrees. I map the result of the inverse tangent to

the domain 0 to 180 degrees so that the extinction angle will fall in the first quadrant,

according to my convention. Both of these corrections can be summarized by:

nth

nth

nth

else

45 then , if

εε

ε

=

== oCA (28)

nth

nth

nth

nth

nth

else

90 then ,0 if

εε

εεε

=

+=< o

(29)

Now I have developed the procedure to determine the theoretical extinction

angle for a given indicatrix orientation and sample orientation. In the next section, I

will describe how I use this information to find a unique Euler angle orientation for a

particular grain.

I used the Maple mathematics program to setup and test the rotation matrices

to make sure the definitions and sign (+,-) conventions were correct. It also allowed

me to graph the ellipsoid cross section to visually verify the results of the rotation

matrices. Also using Maple, I was able to perform all of the rotations symbolically and

arrive at explicit (albeit complicated) expressions for A, B, and C (the coefficients of

the cross sectional ellipse) in terms of the rotation angles φ, θ, ψ, ξ and φinner and the

values of a, b, and c. Therefore, the matrix multiplication need not be carried out

during the analysis. In order to save space, these equations will be deferred to

Appendix C. The rest of the analysis was carried out using Labview, which is the same

49

development software used in all other aspects of the experiment, such as data

collection.

3.4 Analysis Procedure Definition of a “Match” using R2

The analysis routine I have devised will determine the Euler angle orientation,

(φ, θ, ψ), for each grain being analyzed within a thin section sample using the known

optical parameters and the nine experimental extinction angles. The method for

determining the solution, (φ, θ, ψ), involves working backwards by finding the

theoretical extinction angle for many combinations of φ, θ, ψ, ξ, and φinner. Then, the

set of nine extinction angles determined experimentally is compared with all the sets

of nine extinction angles determined theoretically and the combination with the closest

match overall is going to reveal the Euler angles of the grain under evaluation.

The function describing the closeness of a match between a set of experimental

extinction angles and a set of theoretical extinction angles is a least squares fit called R

squared (R2). The R2 value is calculated according to the following formula:

∑=

−⋅=9

1expth

22 }]),,({2[sin),,(n

nnR εψθφεψθφ (30)

Note that R2 values are unitless. If there is a perfect match in all nine sequences, then

the summation will add to zero, because 00sin o2 = . If all of the angles are the worst

match possible, i.e. ± 45o apart, then because 190sin o2 = , the summation will be

equal to the number of sequences used in the analysis. A value for R2 of <.01 total

50

over nine sequences is what I consider a close match. This would correspond to a

difference < ± .955o per sequence’s extinction angle if the error were evenly distributed

among the sequences, which is comparable to the error in the experimental

measurement. If there were eight sequences with a perfect match and one with a bad

match, then an R2 of <.01 would allow a maximum difference of < ± 2.87o for the bad

sequence. This nonlinearity of the R2 function means one bad experimental extinction

angle can make finding a close match in absolute terms rather difficult.

The reason for using the sine squared of the difference, instead of just the

difference squared, is due to the 90 degree periodicity of the extinction angle. When

the difference between theory and experiment is close to 90 degrees, then the function

should report a close match. For example, suppose I am comparing the two extinction

angles 89° and 1°. If I used the difference squared, ( nthε - n

expε )2, it would consider

these two angles a bad match. In reality, they are only 2° apart. So I use the sine

squared function with a factor of 2 multiplied to the difference, ( nthε - n

expε ), to change

the periodicity from 180° to 90°. The sine function is squared to always return a

positive value. To demonstrate, observe the graph in Figure 28 below of f(x), which is

R2 for one sequence where the variable x represents the theoretical extinction angle

when the experimental extinction angle is zero degrees.

51

Figure 26. Graph of ( )xxf 2sin)( 2= .

Two Conceptually Similar Methods of Minimizing R2

Here I will explain the two conceptually similar methods I devised to find the

unique Euler angle orientation φ, θ, ψ that minimizes R2. Later I will describe some

of the programming tricks I employed to get the most performance out of the two

methods. The building block for either method is to divide the continuous 3-D space

of Euler angles into a discrete grid and then find the minimum R2 value over all the

points on the grid. The total number of sample points depends on the range and grid

spacing. For example, suppose the grid covers the full range of the three Euler angles

(0-180 degrees for each angle) and the grid spacing is 2 degrees. I represent this

minimization with the notation [ ] [ ] [ ]))2,1800 ,2,1800 ,2,1800(min( 2 →→→ ψθφR .

As a second example, suppose the grid focuses on a smaller region with high detail

such as, [ ] [ ] [ ]))01.0,100,01.0,8575,01.0,5040(min( 2 →→→ ψθφR . Note that the

building block used in both methods requires the theoretical extinction angle for each

sequence to be calculated for every point on the grid.

52

Simple Grid Method

The first method is called the Simple Grid Method because it involves only one

iteration of the basic building block above. In fact, it is exactly described by the first

example above: [ ] [ ] [ ]))2,1800,2,1800,2,1800(min( 2 →→→ ψθφR . However, the

calculation is optimized as follows: The number of repetitions of the calculation of a

theoretical extinction angle is specified by equation (32) described later in this chapter.

To save calculation time, I take advantage of the redundancies in calculating all

possible values of nthε over the range of the grid for different grains in a thin section.

Even though a sample may contain as many as 500 to 1000 grains, as long as each one

is composed of the same mineral composition, I can use the same set of nthε for each

grain. Rather than recalculate them each time, it makes sense to save that work in a

file to be used during subsequent runs of the analysis program (for different grains of

the same mineral). Such a file is written for each sequence and it lists the nthε for each

Euler angle specified on the grid. The optimized simple grid method opens up the

entire list and finds the unique Euler angle orientation φ, θ, ψ that minimizes R2 for

that grain.

In practice, the optimized simple grid method by itself had some shortcomings.

First, it takes approximately 20 minutes to write a set of nine sequence files at a 2

degree resolution on the lab computer. Since this is done once per mineral

composition, it is a tolerable wait. However, since all Euler angles are checked when

calculating R2, it still takes about 2 minutes per grain at this 2 degree Euler angle

53

resolution. For a typical sample containing hundreds of grains, this results in many

hours of calculation. Plus, I am left with poor precision (approximately two degrees).

Later, I show that the number of calculations (and hence the calculation time) is a

nonlinear function of the precision desired. This means that simply increasing the grid

resolution by a factor of one half does not simply double the calculation time, but

rather it increases the calculation time by nearly a factor of 8. The dilemma of how to

improve the precision without further increasing the calculation time led me to create

another method, which I will describe in the next section.

Adapted Newton Method

The second method is called the Adapted Newton Method. It was first devised

as a method to find the precise Euler angle match to .01 degrees, given the best match

from the two degree resolution of the Simple Grid Method as a starting point. The

volume of the Euler angle space is like a 3D cube 180 degrees on a side (Figure 27).

54

Figure 27. Depiction of 3-D Euler angle space.

The Simple Grid Method divided that cube into smaller 2 x 2 cubes. Unless the

minimum of the Simple Grid Method is far away from the global minimum, the lowest

minimum of R2 lies within a volume of 8 of these smaller cubes surrounding the best

match from the Simple Grid Method. A naïve approach is to consider using the Grid

Method again on this smaller domain with a grid spacing of 0.01 degree. However,

100 samples per degree in the smaller domain still requires calculating R2 over

64,481,201 Euler angle combinations. This is more than 85 times the number of

samples used by the Simple Grid Method in the first place! This strategy was

successful, but too time-consuming.

Rethinking the problem, I returned to picturing how the function describing R2

varies. It can be likened to a function that describes the temperature of a room given

the point in space (x,y,z). If you were asked to find the lowest temperature in this

room, as quickly as possible, you might employ the hot-cold method. This entails

Phi

0

180 Theta

180

180

Psi

55

moving around the room towards the lowest temperature reading you find. You might

initially just check the temperature at a point on the left and right sides of the room,

and then move to the colder side and then check a few points around there, etc. This

led to the development of the Adapted Newton Method. The premise of the Adapted

Newton Method is to start with a very coarse grid and iterate the grid method

building block, continually shrinking the grid while centering it on the minimum of the

previous grid.

Detailed Methodology of the Adapted Newton Method

The Adapted Newton Method begins with a coarse grid over the entire region

of interest. The region of interest could be two degrees above and below each Euler

angle of the best match from the Simple Grid Method. The region of interest could

also be the entire span of Euler angles used in the Simple Grid Method, 0-180 degrees.

The building block grid of the Adapted Newton Method includes a central point, two

endpoints, and two midpoints for each Euler angle. Using the notation developed

earlier while covering all Euler angles, it would be expressed as:

[ ] [ ] [ ]))45,1800,45,1800,45,1800(min( 2 →→→ ψθφR . There are 5 sampling points

for each Euler angle giving 53 = 125 combinations. This number of combinations is

used in each stage (i.e. iteration of the building block) of the process. During each

successive stage, a new building block grid is created with a resolution that is one half

the resolution used in the previous iteration. In addition, the new building block grid

is centered about the Euler angle combination with the lowest R2 from the previous

iteration.

56

As an example, observe the sampling angles for the first iteration in this case.

Over the entire range there is 0-180 degrees. The center point is 90 degrees, the

endpoints are 0 and 180 degrees and the midpoints are 45 and 135 degrees. The

Adapted Newton Method takes all 125 combinations and minimizes R2. As an

example, if the method finds the minimum at (90,135,90) for (φ, θ, ψ), then the new

coarse grid is centered there. The endpoints are now set to be the nearest points of the

previous iterations grid, which were 45 and 135 degrees for φ and ψ and 90 and 180

degrees for θ. Then the new midpoints are set at 67.5 and 112.5 for φ and ψ and 112.5

and 157.5 degrees for θ. Once a minimum R2 is found, the process continues. After

13 iterations, the resolution in each Euler angle reaches:

oo

13

o

iterations 00549.819245

245

2resolutioninitialresolutionfinal ≈=== (31)

This method is called the Adapted Newton Method because of its similarity to

Sir Isaac Newton’s method of finding the x-intercept of a function.

Comparison of the Number of Repetitions

Even though it includes hundreds of sine and cosine functions, the calculation

of a single extinction angle takes less than one millisecond. However, when the

analysis is done, the calculation of an extinction angle must be made over and over.

When using the Simple Grid Method, in order to test all possible Euler angle

orientations, a range of each angle and a sampling resolution must be defined. The

repetitions of this program are calculated according to the following equation:

57

sequencesnresolution

ofrangeresolution

ofrangeresolution

ofrangesrepetitiongrid ⋅

+

+

+= 111

ψψ

θθ

φφ (32)

The range of 0-180 degrees covers all orientations and was chosen as the convention

for this experiment. Here is a sample calculation of the number of repetitions using 9

sequences and a 2 degree sampling resolution:

139,782,69571,753991912

18012

18012

180 3 =⋅=⋅=⋅

+

+

+=srepetitiongrid (33)

The program needs 19.5 seconds to iterate 100,000 calculations on the lab computer.

In the case of the example above, before any comparisons are made, (like R2), the

program would require 22 minutes per grain. To make things worse, that still only

reveals a solution within two degrees! It is important to note that the number of

calculations is proportional to the inverse cube of the grid spacing.

In the case of the Adapted Newton Method, the total number of repetitions of

the standard calculation is far fewer. To achieve a .00549 degree resolution, 13

iterations are made of the Adapted Newton Method yielding:

14625913125125 =⋅⋅=⋅⋅= sequencesniterationssrepetitionnewton (34)

Therefore, in order to reduce the amount of calculations necessary, the Adapted

Newton Method is the method of choice. However, one must also verify that this (or

any) method yields the true global minimum of the R2 function. Later in the results

section, I will discuss the success rate in practice of all the methods.

58

Detailed Methodology of the Simple Grid Method

Each file is classified in an easy to understand file management system

describing its contents. The data directory is located at C:\Geo Orientation Angle

Data. This contains folders organized by the resolution of each Euler angle. Each of

these folders contains sub-folders organized by the ellipsoid axes a, b, and c specified

to 4 digits of precision. It also contains a file that tells the analysis program how long

each data file is in bytes (more on that later). Finally, each of these folders contains a

file for each sequence, i.e. 0s0 Extinction Angles.dat. The default is a two degree

resolution from 0 to 180 degrees. Figure 28 is an example of the file structure for the

Simple Grid Method.

Figure 28. File structure for the Simple Grid Method.

The theoretical extinction angle is originally found in radians because the

computer makes all sine and cosine calculations in radian mode. However, it is easier

and more intuitive when using degrees. Therefore, I save the extinction angle data

files as degrees by shifting all the radian values by the constant multiplier 180/pi. As

an added bonus, I am able to maintain a higher resolution in terms of digits needed

59

after the decimal point. For example, a .01 degree change is equivalent to a change of

30001745.0 radians. What takes only 2 place values after the decimal in degrees will

consume many more place values in radians.

Next, I made some careful comparisons to determine the best method to save

the files. If each extinction angle were saved as a floating point number it would

consume 8 bytes of hard disk space. If each extinction angle were saved as an integer

number it would consume 1 byte of hard disk space, but I would lose all the

information after the decimal point as each value is rounded to a whole number. I

found this information from the “help section” included with Labview, as indicated

below.

• Double-precision (DBL) - Double-precision floating-point numbers have

64-bit IEEE double-precision format. Double-precision is the default format

for numeric objects.

• Byte (I8) - Byte integer numbers have 8 bits of storage.

To save space and retain all precision, I have employed a creative solution that

converts each extinction angle to two integers and writes them to disk in a continuous

fashion. The first integer byte is equal to the number before the decimal point and the

next integer byte is equal to the 2 significant figures after the decimal point. Since

each extinction angle is calculated between 0 and 90 degrees, the integer’s range from

-128 to 127 is more than enough. Even though this requires a conversion to and from

the integer format, hard drive space is saved and overall execution time is reduced

60

since it will write to disk faster. Table 3 shows the amount of hard drive space saved

by this method which (theoretically) writes to disk four times faster.

Table 3. Memory usage statistics for floating point vs. two integer bytes.

Resolution Combinations Total Memory Used

Memory Used

Memory Saved

Memory Saved

in Degrees Per Angle Combinations Floating Point

2 Integer Bytes Per Sequence

Over 9 Sequences

10 19 6859 0.05 MB 0.01 MB 0.04 MB 0.35 MB

5 37 50653 0.39 MB 0.10 MB 0.29 MB 2.61 MB

2 91 753571 5.75 MB 1.44 MB 4.31 MB 38.81 MB

1 181 5929741 45.24 MB 11.31 MB 33.93 MB 305.37 MB

0.5 361 47045881 358.93 MB 89.73 MB 269.20 MB 2422.79 MB

Each data file is written according to the same standardization, which is a systematic

method similar to how one would count all combinations of three items, given a set

number of items, if order matters. First, φ and θ are held constant while ψ varies

incrementally according to the sampling resolution. When ψ reaches a maximum, θ is

incremented once, and the process repeats. When θ reaches a maximum, φ is

incremented once, and the process repeats. Finally, φ, reaches a maximum when all

nthε have been calculated. Each n

thε is written to the file in the order in which it is

calculated. This effectively “numbers” them according to their position in the file.

The analysis routine employs no advanced selection criteria to eliminate bad

comparisons, rather it compares all extinction angles calculated. The procedure

includes: open all the files in the data directory, read the first entry in each one,

assemble it into a floating point number, convert it to radians, calculate the R2

(summed over all sequences), then do the same for each subsequent entry. This is why

61

the program needs to know how long the files are in bytes. After that task in

complete, I have an array of R2 values ordered according to their Euler angles. I find

the lowest R2 value and its place in the array indicates what Euler angles are the

solution!

62

4.0 RESULTS AND DISCUSSION

The olivine sample that I analyzed using the new optical technique was

acquired from a research group at Yale University. I wanted to compare the results of

my experiment with the known orientations of the minerals in order to see if the

optical method would yield the correct results. The research group at Yale, with the

help of my advisor, Dr. Larry Wilen, used a scanning electron microscope (SEM) to

find the orientation of 21 grains in the sample. The details of the SEM procedure and

the side-by-side comparison between both methods will be explained in the next few

paragraphs. I will also discuss the corrections I made to the analysis.

In the typical application of a scanning electron microscope (SEM), a beam of

electrons is focused onto the surface of a sample. The reflected electrons are used to

map out an image of the surface to be displayed on a monitor. The surface must be

electrically conductive for this procedure to work. In my case, the olivine sample was

covered with a carbon film for conduction. A SEM can display magnifications of

50,000 times or higher, which is much higher magnifications than an optical

microscope can obtain. However, the grains in the olivine sample are visible to the

naked eye, so a high level of magnification was not necessary (Callister 84-85).

Electron backscattered diffraction (EBSD) is the technique utilizing a SEM

that is used by mineralogists and other materials scientists to find the orientations of

mineral grains. The EBSD technique I am about to describe is the same system by

which the Yale University researchers found the orientations in the olivine sample.

63

Since its development in 1954, EBSD has proved to be a fast and user-friendly method

for this type of experimentation. The procedure examines a polished sample, such as a

thin section, that is tilted to a relatively high angle of ~70°. Below is an excerpt from

the HKL Technologies website detailing the EBSD procedure:

The electron beam is... directed at the point of interest on the sample surface: initial elastic scattering of the incident beam causes the electrons to diverge from a point just below the sample surface and to impinge upon crystal planes in all directions. Wherever the Bragg condition for diffraction is satisfied by a family of atomic lattice planes in the crystal, 2 cones of diffracted electrons are produced. These cones are produced for each family of lattice planes (HKL Technology: EBSD Basics). The website continues by stating how the cones can be detected by a phosphor

screen detector attached to a digital CCD camera. The cones produce bands on the

screen, which is close to the sample to collect as many bands as possible. Each band

corresponds to a crystal lattice plane family and the bands are called “Kikuchi bands”.

There are many Kikuchi bands which are then matched with theoretical data to

calculate the 3-D crystallographic orientation in less than 0.02 seconds (HKL

Technology: EBSD Basics). In the case of the Yale University SEM, this output is in

the form of the Euler angles.

Dr. Wilen and the researchers at Yale University analyzed a total of 23 grains

from the olivine sample. Of these, 14 were olivine and 9 were another phase called

orthopyroxene (OPX). However, when analyzing the grains using the optical setup,

one of each phase from the original 23 grains was too close to the edge of the camera

view to be seen in every sequence and could not be analyzed. The remaining 21

64

analyzable grains were numbered 0-20 and tested using the most recent version of the

Adapted Newton Method analysis routine. After some additional theoretical issues

were taken into account, the analysis routine found each grain’s Euler angles in good

agreement with the results from the SEM method.

It was important to make sure I analyzed the exact same grains using both

methods to be able to compare the data effectively. Dr. Wilen worked with the Yale

group on the olivine sample. He told me that when viewing the sample with the SEM,

it was difficult to discern the grain boundaries. Without the cross polarized light, the

only clue available to determine which grain was being analyzing was the cracks in

the sample. He used the cracks as a roadmap to hit as many unique grains as possible

from looking at a printout of an image of the sample taken with a microscope under

crossed polarizers. Since the SEM analyzed a specific region of each grain, he

indicated this region on the printout so I could use the optical technique to analyze the

same region of the grain. This ensured we were both hitting the same grain and it also

minimized any differences that might have arisen from a variation in grain orientation

according to the region of the grain that was analyzed (Wilen).

Figure 29 is an image showing the location of each analysis point for both

methods and its corresponding number.

65

Figure 29. Image showing each analyzed grain’s location. Pixel resolution = 640 x 480.

The SEM made two different runs on the sample to collect Euler angle data.

Therefore, the first 15 grains were analyzed before the final 8 were analyzed. The

overlap between the two runs resulted in 18 unique grains included in the 21 analysis

points. Analysis numbers 3 and 8 are the same grain, 7 and 15 are the same grain, and

12 and 20 are the same grain. It was initially discovered visually that the same grain

was being analyzed in each case. Then both methods verified this by producing

similar results for the Euler orientation angles for each pair of analysis points on the

same grain.

1 mm

66

It is interesting to note that analysis points 14 and 16 had a similar orientation.

These points might be on one grain that has a discontinuous orientation resulting from

stress or strain. Or these two points are on separate grains that split apart from one

grain, also giving them similar orientations. Another interesting pair of analysis points

are point 5 and point 18. They too have similar orientations. However, they are not

anywhere near each other in the sample meaning there are many grains in between

them. In addition, these two grains are not even the same phase, one is olivine and the

other is OPX. Therefore, I attribute the close alignment of these grains to be a

coincidence.

Table 4 shows a comparison of the results from the two methods. These

results were obtained using the final version of the analysis algorithm, which I will

discuss later.

67

Table 4. The Results

Grain Orientations Using Two Methods Grain Euler

Angles Yale Univ.

SEM Ohio Univ.

Optical Phi 62.79 69.83 Theta 94.12 92.43 Psi 68.00 67.67

Olivine 0

R2 0.00445 Phi 134.31 140.84 Theta 62.66 61.82 Psi 90.93 90.85

Olivine 1

R2 0.00871 Phi 63.01 72.19 Theta 52.00 53.88 Psi 122.35 119.24

Olivine 2

R2 0.00244 Phi 81.19 89.60 Theta 55.74 55.01 Psi 140.36 138.69

Olivine 3

R2 0.00160 Phi 43.07 45.61 Theta 19.89 20.04 Psi 0.73 4.59

Olivine 4

R2 0.00010 Phi 169.01 175.13 Theta 18.17 17.35 Psi 101.10 101.15

Olivine 5

R2 0.00053 Phi 34.77 42.38 Theta 102.48 101.09 Psi 149.96 149.56

Olivine 6

R2 0.00181 Phi 57.68 64.30 Theta 104.40 103.37 Psi 83.51 83.20

Olivine 7

R2 0.00156 Phi 80.26 88.38 Theta 55.17 54.81 Psi 141.63 139.94

Olivine 8*

R2 0.00172 Phi 102.16 107.89 Theta 112.01 112.03 Psi 63.11 61.64

Olivine 9

R2 0.01670

68

Grain Orientations Using Two Methods Grain Euler

Angles Yale Univ.

SEM Ohio Univ.

Optical

Phi 40.46 47.52Theta 43.91 45.02Psi 174.93 174.04

Olivine 10

R2 0.00148Phi 75.85 82.20Theta 90.25 90.04Psi 16.50 15.86

OPX 11

R2 0.00139Phi 69.55 75.85Theta 52.40 51.41Psi 32.54 31.86

OPX 12

R2 0.00007Phi 138.56 149.59Theta 57.26 55.66Psi 56.19 52.52

Olivine 13

R2 0.00343Phi 86.40 93.80Theta 87.25 86.33Psi 14.75 16.04

OPX 14

R2 0.00103Phi 58.19 64.36Theta 102.01 103.23Psi 82.85 83.16

Olivine 15*

R2 0.00150Phi 89.93 94.64Theta 86.39 86.36Psi 8.96 9.93

OPX 16

R2 0.00097Phi 72.23 77.09Theta 118.49 119.31Psi 178.71 178.48

OPX 17

R2 0.00175Phi 168.05 173.62Theta 18.44 16.46Psi 91.40 90.67

OPX 18

R2 0.00344Phi 30.85 36.85Theta 50.86 51.03Psi 24.55 23.64

OPX 19

R2 0.00035

69

Grain Orientations Using Two Methods

Grain Euler Angles

Yale Univ. SEM

Ohio Univ. Optical

Phi 71.04 75.80 Theta 51.23 51.76 Psi 32.47 31.06

OPX 20*

R2 0.00052 *Repeat Grain

70

Figure 30 is a graph depicting the orientation of all 21 grains. The x

coordinate is the angle phi, the y coordinate is the angle theta, and the angle that each

line makes with horizontal is psi. If the line is horizontal, then phi is 0° and vertical is

90°. There isn’t a need to make the line a vector pointing in all directions, because a

180° rotation in psi is a rotation equivalent to 0° because of symmetry in the

indicatrix.

Figure 30. Euler angle plot of data from the optical method.

71

Figure 31 is a graph depicting the orientation of all 21 grains alongside the

SEM results. The data from the SEM has been adjusted to account for the difference

in mounting between the SEM setup and the optical setup, which was apparent by a

constant shift in the angle phi.

Red = Optical; Blue = SEM

Figure 31. Euler angle plot of data from both methods side-by-side. SEM data has been adjusted to compensate for a sample mounting difference.

72

Characteristics of Olivine and the Solution to an Initial Error of Psi ± 90° This section will describe the development process by which I arrived at the

final version of the analysis routine. Initially, both the analysis routines derived for

this experiment (discussed in section 3.4) were used to find Euler angles for each of

the 21 analyzable grains in the sample. The first was the Simple Grid Method (with

two degree resolution) followed by the Adapted Newton Method (using 13 iterations

of a 2° to 2° starting grid) to achieve a 0.01 degree resolution. The second was the

Adapted Newton Method by itself (13 iterations of a 0° to 180° starting grid) to

achieve a 0.01 degree resolution. The Euler angles from the two analysis methods

were then compared with each other and with the SEM results.

Neither the Adapted Newton Method alone, nor the Grid Method + Adapted

Newton Method gave results in complete agreement with the SEM data. Two issues

with the results became apparent. The first problem was that sometimes one method

or the other found an answer with a lower value of R2. This means that neither

method found the global minimum every time. The second problem was that for the

best solutions (those with the lowest R2 among the two methods), there was good

agreement with the SEM results only in the angles phi and theta. For the olivine

grains, the best solution (independent of the analysis technique from which it was

derived) differed from the SEM results in the angle psi by 90°. For the OPX grains,

the angle psi was in agreement between the two methods.

The first step in diagnosing the inconsistency between the two analysis

methods involved a close examination of the data for R2 using the grid method. This

73

was the best way to get an overview of the entire 3-D Euler angle space. As a

reminder, the grid method calculates R2 in the following order: first, φ and θ are held

constant while ψ varies incrementally according to the sampling resolution. When

ψ reaches a maximum, θ is incremented once, and the process repeats. When

θ reaches a maximum, φ is incremented once, and the process repeats. Finally,

φ, reaches a maximum and all calculations are complete with the results “numbered”

according to the order in which they were calculated. The images below, (Figure 32),

(Figure 33), and (Figure 34), show three plots of R2 vs. Euler angle taken from the

results of the Grid Method for olivine.

The first plot was created by plotting the R2 value at a two degree resolution in

each Euler angle and shows 753571 data points for

[ ] [ ] [ ])2,1800,2,1800,2,1800(2 →→→ ψθφR .

Figure 32. R2 vs. all Euler angles for one grain. The second plot was created by plotting the R2 value at a two degree resolution in two

Euler angles and shows 8281 data points for

[ ] [ ] [ ])2,1800,2,1800,constant(2 →→ ψθφR .

74

Figure 33. R2 vs. the Euler angles theta and psi for one grain. The third plot was created by plotting the R2 value at a two degree resolution in one

Euler angle and shows 91 data points for [ ] [ ] [ ])2,1800,constant,constant(2 →ψθφR .

Figure 34. R2 vs. the Euler angles theta and psi for one grain.

Each successive plot has the effect of zooming in on the apparent minimum

from the plot before it. From examining these plots, I discovered that there was a near

degeneracy with respect to the angle psi. R2 appears to reach a local minimum at a

periodic 90° interval in psi. So, I concluded that the main reason that one or the other

analysis method was sometimes finding the wrong global minimum was because there

75

were many close local minima over the entire 3-D space. This was corroborated by the

fact that when the results from the two analysis methods were different, the difference

was usually just ± 90° in psi. To check this idea out, I re-minimized the solution from

the Simple Grid + Adapted Newton Method for each grain about the point (phi, theta,

psi + 90°) and chose the lowest R2 between them. This always gave the correct global

minimum (i.e. the one that agreed with the best choice among the two methods).

However, recall that the Simple Grid method is very slow; later, I discuss a more

efficient method to get the global minimum.

At this point, I was still left with the second problem, namely that all the

olivine results differ from the SEM results by 90° in psi and that the OPX grains were

in agreement. Dr. Wilen had earlier suggested an inconsistency between the angle

conventions as the cause for discrepancy. However, this bothered me because it didn’t

happen with all of the grains; the OPX grains seemed to be immune from the problem.

I spent about 6 months trying to discover what might be causing the

discrepancy. The quest for the answer revealed a physical property of olivine to be the

cause - a mineral characteristic called solid solution. More specifically, the psi ± 90o

near degeneracy (and hence the discrepancy) was due to how olivine’s solid solution

affected the values for the refractive indices: α, β, and γ. To better describe this

relationship, I will first define solid solution as a phenomenon whereby one or more

ions can be substituted for other ions in the mineral’s atomic structure. This must be

done without distorting the structure of the crystal or introducing a chemical inequality

that may cause instability. Therefore, the best substitutions are within 15% of the size

76

of the ion replaced and of equal electronic charge. There are different forms of olivine

that depend on the chemical composition, which is written as: (Mg,Fe)2SiO4. In

olivine, the cations (positive ions) Mg2+ and Fe2+ can substitute for each other. This

causes what is called a solid solution series, which is a range of olivine minerals

varied by percentage composition of the pure end members. These pure end members

are Mg2SiO4 (called forsterite and abbreviated as Fo) and Fe2SiO4 (called fayalite and

abbreviated as Fa). The in between compositions are denoted by the relative

percentages of each end member (Sen 53-54). Since olivine’s chemical composition varies from end member to end member,

the values for the refractive indices also vary. These values are found in a book that

compiles experimental and observational data on different minerals, such as An

Introduction to the Rock-Forming Minerals by Deer, Howie and Zussman. The

refractive indices listed in that book for the end members of olivine and OPX are in

Table 5.

Table 5. Index of refraction data for olivine and OPX.

Olivine OPX forsterite,

Mg2SiO4 fayalite, Fe2SiO4

enstatite, MgSiO3

ferrosilite, FeSiO3

α 1.635 1.827 y 1.650 1.768 y β 1.651 1.869 z 1.653 1.770 x γ 1.670 1.879 x 1.658 1.788 z 2Vγ 82o 134o 55-127-55o

(Deer 3, 155)

Besides listing the refractive indices, the book also shows how the

crystallographic axes align with the axes of the optical indicatrix. For olivine, the γ

refractive index is aligned along the x-axis, the α refractive index is aligned along the

77

y-axis and the β refractive index is aligned along the z-axis. This means if the grain

had an Euler angle orientation of zero degrees in each angle, then maximum

birefringence would be seen in my experimental setup (which views the grain parallel

to the z-axis). Figure 35 illustrates the relationship between the crystal axes and the

indicatrix axes (Deer 3).

Figure 35. Relationship between the crystal axes and the indicatrix axes in olivine.

The values I used for α, β, and γ were calculated assuming the most common

chemical composition for olivine: a magnesium rich 90% Mg and 10% Fe, indicated

as 10% fayalite: Fa10. The calculation was done by creating a linear variation between

the end members and interpolating each refractive index at 90% Mg and 10% Fe. This

is where I found the biggest clue to what was causing the near degeneracy in the Euler

angle psi. It is related to the angle 2Vγ, which is the angle between the two optic axes

as measured so that the γ axis in the indicatrix bisects the angle. (Incidentally 2Vα is

measured across the α axis, but I choose to stay consistent with 2Vγ). The relationship

z = β

y = α

x = γ

78

between the three indices of refraction and the angle 2Vγ is as shown below from

Neese (78).

)()(Vcos 222

222

γ2

αγββγα

−−

= (35)

(where the angle Vγ is half of 2Vγ)

The four values are linked by an equation, so this means 2Vγ can be calculated from

known indices of refraction as opposed to a direct measurement. (My experimental

setup is not equipped to measure 2Vγ or the indices of refraction). In fact, if any three

of the four variables in the equation are known, then the fourth variable can be

calculated or found using a plot called a nomogram (Bloss 159). Even though the

angle 2Vγ is not used in the calculation of the theoretical extinction angle or R2, it

proved to be an important indicator of the correct geometry of the indicatrix ellipsoid.

When the angle 2Vγ surpasses 90o, olivine changes optic sign from positive to neutral

(at exactly 90o) to negative. (Neese 78). Deer said about olivine, “The optic axial

angle likewise varies systematically from 2Vγ = 82o for Mg2SiO4 to 2Vγ = 134o for

Fe2SiO4, the sign changing at about Fa13” (10). The problem is, when using

α, β, and γ as linearly interpolated values, the change in optic sign occurs between Fa6

and Fa7. At Fa10, olivine should be biaxial positive. Therefore, I was using

interpolated values of α, β, and γ that made olivine appear biaxial negative when in

fact it was biaxial positive!

The 2V angle can be measured experimentally with a higher degree of

precision than it can be calculated from the measured refractive indices. In other

79

words, a small variation in the ratio among β and the other two refractive indices

causes a large variation in the angle 2V. As a matter of fact, I calculated 2Vγ from the

indices of refraction listed for the end points and discovered that instead of 82o and

134o, I got 86o and 128.9o respectively. So I devised a new method for calculating

α, β, and γ for use in the analysis routine taking optic sign vs. composition into

account. It still applies a linear interpolation for α and γ, but now it linearly

interpolates the measured data for 2Vγ using the end values in the literature (82o and

134o). This causes a change of sign around Fa13 and gives a value of 87.2o at Fa10.

Then, using the interpolated value of 2Vγ, Ι calculate β from the equation relating all

four variables. The graph in Figure 36 illustrates the difference between the two

methods.

0 10 20 30 40 50 60 70 80 90 100

Percent Fayalite

Beta Calculated

Beta Interpolated

Optic Sign vs. Composition For OlivineBiaxial Positive

Biaxial Negative

Figure 36. Optic sign vs. composition for olivine.

This showed that the relationship among α, β, and γ as compared to each other

is most important in calculating the Euler orientation angles. So how did

misinterpreting the optic sign cause the angle psi to be off by almost exactly 90°?

This has to do with the alignment of the crystal axes to the indicatrix axes and how the

Euler angle rotations are defined (all stated previously). Both optic axes of olivine

80

initially lie in the x-y plane. So, when the 2V angle is very close to 90°, an extra

rotation of psi by 90° will produce theoretical extinction angles as if the mineral were

the opposite optical sign and had not undergone the extra rotation.

The OPX in the sample was not off by psi ± 90° because there was not a

problem with the interpolated values of the indices of refraction not matching the optic

sign of OPX. OPX is also biaxial positive at a 90% Mg and 10% Fe composition and

if 2Vγ is used to find the β index of refraction it agrees with the optic sign. Therefore,

I did not need adjust the indices for OPX. I just used a straight linear interpolation

like I had originally used for olivine. Incidentally, the 2Vγ angle in OPX does not vary

linearly with composition over the entire range.

As an interesting note, had olivine’s indices of refraction been aligned with the

crystallographic axes differently, then the consistency of the discrepancy might not

have been detected as readily. This is because misinterpreted optic sign would not

have exclusively affected the angle psi. In other words there would have still been a

90° rotational difference from the switch in optic sign, but that would have been in a

different plane from the psi rotation. This means it would have changed all three

Euler angles making the consistency of the discrepancy more difficult to detect.

Table 6 below shows the final parameters used to run the analysis routine. As

a reminder, the values of a, b, and c are the input variables into the equation for the

indicatrix ellipsoid.

81

Table 6. Final optical parameters: a, b, and c used to analyze olivine and OPX.

Final Parameters Used Olivine OPX a 1.6909 1.6647 b 1.6542 1.6609 c 1.6714 1.6701

With these parameters, the analysis method (using one of the versions that

finds the global minimum, as discussed above) finds a solution consistent with the

SEM results.

The next task was to create one optimized theoretical analysis technique that

finds this global minimum for each grain. Ironically, the near degeneracy of the angle

psi which alerted me to the inconsistency in the first place causes the two theoretical

methods (Newton and Grid) to have difficulty in finding the global minimum for each

grain (but at least this global minimum is now correct!).

The adaptation that fixed this problem most efficiently involved dropping the

Simple Grid Method completely (because it is slow) and increasing the number of

repetitions of the Adapted Newton Method by decreasing the starting grid. The

Adapted Newton Method is run eight times and the lowest R2 of these eight is checked

against psi + 90° to find the final lowest R2 solution. Each of the 8 runs of the

Adapted Newton Method is one equal sub-section of the entire 3-D Euler angle space

that was shown in Figure 27. Instead of one grid using points spaced 45° apart, each

of the 8 grids uses points spaced 22.5° apart. As an example, the first grid uses

endpoints of 0° and 90°, a center point of 45°, and midpoints of 22.5° and 67.5°. On

the lab computer, the new Adapted Newton Method (eight times with a check at psi +

90°) takes between 20 and 25 seconds of calculation time per grain.

82

After this programming adjustment was made, the new Adapted Newton

Method became the best minimization method. It found results consistent with the

SEM results for each and every grain as displayed in Table 4. In addition, there was

no longer a need to write memory consuming files and the calculation time was

reasonably short.

83

5.0 CONCLUSIONS AND APPLICATIONS TO FUTURE RESEARCH

In conclusion, the optical experimental technique I developed has been shown

to be able to accurately determine the orientation of the biaxial minerals olivine and

OPX in a thin section. The experimental system images a thin section between

rotating crossed polarizers and determines an experimental extinction angle for nine

sample orientations. By calculating the theoretical extinction angles for a coarse grid

of possible Euler angle orientations and continually narrowing the grid down to the

desired resolution, the best matching Euler angles can be found.

The optical method I developed costs much less than a comparably accurate

scanning electron microscope that uses electron backscattered diffraction. The cost

savings is augmented by the ability to run other analyses from the image data such as

grain size, elongation, and nearest neighbor correlations. Because of the relatively

small size of the setup, it is conceivable that the system could be designed to operate

using a laptop and battery powered rotation stages. This would enable the setup to be

taken into the field, which is currently not possible with the SEM.

Future considerations for this procedure would be an analysis routine that

would be able to decipher the mineral and/or composition of a grain in a sample, given

the known Euler angle orientation determined by some other method (such as the

SEM). The routine would use the known Euler angles to calculate all nine theoretical

extinction angles for many combinations of α, β, and γ. Then, by finding the best

84

matches between the theoretical extinction angles and the experimental extinction

angles, the ratios among the three indices can be determined. A database of known

indices of refraction for all possible minerals and/or compositions would then be

consulted to determine what satisfies the determined ratios of α, β, and γ.

The success of this new application depends on factors such as whether or not

nine sequences are enough to uniquely distinguish two minerals and/or compositions.

However, one way to increase the data is to analyze a polycrystalline sample that

consists of many grains of the same unknown mineral composition with many random

(but still known) orientations. By combining the data it might be possible to constrain

the indices of refraction enough to find unique numbers for α, β, and γ, instead of just

ratios, making a unique solution possible.

As the procedure stands now, it is immediately applicable to geologists and

geophysicists in search of a low cost experimental tool for finding the texture of

biaxial minerals. The only needed parameters outside of those measured by the

system are: the three indices of refraction α, β, and γ, their relationship to the crystal

axes, and a verification that β is consistent with the optical sign. If there is a problem

with β, then it can be adjusted according to the measured value of 2V.

85

6.0 REFERENCES Abramowitz, Mortimer, Matthew J. Parry-Hill, and Michael W. Davidson. Molecular

Expressions Microscopy Primer: Light and Color - Electromagnetic Radiation:

Interactive Java Tutorial. 2003. Olympus America Inc. and The Florida State

University.

30 Mar. 2005 <http://micro.magnet.fsu.edu/primer/java/electromagnetic/>.

Berry, L.G., Brian Mason and R.V. Dietrich. Mineralogy - Concepts, Descriptions,

Determinations. New York: W.H. Freeman & Co., 1983.

Bloss, Donald F. An Introduction to the Methods of Optical Crystallography. New

York: Holt, Rinehart and Winston, 1961.

Callister, William D. Jr. Materials Science and Engineering An Introduction. New

York: John Wiley & Sons, Inc., 2003.

Deer, W.A., R.A. Howie and J. Zussman. An Introduction to the Rock-Forming

Minerals. Harlow, England: Pearson Education Limited, 1992.

Furlong, Kevin. “Earthquakes Illuminate Mantle Under Tibet.” 1996. Earth &

Environmental Systems Institute, Penn State University. 17 May 2005

<http://www.psu.edu/ur/archives/SCIENCETECH/Tibet.html>.

Gribble, Colin D. and Allan J. Hall. Optical Mineralogy Principle and Practice. New

York: Chapman & Hall, 1992.

Hecht, Eugene. Optics. San Francisco: Addison Wesley, 2002.

86

“HKL Technology: EBSD Basics.” 2005 HKL Technology. 27 May 2005

<http://www.hkltechnology.com/?pid=4,18&articles>.

Karato, Shun-ichiro. The Dynamic Structure of the Deep Earth. New Jersey: Princeton

University Press, 2003.

Marion, Jerry B. and Stephen T. Thorton. Classical Dynamics of Particles and

Systems. Fort Worth: Harcourt College Publishers, 1995.

Neese, William D. Introduction to Optical Mineralogy. New York: Oxford University

Press, 2004.

Nelson, Stephen A. “Biaxial Minerals.” Mineralogy. Course Website. 2002.

Tulane University. 30 Mar. 2005 <http://www.tulane.edu/~sanelson/geol211/

biaxial.htm>.

---. “The Isotropic Indicatrix, Isotropic Minerals, and the Immersion Method.”

Mineralogy. Course Website. 2002. Tulane University. 30 Mar. 2005

<http://www.tulane.edu/~sanelson/geol211/isotropic_minerals.htm>.

---. “Uniaxial Minerals, Uniaxial Indicatrix, Optic Sign, & Ray Path.”

Mineralogy. Course Website. 2002. Tulane University. 30 Mar. 2005

<http://www.tulane.edu/~sanelson/geol211/uniaxial_minerals.htm >.

Sen, Gautam. Earth’s Materials: Minerals and Rocks. New Jersey: Prentice Hall, 2001.

Shelley, David. Manual of Optical Mineralogy. New York: Elsevier, 1975.

Weisstein, Eric W. “Rotation Matrix.” MathWorld - A Wolfram Web Resource. 1999.

Wolfram Research Inc. 30 Mar. 2005

<http://mathworld.wolfram.com/RotationMatrix.html>.

87

Wenk, H.-R., Dawson, P.R., et al. “AGU Mantle Video.” Texturing of Rocks in the

Earth’s Mantle. 1999. UC Berkeley Texture Laboratory. 17 May 2005

<http://eps.berkeley.edu/~wenk/TexturePage/AGU-Mantle-Video.htm>.

Wilen, Larry. Private communication. 2003 - 2005.

88

Appendix A A derivation of the corrected rotation matrix. Given the matrix identity relationships: IAA =−1 1111)( −−−− = ABCABC Since mathematically the rotations of phi and psi are in the same plane when θ = 0,

then:

ψ

φψφ

ψψψψ

φφφφ

ψφφψψφφψφψψφψφφψ

φφφφ

ψψψψ

φφφφ

ρ

ρρρ

=

−=

−

+−−+

=

−

−

−=

−

1000cossin0sincos

1000cossin0sincos

1000sinsincoscossincossincos0sincossincossinsincoscos

1000cossin0sincos

1000cossin0sincos

1000cossin0sincos

1

So therefore,

ψθφ

φψφθφ

φφθφφθφψφθφ

φφθφφθφψφθφ

ρρρρρρρρ

ρρρρρρρρρρρ

ρρρρρρρρρρρρ

=

=

=

=

−

−−−−

−−−−

1

1111

1111corrected )()()(

89

Appendix B Derivation of the formula for the angle between the x-axis and the semi-major axis or

semi-minor axis of an ellipse.

Starting with the equation for an ellipse centered at the origin:

12

2

2

2

=+by

ax

Then rotate the body of the ellipse by an angle θ:

[ ]

[ ]

22

2cos2

2sin2cossin2

2cossin22

2

2sin2

2cos

22

2cos2

2sin2cossin

2cossin

2cossin

2cossin2

2

2sin2

2cos

2

2cos2

2cossin

2

2sin2

2cossin

2cossin

2

2sin2

2cossin

2

2cos2

2

2cos2

cossin

2

2sin2cossin

2cossin

2

2sin2cossin

2

2cos

cossinsincos

2cos

2sin

2sin

2cos

cossinsincos

21

0

021

cossinsincos

cossinsincos

21

0

021

cossinsincos

yba

xyba

xba

yba

xybaba

xba

b

y

b

xy

a

y

a

xy

b

xy

b

x

a

xy

a

x

yx

b

y

b

x

a

y

a

x

b

y

b

x

a

y

a

x

yx

b

y

b

x

a

y

a

x

yx

b

ayxyx

yx

b

ayx

++−++

=++−+−++

=+−

+++−++

=

+−

++

−++

=−

+−

+

=−

+−+

=−

−

θθθθθθθθ

θθθθθθθθθθθθ

θθθ

θθθθθθθθθ

θθθ

θθθ

θθθθθθ

θθθθθθθθ

θθθθ

θθθθ

θθθθ

θθθθ

90

Define A = coefficient of x2, B = coefficient of xy, and C = coefficient of y2

+=

−=

+= 2

2

2

2

222

2

2

2 cossincossin2cossin2sincosba

Cba

Bba

A θθθθθθθθ

Let’s try to determine what θ is. The following identities from trigonometry are

employed extensively:

θθθθ

θθθθ

θθ

2cos1sin2cossin2cos1cos2sincos

1sincos

222

222

22

−=−=−

=−=−

=+

A good place to start is to simplify things later by combining terms:

( )

+=

++=

++

+=

++

+=+

22

2222

222

222

2

2

2

2

2

2

2

2

11

11sincos

11sin11cos

cossinsincos

ba

ba

baba

babaCA

θθ

θθ

θθθθ

Now let’s try subtraction:

91

( ) ( )

( )

( )

( )θ

θθ

θθθ

θθθ

θθθθ

θθ

θθθθ

2cos11

2cos2cos

1sincos2cos

1cossin2cos

1cossin1sincos

11sin11cos

cossinsincos

22

22

222

2

222

2

222

222

222

222

2

2

2

2

2

2

2

2

−=

−=

−−=

−+=

−+−=

−+

−=

+−

+=−

ba

ba

ba

ba

ba

abba

babaCA

After taking another look at B, it appears I have another simplification that will

eliminate all dependence of A, B, and C on “a” or “b”:

( )

θθθ

θθθ

θ

θθ

2tan2cos2sin

2coscossin2

2cos11

cossin211

22

22

=

=

=

−

−

=−

ba

baCA

B

Now solving for θ is trivial:

−

= −

CAB1tan

21θ

Converting to degrees will add an extra term:

−

= −

CAB1tan

21180

πθ

o

92

Appendix C The coefficients of the 2D ellipse centered at the origin in terms of the rotation angles

φ, θ, ψ, ξ and φinner and the values of a, b, and c.

Equation of an ellipse using primes: 122 =′+′+′ yCxyBxA

Please Note: 222

1,1,1c

Cb

Ba

A ===

** = “to the power of”

A′ =B*cos(xi)**2*cos(psi)**2*cos(phi_in)**2+C*cos(theta)**2-2*cos(phi_in)* cos(theta)*B*cos(psi)*sin(psi)*cos(xi)**2*sin(phi_in)-1*C*cos(xi)**2* cos(phi_in)**2*cos(theta)**2-1*A*cos(psi)**2*cos(xi)**2*cos(theta)**2 +2*cos(xi)*sin(phi_in)*cos(phi)*C*sin(theta)*sin(xi)*cos(theta)+B*cos(xi)**2+A-1*B*cos(psi)**2*cos(theta)**2-1*B*cos(xi)**2*cos(phi_in)**2+C*cos(phi)**2 *cos(xi)**2+C*cos(xi)**2*cos(phi_in)**2+2*A*cos(psi)**2*cos(xi)**2+A*cos(xi)**2*cos(theta)**2+A*cos(psi)**2*cos(theta)**2-2*B*cos(xi)**2*cos(psi)**2-1*B* cos(xi)**2*cos(phi)**2+2*cos(phi_in)*sin(phi)*C*cos(phi)*cos(xi)**2*sin(phi_in)-2*cos(phi_in)*sin(phi)*C*cos(phi)*cos(xi)**2*sin(phi_in)*cos(theta)**2+2*A*cos (psi)**2*cos(phi)**2*cos(xi)**2*cos(phi_in)**2-4*cos(phi)*A*cos(psi)*sin(psi) *cos(theta)*sin(phi)*cos(xi)**2*cos(phi_in)**2-2*cos(xi)*cos(phi_in)*cos(phi)* B*sin(psi)*cos(psi)*sin(xi)*sin(theta)+2*sin(phi)*A*cos(psi)*sin(psi)*cos(theta) *cos(phi)*cos(xi)**2-2*cos(phi_in)*sin(phi)*A*cos(theta)**2*cos(phi)*cos(xi)**2 *sin(phi_in)*cos(psi)**2+2*cos(phi_in)*cos(phi)*B*sin(phi)*cos(xi)**2*sin(phi_in)*cos(psi)**2-2*cos(phi_in)*cos(phi)*B*sin(phi)*cos(xi)**2*sin(phi_in)-2*cos(xi)*sin(phi_in)*cos(phi)*cos(theta)*A*sin(xi)*sin(theta)+2*cos(xi)*sin(phi_in)*cos(phi)*cos(theta)*A*sin(xi)*sin(theta)*cos(psi)**2+B*cos(psi)**2*cos(xi)**2* cos(theta)**2-2*cos(xi)*cos(phi_in)*sin(phi)*cos(theta)*A*sin(xi)*sin(theta)+ 2*cos(phi_in)*cos(theta)*A*sin(psi)*cos(psi)*cos(xi)**2*sin(phi_in)-1*A*cos(psi)**2*cos(xi)**2*cos(phi_in)**2-2*B*cos(psi)**2*cos(theta)**2 *cos(phi)**2*cos(xi)**2*cos(phi_in)**2-2*C*cos(phi)**2*cos(xi)**2 *cos(phi_in)**2-1*C*cos(phi)**2*cos(xi)**2*cos(theta)**2-1*A*cos(theta)**2+2*A*cos(theta)**2*cos(phi)**2*cos(xi)**2*cos(psi)**2

93

*cos(phi_in)**2-2*sin(phi)*B*sin(psi)*cos(psi)*cos(theta)*cos(phi)*cos(xi)**2 +2*cos(xi)*cos(phi_in)*sin(phi)*cos(theta)*A*sin(xi)*sin(theta)*cos(psi)**2+A *cos(theta)**2*cos(phi)**2*cos(xi)**2-2*B*cos(xi)**2*cos(psi)**2*cos(phi)**2 *cos(phi_in)**2+B*cos(psi)**2-4*cos(phi_in)*A*cos(psi)*sin(psi)* cos(theta)*cos(phi)**2*cos(xi)**2*sin(phi_in)-1*A*cos(psi)**2-2*cos(xi)*sin(phi_in)*cos(phi)*cos(theta)*B*cos(psi)**2*sin(xi)*sin(theta)-2*cos(phi_in)*cos(phi)*A*cos(psi)**2*sin(phi)*cos(xi)**2*sin(phi_in)+2*cos(phi_in)*sin(phi)*B*cos(psi)**2*cos(theta)**2*cos(phi)*cos(xi)**2*sin(phi_in)+2*B*cos(xi)**2*cos(phi)**2*cos(phi_in)**2-2*cos(xi)*cos(phi_in)*sin(phi)*cos(theta)*B *cos(psi)**2*sin(xi)*sin(theta)+4*cos(phi)*B*sin(psi)*cos(psi)*cos(theta)*sin(phi) *cos(xi)**2*cos(phi_in)**2+2*cos(xi)*sin(phi_in)*sin(phi)*B*sin(psi)*cos(psi)*sin(xi)*sin(theta)-2*cos(xi)*sin(phi_in)*sin(phi)*A*cos(psi)*sin(psi)*sin(xi)*sin(theta) +4*cos(phi_in)*B*sin(psi)*cos(psi)*cos(theta)*cos(phi)**2*cos(xi)**2 *sin(phi_in)-1*C*cos(theta)**2*cos(xi)**2+2*cos(xi)*cos(phi_in)*sin(phi)*C *sin(theta)*sin(xi)*cos(theta)+2*cos(xi)*cos(phi_in)*cos(phi)*A*cos(psi)*sin(psi)* sin(xi)*sin(theta)+B*cos(xi)**2*cos(psi)**2*cos(phi)**2+2*C*cos(phi)**2*cos(xi)**2*cos(theta)**2*cos(phi_in)**2+2*cos(phi_in)*sin(phi)*A*cos(theta)**2*cos(phi)*cos(xi)**2*sin(phi_in)+B*cos(psi)**2*cos(theta)**2*cos(xi)**2*cos(phi_in)**2+A*cos(theta)**2*cos(xi)**2*cos(phi_in)**2-1*A*cos(theta)**2*cos(xi)**2* cos(phi_in)**2*cos(psi)**2-2*A*cos(theta)**2*cos(phi)**2*cos(xi)**2* cos(phi_in)**2-1*A*cos(theta)**2*cos(phi)**2*cos(xi)**2*cos(psi)**2-1*A*cos(psi)**2*cos(xi)**2*cos(phi)**2+B*cos(psi)**2*cos(theta)**2*cos(phi)**2*cos(xi)**2-1*A*cos(xi)**2; B′ =-2*cos(xi)*cos(phi_in)*C*sin(phi_in)*cos(theta)**2-4*cos(xi)*sin(phi_in)*B* cos(psi)**2*cos(theta)**2*cos(phi)**2*cos(phi_in)+4*cos(xi)*cos(phi_in)*A *cos(psi)**2*cos(phi)**2*sin(phi_in)-8*cos(xi)*B*sin(psi)*cos(psi)*cos(theta)* cos(phi)**2*cos(phi_in)**2+4*cos(xi)*cos(phi)*A*cos(psi)**2*sin(phi) *cos(phi_in)**2+2*cos(xi)*cos(phi)*C*sin(phi)-2*cos(xi)*B*sin(psi)*cos(psi) *cos(theta)+4*cos(xi)*B*sin(psi)*cos(psi)*cos(theta)*cos(phi_in)**2-2*sin(xi)*sin(theta)*B*cos(psi)*sin(psi)*cos(phi)*sin(phi_in)+2*cos(xi)*cos(phi)*B *cos(psi)**2*cos(theta)**2*sin(phi)+8*cos(xi)*cos(phi_in)*cos(phi)*B*sin(psi)* cos(psi)*cos(theta)*sin(phi)*sin(phi_in)+4*cos(xi)*cos(phi)*C*sin(phi)*cos(theta)**2*cos(phi_in)**2-4*cos(xi)*sin(phi)*B*cos(phi)*cos(psi)**2*cos(phi_in)**2+ 4*cos(xi)*sin(phi_in)*A*cos(theta)**2*cos(phi)**2*cos(phi_in)*cos(psi)**2-4*cos(xi)*cos(phi)*C*sin(phi)*cos(phi_in)**2-2*sin(xi)*C*cos(theta)*sin(theta) *cos(phi)*cos(phi_in)+2*sin(xi)*sin(theta)*B*cos(psi)**2*cos(theta)*cos(phi)* cos(phi_in)+2*sin(xi)*C*cos(theta)*sin(theta)*sin(phi)*sin(phi_in)-2*sin(xi) *sin(theta)*B*cos(psi)*sin(psi)*sin(phi)*cos(phi_in)-2*sin(xi)*sin(theta)* B*cos(psi)**2*cos(theta)*sin(phi)*sin(phi_in)+2*sin(xi)*sin(theta)*A*cos(theta) *cos(phi)*cos(phi_in)-2*sin(xi)*sin(theta)*A*cos(theta)*cos(phi)*cos(phi_in) *cos(psi)**2-4*cos(xi)*sin(phi)*B*cos(psi)**2*cos(theta)**2*cos(phi) *cos(phi_in)**2+2*sin(xi)*sin(theta)*A*sin(psi)*cos(psi)*cos(phi)*sin(phi_in)

94

+2*sin(xi)*sin(theta)*A*sin(psi)*cos(psi)*sin(phi)*cos(phi_in)-4*cos(xi) *cos(phi_in)*B*cos(phi)**2*sin(phi_in)*cos(psi)**2+2*cos(xi)*sin(phi_in)*B*cos(phi_in)*cos(psi)**2+2*cos(xi)*cos(phi_in)*B*cos(psi)**2*cos(theta)**2*sin(phi_in)+2*cos(xi)*cos(phi)*A*cos(theta)**2*sin(phi)-4*cos(xi)*cos(phi)*A* cos(theta)**2*sin(phi)*cos(phi_in)**2-8*cos(xi)*cos(phi_in)* cos(phi)*A*cos(psi)*sin(psi)*cos(theta)*sin(phi)*sin(phi_in)-4*cos(xi)*sin(phi_in)*A*cos(theta)**2*cos(phi)**2*cos(phi_in)-2*cos(xi)*sin(phi)*B*cos(phi)+4*cos(xi)*sin(phi)*B*cos(phi)* cos(phi_in)**2+2*cos(xi)*sin(phi)*B*cos(phi)*cos(psi)**2+4*cos(xi)* cos(phi_in)*B*cos(phi)**2*sin(phi_in)+2*sin(xi)*sin(theta)*A*cos(theta)*sin(phi)*sin(phi_in)*cos(psi)**2-2*cos(xi)*cos(phi)*C*sin(phi)*cos(theta)**2+ 4*cos(xi)*cos(theta)*B*cos(psi)*sin(psi)*cos(phi)**2-4*cos(xi)* cos(theta)*A*sin(psi)*cos(psi)*cos(phi)**2-2*sin(xi)*sin(theta)* A*cos(theta)*sin(phi)*sin(phi_in)+2*cos(xi)*A*cos(psi)*sin(psi) *cos(theta)-4*cos(xi)*sin(phi_in)*C*cos(phi)**2*cos(phi_in)+4*cos(xi)*sin(phi_in) *C*cos(phi)**2*cos(phi_in)*cos(theta)**2-4*cos(xi)*A*cos(psi)*sin(psi)*cos(theta) *cos(phi_in)**2-2*cos(xi)*cos(phi_in)*A*cos(theta)**2*sin(phi_in)*cos(psi)**2-2*cos(xi)*cos(phi)*A*cos(theta)**2*sin(phi)*cos(psi)**2+4*cos(xi)*cos(phi)*A *cos(theta)**2*sin(phi)*cos(psi)**2*cos(phi_in)**2+2*cos(xi)*cos(phi_in)*A*cos(theta)**2*sin(phi_in)-2*cos(xi)*sin(phi_in)*B*cos(phi_in)+8 *cos(xi)*A*cos(psi)*sin(psi)*cos(theta)*cos(phi)**2*cos(phi_in)**2-2*cos(xi)*sin(phi_in)*A*cos(psi)**2*cos(phi_in)+2*cos(xi)*cos(phi_in)*C*sin(phi_in)-2*cos(xi)*sin(phi)*A*cos(psi)**2*cos(phi); C ′ =-1*B*cos(phi_in)**2*cos(psi)**2-1*C*cos(theta)**2-2*A*cos(theta)**2 *cos(psi)**2*cos(phi)**2*cos(phi_in)**2+A*cos(psi)**2*cos(phi_in)**2+2*C *cos(phi)**2*cos(phi_in)**2+C*cos(theta)**2*cos(phi)**2+C*cos(theta)**2*cos(phi_in)**2-1*B*cos(phi)**2*cos(psi)**2-1*A*cos(theta)**2*cos(phi_in)**2-2*B*cos(phi_in)**2*cos(phi)**2-1*C*cos(phi)**2+2*cos(phi)*B*sin(psi) *cos(psi)*cos(theta)*sin(phi)+B*cos(phi_in)**2+B*cos(phi)**2-1*C* cos(phi_in)**2+B*cos(psi)**2*cos(theta)**2-2*sin(phi)*A*cos(psi)*sin(psi) *cos(theta)*cos(phi)-1*A*cos(psi)**2*cos(theta)**2+A*cos(psi)**2 *cos(phi)**2-1*B*cos(psi)**2*cos(theta)**2*cos(phi)**2+A*cos(theta)**2-1*A* cos(theta)**2*cos(phi)**2-2*sin(phi_in)*cos(theta)*A*sin(psi)*cos(psi) *cos(phi_in)+4*sin(phi_in)*A*cos(psi)*sin(psi)*cos(theta)*cos(phi)**2*cos(phi_in)-2*sin(phi_in)*sin(phi)*A*cos(theta)**2*cos(phi)*cos(phi_in)+2* sin(phi_in)*cos(theta)*B*cos(psi)*sin(psi)*cos(phi_in)+2*sin(phi_in)*sin(phi)*C*cos(phi)*cos(phi_in)*cos(theta)**2-2*sin(phi_in)*cos(phi)*B*sin(phi) *cos(phi_in)*cos(psi)**2+4*sin(phi)*A*cos(psi)*sin(psi)*cos(theta) *cos(phi)*cos(phi_in)**2-2*sin(phi_in)*sin(phi)*B*cos(psi)**2 *cos(theta)**2*cos(phi)*cos(phi_in)-2*C*cos(phi)**2*cos(phi_in)**2 *cos(theta)**2+2*B*cos(phi_in)**2*cos(psi)**2*cos(phi)**2-2 *A*cos(psi)**2*cos(phi_in)**2*cos(phi)**2+A

95

*cos(theta)**2*cos(psi)**2*cos(phi)**2+A*cos(theta)**2*cos(psi)**2*cos(phi_in)**2+C-2*sin(phi_in)*sin(phi)*C*cos(phi)*cos(phi_in)+2*sin(phi_in)*cos(phi)*B *sin(phi)*cos(phi_in)+2*A*cos(theta)**2*cos(phi)**2*cos(phi_in)**2+2*sin(phi_in)*cos(phi)*A*cos(psi)**2*sin(phi)*cos(phi_in)-1*B*cos(psi)**2*cos(theta)**2* cos(phi_in)**2+2*B*cos(psi)**2*cos(theta)**2*cos(phi)**2*cos(phi_in)**2 -4*sin(phi_in)*B*sin(psi)*cos(psi)*cos(theta)*cos(phi)**2*cos(phi_in)+2 *sin(phi_in)*sin(phi)*A*cos(theta)**2*cos(phi)*cos(phi_in)* cos(psi)**2-4*sin(phi)*B*sin(psi)*cos(psi)*cos(theta)*cos(phi)*cos(phi_in)**2;

Documents

A NOVEL AUTOMATED OPTICAL TECHNIQUE TO ...personal.kent.edu/~shurley/Shawn Hurley HTC Thesis FINAL...biaxial mineral grains in thin sections has been developed. The system takes images