19
Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan Purdue University, University of Notre Dame, Southern Illinois University [email protected], [email protected], [email protected] May 25, 2018 Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 1 / 19

Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Changes in Forest Communities of the EasternUnited States

Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Purdue University, University of Notre Dame, Southern Illinois University

[email protected], [email protected], [email protected]

May 25, 2018

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 1 / 19

Page 2: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Motivation

Imagine you’re walking through a forest...

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 2 / 19

Page 3: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Research Goals

Identify the main forest communities of the Eastern U.S

Assess how they have changed based on two scales.

1 Species Level(Reason: Species loss/local extinction, Species gain/invasion andEconomic value)

2 Community Level(Reason: Ecosystem functioning, Loss of forests/habitat types andSpecies interactions)

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 3 / 19

Page 4: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Latent Dirichlet Allocation (LDA)

In the Latent Dirichlet Allocation (LDA) topic model, the frequencyand co-occurrence of words in text segments define concepts.[Blei et al., 2003]

LDA has recently been used to define communities from frequencyand co-occurrence of species in sampling units [Valle et al., 2014]

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 4 / 19

Page 5: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

BigCLAM Clustering Algorithm

Cluster Affiliation Model for Big Networks (BigClam) on the StanfordNetwork Analysis Project (SNAP) [Yang and Leskovec, 2013]

It is a popular graph mining algorithm that is capable of findingoverlapping communities in networks containing millions of nodes andedges.

Squares = nodes = species

Circles = clusters = communities

Lines = cluster/community membership

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 5 / 19

Page 6: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Forest Inventory and Analysis (FIA)

Approx. 80,000 plots in theeastern U.S.

Collected by U.S. Forest Service≥200 species; 79 selected for thisproject

Compiled for two time periods(varies by state)

T1: 1980-1993T2: 2013-2015Date range for complete coverage

Aggregated to a hexagonsample unit (∼ 2400)

Reduces sampling biasAccounts for fuzzed and swappedLat/Lon from USFS

Figure: FIA plots (blue dots) andhexagon sample units

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 6 / 19

Page 7: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Abundance measures - For T1 and T2

LDA with Importance Value

Importance Value (IV) =

(rel. stem density + rel. basal area

2

)

LDA with Species Dominance Index [Costanza et al., 2017]

Species Dominance Index =

(IV+ 1

no.species in hex+THC

3

)

THC(the tendency toward high cover) =

{1 for IV ≥ 0.25 & max (IV) in the hexagon

0 otherwise

BigCLAM with edge list

List of species overlap in each hexagon

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 7 / 19

Page 8: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Methodology

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 8 / 19

Page 9: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results - Communities

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 9 / 19

Page 10: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results - Community Location

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 10 / 19

Page 11: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results - IV vs. SDI at T1

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 11 / 19

Page 12: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results - T1 vs. T2 (IV)

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 12 / 19

Page 13: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results

Largest Overlapping Communities with Exclusive Species

Table: T1 — LDA

Species CommunitiesBalsam Poplar 2Paper Birch 2Quaking Aspen 2Tamarack 2White Spruce 2

Table: T2 — LDA

Species CommunitiesBalsam Poplar 2Black Ash 2Paper Birch 2Quaking Aspen 2Tamarack 2White Spruce 2

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 13 / 19

Page 14: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Results - Black Ash

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 14 / 19

Page 15: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Conclusions

High concordance between LDA model with IV, SDI, and BigCLAMmodel

Close (but not perfect) relationship between T1 and T2: evidence offorest community change

Possible evidence of community response to Emerald Ash Borerinvasion

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 15 / 19

Page 16: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Future Directions

Determine the best number of communities to describe the data setusing Bootstrapping methods. (Currently k = 16 - AIC)

Assess ”goodness-of-fit” for LDA and BigClam by incorporatingsilhouette or other measures for validation of consistency withinclusters.

Interpret results (such as Black Ash Reduction) in an ecologicalcontext

Predict the forest changes using improved clustering methods(hierarchical/ k-means clustering)[Costanza et al., 2017].

Investigate factors that affect communities (climate change, land usechange, management practices, etc.)

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 16 / 19

Page 17: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Chathurangi PathiravasanSouthern Illinois University

[email protected]

Trenton FordUniversity of Notre Dame

[email protected]

Jonathan KnottPurdue [email protected]

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 17 / 19

Page 18: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

References

Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003).Latent dirichlet allocation.Journal of machine Learning research, 3(Jan):993–1022.

Costanza, J. K., Coulston, J. W., and Wear, D. N. (2017).An empirical, hierarchical typology of tree species assemblages for assessing forestdynamics under global change scenarios.PloS one, 12(9):e0184062.

Valle, D., Baiser, B., Woodall, C. W., and Chazdon, R. (2014).Decomposing biodiversity data using the latent dirichlet allocation model, a probabilisticmultivariate statistical method.Ecology letters, 17(12):1591–1601.

Yang, J. and Leskovec, J. (2013).Overlapping community detection at scale: a nonnegative matrix factorization approach.In Proceedings of the sixth ACM international conference on Web search and data mining,pages 587–596. ACM.

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 18 / 19

Page 19: Changes in Forest Communities of the Eastern United States · 2018-10-16 · Changes in Forest Communities of the Eastern United States Jonathan Knott, Trenton Ford, Chathurangi Pathiravasan

Questions?

Forest Team (CSoI) Workshop: Introduction to Data Science May 25, 2018 19 / 19