12
Growth rate and acceleration analysis of the COVID-19 pandemic reveals the effect of public health measures in real time Yuri Tani Utsunomiya 1,2* , Adam Taiti Harth Utsunomiya 2 , Rafaela Beatriz Pintor Torrecilha 2 , Silvana de Cássia Paula 2 , Marco Milanesi 2 , José Fernando Garcia 1,2,3 1 Department of Support, Production and Animal Health, School of Veterinary Medicine of Araçatuba, São Paulo State University (Unesp), 16050-680 R. Clovis Pestana 793 - Dona Amelia, Araçatuba/SP Brazil; 2 International Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and Bioinformatics, 16050-680 R. Clovis Pestana 793 - Dona Amelia, Araçatuba/SP, Brazil; 3 Department of Preventive Veterinary Medicine and Animal Reproduction, School of Agricultural and Veterinarian Sciences, São Paulo State University (Unesp), 14884-900 Via de Acesso Prof. Paulo Donato Castellane s/n - Jaboticabal/SP, Brazil; *corresponding author: [email protected] ABSTRACT Background: Ending the COVID-19 pandemic is arguably one of the most prominent challenges in recent human history. Following closely the growth dynamics of the disease is one of the pillars towards achieving that goal. Objective: We aimed at developing a simple framework to facilitate the analysis of the growth rate (cases/day) and growth acceleration (cases/day²) of COVID-19 cases in real-time. Methods: The framework was built using the Moving Regression (MR) technique and a Hidden Markov Model (HMM). The dynamics of the pandemic was initially modeled via combinations of four different growth stages: lagging (beginning of the outbreak), exponential (rapid growth), deceleration (growth decay) and stationary (near zero growth). A fifth growth behavior, namely linear growth (constant growth above zero), was further introduced to add more flexibility to the framework. An R Shiny application was developed, which can be accessed at http://www.theguarani.com.br/covid-19 or downloaded from https://github.com/adamtaiti/SARS- CoV-2 . The framework was applied to data from the European Center for Disease Prevention and Control (ECDC), which comprised 853,200 cases reported worldwide as of April 2 nd 2020. Results: We found that the impact of public health measures on the prevalence of COVID-19 could be perceived in seemingly real-time by monitoring growth acceleration curves. Restriction to human mobility produced detectable decline in growth acceleration within few days, deceleration within ~2 weeks and near-stationary growth within ~6 weeks. Countries exhibiting different permutations of the five growth stages indicated that the evolution of COVID-19 prevalence is more complex and dynamic than previously appreciated. Conclusions: These results corroborate that mass social isolation is a highly effective measure against the dissemination of SARS-CoV-2, as previously suggested. Apart from the analysis of prevalence partitioned by country, the proposed framework is easily applicable to city, state, region and arbitrary territory data, serving as an asset to monitor the local behavior of COVID-19 cases. Keywords: Coronavirus, Severe Acute Respiratory Syndrome, Growth Curve Analysis, Mathematical Modeling, Moving Regression, Hidden Markov Model. INTRODUCTION The World Health Organization (WHO) officially declared Coronavirus Disease (COVID-19) a global pandemic on March 11 th 2020 (1). The disease is caused by the novel Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) (2,3), which seems to have first emerged in Wuhan, China on December 12 th 2019 (4,5). Worldwide dissemination has been extremely rapid, and by the time this study was completed (April 2 nd 2020) a total of 928,437 cases and 46,891 deaths had been reported across 204 countries and territories according to data from the European Center for Disease Prevention and Control (ECDC) (6). Approximately 86% of all cases are 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688 doi: medRxiv preprint NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.

Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

Growth rate and acceleration analysis of the COVID-19 pandemic reveals theeffect of public health measures in real time

Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2,Silvana de Cássia Paula2, Marco Milanesi2, José Fernando Garcia1,2,3

1Department of Support, Production and Animal Health, School of Veterinary Medicine of Araçatuba, São Paulo StateUniversity (Unesp), 16050-680 R. Clovis Pestana 793 - Dona Amelia, Araçatuba/SP Brazil; 2International AtomicEnergy Agency (IAEA) Collaborating Centre on Animal Genomics and Bioinformatics, 16050-680 R. Clovis Pestana793 - Dona Amelia, Araçatuba/SP, Brazil; 3Department of Preventive Veterinary Medicine and Animal Reproduction,School of Agricultural and Veterinarian Sciences, São Paulo State University (Unesp), 14884-900 Via de Acesso Prof.Paulo Donato Castellane s/n - Jaboticabal/SP, Brazil; *corresponding author: [email protected]

ABSTRACT

Background: Ending the COVID-19 pandemic is arguably one of the most prominent challenges inrecent human history. Following closely the growth dynamics of the disease is one of the pillarstowards achieving that goal.Objective: We aimed at developing a simple framework to facilitate the analysis of the growth rate(cases/day) and growth acceleration (cases/day²) of COVID-19 cases in real-time.Methods: The framework was built using the Moving Regression (MR) technique and a HiddenMarkov Model (HMM). The dynamics of the pandemic was initially modeled via combinations offour different growth stages: lagging (beginning of the outbreak), exponential (rapid growth),deceleration (growth decay) and stationary (near zero growth). A fifth growth behavior, namelylinear growth (constant growth above zero), was further introduced to add more flexibility to theframework. An R Shiny application was developed, which can be accessed athttp://www.theguarani.com.br/covid-19 or downloaded from https://github.com/adamtaiti/SARS-CoV-2. The framework was applied to data from the European Center for Disease Prevention andControl (ECDC), which comprised 853,200 cases reported worldwide as of April 2nd 2020.Results: We found that the impact of public health measures on the prevalence of COVID-19 couldbe perceived in seemingly real-time by monitoring growth acceleration curves. Restriction tohuman mobility produced detectable decline in growth acceleration within few days, decelerationwithin ~2 weeks and near-stationary growth within ~6 weeks. Countries exhibiting differentpermutations of the five growth stages indicated that the evolution of COVID-19 prevalence is morecomplex and dynamic than previously appreciated.Conclusions: These results corroborate that mass social isolation is a highly effective measureagainst the dissemination of SARS-CoV-2, as previously suggested. Apart from the analysis ofprevalence partitioned by country, the proposed framework is easily applicable to city, state, regionand arbitrary territory data, serving as an asset to monitor the local behavior of COVID-19 cases.

Keywords: Coronavirus, Severe Acute Respiratory Syndrome, Growth Curve Analysis,Mathematical Modeling, Moving Regression, Hidden Markov Model.

INTRODUCTION

The World Health Organization (WHO) officially declared Coronavirus Disease (COVID-19) aglobal pandemic on March 11th 2020 (1). The disease is caused by the novel Severe AcuteRespiratory Syndrome Coronavirus 2 (SARS-CoV-2) (2,3), which seems to have first emerged inWuhan, China on December 12th 2019 (4,5). Worldwide dissemination has been extremely rapid,and by the time this study was completed (April 2nd 2020) a total of 928,437 cases and 46,891deaths had been reported across 204 countries and territories according to data from the EuropeanCenter for Disease Prevention and Control (ECDC) (6). Approximately 86% of all cases are

1

2

3456789

10111213

14

15161718192021222324252627282930313233343536373839404142

43

44

4546474849505152

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.

Page 2: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

estimated to have been undocumented prior to the cordon sanitaire in China (7), which suggests thatthe disease might be also substantially under-reported in other countries. Nevertheless, partialCOVID-19 prevalence data are still an invaluable resource to help monitoring and controlling thedisease. In particular, extracting daily estimates of growth rate (cases/day) and acceleration(cases/day²) in disease dissemination from real-time case reports can be decisive for an effectiveand promptly action to restrain further contagion. Here we report the development of a simpleframework dedicated to the real-time analysis of COVID-19 prevalence. This framework was builtusing a combination of Moving Regression (MR) and Hidden Markov Model (HMM), and wasdeployed as a Shiny (8) application in R (9). Here we show the utility of that framework in theanalysis of publicly available COVID-19 case reports that are updated daily by the ECDC.

RESULTS AND DISCUSSION

For simplicity, assume that the cumulative number of COVID-19 cases over time (i.e., the growthcurve) in a specific country or territory follows an unknown sigmoidal function (Figure 1a). Suchassumption is common in the analysis of growth data and has been applied to a wide range ofproblems, from tumor (10) to bacterial (11) growth. Although empirical data from China (Figure1b) and South Korea (Figure 1c) seemed to support it well, that assumption will be substantiallyrelaxed later in our framework to accommodate complex dynamics in the evolution of COVID-19prevalence. We define growth rate and growth acceleration as the first and second order derivatives,respectively, of the prevalence of COVID-19 in respect to time. In our framework, we selected MRto approximate these derivatives over competing models that are frequently used to describe thebehavior of sigmoidal growth curves, such as the Gompertz model (12,13), because: (i) it isdependent on a single free parameter, the “smooth factor”, which represents the number ofneighboring days used in local regression; (ii) growth rate and acceleration estimates areapproximated by ordinary least squares equations, which are computationally inexpensive; (iii) weperformed extensive simulations of growth curves and found that it produces reasonably accurateestimates of growth rate (median R2 = 0.99 with smooth factor of 3) and acceleration (median R2 =0.92 with smooth factor of 3) (Figure 2); (iv) it is very robust to departures from sigmoidal curves;and (v) it does not rely on observations of the whole curve to produce instantaneous growth rate andacceleration estimates, and thus can produce such estimates in near real time. Argument (v) isespecially relevant to the analysis of COVID-19 data since the pandemic is ongoing and eachcountry will be at a different stage of the growth curve as time passes. A clear disadvantage of MRis that it may over-fit the growth curve to the data, especially if the selected smooth factor is small(say < 3), in which case accurate prediction of new cases of COVID-19 is limited to very few daysin the future. Still, even single-day predictions can be of great use during a pandemic if reasonablyaccurate. In the ECDC data set, a forward validation showed that single-day predictions weresufficiently accurate (R² ~ 0.95) (Figure 3).

Sigmoidal growth curves can be partitioned into four easily distinguishable stages (Figure 1a): (a)the lagging stage, which corresponds to the beginning of the outbreak or disease importation, wherethe number of cases are low and increase only marginally every day; (b) the exponential stage,when growth starts accelerating and the number of new cases increase rapidly day-by-day; (c) thedeceleration stage, where the number of new cases reduces daily and tends to asymptote; and (d) thestationary stage, characterized by stagnation of the prevalence with sporadic new cases occurringeach day. The growth rate graph is approximately bell-shaped, with its peak corresponding to theinflection of the exponential stage. This inflection point signals the beginning of a decline in thegrowth rate. The growth acceleration graph usually consists of a combination of two bell-shapedcurves: the first one with a peak and the second with a valley. The peak indicates the point whereacceleration starts descending towards zero. The moment when acceleration is exactly zerocoincides with the inflection of the exponential stage, which marks the beginning of growth

5354555657585960616263

64

65

66676869707172737475767778798081828384858687888990919293949596979899

100101102103

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 3: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

deceleration (i.e., negative acceleration). The latter corresponds to the entire concave section of thecurve, but the very bottom of the valley indicates that the prevalence is moving towards stagnation.

In spite of sigmoidal curves following the four above described stages sequentially, we anticipatedthat the growth of COVID-19 cases may not necessarily obey this sequence in practice, since thedynamics of the disease is likely complex and highly responsive to the implementation or relaxationof public health measures. This implies that a country that has already reached a stationary stagecould resume exponential growth, for example by seeding a new outbreak via importation.Likewise, decelerating countries could as well regain acceleration by relaxing prevention measures.Furthermore, some countries may face multiple cycles of acceleration and deceleration prior toreaching a stationary growth. These scenarios could produce more complex growth curves thatdeviate from the sigmoidal shape by mounting different arrangements of exponential, decelerationand stationary stages. Of note, MR has sufficient flexibility to model these complex scenarios andcan easily accommodate curves exhibiting arbitrary permutations of these four stages. In addition,the near-zero acceleration that is intimately related to the stationary stage in sigmoidal curves couldalso arise from a non-zero constant growth rate in practice. In such cases, the growth curve wouldexhibit a linear pattern, which can be interpreted as a fifth growth stage that is not observed inclassic sigmoidal functions. Such linear pattern may appear if the deceleration stage does not forman enough deep valley prior to acceleration rising up again towards zero. Again, MR is capable ofmodeling these anomalous behaviors. In this study we sought to ascertain whether these five stagesof growth curves could have direct implications in understanding the dynamics of COVID-19prevalence both globally and locally. We further developed a Hidden Markov Model (HMM) toautomate the detection of transitions between stages in the growth curve using acceleration andgrowth rate data obtained with MR as input (see Material and Methods).

Using MR and HMM on ECDC data frozen on April 2nd 2020, we evaluated the utility of theframework in identifying countries reaching deceleration or stationary growth. We also looked forcountries presenting complex arrangements of the five growth stages. The countries found to havereached a near-stationary stage at some point were China and South Korea (Figure 1b-c). Byprojecting official government announcements against the fitted curves of these countries, weobserved that decline in growth acceleration occurred shortly after the implementation of measuresthat drastically reduced human movement. Deceleration of growth was achieved within 2 weeks andthe prevalence plateaued within 6 weeks. These results indicated that: (i) the effect of public healthmeasures on SARS-CoV-2 prevention can be detected in seemingly real time by monitoring thebehavior of acceleration curves; and (ii) restriction to human mobility is very effective incontrolling the spread of the disease, but takes several weeks to produce a stationary growth. Thesefindings are in line with a recent study showing that human mobility explained early growth anddecline of new cases of COVID-19 in China (14). As discussed before, one should not immediatelyassume that a country in stationary growth will remain in that stage, since acceleration could takeoff again if new cases are imported or preventive measures are relaxed. In fact, our HMM classifiercategorized the apparent stationary growth stage of China and South Korea as a linear growth.Indeed, both countries have not reached a perfect asymptote and their COVID-19 prevalences areinstead growing in a linear pattern.

To illustrate the utility of the framework in detecting acceleration decay, we decided to look moreclosely to data from three countries: United States of America (USA), Brazil and Italy (Figure 4).The latter has been severely impacted with the disease, and by the time we completed our study thecountry had recorded 110,574 cases and 13,157 deaths. On March 10 th 2020 Italy implemented astrict quarantine, and five days later the country reached its maximum acceleration and started tomove towards an inflection of the exponential growth. On March 25th Italy further implemented acomplete shut down of its borders, and our analysis showed that the country started to decelerate onMarch 26th. Brazil and USA both showed brief declines in acceleration between March 20th and 25th,

104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 4: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

but were still found in exponential growth as of April 2nd. In fact, Brazil resumed accelerationincline on March 26th. Still, since acceleration response to effective measures seems rapid, bothBrazil and USA could begin a deceleration process within few weeks if effective measures areimplemented are rigorously followed. To date, Brazil has accumulated 6,836 cases and 241 deaths,whereas USA has recorded 216,721 cases and 5,138 deaths.

To exemplify the ability of our framework in detecting complex arrangements of growth stages thatsubstantially deviate from the sigmoidal model, we selected Denmark, Estonia and Qatar (Figure4). All three countries experienced deceleration phases that did not culminate in stationary growth.Instead, a brief linear growth was formed after deceleration, which was followed by a newexponential growth stage. Estonia has further entered a new deceleration phase on March 27 th.These observations indicate that the growth dynamics of COVID-19 cases is more complex thanpreviously appreciated. Therefore, analyzing the raw growth curve alone, dissociated from itsderivatives, is very limiting for inference and may hamper the understanding of the pandemicevolution. In part, the lack of combined analysis of growth rate and acceleration in this pandemic isto be blamed on scarce availability of tailor made, user-friendly software. To aid to the analysis ofgrowth rate and acceleration of COVID-19 cases, we built a web application using R (9) and Shiny(8). This application automatically loads the latest ECDC case reports and applies MR to extractgrowth rate and acceleration from real-time data. The app also performs automated classification ofgrowth stages with HMM (albeit free parameters should be manually tuned for improved results).Users are not limited to case reports from ECDC, since the app allows for the upload of custom data(e.g., city, region, province or state), which can be used to monitor the growth behavior of COVID-19 locally. Upon closing of the COVID-19 pandemic, this tool could be further used in the analysisof future outbreaks and epidemics, or even of historical disease data. A limiting factor however isthat the proposed framework relies on updated case reports, such that sub-notification, delayedcommunication and the elapsed time between sample collection, diagnostic results and reportingmay impact the real-time inference of growth dynamics in disease transmission and consequentlyjeopardize the timely detection of transitions in the growth curve. In spite of that limitation, thepresented tool remains highly useful to monitor the growth behavior of epidemics.

CONCLUSIONS

We deployed a simple framework for the real-time analysis of COVID-19 prevalence. We were ableto demonstrate that the real-time decomposition of growth curves of COVID-19 cases into growthrate and acceleration can be a powerful tool to monitor the impact of public health measures on thespread of the disease. We also showed that restrictions to human mobility can significantlydecelerate the incidence of new cases within weeks. Furthermore, we found that the prevalence ofthe disease is more complex and dynamic than previously appreciated. This observation will haveimportant implications to assumptions adopted in mathematical models to predict the evolution ofthe pandemic.

MATERIAL AND METHODS

Moving regression (MR) modelThe MR technique adopted here aimed at fitting a smooth growth curve to the COVID-19prevalence data, such that the resulting curve could describe the cumulative number of cases as afunction of time. For n recorded days in a given country or territory, let x be a n-dimensionalcolumn vector of days since the first case report and y the reciprocal column vector with elementscorresponding to the cumulative number of cases. Relative to day d, we define yd and xd as k-sizedsubset vectors of y and x, respectively, where k = 1 + 2s and s is a free parameter representing thenumber of offset days before and after day d. Hereafter, we refer to s as the “smooth factor”, since itcontrols the compromise between over-smoothing (large s) and over-fitting (small s) the curve to

156157158159160161162163164165166167168169170171172173174175176177178179180181182183184

185

186

187188189190191192193194195196

197

198199200201202203204205206207

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 5: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

the data. Finally, we define Xd = [1k xd], where 1k is a k-dimensional column vector with allelements equal to one. The local growth rate was estimated by ordinary least squares regression:

[md gd ]T = (XdTXd)–1Xd

Tyd (1)

where md is an intercept and gd is the estimated growth rate (cases/day) at day d. In practice, gd

corresponds to an estimate of the instantaneous rate of change in the number of cases at day d,which in turn is an approximation to the first order derivative of the unknown growth functionevaluated at time d. The smoothed growth curve was obtained by calculating fitted values as:

ŷd = Xd[md gd ]T (2)

After fitting equation (1) to all n records, we define g is a vector of size n containing all estimatedlocal growth rates and gd as a k-sized subset vector of g. The local growth acceleration at day d wasthen obtained by adapting equation (1):

[md ad ]T = (XdTXd)–1Xd

Tgd (3)

where ad is the estimated growth acceleration (cases/day²) at day d. Now ad is an estimate of theinstantaneous rate of change of the growth rate at day d, which consequently approximates thesecond order derivative of the unknown growth function evaluated at time d.

Hidden Markov Model (HMM) for growth stage classificationIn order to automate the process of growth stage classification, we built a HMM that usesacceleration data obtained from MR as input. Considering a as the n-dimensional vector ofestimated growth accelerations across n recorded days, we first compute z = sign(a), where sign(.)is a modified sign function which retrieves -1 for a < -c, +1 for a > c and 0 otherwise. Scalar c isdefined as an acceleration cutoff, which is treated here as a free parameter. The objective of theHMM was to generate a sequence of states K = (k1, k2, …, kn) where each element ki takes one of thefollowing values: “lagging”, “exponential”, “deceleration” or “stationary”. The initial probabilitiesfor these hidden states were set to 1, 0, 0 and 0, respectively, assuming that all growth curves startfrom a lagging stage. Now let T be a 4 x 4 matrix of transition probabilities between hidden statesand E be a 4 x 3 matrix of emission probabilities that models the probability of each hidden stateproducing a z value of -1, 0 or +1. We adopted:

T=[0.8 0.2 0.0 0.00.0 0.8 0.2 0.00.0 0.0 0.8 0.20.0 0.2 0.0 0.8

]E=[0.25 0.40 0.250.00 0.10 0.500.50 0.10 0.000.25 0.40 0.25

]The selected values in T only permitted transitions lagging → exponential, exponential→deceleration or deceleration → stationary. Values in E made z = 0 more likely to be produced byeither the lagging or stationary stages, z = +1 more likely to be produced by the acceleration stageand z = -1 more likely to be produced by the deceleration stage. For the atypical transitiondeceleration → exponential, the described model would generate a short and intermediate stationarystep between these two stages. In these cases, the spurious stationary step was replaced by anexponential classification after the HMM has been fitted to the data. The Viterbi algorithmimplemented in the HMM v1.0 package (15) in R (9) was used to estimate the sequence K. Afterprediction of growth stages, stationary classifications were confronted against growth rates. If agiven stationary stage presented a median growth rate greater than the maximum growth rate of thelagging phase, it was re-classified as a “linear” stage.

208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242

243

244245246247248249250251252253254255256

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 6: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

Simulation studyTo test the performance of MR in approximating growth curves and their rate of change andacceleration in scenarios where these curves have been observed only partially (i.e., real-time casereport), we selected a widely used sigmoidal mathematical function, namely the Gompertz model,to generate 50,000 simulated growth curves. We used a parameterization of the Gompertz modelthat is dependent on four parameters:

f(t) = a*exp(-exp(-k(t – d))) (4)

where t is a time point, a is the asymptote (i.e., number of cases at the stationary stage), exp is theexponential function, k is a growth coefficient and d is the time at inflection of the exponentialstage (i.e., time when the growth rate reaches its maximum value and acceleration transitions frompositive to negative). All simulations were performed considering a 100-days period, withparameters sampled as follows: a ~ Uniform(500, 10000), k ~ Uniform(0.05, 0.95) and d ~Uniform(5, 95). Completely stationary curves were discarded. The accuracy of growth rate andacceleration estimates produced by MR with smooth factor ranging from s = 3 to s = 10 were thenevaluated by taking the coefficient of determination (R²) of the regression of true values ontoestimates.

Analysis of COVID-19 case reportsWe analyzed case reports that have been updated daily by the European Center for DiseasePrevention and Control (ECDC). The framework was applied to that data using smooth factorsranging from s = 3 to s = 10. The acceleration curves were clipped at observation n – s to avoid poorgrowth acceleration estimates at the end of the curve. Likewise, the last s days had their growthrates estimated by compounding rates from n – s to n using the acceleration estimated for day n – s.Finally, next-day predictions of COVID-19 prevalence were obtained by summing the last observedprevalence with its estimated growth rate. In order to measure the accuracy of these predictions, weperformed a step-wise simulation by censoring observations ahead of each day, fitting MR to theremaining data and then comparing predicted and true next-day prevalence. Accuracy of predictionswere again measured by linear regression.

Analysis and visualization toolsAll analyses presented in this paper were performed using R version 3.4.4 (9). To visualize thegrowth rate and acceleration of COVID-19 pandemic, we implemented a simple Shiny (8)dashboard application, which offers an intuitive web interface and allow us to be updated on newcases and the prevalence of COVID-19 worldwide. The application automatically loads the latestcase reports from ECDC. Alternatively, users can upload their own data to visualize the growth rateand acceleration of COVID-19 of specific states, provinces, cities or aggregate data from arbitraryterritory definitions. For the implementation we used the following packages: shiny v1.4.0 (16),shinydashboard v0.7.1 (17), shinydashboardPlus v0.7.0 (18), readxl v1.3.1 (19), shinyalert v1.0(20), httr v1.4.1 (21) and plotly v4.9.2 (22), all available on CRAN (Comprehensive R ArchiveNetwork, https://cran.r-project.org/). The application can be downloaded from our GitHubrepository at https://github.com/adamtaiti/SARS-CoV-2. A live instance of the app will bemaintained until the end of the pandemic at https://www.theguarani.com.br/covid-19 .

REFERENCES

1. World Health Organization, Coronavirus disease (COVID-2019) situation reports;https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/ [accessed 25March 2020].

257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303

304

305306307308

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 7: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

2. Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The speciesSevere acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming itSARS-CoV-2. Nature Microbiology (2020). 5:536–544 . doi: 10.1038/s41564-020-0695-z.

3. Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF. The proximal origin of SARS-CoV-2. Nature Medicine (2020). doi: 0.1038/s41591-020-0820-9.

4. Zhou P, Yang XL, Wang XG, Hu B, Zhang L, Zhang W, Si HR, Zhu Y, Li B, Huang CL, ChenHD, Chen J, Luo Y, Guo H, Jiang RD, Liu MQ, Chen Y, Shen XR, Wang X, Zheng XS, Zhao K,Chen QJ, Deng F, Liu LL, Yan B, Zhan FX, Wang YY, Xiao GF, Shi ZL. A pneumonia outbreakassociated with a new coronavirus of probable bat origin. Nature (2020). 579:270–273 . doi:10.1038/s41586-020-2012-7.

5. Chen S, Yang J, Yang W, Wang C, Bärnighausen T. COVID-19 control in China during masspopulation movements at New Year. Lancet (2020). 395:764-766. doi: 10.1016/S0140-6736(20)30421-9.

6. European Center for Disease Prevention and Control (ECDC), COVID-2019;https://www.ecdc.europa.eu/en/novel-coronavirus-china [accessed 25 March 2020].

7. Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, Shaman J. Substantial undocumented infectionfacilitates the rapid dissemination of novel coronavirus (SARS-CoV2). Science (2020). doi:10.1126/science.abb3221.

8. RStudio Inc. Shiny: Easy web applications in R. http://shiny.rstudio.com/ [accessed 25 March2020].

9. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation forStatistical Computing, Vienna, Austria. https://www.r-project.org/ [accessed 25 March 2020].

10. Henscheid N, Clarkson E, Myers KJ, Barrett HH. Physiological random processes in precisioncancer therapy. PloS One (2018). 13:e0199823. doi: 10.1371/journal.pone.0199823.

11. Zwietering MH, Jongenburger I, Rombouts FM, van 't Riet K. Modeling of the bacterial growthcurve. Appl Environ Microbiol (1990). 56:1875-81.

12. Winsor CP. The Gompertz Curve as a Growth Curve. Proc Natl Acad Sci U S A (1932). 18:1-8.doi: 10.1073/pnas.18.1.1.

13. Tjørve KMC, Tjørve E. The use of Gompertz models in growth analyses, and new Gompertz-model approach: An addition to the Unified-Richards family. PLoS One (2017). 12:e0178691. doi:10.1371/journal.pone.0178691.

14. Kraemer MUG, Yang CH, Gutierrez B, Wu CH, Klein B, Pigott DM, Open COVID-19 DataWorking Group, du Plessis L, Faria NR, Li R, Hanage WP, Brownstein JS, Layan M, Vespignani A,Tian H, Dye C, Pybus OG, Scarpino SV. The effect of human mobility and control measures on theCOVID-19 epidemic in China. Science (2020). doi: 10.1126/science.abb4218.

15. Himmelmann l. HMM - Hidden Markov Models. R package version 1.0. https://CRAN.R-project.org/package=HMM [acessed 25 March 2020].

309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 8: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

16. Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J. shiny: Web Application Framework for R. Rpackage version 1.4.0. (2019) https://CRAN.R-project.org/package=shiny [accessed 25 March2020]

17. Chang W, Ribeiro BB. shinydashboard: Create Dashboards with 'Shiny'. R package version0.7.1. (2018) https://CRAN.R-project.org/package=shinydashboard [accessed 25 March 2020]

18. Granjon D. shinydashboardPlus: Add More 'AdminLTE2' Components to 'shinydashboard'. Rpackage version 0.7.0. (2019) https://CRAN.R-project.org/package=shinydashboardPlus [accessed25 March 2020]

19. Wickham H, Bryan J. readxl: Read Excel Files. R package version 1.3.1. (2019)https://CRAN.R-project.org/package=readxl [accessed 25 March 2020]

20. Attali D, Edwards T. shinyalert: Easily Create Pretty Popup Messages (Modals) in 'Shiny'. Rpackage version 1.0. (2018) https://CRAN.R-project.org/package=shinyalert [accessed 25 March2020]

21. Wickham H. httr: Tools for Working with URLs and HTTP. R package version 1.4.1. (2019)https://CRAN.R-project.org/package=httr [accessed 25 March 2020]

22. Sievert C. plotly for R. (2018) https://plotly-r.com [accessed 25 March 2020]

ACKNOWLEDGMENTS

We would like to express our highest gratitude to all health agents and individuals around the globewho were involved in reporting cases and making COVID-19 prevalence data available to thepublic. This study did not receive financial support and was conducted during voluntary socialisolation.

AUTHOR CONTRIBUTIONS

Y.T.U. conceived the study, performed simulations, coordinated the data analysis and wrote themanuscript. A.T.H.U. built R code for data analysis and programmed the Shiny App Dashboard.R.B.P.T., S.C.P., M.M. and J.F.G. revised growth curves for all countries/territories and pinpointeddates of measures taken by them to reduce human mobility. All authors revised and agreed with thecontents of the manuscript.

COMPETING INTERESTS

The authors declare no competing interests.

DATA AVAILABILITY

The data in this study were obtained from the European Center for Disease Prevention and Control(ECDC) and are publicly available at https://opendata.ecdc.europa.eu/covid19/casedistribution/csv(accessed on April 2nd 2020). The source code for the R Shiny application used for data analysis isfound in our GitHub repository: https://github.com/adamtaiti/SARS-CoV-2. A live instance of theapp can be accessed at http://www.theguarani.com.br/covid-19.

361362363364365366367368369370371372373374375376377378379380381382383384

385

386387388389390391

392

393394395396397398399

400

401402403

404

405406407408409410

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 9: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

FIGURE 1 | Growth rate and acceleration in China and South Korea. (A) Theoretical modelexemplified by simulated data using a four-parameters Gompertz model with an asymptote at80,000, growth coefficient of 0.15, inflection time at 35, and time ranging from 1 to 80. (B) Fittedcurves for China between December 31st 2019 and April 2nd 2020. The first red dot marks themidpoint between January 23rd and 24th 2020, when a strict cordon sanitaire was imposed to Wuhan,Shanghai, Jiangsu and Hainan. The second red dot pinpoints February 4th 2020, when the cordonwas extended to a larger portion of the eastern part of China. (c) Fitted curves for South Koreabetween January 20th and April 2nd 2020. The red dot is placed between February 20 th and 21st 2020,when a collection of restrictions to human mobility was imposed, including lockdown of Daegucity, suspension of flights, cancellation of mass gatherings and lockdown of all South Koreanmilitary bases. The apparent stationary stage of both countries is in reality classified as lineargrowth by our framework, since the approximately constant growth rate in this period is larger thanthe maximum growth rate observed during the lagging stage.

412413414415416417418419420421422423424

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 10: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

FIGURE 2 | Accuracy (R2) of moving regression estimates of growth rate and growthacceleration from 50,000 simulated Gompertz growth curves.

FIGURE 3 | Accuracy (R2) of moving regression predictions of next-day COVID-19prevalence.

426427428429

431432

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 11: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

FIGURE 4 | Growth rate and acceleration in the United States of America (USA), Brazil andItaly. By the time the study was completed, the governments of the United States of America (a)and Brazil (b) had not announced severe strict measures to restrain human movement, such as alockdown. As a measure to reduce movement, these two countries had primarily focused onpreventing mass gatherings, besides of closing schools, nurseries, universities and other places thatfacilitate agglomeration, such as shopping malls. This is in contrast with Italy (c), which imposedstrict quarantine on March 10th 2020 (first red dot) and closure of borders on March 25th 2020(second red dot). The country is currently in growth deceleration.

434435436437438439440441

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint

Page 12: Growth rate and acceleration analysis of the COVID-19 ...€¦ · 30-03-2020  · Yuri Tani Utsunomiya1,2*, Adam Taiti Harth Utsunomiya2, Rafaela Beatriz Pintor Torrecilha2, Silvana

FIGURE 5 | Growth rate and acceleration in Denmark, Estonia and Qatar . These threecountries exhibited complex arrangements of growth stages. (a) On March 11th 2020 (red dot),Denmark became the second European country to establish a lockdown. The country starteddecelerating new cases of COVID-19 two days after. (b) On March 13th 2020 (red dot) a state ofemergency was declared by the Estonian government, which imposed significant restrictions totravel and mass gatherings. Decline of growth rate was observed one day after, and decelerationstarted within 4 days. (c) Our analysis indicated that Qatar reached its peaking acceleration onMarch 9th 2020 (red dot), the same day when Qatari officials had announced the closure of schools,nurseries, and universities, in addition to strong restrictions to traveling. One day after, the countryregistered a sudden spike in the number of cases (arrow). The days that followed were marked by aclear decline in acceleration and eventually deceleration of growth, which coincided with asuccession of measures by the Qatari government that eventually led to significant restrictions tohuman movement. Although stationary growth was expected after deceleration, these threecountries passed through a short linear phase and then regained acceleration, which initiated a newexponential phase. Estonia has further initiated a second deceleration stage on March 27th 2020.

443444445446447448449450451452453454455456457458

. CC-BY-NC-ND 4.0 International licenseIt is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprintthis version posted April 10, 2020. ; https://doi.org/10.1101/2020.03.30.20047688doi: medRxiv preprint