27
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927 doi: medRxiv preprint NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.

2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Neutralising antibodies drive Spike mediated SARS-CoV-2 evasion 1

2

Kemp SA1*, Collier DA1,2,3*, Datir R1,2,3, Gayed S4, Jahun A5, Hosmillo M5, Ferreira IATM2,3, Rees-3

Spear C1, Mlcochova P2,3, Ines Ushiro Lumb6, David Roberts6, Anita Chandra2,3, Temperton N7 , 4

The CITIID-NIHR BioResource COVID-19 Collaboration8, The COVID-19 Genomics UK (COG-UK) 5

Consortium9, Sharrocks K4, Blane E3, Briggs JAG10, van Gils MJ11, Smith KGC 2,3, Bradley JR3,12, 6

Smith C14, Goldstein RA1, Goodfellow IG5, Smielewska A5,13, Skittrall JP 4,14,15, Gouliouris T4, 7

Gkrania-Klotsas E4, Illingworth CJR14,16, McCoy LE1, Gupta RK2,3,16,17 8

9 1Division of Infection and Immunity, University College London, London, UK. 10 2 Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Cambridge, UK. 11 3Department of Medicine, University of Cambridge, Cambridge, UK. 12 4Department of Infectious Diseases, Cambridge University NHS Hospitals Foundation Trust, 13

Cambridge, UK. 14 5Department of Pathology, University of Cambridge, Cambridge 15 6NHS Blood and Transplant, London, UK 16 7Viral Pseudotype Unit, Medway School of Pharmacy, University of Kent, UK 17 8The CITIID-NIHR BioResource COVID-19 Collaboration, see appendix for author list 18 9The COVID-19 Genomics UK (COG-UK) Consortium, https://www.cogconsortium.uk. Full list of 19

consortium names and affiliations are in the appendix 20 10Medical Research Council Laboratory of Molecular Biology, Cambridge, UK. 21 11Department of Medical Microbiology, Academic Medical Center, University of Amsterdam, 22

Amsterdam Institute for Infection and Immunity, Amsterdam, Netherlands 23 12 NIHR Cambridge Clinical Research Facility, Cambridge, UK. 24 13Department of Virology, Cambridge University NHS Hospitals Foundation Trust 25 14Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK 26 15Clinical Microbiology and Public Health Laboratory, Addenbrookes’ Hospital, Cambridge, UK 27 16 MRC Biostatistics Unit, University of Cambridge, Cambridge, UK 28 17Africa Health Research Institute, Durban, South Africa 29

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.

Page 2: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

*equal contribution 30

31

Address for correspondence: 32

Ravindra K. Gupta 33

Cambridge Institute for Therapeutic Immunology and Infectious Diseases 34

Jeffrey Cheah Biomedical Centre 35

Puddicombe Way 36

Cambridge CB2 0AW, UK 37

Tel: +44 1223 331491 38

[email protected] 39

40

Key words: SARS-CoV-2; COVID-19; antibody escape, Convalescent plasma; neutralising 41

antibodies; mutation; evasion; resistance; immune suppression 42

43

Abstract 44

SARS-CoV-2 Spike protein is critical for virus infection via engagement of ACE2, and amino acid 45

variation in Spike is increasingly appreciated. Given both vaccines and therapeutics are 46

designed around Wuhan-1 Spike, this raises the theoretical possibility of virus escape, 47

particularly in immunocompromised individuals where prolonged viral replication occurs. Here 48

we report fatal SARS-CoV-2 escape from neutralising antibodies in an immune suppressed 49

individual treated with convalescent plasma, generating whole genome ultradeep sequences by 50

both short and long read technologies over 23 time points spanning 101 days. Little 51

evolutionary change was observed in the viral population over the first 65 days despite two 52

courses of remdesivir. However, following convalescent plasma we observed dynamic virus 53

population shifts, with the emergence of a dominant viral strain bearing D796H in S2 and 54

�H69/�V70 in the S1 NTD of the Spike protein. As serum neutralisation waned, viruses with the 55

escape genotype diminished in frequency, before returning during a final, unsuccessful course 56

of convalescent plasma. In vitro, the Spike escape variant conferred decreased sensitivity to 57

multiple units of convalescent plasma/sera from different recovered patients, whilst 58

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 3: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

maintaining infectivity similar to wild type. These data reveal strong positive selection on SARS-59

CoV-2 during convalescent plasma therapy and identify the combination of Spike mutations 60

D796H and �H69/�V70 as a broad antibody resistance mechanism against commonly occurring 61

antibody responses to SARS-CoV-2. 62

63

Introduction 64

SARS-CoV-2 is an RNA betacoronavirus, with closely related viruses identified in pangolins and 65

bats1,2. RNA viruses have inherently higher rates of mutation than DNA viruses such as 66

Herpesviridae3. The capacity for successful adaptation is exemplified by the Spike D614G 67

mutation, that arose in China and rapidly spread worldwide4, now accounting for more than 68

90% of infections. The mutation appears to increase infectivity and transmissibility in animal 69

models5. Although the SARS-CoV-2 Spike protein is critical for virus infection via engagement of 70

ACE2, substantial Spike amino acid variation is being observed in circulating viruses6. Logically, 71

mutations in the receptor binding domain (RBD) of Spike are of particular concern due to the 72

RBD being targeted by neutralising antibodies and therapeutic monoclonal antibodies. 73

74

Deletions in the N-terminal domain (NTD) of Spike S1 are also being increasingly recognised, 75

both within hosts7 and across individuals8. The evolutionary basis for the emergence of 76

deletions is unclear at present but could be related to escape from immunity or to enhanced 77

fitness/transmission. The most notable deletion in terms of frequency is �H69/�V70. This 78

double deletion has been detected in multiple unrelated lineages, including the recent ‘Cluster 79

5’ mink related strain in the North Jutland region of Denmark (https://files.ssi.dk/Mink-cluster-80

5-short-report_AFO2). There it was associated with the RBD mutation Y453F in almost 200 81

individuals. Another European cluster in GISAID includes �H69/�V70 along with the RBD 82

mutation N439K. 83

84

Although �H69/�V70 has been detected multiple times, within-host emergence remains 85

undocumented and the reasons for its selection are unknown. Here we document real time 86

SARS-CoV-2 emergence of �H69/�V70 in response to convalescent plasma therapy in an 87

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 4: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

immunocompromised human host, demonstrating broad antibody escape in combination with 88

the S2 mutation D796H. 89

90

Results 91

Clinical case history of SARS-CoV-2 infection in setting of immune-compromised host 92

Please contact authors for clinical details. Some indices are given in Supplementary table 1 and 93

Supplementary Figure 1 and Supplementary Figure 2). Figure 1 summarises the timing of 94

treatment. 95

96

Negative SARS-CoV-2 specific antibodies and requirement for convalescent plasma (CP) 97

Given the history of B cell depletion therapy and hypogammaglobulinemia we measured serum 98

SARS-CoV-2 specific antibodies over the course of the admission. Total serum antibodies to 99

SARS-CoV-2 were tested at days 44 and 50 by S protein immunoassay (Siemens). Results were 100

negative. Three units (200mL each) of convalescent plasma (CP) from three independent 101

donors were obtained from the NHS Blood and Transfusion Service and administered on 102

compassionate use basis. These had been assayed for antibody titres using the validated 103

Euroimmun assay (Supplementary figure 3). Patient serum was subsequently positive for SARS-104

CoV-2 specific antibodies by S protein immunoassay (Siemens) in the hospital diagnostic 105

laboratory on days 68, 90 and 101. 106

107

Virus genomic comparative analysis of 23 sequential respiratory samples over 101 days 108

The majority of samples were respiratory samples from nose and throat or endotracheal 109

aspirates during the period of intubation. Ct values ranged from 16-34 and all 23 respiratory 110

samples were successfully sequenced by standard long read approach as per the ARTIC protocol 111

implemented by COG-UK; of these 20 additionally underwent short-read deep sequencing using 112

the Illumina platform. There was generally good agreement between the methods, though as 113

expected nanopore demonstrated greater error at low variant frequencies (<10%) 114

(Supplementary Figure 4). We detected no evidence of recombination, based on two 115

independent methods. 116

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 5: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

117

Maximum likelihood analysis of patient-derived whole genome consensus sequences 118

demonstrated clustering with other local sequences from the same region (Figure 2A). The 119

infecting strain was assigned to lineage 20B bearing the D614G Spike variant. Environmental 120

sampling showed evidence of virus on surfaces such as telephone and call bell but not in air on 121

days 59, 92 and 101. Sequencing of these surface viruses showed clustering with those derived 122

from the respiratory tract (Figure 2B). All samples were consistent with having arisen from a 123

single viral population. In our phylogenetic analysis, we included sequential sequences from 124

three other local patients identified with persistent viral RNA shedding over a period of 4 weeks 125

or more (Supplementary Table 2). Viruses from these individuals showed very little divergence 126

in comparison to the case patient (Figure 2B) and none showed amino acid changes in Spike 127

over time. We additionally inferred a maximum likelihood phylogeny comparing sequences 128

from these three local individuals and two long term immunosuppressed SARS-CoV-2 ‘shedders’ 129

recently reported7,9, (Figure 2B). While the sequences from Avanzato et al showed a pattern of 130

evolution more similar to two of the three other local patients, the case patient showed 131

significant diversification with a mutation rate of 30 per year (Supplementary table 2). 132

133

Further investigation of the sequence data suggested the existence of an underlying structure 134

to the viral population in our patient, with samples collected at days 93 and 95 being rooted 135

within, but significantly divergent from the original population (Figure 2B and 3A). The 136

relationship of the divergent samples to those at earlier time points rules out the possibility of 137

superinfection. The increased divergence of sequences does not necessarily indicate selection; 138

a spatially compartmentalised subset of viruses, smaller in number than the main viral 139

population, would be expected to evolve more quickly than the main population due to the 140

increased effect of genetic drift 10,11. 141

142

Virus population structure changes following convalescent plasma and remdesivir 143

All samples tested positive by RT-PCR and there was no sustained change in Ct values 144

throughout the 101 days following the first two courses of remdesivir (days 41 and 54), or the 145

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 6: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

first two units of convalescent plasma (days 63 and 65). According to nanopore data, no 146

polymorphisms occurred over the first 60 days at consensus level (Figure 3A). However, short 147

read deep sequence Illumina data revealed minority polymorphisms in the viral population 148

during this period (Figure 3B). For example, T39I in ORF7a reached a majority frequency of 77% 149

on day 44, arising de novo and increased in frequency during the first period of the infection 150

(Supplementary Figure 6). 151

152

In contrast to the early period of infection, between days 66 and 82 a dramatic shift in virus 153

population structure was observed, with the near-fixation of D796H in S2 along with 154

�H69/�V70 in the S1 N-terminal domain (NTD) at day 82. This was identified in a nose and 155

throat swab sample with high viral load as indicated by Ct of 23 (Figure 4). The deletion was 156

not detected at any point prior to the day 82 sample, even as minority variants by short read 157

deep sequencing. 158

159

On Days 86 and 89, viruses collected were characterised by the Spike mutations Y200H and 160

T240I, with the deletion/mutation pair observed on day 82 having fallen to very low frequency. 161

Sequences collected on these days formed a distinct branch at the bottom of the phylogeny in 162

Figure 4 but were clearly associated with the remainder of the samples, suggesting that they 163

did not result from superinfection (Supplementary Figure 7), and further were not significantly 164

divergent from the bulk of the viral population (Supplementary Figure 5). 165

166

Sequencing of a nose and throat swab sample at day 93 again showed D796H along with 167

�H69/�V70 at <10% abundance, along with an increase in a virus population characterised by 168

Spike mutations P330S at the edge of the RBD and W64G in S1 NTD. This new lineage reached 169

near 100% abundance at day 93. Viruses with the P330S variant were detected in two 170

independent samples from different sampling sites, ruling out the possibility of contamination. 171

The divergence of these samples from the remainder of the population, noted above, suggests 172

the possibility of their resulting from the stochastic emergence, in the upper respiratory tract, 173

of a previously unobserved subpopulation of viruses (Supplementary Figure 5). 174

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 7: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

175

Following the third course of remdesivir (day 93) and third CP (day 95), we observed a re-176

emergence of the D796H + �H69/�V70 viral population. The inferred linkage of D796H and 177

�H69/�V70 was maintained as evidenced by the highly similar frequencies of the two variants, 178

suggesting that the third unit of CP led to the re-emergence of this viral strain under renewed 179

positive selection. Ct values remained low throughout this period with hyperinflammation, 180

eventually leading to multi-organ failure and death at day 102. The repeated increase in 181

frequency of the novel viral strain during CP therapy strongly supports the hypothesis that the 182

deletion/mutation combination conferred antibody escape properties. 183

184

�H69/�V70 + D796H confers impaired neutralisation by multiple convalescent plasma units 185

and sera from recovered COVID-19 patients 186

Using lentiviral pseudotyping we expressed wild type and �H69/�V70 + D796H mutant Spike 187

protein in enveloped virions and compared neutralisation activity of CP against these viruses. 188

This system has been shown to give similar results to replication competent virus12,13. We first 189

tested infection capacity over a single round of infection and found that �H69/�V70 + D796H 190

had similar infectivity to wild type (both in a D614G background, Figure 5A, B). The �H69/�V70 191

+ D796H mutant was partially resistant to the first two CP units (Figure 5C, Table 1A). In 192

addition, patient derived serum from days 64 and 66 (one day either side of CP2 infusion) 193

similarly showed lower potency against the mutant (Figure 5C, Table 1A). The repeated 194

observation of D796H + �H69/�V70 emergence and positive selection strengthens the 195

hypothesis that these variants were the key drivers of antibody escape. Experimentally, the 196

D796H + �H69/�V70 mutant also demonstrated reduced susceptibility to the CP3 administered 197

on day 95, explaining its re-emergence (Figure 5C, Table 1A). 198

199

Given reduced susceptibility of the mutants to at least two units of CP, and the expansion of 200

sequences bearing �H69/�V70, we hypothesised that this represented a broad escape 201

mechanism. We therefore screened antiviral neutralisation activity in sera from five recovered 202

patients against the mutant and wild type viruses (Figure 5D). We observed that the mutant 203

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 8: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

was indeed significantly less susceptible to four of five randomly selected sera, with the fifth 204

showing reduced susceptibility that did not reach statistical significance (Table 1B). Fold change 205

reductions in susceptibility of the mutant were as high as ten-fold compared to wild type (Table 206

1B). 207

208

In order to probe the impact of the D796H and �H69/�V70 mutations on potency of 209

monoclonal antibodies (mAbs) targeting Spike, we screened neutralisation activity of a panel of 210

seven neutralizing mAbs across a range of epitope clusters13 (Figure 5E). We observed no 211

differences in neutralisation between single mutants and wild type, suggesting that the 212

mechanism of escape was likely outside these epitopes. 213

214

In order to understand the mechanisms that might confer resistance to antibodies we 215

examined a published Spike structure and annotated it our residues of interest (Figure 6). This 216

analysis showed that �H69/�V70 is in a disordered, glycosylated loop at the very tip of the 217

NTD, and therefore could alter binding of antibodies. ΔH69/V70 is close to the binding site of 218

the polyclonal antibodies derived from COV57 plasma, indicating the sera tested here may 219

contain similar antibodies 14,15. D796H is in an exposed loop in S2 (Figure 6) and appears to be in 220

a region frequently targeted by antibodies16, despite mutations at position 796 being rare 221

(Supplementary table 4). 222

223

Discussion 224

Here we have documented a repeated evolutionary response by SARS-CoV-2 against antibody 225

therapy during the course of a persistent and eventually fatal infection in an 226

immunocompromised host. The observation of potential selection for specific variants 227

coinciding with the presence of antibodies from convalescent plasma is supported by the 228

experimental finding of reduced susceptibility of these viruses to plasma. Further, we were 229

able to document real-time emergence of a variant �H69/�V70 in the NTD of Spike that has 230

been increasing in frequency in Europe. In the case we report that it was not clear that the 231

emergence of the antibody escape variant was the primary reason for treatment failure. 232

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 9: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

However, given that both vaccines and therapeutics are aimed at Spike, our study raises the 233

possibility of virus evasion, particularly in immune suppressed individuals where prolonged viral 234

replication occurs. 235

236

Our observations represent a very rare insight, and only possible due to lack of antibody 237

responses in the individual following administration of the B cell depleting agent rituximab for 238

lymphoma, and an intensive sampling course undertaken due to concerns about persistent 239

shedding and risk of nosocomial transmission. Persistent viral replication and the failure of 240

antiviral therapy allowed us to define the viral response to convalescent plasma, similar to a 241

recent report on asymptomatic long term shedding with four sequences over 105 days9, 242

although the reported shifts in genetic composition of the viral population could not be 243

explained phenotypically. Another common finding is the very low neutralisation activity in 244

serum post transfusion of CP with waning as expected. Apart from the difference in the 245

outcome of infection (severe, fatal disease versus asymptomatic disease and clearance), 246

critically important differences in our study include: 1. The intensity of sampling and use of both 247

long and short read sequencing to verify variant calls, thereby providing a unique scientific 248

resource for longitudinal population genetic analysis. 2. The close alignment between the 249

genetic composition of the viral population and CP administration, with an experimentally 250

verified resistant strain emerging, falling to low frequency, and then rising again under CP 251

selection. 3. Real time detection of emergence of a variant, �H69/�V70, that is increasing in 252

frequency in Europe, and shown here to affect neutralization by multiple COVID-19 patient 253

derived sera. 254

255

We have noted in our analysis the potential influence of compartmentalised viral replication 256

upon the sequences recovered in upper respiratory tract samples. Both population genetic and 257

small animal studies have shown a lack of reassortment between influenza viruses within a 258

single host during an infection, suggesting that acute respiratory viral infection may be 259

characterised by spatially distinct viral populations17,18. In the analysis of data, it is important to 260

distinguish genetic changes which occur in the primary viral population from apparent changes 261

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 10: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

that arise from the stochastic observation of spatially distinct subpopulations in the host. While 262

the samples we observe on days 93 and 95 of infection are genetically distinct from the others, 263

the remaining samples are consistent with arising from a consistent viral population, supporting 264

the finding of a reversion and subsequent regain of antibody resistance. We note that in a 265

study of SARS-CoV-2, Choi et al reported the detection in postmortem tissue of viral RNA not 266

only in lung tissue, but also in the spleen, liver, and heart7. Mixing of virus from different 267

compartments, for example via blood, or movement of secretions from lower to upper 268

respiratory tract, could lead to fluctuations in viral populations at particular sampling sites. 269

Experiments in animal models with sampling of different replication sites could allow a better 270

understanding of SARS-CoV-2 population genetics and enable prediction of escape variants 271

following antibody based therapies. 272

273

This is a single case report and therefore limited conclusions can be drawn about 274

generalisability. 275

In addition to documenting the emergence of SARS-CoV-2 Spike �H69/�V70 + D796H in vivo, 276

conferring broad reduction in susceptibility to serum/plasma polyclonal, but no effect on a set 277

of predominantly RBD-targeting monoclonal antibodies, these data highlight that infection 278

control measures need to be specifically tailored to the needs of immunocompromised 279

patients. The data also highlight caution in interpretation of CDC guidelines that recommend 20 280

days as the upper limit of infection prevention precautions in immune compromised patients 281

who are afebrile19. Due to the difficulty with culturing clinical isolates, use of surrogates for 282

infectious virus such as sgRNA are warranted20. However, where detection of ongoing viral 283

evolution is possible, this serves as a clear proxy for the existence of infectious virus. In our case 284

we detected environmental contamination whilst in a single occupancy room and the patient 285

was moved to a negative-pressure high air-change infectious disease isolation room. 286

287

The clinical efficacy of CP has been called into question recently21, and our data suggest caution 288

in use of CP in patients with immune suppression of both T cell and B cell arms. In such cases, 289

the antibodies administered have little support from cytotoxic T cells, thereby reducing chances 290

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 11: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

of clearance and raising the potential for escape mutations. Whilst we await further data, CP 291

administered for clinical need in immune suppression, should ideally be undertaken in single 292

occupancy rooms with enhanced infection control precautions, including SARS-CoV-2 293

environmental sampling and real-time sequencing. 294

295

296

Table 1: Neutralisation activity of convalescent plasma and patient sera taken in between CP 297

doses against SARS-CoV-2 viral variants observed in A. patient case. B. five unselected 298

patients with recovered COVID-19. Shown are IC50 dilution titres - the CP or serum reciprocal 299

dilution at which 50% of virus infectivity is lost against wild type (baseline) or D796H + Δ69/70 300

mutant Spike protein expressed on the surface of a lentiviral particle. Fold change is the 301

difference between the two viruses, and values below 1 indicate that the D796H/Δ69/70 302

mutant is less sensitive to the CP or serum. Only the D66 serum sample failed to demonstrate 303

significantly higher resistance of the D796H/Δ69/70 mutant. 304

305

A. IC50 as reciprocal dilution (95% CI)

WT D796H + Δ69/70 Fold Change IC50 (95% CI)

CP1 2480.0 (2104.0 - 2965.0) 924.2 (634.1 - 1316.0) 0.37 (0.24 - 0.56)

D64 serum 163.7 (106.6 - 256.4) 55.6 (29.5 - 100.6) 0.27 (0.12 - 0.77)

CP2 447.1 (305.2 - 670.1) 196 (129.8 - 294.7) 0.47 (0.25 - 0.86)

D66 serum 175.7 (90.7 - 410.4) 85.5 (48.2 - 149.8) 0.82 (0.29 - 1.96)

CP3 280.2 (172.8 - 480.3) 115 (75.0 - 175.5) 0.47 (0.24 - 0.91)

306

307

308

B. IC50 as reciprocal dilution (95% CI)

WT D796H + Δ69/70 Fold Change IC50 (95% CI)

Serum 1 994.1 (852.7-1159.0) 755.8 (426.7-1269.0) 0.71 (0.39-1.33)

Serum 2 2107.0 (1680.0-2662.0) 1045.0 (745.9-1387.0) 0.42 (0.27-0.70)

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 12: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Serum 3 3241.0 (2377.0-4643.0) 347.4 (234.4-505.4) 0.11 (0.07-0.17)

Serum 4 17368.0 (14357.0-22130.0) 2965.0 (2287.0-3767.0) 0.26 (0.20-0.32)

Serum 5 52735.0 (34276.0-93957.0) 3871.0 (3525.0-4252.0) 0.20 (0.17-0.24)

309

310

Ethics 311

The study was approved by the East of England – Cambridge Central Research Ethics Committee 312

(17/EE/0025). Written informed consent was obtained from both the patient and family. 313

Additional controls with COVID-19 were enrolled to the NIHR BioResource Centre Cambridge 314

under ethics review board (17/EE/0025). 315

316

Acknowledgements 317

We are immensely grateful to the patient and his family. We would also like to thank the staff 318

at CUH and the NIHR Cambridge Clinical Research Facility. We would like to thank Dr Ruthiran 319

Kugathasan and Professor Wendy Barclay for helpful discussions and Dr Martin Curran, Dr 320

William Hamilton, and Dr. Dominic Sparkes. We would like to thank Prof Andres Floto and Prof 321

Ferdia Gallagher. We thank Dr James Voss for the kind gift of HeLa cells stably expressing ACE2. 322

COG-UK is supported by funding from the Medical Research Council (MRC) part of UK Research 323

& Innovation (UKRI), the National Institute of Health Research (NIHR) and Genome Research 324

Limited, operating as the Wellcome Sanger Institute. RKG is supported by a Wellcome Trust 325

Senior Fellowship in Clinical Science (WT108082AIA). LEM is supported by a Medical Research 326

Council Career Development Award (MR/R008698/1). SAK is supported by the Bill and Melinda 327

Gates Foundation via PANGEA grant: OPP1175094. DAC is supported by a Wellcome Trust 328

Clinical PhD Research Fellowship. CJRI acknowledges MRC funding (ref: MC_UU_00002/11). 329

This research was supported by the National Institute for Health Research (NIHR) Cambridge 330

Biomedical Research Centre, the Cambridge Clinical Trials Unit (CCTU) and by the UCL 331

Coronavirus Response Fund and made possible through generous donations from UCL’s 332

supporters, alumni, and friends (LEM). JAGB is supported by the Medical Research Council 333

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 13: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

(MC_UP_1201/16). IG is a Wellcome Senior Fellow and supported by the Wellcome Trust 334

(207498/Z/17/Z). 335

336

Author contributions 337

Conceived study: RKG, SAK, DAC, AS, TG, EGK 338

Designed experiments: RKG, SAK, DAC, LEM, JAGB, EGK, AC, NT, AC, CS 339

Performed experiments: SAK, DAC, LEM, RD, CRS, AJ, IATMF, KS, TG, CJRI, BB, JS, MJvG 340

Interpreted data: RKG, SAK, DAC, PM, LEM, JAGB, PM, SG, KS, TG, JB, KGCS, IG, CJRI, JAGB, IUL, 341

DR, JS, BB, RAG 342

343

Conflicts of interest 344

The authors declare no conflicts of interest 345

346

Methods 347

Clinical Sample Collection and Next generation sequencing 348

Serial samples were collected from the patient periodically from the lower respiratory tract 349

(sputum or endotracheal aspirate), upper respiratory tract (throat and nasal swab), and from 350

stool. Nucleic acid extraction was done from 500µl of sample with a dilution of MS2 351

bacteriophage to act as an internal control, using the easyMAG platform (Biomerieux, Marcy-352

l'Étoile) according to the manufacturers’ instructions. All samples were tested for presence of 353

SARS-CoV-2 with a validated one-step RT q-PCR assay developed in conjunction with the Public 354

Health England Clinical Microbiology 22. Amplification reaction were all performed on a 355

Rotorgene™ PCR instrument. Samples which generated a CT of ≤36 were considered to be 356

positive. 357

358

Sera from recovered patients in the COVIDx study23 were used for testing of neutralisation 359

activity by SARS-CoV-2 mutants. 360

361

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 14: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

For viral genomic sequencing, total RNA was extracted from samples as described. Samples 362

were sequenced using MinION flow cells version 9.4.1 (Oxford Nanopore Technologies) 363

following the ARTICnetwork V3 protocol (https://dx.doi.org/10.17504/protocols.io.bbmuik6w) 364

and BAM files assembled using the ARTICnetwork assembly pipeline 365

(https://artic.network/ncov-2019/ncov2019-bioinformatics-sop.html). A representative set of 366

10 sequences were selected and also sequenced using the Illumina MiSeq platform. Amplicons 367

were diluted to 2 ng/µl and 25 µl (50 ng) were used as input for each library preparation 368

reaction. The library preparation used KAPA Hyper Prep kit (Roche) according to manufacturer’s 369

instructions. Briefly, amplicons were end-repaired and had A-overhang added; these were then 370

ligated with 15mM of NEXTflex DNA Barcodes (Bio Scientific, Texas, USA). Post-ligation products 371

were cleaned using AMPure beads and eluted in 25 µl. Then, 20 µl were used for library 372

amplification by 5 cycles of PCR. For the negative controls, 1ng was used for ligation-based 373

library preparation. All libraries were assayed using TapeStation (Agilent Technologies, 374

California, USA) to assess fragment size and quantified by QPCR. All libraries were then pooled 375

in equimolar accordingly. Libraries were loaded at 15nM and spiked in 5% PhiX (Illumina, 376

California, USA) and sequenced on one MiSeq 500 cycle using a Miseq Nano v2 with 2x 250 377

paired-end sequencing. A minimum of ten reads were required for a variant call. 378

379

Bioinformatics Processes 380

For long-read sequencing, genomes were assembled with reference-based assembly and a 381

curated bioinformatics pipeline with 20x minimum coverage across the whole-genome 24. For 382

short-read sequencing, FASTQs were downloaded, poor-quality reads were identified and 383

removed, and both Illumina and PHiX adapters were removed using TrimGalore v0.6.6 25. 384

Trimmed paired-end reads were mapped to the National Center for Biotechnology Information 385

SARS-CoV-2 reference sequence MN908947.3 using MiniMap2-2.17 with arguments -ax and sr 386 26. BAM files were then sorted and indexed with samtools v1.11 and PCR optical duplicates 387

removed using Picard (http://broadinstitute.github.io/picard). Single nucleotide polymorphisms 388

(SNPs) were called using Freebayes v1.3.2 27 with a ploidy setting of 1, minimum allele 389

frequency of 0.20 and a minimum depth of five reads. Finally, a consensus sequences of nucleic 390

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 15: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

acids with a minimum whole-genome coverage of at least 20× were generated with BCFtools 391

using a 0% majority threshold. 392

393

Phylogenetic Analysis 394

All available full-genome SARS-CoV-2 sequences were downloaded from the GISAID database 395

(http://gisaid.org/) 28 on 17th September. Duplicate and low-quality sequences (>5% N regions) 396

were removed, leaving a dataset of 138,472 sequences with a length of >29,000bp. All 397

sequences were sorted by name and only sequences sequenced with United Kingdom / England 398

identifiers were retained. From this dataset, a subset of 250 sequences were randomly 399

subsampled using seqtk (https://github.com/lh3/seqtk). These 250 sequences were aligned to 400

the 23 patient sequences, as well as three other control patients (persistent long-term shedders 401

from the same hospital) (Supplementary Table 2) and the SARS-CoV-2 reference strain 402

MN908947.3, using MAFFT v7.473 with automatic flavour selection 29. Major SARS-CoV-2 clade 403

memberships were assigned to all sequences using the Nextclade server v0.8 404

(https://clades.nextstrain.org/). 405

406

Maximum likelihood phylogenetic trees were produced using the above curated dataset using 407

IQ-TREE v2.1.2 30. Evolutionary model selection for trees were inferred using ModelFinder 31 408

and trees were estimated using the GTR+F+I model with 1000 ultrafast bootstrap replicates 32. 409

All trees were visualised with Figtree v.1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/), rooted 410

on the SARS-CoV-2 reference sequence and nodes arranged in descending order. Nodes with 411

bootstraps values of <50 were collapsed using an in-house script. 412

413

Molecular substitution (clock) rates for the index patient, as well as three long-term shedders 414

and two recently described immunocompromised patients from literature, were estimated 415

using BEAST v2.6.3 34 using a HKY substitution model with 4 rate categories drawn from a 416

gamma distribution, a strict clock and a coalescent exponential population tree prior. MCMC 417

was run for 100 million iterations excluding a 15% burn-in. Tracer v1.7.1 was used to analyse 418

the BEAST trace in order extract the clock rate and ensure convergence had occurred. 419

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 16: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

420

In-depth allele frequency variant calling 421

The SAMFIRE package35 was used to call allele frequency trajectories from BAM file data. Reads 422

were included in this analysis if they had a median PHRED score of at least 30, trimming the 423

ends of reads to achieve this if necessary. Nucleotides were then filtered to have a PHRED 424

score of at least 30; reads with fewer than 30 such reads were discarded. Distances between 425

sequences, accounting for low-frequency variant information, was also conducted using 426

SAMFIRE. The sequence distance metric, described in an earlier paper 11, combines allele 427

frequencies across the whole genome. Where L is the length of the genome, we define q(t) as a 428

4 x L element vector describing the frequencies of each of the nucleotides A, C, G, and T at each 429

locus in the viral genome sampled at time t. For any given locus i in the genome we calculate 430

the change in allele frequencies between the times t1 and t2 via a generalisation of the 431

Hamming distance 432

433

��������, ������ 12 |������� � �������|����,,,��

434

where the vertical lines indicate the absolute value of the difference. These statistics were then 435

combined across the genome to generate the pairwise sequence distance metric 436

437

�������, ����� ��������, �������

438

The Mathematica software package was to conduct a regression analysis of pairwise sequence 439

distances against time, leading to an estimate of a mean rate of within-host sequence 440

evolution. In contrast to the phylogenetic analysis, this approach assumed the samples 441

collected on days 93 and 95 to arise via stochastic emission from a spatially separated 442

subpopulation within the host, leading to a lower inferred rate of viral evolution for the bulk of 443

the viral population. 444

445

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 17: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Inference of selection 446

Under the assumption of a large effective population size, a deterministic one-locus model of 447

selection was fitted to genome sequence data describing changes in the frequency of the 448

variant T39I. Where q(t) represents the frequency of a single variant at time t, we used a 449

maximum likelihood method to infer the initial variant frequency at time t=1, and the selection 450

coefficient s, for times from the initial time point to the disappearance of the variant from the 451

population. Specifically, where n(ti) and N(ti) were the number of observations of the variant 452

and the total read depth at that locus at time ti, we fitted the model 453

454

���� ������

�� ���� ������ 455

456

so as to maximise the binomial likelihood 457

458

� log ������ ��������1 � ���������

A similar calculation was performed to estimate the mean effective selection (incorporating 459

intrinsic selection for the variant plus linkage with other selected alleles) acting upon the 460

variant D796H during the final period of CP therapy. In this case selection was modelled as 461

being time-dependent, kicking in with the administration of therapy. We fitted the model: 462

463

���� � �� � � ����� ��������

1 � � ���������� � ! �

464

to the data, setting τ = 95. 465

466

Recombination Detection 467

All sequences were tested for potential recombination, as this would impact on evolutionary 468

estimates. Potential recombination events were explored with nine algorithms (RDP, MaxChi, 469

SisScan, GeneConv, Bootscan, PhylPro, Chimera, LARD and 3SEQ), implemented in RDP5 with 470

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 18: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

default settings 36. To corroborate any findings, ClonalFrameML v1.12 37 was also used to infer 471

recombination breakpoints. Neither programs indicated evidence of recombination in our data. 472

473

Structural Viewing 474

The Pymol Molecular Graphics System v2.4.0 (https://github.com/schrodinger/pymol-open-475

source/releases) was used to map the location of the four spike mutations of interested onto a 476

SARS-CoV-2 spike structure visualised by Wrobel et al (PDB: 6ZGE) 38. 477

478

Generation of Spike mutants 479

Amino acid substitutions were introduced into the D614G pCDNA_SARS-CoV-2_Spike plasmid 480

as previously described39 using the QuikChange Lightening Site-Directed Mutagenesis kit, 481

following the manufacturer’s instructions (Agilent Technologies, Inc., Santa Clara, CA). 482

483

Pseudotype virus preparation 484

Viral vectors were prepared by transfection of 293T cells by using Fugene HD transfection 485

reagent (Promega). 293T cells were transfected with a mixture of 11ul of Fugene HD, 1µg of 486

pCDNAΔ19Spike-HA, 1ug of p8.91 HIV-1 gag-pol expression vector40,41, and 1.5µg of pCSFLW 487

(expressing the firefly luciferase reporter gene with the HIV-1 packaging signal). Viral 488

supernatant was collected at 48 and 72h after transfection, filtered through 0.45um filter and 489

stored at -80˚C. The 50% tissue culture infectious dose (TCID50) of SARS-CoV-2 pseudovirus 490

was determined using Steady-Glo Luciferase assay system (Promega). 491

492

Serum/plasma pseudotype neutralization assay 493

Spike pseudotype assays have been shown to have similar characteristics as neutralisation 494

testing using fully infectious wild type SARS-CoV-212.Virus neutralisation assays were performed 495

on 293T cell transiently transfected with ACE2 and TMPRSS2 using SARS-CoV-2 Spike 496

pseudotyped virus expressing luciferase42. Pseudotyped virus was incubated with serial dilution 497

of heat inactivated human serum samples or convalescent plasma in duplicate for 1h at 37˚C. 498

Virus and cell only controls were also included. Then, freshly trypsinized 293T ACE2/TMPRSS2 499

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 19: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

expressing cells were added to each well. Following 48h incubation in a 5% CO2 environment at 500

37°C, the luminescence was measured using Steady-Glo Luciferase assay system (Promega). 501

502

mAb pseudotype neutralisation assay 503

Virus neutralisation assays were performed on HeLa cells stably expressing ACE2 and using 504

SARS-CoV-2 Spike pseudotyped virus expressing luciferase as previously described43. 505

Pseudotyped virus was incubated with serial dilution of purified mAbs13 in duplicate for 1h at 506

37˚C. Then, freshly trypsinized HeLa ACE2- expressing cells were added to each well. Following 507

48h incubation in a 5% CO2 environment at 37°C, the luminescence was measured using Bright-508

Glo Luciferase assay system (Promega) and neutralization calculated relative to virus only 509

controls. IC50 values were calculated in GraphPad Prism. 510

511

Data Availability 512

Long-read sequencing data that support the findings of this study have been deposited in the 513

NCBI SRA database with the accession codes SAMN16976824 - SAMN16976846 under 514

BioProject PRJNA682013 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA682013). Short reads 515

and data used to construct figures were deposited at https://github.com/Steven-516

Kemp/sequence_files. 517

518

References 519

1 Zhang, T., Wu, Q. & Zhang, Z. Probable pangolin origin of SARS-CoV-2 associated with the 520

COVID-19 outbreak. Current Biology (2020). 521

2 Xiao, K. et al. Isolation of SARS-CoV-2-related coronavirus from Malayan pangolins. Nature 583, 522

286-289, doi:10.1038/s41586-020-2313-x (2020). 523

3 Sanjuán, R. & Domingo-Calap, P. Mechanisms of viral mutation. Cell Mol Life Sci 73, 4433-4448, 524

doi:10.1007/s00018-016-2299-6 (2016). 525

4 Korber, B. et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases 526

Infectivity of the COVID-19 Virus. Cell, doi:10.1016/j.cell.2020.06.043 (2020). 527

5 Yurkovetskiy, L. et al. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein 528

Variant. Cell 183, 739-751 e738, doi:10.1016/j.cell.2020.09.032 (2020). 529

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 20: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

6 Korber, B. et al. Tracking changes in SARS-CoV-2 Spike: evidence that D614G increases infectivity 530

of the COVID-19 virus. Cell 182, 812-827. e819 (2020). 531

7 Choi, B. et al. Persistence and Evolution of SARS-CoV-2 in an Immunocompromised Host. N Engl 532

J Med, doi:10.1056/NEJMc2031364 (2020). 533

8 McCarthy, K. R. et al. Natural deletions in the SARS-CoV-2 spike glycoprotein drive antibody 534

escape. bioRxiv, 2020.2011.2019.389916, doi:10.1101/2020.11.19.389916 (2020). 535

9 Avanzato, V. A. et al. Case Study: Prolonged infectious SARS-CoV-2 shedding from an 536

asymptomatic immunocompromised cancer patient. Cell (2020). 537

10 Wright, S. Size of population and breeding structure in relation to evolution. Science, 430-431 538

(1938). 539

11 Lumby, C. K., Zhao, L., Breuer, J. & Illingworth, C. J. A large effective population size for 540

established within-host influenza virus infection. Elife 9, doi:10.7554/eLife.56915 (2020). 541

12 Schmidt, F. et al. Measuring SARS-CoV-2 neutralizing antibody activity using pseudotyped and 542

chimeric viruses. 2020.2006.2008.140871, doi:10.1101/2020.06.08.140871 %J bioRxiv (2020). 543

13 Brouwer, P. J. M. et al. Potent neutralizing antibodies from COVID-19 patients define multiple 544

targets of vulnerability. Science 369, 643-650, doi:10.1126/science.abc5902 (2020). 545

14 Robbiani, D. F. et al. Convergent antibody responses to SARS-CoV-2 in convalescent individuals. 546

Nature 584, 437-442, doi:10.1038/s41586-020-2456-9 (2020). 547

15 Barnes, C. O. et al. Structures of Human Antibodies Bound to SARS-CoV-2 Spike Reveal Common 548

Epitopes and Recurrent Features of Antibodies. Cell 182, 828-842 e816, 549

doi:10.1016/j.cell.2020.06.025 (2020). 550

16 Shrock, E. et al. Viral epitope profiling of COVID-19 patients reveals cross-reactivity and 551

correlates of severity. Science, doi:10.1126/science.abd4250 (2020). 552

17 Sobel Leonard, A. et al. The effective rate of influenza reassortment is limited during human 553

infection. PLoS pathogens 13, e1006203, doi:10.1371/journal.ppat.1006203 (2017). 554

18 Richard, M., Herfst, S., Tao, H., Jacobs, N. T. & Lowen, A. C. Influenza A Virus Reassortment Is 555

Limited by Anatomical Compartmentalization following Coinfection via Distinct Routes. Journal 556

of virology 92, doi:10.1128/JVI.02063-17 (2018). 557

19 CDC. Discontinuation of Transmission-Based Precautions and Disposition of Patients with COVID-558

19 in Healthcare Settings (Interim Guidance), <https://www.cdc.gov/coronavirus/2019-559

ncov/hcp/disposition-hospitalized-patients.html> (2020). 560

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 21: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

20 Boshier, F. A. T. et al. Remdesivir induced viral RNA and subgenomic RNA suppression, and 561

evolution of viral variants in SARS-CoV-2 infected patients. medRxiv, 2020.2011.2018.20230599, 562

doi:10.1101/2020.11.18.20230599 (2020). 563

21 Simonovich, V. A. et al. A Randomized Trial of Convalescent Plasma in Covid-19 Severe 564

Pneumonia. N Engl J Med, doi:10.1056/NEJMoa2031304 (2020). 565

22 Meredith, L. W. et al. Rapid implementation of SARS-CoV-2 sequencing to investigate cases of 566

health-care associated COVID-19: a prospective genomic surveillance study. The Lancet 567

Infectious Diseases 20, 1263-1272, doi:10.1016/S1473-3099(20)30562-4 (2020). 568

23 Collier, D. A. et al. Point of Care Nucleic Acid Testing for SARS-CoV-2 in Hospitalized Patients: A 569

Clinical Validation Trial and Implementation Study. Cell Rep Med, 100062, 570

doi:10.1016/j.xcrm.2020.100062 (2020). 571

24 Loman, N., Rowe, W. & Rambaut, A. (v1, 2020). 572

25 Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. 573

EMBnet. journal 17, 10-12 (2011). 574

26 Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics (Oxford, England) 575

34, 3094-3100, doi:10.1093/bioinformatics/bty191 (2018). 576

27 Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv 577

preprint arXiv:1207.3907 (2012). 578

28 Shu, Y. & McCauley, J. GISAID: Global initiative on sharing all influenza data - from vision to 579

reality. Euro surveillance : bulletin Europeen sur les maladies transmissibles = European 580

communicable disease bulletin 22, 30494, doi:10.2807/1560-7917.ES.2017.22.13.30494 (2017). 581

29 Katoh, K. & Standley, D. M. MAFFT Multiple Sequence Alignment Software Version 7: 582

Improvements in Performance and Usability. Molecular Biology and Evolution 30, 772-780, 583

doi:10.1093/molbev/mst010 (2013). 584

30 Minh, B. Q. et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in 585

the Genomic Era. Molecular Biology and Evolution 37, 1530-1534, doi:10.1093/molbev/msaa015 586

(2020). 587

31 Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: 588

fast model selection for accurate phylogenetic estimates. Nature Methods 14, 587-589, 589

doi:10.1038/nmeth.4285 (2017). 590

32 Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast Approximation for Phylogenetic 591

Bootstrap. Molecular Biology and Evolution 30, 1188-1195, doi:10.1093/molbev/mst024 (2013). 592

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 22: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

33 Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: Maximum-likelihood phylodynamic analysis. 593

Virus evolution 4, vex042-vex042, doi:10.1093/ve/vex042 (2018). 594

34 Bouckaert, R. et al. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS 595

Comput Biol 10, e1003537 (2014). 596

35 Illingworth, C. J. SAMFIRE: multi-locus variant calling for time-resolved sequence data. 597

Bioinformatics 32, 2208-2209, doi:10.1093/bioinformatics/btw205 (2016). 598

36 Martin, D. P., Murrell, B., Golden, M., Khoosal, A. & Muhire, B. RDP4: Detection and analysis of 599

recombination patterns in virus genomes. Virus evolution 1 (2015). 600

37 Didelot, X. & Wilson, D. J. ClonalFrameML: efficient inference of recombination in whole 601

bacterial genomes. PLoS Comput Biol 11, e1004041 (2015). 602

38 Wrobel, A. G. et al. SARS-CoV-2 and bat RaTG13 spike glycoprotein structures inform on virus 603

evolution and furin-cleavage effects. Nature Structural & Molecular Biology 27, 763-767, 604

doi:10.1038/s41594-020-0468-7 (2020). 605

39 Gregson, J. et al. HIV-1 viral load is elevated in individuals with reverse transcriptase mutation 606

M184V/I during virological failure of first line antiretroviral therapy and is associated with 607

compensatory mutation L74I. The Journal of infectious diseases, doi:10.1093/infdis/jiz631 608

(2019). 609

40 Naldini, L., Blomer, U., Gage, F. H., Trono, D. & Verma, I. M. Efficient transfer, integration, and 610

sustained long-term expression of the transgene in adult rat brains injected with a lentiviral 611

vector. Proceedings of the National Academy of Sciences of the United States of America 93, 612

11382-11388 (1996). 613

41 Gupta, R. K. et al. Full-length HIV-1 Gag determines protease inhibitor susceptibility within in 614

vitro assays. Aids 24, 1651-1655, doi:10.1097/QAD.0b013e3283398216 (2010). 615

42 Mlcochova, P. et al. Combined point of care nucleic acid and antibody testing for SARS-CoV-2 616

following emergence of D614G Spike Variant. Cell Rep Med, 100099, 617

doi:10.1016/j.xcrm.2020.100099 (2020). 618

43 Seow, J. et al. Longitudinal observation and decline of neutralizing antibody responses in the 619

three months following SARS-CoV-2 infection in humans. Nat Microbiol 5, 1598-1607, 620

doi:10.1038/s41564-020-00813-8 (2020). 621

622

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 23: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Figure 1

Figure 1. Analysis of 23 Patient derived whole SARS-CoV-2 genome sequences in context of national sequences and othercases of chronic SARS-CoV-2 shedding. A. Circularised maximum-likelihood phylogenetic tree rooted on the Wuhan-Hu-1reference sequence, showing a subset of 250 local SARS-CoV-2 genomes from GISAID. This diagram highlights significantdiversity of the case patient (green) compared to three other local patients with prolonged shedding (blue, red and purplesequences). All SARS-CoV-2 genomes were downloaded from the GISAID database and a random subset of 250 localsequences selected. B. Close-view maximum-likelihood phylogenetic tree indicating the diversity of the case patient andthree other long-term shedders from the local area (red, blue and purple), compared to recently published sequences fromChoi et al (orange) and Avanzato et al (gold). Control patients generally showed limited diversity temporally, though the Choiet al sequences were found to be even more divergent than the case patient. Environmental samples are indicated. 1000ultrafast bootstraps were performed and support at nodes is indicated.

2.0E-4

hCoV

-19/

Engla

nd/C

AMB-

7FB5

6/20

20|E

PI_I

SL_4

3348

1|202

0-04

-17|E

urop

e

NB15_CAMB-1B5CC9

hCoV-19/England/CAMB-1AABD9/2020|EPI_ISL_470190|2020-04-18|Europe

hCoV-19/England/CAMB-726F8/2020|EPI_ISL_439884|2020-03-24|Europe

hCoV-19/England/CAMB-77C79/2020|EPI_ISL_440815|2020-04-01|Europe

hCoV-19/England/CAMB-76BD7/2020|EPI_ISL_438431|2020-03-31|Europe

hCoV-19/England/CAMB-796C4/2020|EPI_ISL_441814|2020-04-03|EuropehC

oV-1

9/Eng

land/C

AMB-

1B05

B4/20

20|E

PI_IS

L_45

2896

|2020

-05-

11|E

urop

e

hCoV-19/England/CAMB-79828/2020|EPI_ISL_441182|2020-04-01|EuropehCoV-19/E

ngland/CAMB-843A1/2020|E

PI_ISL_443537|2020-04-16|E

urope

hCoV-19/England/CAMB-1AA2F2/2020|EPI_ISL_443682|2020-04-16|Europe

hCoV-19/England/CAMB-7D7BE/2020|EPI_ISL_442216|2020-04-07|Europe

hCoV-19/England/CAMB-1AA68D/2020|EPI_ISL_470215|2020-04-17|Europe

hCoV

-19/

Engla

nd/C

AMB-

8133

B/20

20|E

PI_I

SL_4

4289

4|20

20-0

4-11

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

7CB7

7/20

20|E

PI_I

SL_4

4212

1|202

0-04

-05|E

urop

e

hCoV-19/England/CAMB-72FFD/2020|EPI_ISL_440293|2020-03-21|Europe

hCoV-19/England/CAMB-8322D/2020|EPI_ISL_443485|2020-04-14|Europe

hCoV-19/England/CAMB-7DD16/2020|EPI_ISL_441669|2020-04-06|Europe

hCoV-19/England/CAMB-71B81/2020|EPI_ISL_439536|2020-03-31|Europe

hCoV-19/England/CAMB-73EA4/2020|EPI_ISL_440342|2020-03-20|Europe

hCoV-19/England/CAMB-1B194E/2020|EPI_ISL_452984|2020-05-13|Europe

hCoV-19/England/CAMB-1B1261/2020|EPI_ISL_459178|2020-05-06|Europe

NB13_CAMB-1B54A3

hCoV

-19/

Engla

nd/C

AMB-

1AB4

46/2

020|

EPI_

ISL_

4702

95|2

020-

04-2

1|Eu

rope

hCoV

-19/

Engla

nd/C

AMB-

77C6

A/20

20|E

PI_I

SL_4

4083

6|20

20-0

4-01

|Eur

ope

hCoV-19/England/CAMB-7D94F/2020|EPI_ISL_441749|2020-04-05|Europe

hCoV-19/England/CAMB-7A0C5/2020|EPI_ISL_440612|2020-04-02|Europe

hCoV-19/England/CAMB-1AA553/2020|EPI_ISL_443604|2020-04-17|Europe

hCoV-19/England/CAMB-77ECE/2020|EPI_ISL_441840|2020-04-01|Europe

hCoV-19/England/CAMB-1AAB06/2020|EPI_ISL_470191|2020-04-18|Europe

hCoV-19/England/CAMB-1AA8C3/2020|EPI_ISL_470314|2020-04-17|Europe

hCoV-19/England/C

AMB-81544/2020|EPI_IS

L_442980|2020-04-12|Europe

hCoV-19/England/CAMB-7855F/2020|EPI_ISL_433747|2020-00-00|Europe

hCoV-19/England/CAMB-80ACE/2020|EPI_ISL_470388|2020-04-10|Europe

hCoV-19/England/CAMB-1AF1D2/2020|EPI_ISL_459273|2020-04-29|EuropehCoV-19/England/CAMB-7F28E/2020|EPI_ISL_442868|2020-04-08|Europe

hCoV-19/England/CAMB-1B5DC6/2020|EPI_ISL_549391|2020-09-03|Europe

hCoV

-19/E

nglan

d/CAMB-8

3366

/2020

|EPI_I

SL_44

3440

|2020

-04-

11|E

urop

e

hCoV

-19/

Engla

nd/C

AMB-

1AAB

BB/2

020|

EPI_

ISL_

4701

85|2

020-

04-1

8|Eu

rope

hCoV-19/England/CAMB-74FBF/2020|EPI_ISL_440380|2020-03-14|Europe

hCoV

-19/

Engla

nd/C

AMB-

1B18

9C/2

020|

EPI_

ISL_

4585

27|2

020-

05-0

9|Eu

rope

hCoV-19/England/CAMB-79F9C/2020|EPI_ISL_440605|2020-04-02|Europe

hCoV

-19/

Engla

nd/C

AMB-

780A

9/20

20|E

PI_I

SL_4

4111

8|20

20-0

4-01

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

7D68

4/20

20|E

PI_I

SL_4

4156

3|202

0-04

-06|E

urop

e

hCoV-19/England/CAMB-1B65DC/2020|EPI_ISL_584291|2020-10-02|Europe

hCoV-19/England/CAMB-1AA483/2020|EPI_ISL_444316|2020-04-27|Europe

NB14_CAMB-1B5CBA

hCoV-19/England/CAMB-843CF/2020|EPI_ISL_443445|2020-04-16|Europe

hCoV-19/England/CAMB-7EDBB/2020|EPI_ISL_442824|2020-04-06|Europe

hCoV

-19/

Engla

nd/C

AMB-

80E8

6/20

20|E

PI_I

SL_4

4443

8|20

20-0

4-17

|Eur

ope

hCoV-19/England/CAMB-1ABFCD/2020|EPI_ISL_444356|2020-04-27|Europe

hCoV-19/England/CAMB-74CC7/2020|EPI_ISL_425364|2020-03-18|Europe

hCoV-19/England/CAMB-739A3/2020|EPI_ISL_425261|2020-00-00|Europe

hCoV-19/England/CAMB-1ACE74/2020|EPI_ISL_489505|2020-04-24|Europe

hCoV-19/England/C

AMB-1AF1B4/2020|EPI_ISL_459248|2020-04-29|Europe

hCoV-19/England/CAMB-73DB6/2020|EPI_ISL_433697|2020-04-01|Europe

hCoV-19/England/CAMB-72AB0/2020|EPI_ISL_439865|2020-03-22|Europe

hCoV

-19/

Engla

nd/C

AMB-

7C04

B/20

20|E

PI_I

SL_4

3398

3|20

20-0

4-13

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

807E

5/20

20|E

PI_I

SL_4

4261

7|202

0-04

-10|E

urop

e

hCoV

-19/

Engla

nd/C

AMB-

7850

4/20

20|E

PI_I

SL_4

3374

4|202

0-03

-27|E

urop

e

hCoV-19/England/CAMB-72607/2020|EPI_ISL_439906|2020-03-24|Europe

hCoV

-19/

Engla

nd/C

AMB-

74AB

E/20

20|E

PI_I

SL_4

2534

5|202

0-03

-17|E

urop

e

hCoV-19/England/CAMB-1AFC3B/2020|EPI_ISL_459205|2020-05-03|Europe

hCoV-19/England/CAMB-82ADB/2020|EPI_ISL_438658|2020-04-23|Europe

hCoV

-19/

Engla

nd/C

AMB-

7EF2

E/20

20|E

PI_I

SL_4

4294

5|20

20-0

4-07

|Eur

ope

hCoV

-19/E

nglan

d/CAM

B-1A

E425

/2020

|EPI

_ISL_

4595

04|20

20-0

4-29

|Eur

ope

hCoV-19/England/CAMB-82F72/2020|EPI_ISL_443425|2020-04-12|Europe

hCoV-19/England/CAMB-71C7F/2020|EPI_ISL_439650|2020-03-31|Europe

hCoV-19/England/CAMB-8456F/2020|EPI_ISL_443660|2020-04-16|Europe

hCoV

-19/E

nglan

d/CAMB-83

876/2

020|E

PI_ISL_

4434

53|20

20-04

-15|E

urope

hCoV-19/England/CAMB-72C32/2020|EPI_ISL_440024|2020-03-22|Europe

hCoV

-19/E

nglan

d/CAMB-7

50F7/2

020|E

PI_ISL_

4404

36|20

20-0

3-16

|Eur

ope

hCoV-19/England/CAMB-7C3CA/2020|EPI_ISL_434038|2020-04-14|Europe

hCoV-19/England/CAMB-727E6/2020|EPI_ISL_439580|2020-03-23|Europe

hCoV-19/England/CAMB-1AE355/2020|EPI_ISL_524592|2020-04-28|Europe

hCoV-19/England/CAMB-73FB0/2020|EPI_ISL_440355|2020-03-19|Europe

hCoV

-19/

Engla

nd/C

AMB-

82EA

2/20

20|E

PI_I

SL_4

4332

1|20

20-0

4-13

|Eur

ope

hCoV-19/England/CAMB-1B2CF9/2020|EPI_ISL_456720|2020-05-20

hCoV-19/England/CAMB-74A72/2020|EPI_ISL_425344|2020-03-17|Europe

hCoV-19/England/C

AMB-7AC3A/2020|EPI_IS

L_433855|2020-04-08|Europe

NB01_CAMB-1B5124

hCoV-19/England/CAMB-8406E/2020|EPI_ISL_438729|2020-04-26|Europe

hCoV-19/England/CAMB-81429/2020|EPI_ISL_442583|2020-04-10|Europe

hCoV-19/England/CAMB-1B67B8/2020|EPI_ISL_584306|2020-10-03|Europe

hCoV-19/England/CAMB-803A5/2020|EPI_ISL_470375|2020-04-09|Europe

hCoV-19/England/CAMB-7DED7/2020|EPI_ISL_470481|2020-04-06|Europe

hCoV-19/England/CAMB-7ED24/2020|EPI_ISL_442869|2020-04-07|Europe

hCoV-19/England/CAMB-1B1C27/2020|EPI_ISL_458562|2020-05-07|Europe

hCoV-19/England/CAMB-821E5/2020|EPI_ISL_438631|2020-04-21|Europe

hCoV-19/England/CAMB-1ACF44/2020|EPI_ISL_489425|2020-04-24|Europe

hCoV-19/England/CAMB-7D129/2020|EPI_ISL_441655|2020-04-06|Europe

NB14_CAMB-1B5607NB07_CAMB-1B5A29

NB15_CAMB-1B43A7

hCoV-19/England/CAMB-76375/2020|EPI_ISL_438289|2020-03-28|EuropehCoV-19/England/CAMB-7A229/2020|EPI_ISL_441231|2020-04-02|Europe

hCoV

-19/

Engla

nd/C

AMB-

8239

4/20

20|E

PI_I

SL_4

4352

6|20

20-0

4-14

|Eur

ope

hCoV

-19/E

nglan

d/CAM

B-7B

8F9/2

020|E

PI_IS

L_44

2174

|2020

-04-

04|E

urop

e

hCoV-19/England/CAMB-7658E/2020|EPI_ISL_439515|2020-03-30|Europe

hCoV-19/England/CAMB-744CF/2020|EPI_ISL_425298|2020-03-20|Europe

hCoV

-19/E

nglan

d/CAM

B-1A

B9CF

/2020

|EPI

_ISL_

4701

40|20

20-0

4-21

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

8401

3/20

20|E

PI_I

SL_4

3872

4|20

20-0

4-26

|Eur

ope

NB02_CAMB-1B5AFChC

oV-1

9/Eng

land/C

AMB-

82A1

7/202

0|EPI

_ISL_

4386

47|20

20-0

4-23

|Eur

ope

hCoV-19/England/CAMB-1AED75/2020|EPI_ISL_452859|2020-05-07|Europe

hCoV

-19/

Engla

nd/C

AMB-

8407

D/20

20|E

PI_I

SL_4

3873

0|202

0-04

-26|E

urop

e

NB09_CAMB-1B0095

hCoV-19/England/CAMB-7830A/2020|EPI_ISL_433716|2020-00-00|Europe

hCoV-19/England/CAMB-7B3CB/2020|EPI_ISL_433950|2020-04-12|Europe

hCoV-19/England/CAMB-841F2/2020|EPI_ISL_443565|2020-04-16|Europe

hCoV-19/England/CAMB-1AE07C/2020|EPI_ISL_459483|2020-04-27|Europe

MN908947.3

hCoV-19/England/CAMB-1AB6B9/2020|EPI_ISL_470324|2020-04-21|Europe

hCoV-19/England/CAMB-7377C/2020|EPI_ISL_425234|2020-03-29|Europe

hCoV-19/England/CAMB-7C791/2020|EPI_ISL_442282|2020-04-04|Europe

hCoV

-19/

Engla

nd/C

AMB-

8158

0/20

20|E

PI_I

SL_4

4296

1|20

20-0

4-12

|Eur

ope

hCoV-19/England/CAMB-7EAA5/2020|EPI_ISL_442607|2020-04-06|Europe

hCoV-19/England/CAMB-1AE838/2020|EPI_ISL_524609|2020-04-30|EuropehC

oV-1

9/En

gland

/CAM

B-76

1D5/

2020

|EPI

_ISL

_438

452|

2020

-03-

30|E

urop

e

hCoV-19/England/CAMB-1B4A66/2020|EPI_ISL_473492|2020-06-07|Europe

hCoV-19/England/CAMB-79CA4/2020|EPI_ISL_440604|2020-04-02|Europe

hCoV-19/England/CAMB-7FE20/2020|EPI_ISL_444421|2020-04-19

hCoV-19/England/CAMB-77972/2020|EPI_ISL_439409|2020-03-31|EuropehCoV-19/England/CAMB-1B6897/2020|EPI_ISL_584310|2020-10-06|EuropehCoV-19/England/CAMB-7D2CC/2020|EPI_ISL_442267|2020-04-06|Europe

hCoV-19/England/CAMB-7C89E/2020|EPI_ISL_442102|2020-04-04|EuropeNB11_CAMB-1B5616

hCoV-19/England/CAMB-78A50/2020|EPI_ISL_433809|2020-04-07|Europe

hCoV-19/England/CAMB-7382E/2020|EPI_ISL_425245|2020-03-31|Europe

hCoV

-19/

Engla

nd/C

AMB-

7A33

5/20

20|E

PI_I

SL_4

4131

5|20

20-0

4-02

|Eur

ope

hCoV-19/England/CAMB-821A9/2020|EPI_ISL_438628|2020-04-20|Europe

hCoV-19/England/CAMB-7CBC2/2020|EPI_ISL_442303|2020-04-04|Europe

hCoV-19/England/CAMB-823FE/2020|EPI_ISL_443449|2020-04-13|Europe

hCoV-19/England/CAMB-1AFD56/2020|EPI_ISL_452878|2020-05-11|Europe

hCoV-19/England/CAMB-1ACABC/2020|EPI_ISL_489479|2020-04-23|Europe

hCoV-19/England/CAMB-754DD/2020|EPI_ISL_425397|2020-03-17|EuropehCoV-19/England/CAMB-72142/2020|EPI_ISL_439660|2020-03-28|Europe

hCoV-19/England/CAMB-7EB39/2020|EPI_ISL_442917|2020-04-07|Europe

hCoV-19/England/CAMB-7D98B/2020|EPI_ISL_441683|2020-04-02|Europe

hCoV-19/England/CAMB-82CA8/2020|EPI_ISL_438678|2020-04-24|Europe

hCoV-19/England/CAMB-7B501/2020|EPI_ISL_433969|2020-04-12|Europe

hCoV-19/England/CAMB-77A42/2020|EPI_ISL_440818|2020-03-31|Europe

hCoV-19/England/CAMB-7D7CD/2020|EPI_ISL_441732|2020-04-06|Europe

NB02_CAMB-1B5BCC

hCoV-19/England/CAMB-80C04/2020|EPI_ISL_442940|2020-04-11|Europe

hCoV-19/England/CAMB-7328A/2020|EPI_ISL_440267|2020-03-21|Europe

hCoV-19/England/CAMB-83C98/2020|EPI_ISL_443350|2020-04-15|Europe

hCoV-19/England/CAMB-810BC/2020|EPI_ISL_438565|2020-04-19|Europe

hCoV-19/England/CAMB-82C3F/2020|EPI_ISL_438672|2020-04-24

hCoV-19/England/CAMB-763C0/2020|EPI_ISL_438310|2020-03-30|Europe

hCoV-19/England/CAMB-1B0CBF/2020|EPI_ISL_452960|2020-05-12|Europe

hCoV-19/England/CAMB-7568C/2020|EPI_ISL_440377|2020-03-17|Europe

hCoV-19/England/CAMB-7C05A/2020|EPI_ISL_433984|2020-00-00|Europe

hCoV-19/England/CAMB-7E98A/2020|EPI_ISL_442560|2020-04-07|Europe

NB06

_CAM

B-1A

DCF1

hCoV-19/England/CAMB-7F44C/2020|EPI_ISL_470359|2020-04-08|Europe

hCoV-19/England/CAMB-745DB/2020|EPI_ISL_425313|2020-03-28|Europe

hCoV-19/England/CAMB-1AE99F/2020|EPI_ISL_459286|2020-05-01|Europe

hCoV-19/England/CAMB-78470/2020|EPI_ISL_433736|2020-04-05|Europe

hCoV-19/England/CAMB-74641/2020|EPI_ISL_425318|2020-03-20|Europe

hCoV-19/England/C

AMB-1AF21B/2020|EPI_ISL_459322|2020-04-29|Europe

hCoV-19/England/CAMB-79E53/2020|EPI_ISL_441341|2020-04-01|Europe

hCoV-19/England/CAMB-1AB297/2020|EPI_ISL_470118|2020-04-20|Europe

hCoV

-19/E

nglan

d/CAMB-79

71C/20

20|E

PI_ISL_

4410

88|20

20-04

-04|E

urope

hCoV

-19/

Engla

nd/C

AMB-

83E0

B/20

20|E

PI_I

SL_4

3869

6|20

20-0

4-25

|Eur

ope

hCoV-19/England/CAMB-7EB66/2020|EPI_ISL_442932|2020-04-07|Europe

hCoV-19/England/CAMB-7814C/2020|EPI_ISL_441110|2020-04-01|Europe

hCoV-19/England/CAMB-83AE9/2020|EPI_ISL_443365|2020-04-15|Europe

NB20_CAMB-1B3654NB16_CAMB-1B19D5

hCoV-19/England/CAMB-1AD7F0/2020|EPI_ISL_448012|2020-05-04

hCoV

-19/

Engla

nd/C

AMB-

81DA

6/20

20|E

PI_I

SL_4

4259

0|20

20-0

4-13

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

1AE5

31/2

020|

EPI_

ISL_

4594

60|2

020-

04-2

8|Eu

rope

hCoV-19/England/CAMB-1AA993/2020|EPI_ISL_470176|2020-04-18|Europe

hCoV-19/England/CAMB-7B9AB/2020|EPI_ISL_441612|2020-04-05|Europe

NB13_CAMB-1B5CE7

hCoV-19/England/CAMB-77E37/2020|EPI_ISL_440835|2020-04-03|Europe

hCoV

-19/E

nglan

d/CAMB-8

3D95

/2020

|EPI_I

SL_43

8690

|2020

-04-

24|E

urop

e

hCoV-19/England/CAMB-7A8BE/2020|EPI_ISL_433826|2020-04-08|Europe

hCoV-19/England/CAMB-76C6B/2020|EPI_ISL_438257|2020-03-30|Europe

hCoV-19/England/CAMB-7228B/2020|EPI_ISL_439624|2020-03-28|Europe

hCoV-19/England/CAMB-77C3D/2020|EPI_ISL_440851|2020-03-31|Europe

hCoV

-19/

Engla

nd/C

AMB-

1B05

4B/2

020|

EPI_

ISL_

4528

90|2

020-

05-1

1|Eu

rope

hCoV-19/England/CAMB-7FD05/2020|EPI_ISL_444405|2020-04-19|Europe

hCoV-19/England/CAMB-766A9/2020|EPI_ISL_438512|2020-03-29|Europe

hCoV-19/England/C

AMB-789F9/2020|EPI_ISL_433805|2020-04-07|Europe

hCoV-19/England/CAMB-1AA957/2020|EPI_ISL_470251|2020-04-18|Europe

hCoV-19/England/CAMB-1B01FC/2020|EPI_ISL_459304|2020-05-02|Europe

hCoV-19/England/CAMB-79AF5/2020|EPI_ISL_433823|2020-03-27|Europe

hCoV-19/England/CAMB-7CF5D/2020|EPI_ISL_442305|2020-04-06|Europe

hCoV-19/England/CAMB-73AFB/2020|EPI_ISL_425279|2020-03-30|Europe

hCoV-19/England/CAMB-1AC102/2020|EPI_ISL_444373|2020-04-29

hCoV-19/England/CAMB-1B2093/2020|EPI_ISL_453003|2020-05-18

hCoV-19/England/CAMB-7A001/2020|EPI_ISL_440573|2020-04-03|Europe

hCoV-19/England/CAMB-7EF79/2020|EPI_ISL_442923|2020-04-07|Europe

hCoV-19/England/CAMB-7E61A/2020|EPI_ISL_524548|2020-04-08|Europe

NB08_CAMB-1B2CBD

hCoV-19/England/CAMB-1AE70E/2020|EPI_ISL_524583|2020-04-30|Europe hCoV

-19/E

nglan

d/CAMB-8

0CB9/2

020|E

PI_ISL_

4429

54|20

20-0

4-04

|Eur

ope

hCoV-19/England/CAMB-7F5B2/2020|EPI_ISL_442921|2020-04-08|Europe

hCoV-19/England/CAMB-83EDE/2020|EPI_ISL_438707|2020-04-25|Europe

hCoV-19/England/CAMB-7C069/2020|EPI_ISL_433985|2020-00-00|Europe

hCoV-19/England/CAMB-79D74/2020|EPI_ISL_440592|2020-04-02|Europe

hCoV-19/England/CAMB-7712F/2020|EPI_ISL_438515|2020-04-01|Europe

hCoV-19/England/CAMB-7682B/2020|EPI_ISL_438272|2020-03-30|Europe

hCoV-19/England/CAMB-1AC463/2020|EPI_ISL_489438|2020-04-22|Europe

hCoV

-19/

Engla

nd/C

AMB-

7C4F

4/20

20|E

PI_I

SL_4

4224

8|20

20-0

4-04

|Eur

ope

NB06_CAMB-1B55CE

hCoV-19/England/CAMB-80B16/2020|EPI_ISL_442848|2020-04-11|Europe

hCoV-19/England/CAMB-80DF2/2020|EPI_ISL_442991|2020-04-10|Europe

hCoV-19/England/CAMB-7DF4D/2020|EPI_ISL_441602|2020-04-06|Europe

hCoV-19/England/CAMB-77E46/2020|EPI_ISL_441312|2020-04-01|Europe

hCoV-19/England/CAMB-7A65A/2020|EPI_ISL_441303|2020-04-01|Europe

hCoV-19/England/CAMB-79A22/2020|EPI_ISL_433814|2020-03-26|Europe

hCoV-19/England/CAMB-838B2/2020|EPI_ISL_443409|2020-04-15|Europe

hCoV-19/England/CAMB-734DF/2020|EPI_ISL_440107|2020-03-21|Europe

hCoV-19/England/CAMB-80976/2020|EPI_ISL_442867|2020-04-09|Europe

hCoV-19/England/CAMB-72476/2020|EPI_ISL_439552|2020-03-21|Europe

hCoV-19/England/CAMB-810E9/2020|EPI_ISL_438567|2020-04-19|Europe

hCoV-19/England/CAMB-8268C/2020|EPI_ISL_443674|2020-04-13|Europe

NB09_CAMB-1B5C50

hCoV

-19/

Engla

nd/C

AMB-

726E

9/20

20|E

PI_I

SL_4

3987

9|202

0-03

-23|E

urop

e

NB01_CAMB-1B5ADE

hCoV-19/England/CAMB-7DA88/2020|EPI_ISL_441624|2020-04-06|Europe

hCoV-19/England/CAMB-841C5/2020|EPI_ISL_443480|2020-04-17|Europe

hCoV-19/England/CAMB-1AA9A2/2020|EPI_ISL_470199|2020-04-18|Europe

hCoV

-19/

Engla

nd/C

AMB-

1ACF

CC/2

020|

EPI_

ISL_

4895

29|2

020-

04-2

4|Eu

rope

hCoV-19/England/CAMB-817B7/2020|EPI_ISL_442538|2020-04-11|Europe

hCoV-19/England/CAMB-8466C/2020|EPI_ISL_438746|2020-04-27|Europe

hCoV-19/England/CAMB-7275F/2020|EPI_ISL_439922|2020-03-23|Europe

NB17_CAMB-1B5B71

hCoV-19/England/CAMB-81447/2020|EPI_ISL_443065|2020-04-10|Europe

hCoV-19/England/CAMB-1B2DE7/2020|EPI_ISL_477792|2020-05-18|Europe

NB10_CAMB-1B5643

hCoV-19/England/CAMB-7426B/2020|EPI_ISL_440359|2020-03-19|Europe

hCoV-19/England/CAMB-1AB534/2020|EPI_ISL_470183|2020-04-20|Europe

hCoV-19/England/CAMB-75D69/2020|EPI_ISL_438284|2020-03-29|Europe

NB10_CAMB-1B5C6F

hCoV-19/England/CAMB-1ACDE0/2020|EPI_ISL_489450|2020-04-23|Europe

hCoV-19/England/CAMB-75C8A/2020|EPI_ISL_438262|2020-03-29|Europe

hCoV-19/England/CAMB-7622D/2020|EPI_ISL_438366|2020-03-30|Europe

NB16_CAMB-1B5BAE hCoV-19/England/CAMB-1B102B/2020|EPI_ISL_459229|2020-05-05|Europe

hCoV-19/England/CAMB-78786/2020|EPI_ISL_433770|2020-04-05|Europe

hCoV-19/England/CAMB-1AA368/2020|EPI_ISL_443663|2020-04-17|Europe

hCoV-19/England/C

AMB-1AC427/2020|EPI_IS

L_489465|2020-04-21|Europe

hCoV-19/England/CAMB-84972/2020|EPI_ISL_440517|2020-03-08|Europe

hCoV-19/England/CAMB-755F8/2020|EPI_ISL_425412|2020-03-17|Europe

hCoV-19/England/CAMB-807D6/2020|EPI_ISL_470367|2020-04-09|Europe

NB08_CAMB-1B529A

hCoV-19/England/CAMB-1B5BCC/2020|EPI_ISL_529236|2020-08-15|Europe [&!color=#008026]

hCoV-19/England/CAMB-78A9C/2020|EPI_ISL_433811|2020-04-06|Europe

hCoV

-19/E

nglan

d/CAMB-7

3D7A

/2020

|EPI_I

SL_43

3693

|2020

-04-

01|E

urop

e

hCoV-19/England/CAMB-83C89/2020|EPI_ISL_443328|2020-04-16|Europe

hCoV-19/England/CAMB-82AAE/2020|EPI_ISL_438655|2020-00-00|Europe

hCoV-19/England/CAMB-77EFB/2020|EPI_ISL_440826|2020-03-31|Europe

hCoV-19/England/CAMB-825DA/2020|EPI_ISL_443430|2020-04-12|Europe

hCoV

-19/

Engla

nd/C

AMB-

71D9

A/20

20|E

PI_I

SL_4

3830

0|20

20-0

3-30

|Eur

ope

hCoV

-19/

Engla

nd/C

AMB-

815D

B/20

20|E

PI_I

SL_4

4301

7|20

20-0

4-10

|Eur

ope

hCoV

-19/E

nglan

d/CAM

B-75

BAB/

2020

|EPI

_ISL_

4383

07|20

20-0

3-28

|Eur

ope

hCoV-19/England/CAMB-720EB/2020|EPI_ISL_439629|2020-03-28|Europe

hCoV

-19/

Engla

nd/C

AMB-

1AD2

B3/2

020|

EPI_

ISL_

4479

86|2

020-

00-0

0|Eu

rope

hCoV-19/England/CAMB-7EA4B/2020|EPI_ISL_442553|2020-04-07|Europe

hCoV-19/England/CAMB-76849/2020|EPI_ISL_438269|2020-03-30|Europe

NB02_CAMB-1B0AA6

hCoV

-19/

Engla

nd/C

AMB-

8320

F/20

20|E

PI_I

SL_4

4339

8|202

0-04

-14|E

urop

e

hCoV-19/England/CAMB-7DEF5/2020|EPI_ISL_470494|2020-04-07|Europe

hCoV-19/England/CAMB-758C2/2020|EPI_ISL_440313|2020-03-19|Europe

hCoV

-19/

Engla

nd/C

AMB-

1AAC

5E/2

020|E

PI_I

SL_4

7018

7|202

0-04

-19|E

urop

e

hCoV-19/England/CAMB-7D60C/2020|EPI_ISL_442129|2020-04-07|Europe

hCoV-19/England/CAMB-1ADF61/2020|EPI_ISL_459475|2020-04-26|Europe

NB23_CAMB-1B4FB2

hCoV

-19/E

nglan

d/CAMB-7

84DA/20

20|E

PI_ISL_

4337

42|20

20-0

4-06

|Eur

ope

hCoV-19/England/CAMB-7524C/2020|EPI_ISL_440375|2020-03-12|Europe

NB05_CAMB-1B55FB

hCoV-19/England/CAMB-1AA6D8/2020|EPI_ISL_470277|2020-04-17|Europe

NB16_CAMB-1B5C9C

NB01_CAMB-1B5BDB

hCoV-19/England/CAMB-1AD8EE/2020|EPI_ISL_448022|2020-05-02|Europe

hCoV

-19/E

nglan

d/CAMB-81

7E4/2

020|E

PI_ISL_

4431

12|20

20-04

-11|E

urope

hCoV-19/England/CAMB-7DDF8/2020|EPI_ISL_470381|2020-04-07|Europe

hCoV

-19/

Engla

nd/C

AMB-

83E1

A/20

20|E

PI_I

SL_4

3869

7|202

0-04

-26|E

urop

e

hCoV

-19/

Engla

nd/C

AMB-

7832

8/20

20|E

PI_I

SL_4

3371

8|20

20-0

4-04

|Eur

ope

hCoV-19/England/CAMB-7DC37/2020|EPI_ISL_441686|2020-04-06|Europe

hCoV-19/England/CAMB-1AEA23/2020|EPI_ISL_524584|2020-04-27|Europe

hCoV-19/England/CAMB-7AECB/2020|EPI_ISL_433889|2020-04-10|EuropehC

oV-1

9/En

glan

d/CA

MB-8

0897

/202

0|EP

I_IS

L_44

2895

|202

0-04

-09|

Euro

pe

NB08_CAMB-1B5467

hCoV

-19/E

nglan

d/CAM

B-81

1AA/

2020

|EPI

_ISL_

4385

77|20

20-0

4-20

|Eur

ope

hCoV-19/England/CAMB-7A739/2020|EPI_ISL_441135|2020-04-02|Europe

hCoV

-19/E

nglan

d/CAMB-7

A159/2

020|E

PI_ISL_

4405

86|20

20-0

4-03

|Eur

ope

2.0E-4

A

B

2.0E-4

7 8

8 8

8 6

9 0

9 0

9 8

8 5

100

6 4

8 58 4

100

9 4

9 5

100

7 691

9 0

9 0

9 4

9 9

9 3

9 9

100

9 6

8 8

9 5

100

8 9

100

Wuhan-Hu-1

Patient’s Mobile Phone

Patient’s Call Bell

Case PatientControl Patient 1Control Patient 2Choi et al , NEJM (2020)Control Patient 3Avanzato et al , Cel l (2020)

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 24: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

ETAETA

ETAETA

ETA

Figure 2. Virus genetics and population structure in chronic SARS-CoV-2 infection. A. Highlighter plot indicating nucleotidechanges at consensus level in sequential respiratory samples compared to the consensus sequence at first diagnosis of COVID-19. Each row indicates the timepoint the sample was collected (number of days from first positive SARS-CoV-2 RT-PCR). Blackdashed lines indicate the RNA-dependent RNA polymerase (RdRp) and Spike regions of the genome. Of particular interest, atconsensus level, there were no nucleotide substitutions on days 1-57, despite the patient receiving two courses of remdesivir.The first major changes in the spike genome occurred on day 82, following convalescent plasma given on days 63 and 65. Thedouble amino acid deletion in S1, ΔH69/V70 is indicated by black lines. All samples are nose and throat swabs unless indicatedwith ETA (endotracheal aspirate). B. Whole genome amino acid variant trajectories based on Illumina short read ultra deepsequencing at 1000x coverage. All variants which reached a frequency of at least 10% in at least two samples were plotted.Black dashed line represents Δ69/70. CP, convalescent plasma; RDV, Remdesivir.

1374145505455565766828689939395989999100100101101

ORF3a

ORF1ab Spike N

Envelope

ORFs 6-8

Membrane

ORF10

RdRp

ORF1ab Spike N

1000

0

1500

0

2000

0

2500

0

3000

0

50001

A C T G

Nucleotide Position

RdRp

Days

from

firs

t pos

itive

SAR

S-Co

V-2

RT-P

CRdel

A

B

Figure 2

CP1,2 CP3 + RDVRDV RDV

1 21 41 61 81 101Days since 1st Positive RT-PCR for SARS-CoV-2

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 25: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Figure 3

Figure 3. Longitudinal variant frequencies and phylogenetic relationships for virus populations bearing six Spike (S)mutations A. At baseline, all four S variants (Illumina sequencing) were absent (<1% and <20 reads). Approximately twoweeks after receiving two units of convalescent plasma (CP), viral populations carrying ΔH69/V70 and D796H mutants rose tofrequencies >90% but decreased significantly four days later. This population was replaced by a population bearing Y200Hand T240I, detected in two samples over a period of six days. These viral populations were then replaced by virus carryingW64G and P330S mutations in Spike, which both reached near fixation at day 93. Following a 3rd course of remdesivir and anadditional unit of convalescent plasma, the ΔH69/V70 and D796H virus population re-emerged to become the dominant viralstrain reaching variant frequencies of >90%. Pairs of mutations arose and disappeared simultaneously indicating linkage onthe same viral haplotype. B.Maximum likelihood phylogenetic tree of the case patient with day of sampling indicated. Spikemutations defining each of the clades are shown ancestrally on the branches on which they arose. Number at node denotessupport by ultrafast bootstrapping consisting of 1000 replicates.

A

B

6.0E-5

7 6

100

7 8

9 3

75

9 8

61

6 6

6 4

6 9

81

8 8

Wuhan-Hu-1

Day 37

Day 56

Day 45

Day 54

Day 55

Day 57

Day 1

Day 45

Day 95Day 93

Day 93

Day 50

Day 100

Day 82

Day 99

Day 101

Day 101

Day 100

Day 98

Day 99

Day 66

Day 89

Day 86

W64G, P330S

D614G

Y200H, T240IW258S

H69/V70, D796HΔ

1 371s

t Rem

desi

vir 4

1 45 502n

d Re

mde

sivi

r 54

1st +

2nd

CP

65 66 82 86 89

3rd

Rem

desi

vir 9

3

3rd

CP 9

5 98 99 100

1010

10

20

30

40

50

60

70

80

90

100

Number of days since 1st positive SARS-COV-2 Swab RT-PCR

Mut

atio

n Pr

evel

ance

(%)

D796HΔH69/V70 Y200H T240IP330SW64G

15

20

25

30

Ct

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 26: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Figure 4: Spike mutant D796H + ΔH69/V70 has infectivity comparable to wild type but is less sensitive tomultiple units of convalescent plasma (CP) and sera from recovered individuals. A. Reverse transcriptase activityof virus supernatants containing lentivirus pseudotyped with SARS-CoV-2 Spike protein (WT versus mutant) B.Single round Infectivity of luciferase expressing lentivirus pseudotyped with SARS-CoV-2 Spike protein (WT versusmutant) on 293T cells co-transfected with ACE2 and TMPRSS2 plasmids. C. Patient serum (taken at indicated Day(D) ) and convalescent plasma (CP units 1-3) neutralization potency against Spike mutant D796H + ΔH69/V70measured using lentivirus pseudotyped with SARS-CoV-2 Spike protein (WT versus mutant). Indicated is serumdilution required to inhibit 50% of virus infection. D. Neutralization potency of sera from 5 unselectedconvalescent patients (with previous confirmed SARS-CoV-2 infection) against WT versus mutant virus as in panelC. E Neutralization potency of a panel of monoclonal antibodies against lentivirus pseudotyped with SARS-CoV-2Spike protein (WT Wuhan versus single mutants D614G, D796H, ΔH69/V70). Data are representative of at leasttwo independent experiments * p<0.05

WT

WT+D79

6H+∆

69∆7

00

20,000

40,000

60,000

80,000

100,000

Infe

ctiv

ity

(RLU

/µL)

WT

WT+D79

6H+∆

69∆7

00

500,000

1,000,000

1,500,000

2,000,000

RT A

ctiv

ity

(pU

/µL)

A B

C D

E

Figure 4

CP1 D64 CP2 D66 CP30

1000

2000

3000

Sera

IC50

(Neu

tral

izat

ion)

WT

D796H/Δ69/70

S 5 S 4 S 3 S 2 S 10

100020003000

2000030000400005000060000

IC50

(Neu

tral

izat

ion)

WT

D796H/Δ69/70

CP1 D64 CP2 D66 CP30

1000

2000

3000

Sera

IC50

(Neu

tral

izat

ion)

WT

D796H/Δ69/70

* **

* * *

**

WT WTΔ69/70D796H

Δ69/70D796H

-4 -3 -2 -1 0-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA1-18

-3 -2 -1 0 1-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA1-12

-3 -2 -1 0 1-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA2-29

-3 -2 -1 0 1 2-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA2-02

-3 -2 -1 0 1-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA1-16

-3 -2 -1 0 1-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

Δ6970

COVA1-21

D796HWT (D614G)

WT (Wuhan)

-3 -2 -1 0 1-25

0

25

50

75

100

Log10 IgG µg/ml

% N

eutr

aliz

atio

n

COVA2-07

. CC-BY-NC-ND 4.0 International licenseIt is made available under a

is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.(which was not certified by peer review)preprint The copyright holder for thisthis version posted December 19, 2020. ; https://doi.org/10.1101/2020.12.05.20241927doi: medRxiv preprint

Page 27: 2020/12/05  · cases of chronic SARS-CoV-2 shedding.. A

Figure 5. Location of Spike mutations ΔH69/Y70 in S1 and D796H in S2. Amino acid residues H69 andY70 deleted in the N-terminal domain (red) and D796H in subunit 2 (orange) are highlighted on a SARS-CoV-2 spike trimer (PDB: 6ZGE Wrobek et al., 2020). Each of the three protomers making up the Spikehomotrimer are coloured separately in shades of grey (centre). Close-ups of ΔH69/Y70 (above) andD796H (below) are shown in cartoon, stick representation. Both mutations are in exposed loops.

S1

S2

RBD

Viral Membrane

NTD

ΔH69/V70

D796

Figure 5