Update to "assessing far UVC interventions with an individual based infectious disease model"

Overview

In our initial report we introduced helios: an individual-based modelling framework for exploring the impact that installation of far UVC could have on the transmission and burden of respiratory infections. This report describes updates made to the modelling framework since that initial report (Section 1). Additionally, we present a series of analyses evaluating the impact that installation of far UVC in non-household settings could have on the burden of endemic respiratory pathogens, using “influenza-like” and “SARS-CoV-2-like” pathogen archetypes as case studies. (Section 2).

1 Major Model Changes

Two major updates to the model have been made since the initial report. These are:

Using data from the United States of America (USA) to parameterise all aspects of the model’s representation of the physical environment (Section 1.1).
Including variation in the propensity for infectious disease transmission (which we call “riskiness”) across individual locations in the model (Section 1.2).

1.1 Use of Data from the USA

Previously, we used data both from the United Kingdom (UK) and from the USA. In particular, we used data from the UK to parameterise the age distribution, household and school Setting Types; and data from the USA to parameterise the workplace Setting Type. In this update to the model, we parameterise the age distribution, household and school Setting Types using data only from the USA.

1.1.1 Household Setting Type & Age Distribution

To parameterise the model’s household size and age distribution we use data from a synthetic USA population developed by RTI International for the purposes of disease modelling (Wheaton and RTI 2014). This dataset is synthetic, but was developed by calibrating to datasources from the USA - it is therefore a virtual, anonymised population that is representative of the USA. It contains individual-level information on household residency and age, which we use to construct the household Setting Type and age distribution used in the results presented here. State and county specific data are available. We selected data from San Francisco, California.

Figure 1.1A shows the previous household size distribution used (based on UK data); Figure 1.1B shows the household size distribution now being used (based on the USA data described above). Figure 1.2 shows the same but for the age distribution used.

Figure 1.1: A distribution of household sizes for a synthetic population of size 250,000 using data from the UK (A) and from the USA (B).

Figure 1.2: An age group population sizes distribution for a synthetic population of size 250,000 using data from the UK (A) and from the USA (B).

1.1.2 School Setting Type

To parameterise the model’s schools sizes, we used US school size data from 2019-2020 from the National Center for Education Statistics, available from Brey, Zhang, and Duffy (2023). We included all schools categorised as “prekindergarten”, “elementary”, “middle”, or “secondary and high”. As school sizes were provided in ranges (e.g. “500 to 599”) we used the midpoint (e.g. 550) for the purposes of constructing the distribution of schools sizes (this is in contrast to the UK data where exact school sizes were available). This data was available only at the national level, which we use here.

Figure 1.3A shows, for a population of 250,000, the resulting school size distribution used based on previous data which was from the UK; Figure 1.3B shows the household size distribution now being used, which is based on the USA data described above.

$A distribution of school sizes for a synthetic population of size 2.5^{5} using data from the UK (A) and from the USA (B). As the UK has a higher proportion of children, the total school occupancy in the UK is higher than in the US.$

Figure 1.3: A distribution of school sizes for a synthetic population of size 2.5^{5} using data from the UK (A) and from the USA (B). As the UK has a higher proportion of children, the total school occupancy in the UK is higher than in the US.

1.2 Location-Specific Variability in “Riskiness”

Previously, we modelled differences in “riskiness” (i.e. how conducive to transmission it is) between different Setting Types (schools, workplaces, leisure settings and households). However, we made the simplifying assumption that all Locations within a particular Setting Type (e.g. individual schools) had the same riskiness. We now explicitly incorporate variation in riskiness between Locations belonging to the same Setting Type. Below is a mathematical description of this addition and the parameterisation used for the results presented in this report.

1.2.1 Modelling Variation in Transmission Riskiness Between Locations of the Same Setting Type

Previously, we modelled the force of infection experienced by individual $i$ in Location $m$ at time $t$ as follows: \[ \lambda_{i,j,m}(t) = \frac{\beta_{m} I_{j,m}(t)}{N_{j,m}} \] where $I_m(t)$ are the number of infectious individuals in Location $j$ of Setting Type $m$ at time $t$ , $N_{j,m}$ is the total number of individuals visiting Location $j$ of Setting Type $m$ at time $t$ and $\beta_m$ is a Setting Type-specific (i.e. workplace, school, household, leisure-venue specific) transmissibility parameter that models on average how much transmission happens in a particular Setting Type (e.g. workplaces) compared to other Settings Types (e.g. schools). The total force of infection $\lambda_T$ experienced across all Setting Types is then summed and forms the basis for calculating an individual’s probability of infection during one time-step $t$: \[ P(infection) = 1 - e^{- \lambda_{i,T}t}. \]

Previously, $\beta$ was specific to the Setting Type (i.e. it was different for schools, workplaces, households and leisure settings) but assumed to be the same same for every location of that setting type e.g. all schools had the same $\beta$. We have now relaxed this assumption. The updated force of infection term is:

\[ \lambda_{i,j,m}(t) = \frac{\beta_{j,m} I_{j,m}(t)}{N_{j,m}} \]

where:
\[ \beta_{j,m} = \gamma_{j,m}\beta_{m} \] and: \[ \gamma_{j,m} \sim Lognormal(\mu_{m}, \sigma_{m}). \] The parameter $\gamma$ controls how risky a particular Location of a specific Setting Type is relative to the average for that Setting Type and is drawn from a Lognormal distribution. For example, $\gamma = 1$ is a setting which is as “risky” as the average location of that Setting Type. Meanwhile, $\gamma = 2$ would describe a setting that is twice as risky; $\gamma = 0.5$ would describe a setting that is half as risky etc. Parameterisation and estimation of $\gamma$ is described below.

1.2.2 Parameterising $\gamma$ and Estimating Variation in Transmission Riskiness Across Locations of the Same Setting Type

To estimate $\gamma$, we combined a mathematical model of indoor air quality (as applied to infective virus concentrations) with a well-studied airborne infectious risk model (the Wells-Riley equation). We then parameterised these models with literature-derived estimates of:

Virus-specific parameters (with SARS-CoV-2 chosen as a case study),
Variation in ventilation rates across Locations belonging to the same Setting Type.

Together, these enable us to produce estimates of the variation in riskiness for SARS-CoV-2 transmission between different Locations belonging to the same Setting Type. Below we describe this process.

1.2.2.1 Step 1: Modelling the Concentration of Infective Virus in an Enclosed Space and Associated Infection Risk

Following the approach of Blatchley et al, 2024, we model the concentration of a pathogen in the air in an enclosed space via the following ordinary differential equation (ODE): \[ \nu \frac{dC(t)}{dt} = - QC - k_DC \nu + I\pi \] where $v$ is the space volume, $C$ is the infective virus concentration, $t$ is time, $Q$ is the flow rate of air into and out of the space, $k_D$ is the natural decay constant, $I$ is the number of infectious individuals in the space, and $\pi$ is the per capita (infective) virus emission rate. The closed-form solution to this equation is: \[ C(t) = \frac{I\pi}{\alpha \nu}(1 - e^{-\alpha t}) \] where: \[ \alpha = \frac{Q}{\nu} + k_D \] $\frac{Q}{\nu}$ is the number of air changes per hour. $\alpha$ is therefore the number of equivalent air changes per hour taking into account both natural decay of the virus and ventilation-driven removal of infective virus. For the purposes of the analyses presented here, we assume the volume of the space is the following: \[ \nu = N D_mH \]where $N$ is the number of people occupying the space, $D_m$ is the density of people in the space (i.e. the number of people per m²) and $H$ is the height of the space.

We then integrate this estimate of the concentration of infective virus in the space with the Wells-Riley equation, a simple and widely used model for the airbone transmission of infectious diseases. The Wells-Riley equation takes the following form: \[ P_{i}(t) = 1 - e^{-r C_{ss}Bt} \] where $P_i$ is the probability a person becomes infected conditional on them spending time $t$ in a space with $C_{ss}$ concentration of infective virus, $B$ is the volume rate at which air is breathed in, and $r$ is the risk parameter, the probability that a single inhaled infective virus will initiate an infection.

1.2.2.2 Step 2: Estimating Variation in Riskiness Across Locations of the Same Setting Type

With the modelling framework to assess risk in place, we carried out a sensitivity analysis to derive reasonable bounds for $\gamma$. We used the above equations in tandem with estimates of variation in the ventilation rates in different spaces (from Corsi et al, 2006 - Section 3.1) and parameter estimates for SARS-CoV-2 from Blatchley et al (see Table 1.1) to explore how Location riskiness varied whilst varying:

Infection Prevalence (i.e. the number of infectious individuals in the space).
Air Changes Per Hour (considering the minimum and maximum of the ranges provided in 1.1)
The Time Spent in the Space (ranging from 0 to 8 hours).

The results of this are shown in 1.4. They show that the riskiness ratio (i.e. how much more risky the most risky setting is compared to the least risky setting) for each Setting Type is largely the same across the different modelled infection prevalences. They do however depend on the assumed time spent in the space. For the purposes of the results presented here, we assume that individuals spend the following time in each Setting Type:

Household: 12 hours [corresponding to a riskiness ratio of 2.5]
Leisure: 2.5 hours [riskiness ratio of 5.5]
School: 8 hours [riskiness ratio of 4.75]
Workplace: 8 hours [riskiness ratio of 6.35]

and 1.5 hours in the community, noting that individuals spend time in either Schools (children and teachers) or Workplaces (all other adults) but not both. We use these ratios as the basis for the Setting Type and Location specific $\gamma_{j,m}$ values used in the modelling analyses presented in the next section.

Estimating the variation in riskiness between Locations belonging to the same Setting Type. A) For each Setting Type, the probability of infection in the least and most risky settings (i.e. with the lowest and highest ventilation rates) over time. B) The ratio of infection risk in the most vs least risky Locations over time for each Setting Type. Vertical dashed lines indicate the time assumed spent in each Setting Type for purposes of modelling. Note ratios are the same for Leisure and Workplace and so only Workplace is plotted.

Figure 1.4: Estimating the variation in riskiness between Locations belonging to the same Setting Type. A) For each Setting Type, the probability of infection in the least and most risky settings (i.e. with the lowest and highest ventilation rates) over time. B) The ratio of infection risk in the most vs least risky Locations over time for each Setting Type. Vertical dashed lines indicate the time assumed spent in each Setting Type for purposes of modelling. Note ratios are the same for Leisure and Workplace and so only Workplace is plotted.

Table 1.1: Parameter Estimates Used to Calculate Variation in Riskiness Across Locations Belonging to the Same Setting Type
Parameter	Value Used	Description	Source
$\pi$	27 viruses per person per hour	Rate of infective virus emission from infected individuals.	Blatchley III and Cui (2023) & Buonanno, Morawska, and Stabile (2020)
$D_{school}$	30 per 100m²	Assumed density of individuals in schools	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
$A_{school}$	0.1-2.9 air changes per hour (ACPH)	Range of ACPH considered for schools.	Corsi et al, 2006 - Section 3.1
$D_{workplace}$	10 per 100m²	Assumed density of individuals in workplaces	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
$A_{workplace}$	0.22-4.84 ACPH	Range of ACPH considered for workplaces.	Corsi et al, 2006 - Section 3.1
$D_{household}$	5 per 100m²	Density of individuals in households	Assumed
$A_{household}$	0.21-1.48 ACPH	Range of ACPH considered for households.	Corsi et al, 2006 - Section 3.1
$D_{leisure}$	50 per 100m²	Density of individuals in leisure settings	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
$A_{leisure}$	0.22-4.84 ACPH	Range of ACPH considered for leisure locations.	Corsi et al, 2006 - Section 3.1
$k_D$	0.64	Natural decay constant for SARS-CoV-2 in aerosols	Blatchley III and Cui (2023) & Van Doremalen et al. (2020)
$H$	2.5m	Height of the space	Assumed
$B$	0.45 m³ per hour	Volume breathing rate assumes 15 breaths per minute and 500ml tidal volume = 0.45m³ per hour.	Here & here
$r$	1.37 × 10^-2	The probability that a single inhaled infective virus will initiate an infection.	Blatchley III and Cui (2023) & Killingley et al. (2022)

2 Assessing the impact of far UVC on the burden of endemic respiratory viruses

Using the modelling framework described above, we conducted analyses to evaluate the potential impact of far UVC deployment on the transmission and disease burden of a hypothetical endemic respiratory virus (i.e. one which is consistently present in a population and maintained at a particular baseline prevalence level).

2.1 Results

We considered two pathogen archetypes for this hypothetical virus:

“SARS-CoV-2-Like” Archetype - $R_0$ of 2.5, a mean latent period of 2 days, a mean duration of infectiousness of 4 days, and a mean duration of immunity of 365 days. This gives an approximate infection prevalence of 1% (i.e. approximately 1 in 100 individuals are infected at any given timepoint).
“Influenza-Like” Archetype - $R_0$ of 1.5, a mean latent period of 1 day, a mean duration of infectiousness of 2 days, and a mean duration of immunity of 365 days. This gives an approximate infection prevalence of 0.3% (i.e. 1 in 330 individuals infected at any given timepoint).

For each pathogen archetype, we investigated the effect of far UVC efficacy and far UVC coverage on the annual incidence of infection, varying:

far UVC Coverage: Either 10%, 25% or 50% - in all cases, far UVC was assumed to be installed in schools, workplaces and leisure locations only (i.e. not households), at a random set of locations within each Setting Type.
far UVC Efficacy: Either 40%, 60% or 80%, with the assumption that far UVC efficacy was identical across all Settings and Locations where it had been installed.

The results of these analyses are presented in Figure 2.1. In general, we estimate that far UVC installation would have a larger (proportional) impact on disease burden for an “Influenza-Like” pathogen than a “SARS-CoV-2-Like” pathogen; and that increasing far UVC coverage and/or increasing far UVC efficacy leads to increased impact. Assuming 10% coverage and 60% efficacy, far UVC installation led to a 7% (range 2%-11%) reduction in annual infection incidence for “Influenza-Like” and 3% (range 2%-5%) for “SARS-CoV-2-Like”. At 10% coverage and 80% assumed efficacy, these are 10% (range 6%-16%) and 5% (range 2%-8%) respectively.

Figure 2.1: The impact of varying far UVC coverage and efficacy on the annual incidence of a respiratory virus, for “Influenza-Like” and “SARS-CoV-2-Like” pathogen archetypes. The bars represent the mean percentage reduction in infection incidence averaged over the 10 stochastic model simulations run for each parametrisation, with the range of % reduction in incidence across those 10 simulations shown by the error bars.

We next compared these results to those produced by a simplistic, static multiplicative model of estimated impact calculated by multiplying the coverage and efficacy of the modelled far UVC together and then multiplying this by the proportion of transmission that is “targetable” by far UVC (i.e. the proportion of transmission that occurs outside households). The results of this analyses are plotted below in 2.2. As you can see, this simplified model provides similar estimates to those generated by helios for SARS-CoV-2, but significantly underestimates the impact for Influenza.

Comparing the estimates of impact for helios and a simplified multiplicative model. x-axis indicates the reduction estimated by the simple model; y-axis the estimate produced by helios . Points are coloured according to pathogen archetype. Dashed line indicates the line of y = x (i.e. any points lying on that line have the same impact estimate from helios and the simplified model). Grey shaded area indicates range where the simple model predicts lower impact than helios.

Figure 2.2: Comparing the estimates of impact for helios and a simplified multiplicative model. x-axis indicates the reduction estimated by the simple model; y-axis the estimate produced by helios . Points are coloured according to pathogen archetype. Dashed line indicates the line of y = x (i.e. any points lying on that line have the same impact estimate from helios and the simplified model). Grey shaded area indicates range where the simple model predicts lower impact than helios.

2.2 Discussion

The observed differences in predicted far UVC impact between the two pathogen archetypes arise primarily because of their different $R_0$ values. In Figure 2.3 we show the analytical relationship between $R_0$ and prevalence of infection, derived for a SEIRS compartmental model (see here and here) that shares a similar representation of a disease’s natural history to helios. We note that the results presented in 2.3 are not results from running helios (instead they are the analytical solution of a significantly more tractable mathematical model that is simpler than but similar to helios) and are displayed here solely for the purpose of illustrating a generally-held non-linearity between transmissibility ($R_{0}$) and infection prevalence.

There is a non-linear influence of $R_{0}$ on prevalence of infection (and hence disease burden). For lower values of $R_{0}$, there is a near-linear relationship between the two quantities. At higher values of $R_{0}$ however, infection prevalence saturates and the rate at which increasing values of $R_0$ increases infection prevalence diminishes. When $R_0$ is high (as for the “SARS-CoV-2-Like” archetype), small reductions in $R_0$ (e.g. due to far UVC) will have only a slight impact on infection prevalence. By contrast, when baseline $R_0$ is lower (as for the “Influenza-Like” archetype”), the infection prevalence will decrease more (in relative terms) for the same reduction in $R_0$. Increasing duration of infectiousness increases the prevalence of infection in a linear manner

$The relationship between $R_0$ and the prevalence of infection at endemic equilibrium for each of the two pathogen archetypes considered here, derived mathematically for a simpler model that is similar to helios. The vertical dashed lines indicate the value of $R_0$ used for each archetype in the analyses carried out.$

Figure 2.3: The relationship between $R_0$ and the prevalence of infection at endemic equilibrium for each of the two pathogen archetypes considered here, derived mathematically for a simpler model that is similar to helios. The vertical dashed lines indicate the value of $R_0$ used for each archetype in the analyses carried out.

Bibliography

Blatchley III, Ernest R, and Haiying Cui. 2023. “Quantitative Microbial Risk Assessment for Quantification of the Effects of Ultraviolet Germicidal Irradiation on COVID-19 Transmission.” Environmental Science & Technology 57 (45): 17393–403.

Brey, Cristobal de, Anlan Zhang, and Sarah Duffy. 2023. Digest of Education Statistics, 2021. Compendium. National Center for Education Statistics (NCES). https://nces.ed.gov/programs/digest/d21/.

Buonanno, Giorgio, Lidia Morawska, and Luca Stabile. 2020. “Quantitative Assessment of the Risk of Airborne Transmission of SARS-CoV-2 Infection: Prospective and Retrospective Applications.” Environment International 145: 106112.

Killingley, Ben, Alex J Mann, Mariya Kalinova, Alison Boyers, Niluka Goonawardane, Jie Zhou, Kate Lindsell, et al. 2022. “Safety, Tolerability and Viral Kinetics During SARS-CoV-2 Human Challenge in Young Adults.” Nature Medicine 28 (5): 1031–41.

Van Doremalen, Neeltje, Trenton Bushmaker, Dylan H Morris, Myndi G Holbrook, Amandine Gamble, Brandi N Williamson, Azaibi Tamin, et al. 2020. “Aerosol and Surface Stability of SARS-CoV-2 as Compared with SARS-CoV-1.” New England Journal of Medicine 382 (16): 1564–67.

Wheaton, WD, and Int RTI. 2014. “US Synthetic Population 2010 Version 1.0 Quick Start Guide.” RTI International.

Charlie Whittaker¹, Tom Brewer², Adam Howes³

July 2024

Overview

1 Major Model Changes

1.1 Use of Data from the USA

1.1.1 Household Setting Type & Age Distribution

1.1.2 School Setting Type

1.2 Location-Specific Variability in “Riskiness”

1.2.1 Modelling Variation in Transmission Riskiness Between Locations of the Same Setting Type

1.2.2 Parameterising \(\gamma\) and Estimating Variation in Transmission Riskiness Across Locations of the Same Setting Type

1.2.2.1 Step 1: Modelling the Concentration of Infective Virus in an Enclosed Space and Associated Infection Risk

1.2.2.2 Step 2: Estimating Variation in Riskiness Across Locations of the Same Setting Type

2 Assessing the impact of far UVC on the burden of endemic respiratory viruses

2.1 Results

2.2 Discussion

Bibliography

Parameter	Value Used	Description	Source
\(\pi\)	27 viruses per person per hour	Rate of infective virus emission from infected individuals.	Blatchley III and Cui (2023) & Buonanno, Morawska, and Stabile (2020)
\(D_{school}\)	30 per 100m²	Assumed density of individuals in schools	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
\(A_{school}\)	0.1-2.9 air changes per hour (ACPH)	Range of ACPH considered for schools.	Corsi et al, 2006 - Section 3.1
\(D_{workplace}\)	10 per 100m²	Assumed density of individuals in workplaces	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
\(A_{workplace}\)	0.22-4.84 ACPH	Range of ACPH considered for workplaces.	Corsi et al, 2006 - Section 3.1
\(D_{household}\)	5 per 100m²	Density of individuals in households	Assumed
\(A_{household}\)	0.21-1.48 ACPH	Range of ACPH considered for households.	Corsi et al, 2006 - Section 3.1
\(D_{leisure}\)	50 per 100m²	Density of individuals in leisure settings	ANSI/ASHRAE Standard 62.1-2022 Table 6.1
\(A_{leisure}\)	0.22-4.84 ACPH	Range of ACPH considered for leisure locations.	Corsi et al, 2006 - Section 3.1
\(k_D\)	0.64	Natural decay constant for SARS-CoV-2 in aerosols	Blatchley III and Cui (2023) & Van Doremalen et al. (2020)
\(H\)	2.5m	Height of the space	Assumed
\(B\)	0.45 m³ per hour	Volume breathing rate assumes 15 breaths per minute and 500ml tidal volume = 0.45m³ per hour.	Here & here
\(r\)	1.37 × 10^-2	The probability that a single inhaled infective virus will initiate an infection.	Blatchley III and Cui (2023) & Killingley et al. (2022)

Update to "assessing far UVC interventions with an individual based infectious disease model"

Charlie Whittaker1, Tom Brewer2, Adam Howes3

July 2024

Overview

1 Major Model Changes

1.1 Use of Data from the USA

1.1.1 Household Setting Type & Age Distribution

1.1.2 School Setting Type

1.2 Location-Specific Variability in “Riskiness”

1.2.1 Modelling Variation in Transmission Riskiness Between Locations of the Same Setting Type

1.2.2 Parameterising \(\gamma\) and Estimating Variation in Transmission Riskiness Across Locations of the Same Setting Type

1.2.2.1 Step 1: Modelling the Concentration of Infective Virus in an Enclosed Space and Associated Infection Risk

1.2.2.2 Step 2: Estimating Variation in Riskiness Across Locations of the Same Setting Type

2 Assessing the impact of far UVC on the burden of endemic respiratory viruses

2.1 Results

2.2 Discussion

Bibliography

Charlie Whittaker¹, Tom Brewer², Adam Howes³