Malariaverse Workshop Calibration

Pete Winskill

Welcome back!

Session aims:

Given all the information about a site (vectors, season PfPr_2-10 etc…). We still need to make our best estimation of the “baseline” level of transmission.
One relatively simple way of doing this is by modifying the modelled baseline until our model output best matches some observations (e.g. PfPr_2-10).

Site files in 📦 [foresite] have been calibrated $eir
The EIR (Entomological Inoculation Rate) is our “dial” we can turn to change baseline transmission.
Turning the EIR dial is essentially leading to changes in the carrying capacity of the environment we are modelling - how many mosquitoes it can support.

Data 📑
- It will be your decision on what data (and how much of it) to calibrate to
- It is important to understand how and when data were collected, for example data collected at peak malaria season may look very different to that collected in the low season.
Warmup 💪
- 📦 malariasimulation runs do not automatically run a warmup period which is necessary for the model to fully equilibrate. In general it is always advisable to add a warmup period to your simulation runs. Some outputs are more sensitive than others and exploration of a suitable warmup period to use should be performed prior to calibration.
- Subsequent model runs after calibration should use the same warmup period as the calibration.

Calibration or re-calibration is not always required. In general, you will need to calibrate if:

You have created a new site file
You have modified any site inputs that occur before or at the same time as the data you are trying to calibrate to
The data you are calibrating to has changed
There has been a change to an underlying assumption that would impact the disease dynamics in a model run

We can use 📦cali for model calibration.
The calibrate() function takes a target, for example annual PfPr_2-10 for the years 2015-2020.
The user defines a summary_function, a function that takes the raw model output and produces the model estimate corresponding to the target. In this case our summary function will summarise the model-estimated PfPr_2-10 for the years 2015-2020
calibrate() will then search a range of EIRs until the outputs of summary_function are within a user-defined tolerance of the target

It is always good practice to run a simulation and test your summary_function before using it in calibrate() - this will help with debugging 🐞.
There a lots of other tuning and model options, so please see ?calibrate() for more detailed information.

📦cali is (hopefully) convenient, however it doesn’t remove the need to run the model multiple time when calibrating. Therefore be prepared for the process to be lengthy, often requiring calibration runs to be performed on the high performance cluster.

Sometimes calibration can look good:

…and sometimes not 😢 :

Calibration can be difficult when results are dominated by stochasticity:
- low transmission
- small populations
- calibrating to a sub-group in the population
- etc…