Single nucleotide polymorphism (SNP) data

SNP barcoding data

SNP barcode data from the sanger 100 SNP Plasmodium falciparum barcode (Chang et al. 2019).

sanger101_snp_barcode_withGenes.bed

Field Samples

The barcode was subsetted from the above WGS data to just the sanger barcode for the Vietnam and DRC data. The results file can be found within directory snp_barcode/sangerBarcode_SNP_INDEL_Pf3D7_ALL_v3.combined.filtered.vqslod6.biallelic_snp.Vietnam.vcf.gz, snp_barcode/sangerBarcode_SNP_INDEL_Pf3D7_ALL_v3.combined.filtered.vqslod6.biallelic_snp.DRCongo.vcf.gz

Lab Isolates

The barcode was also explicitly called with several monoclonal lab isolates and then lab created mixtures of these isolates. Data can be found snp_barcode/controls_sanger100.vcf.gz with meta data with what mixtures are what found snp_barcode/allControlMixtures.tab.txt and snp_barcode/allControlSampNameToMixName.tab.txt

Simulated

The barcode was also simulated for 100 samples (50 Bangladesh and 50 Ghana). Data can be found snp_barcode/SpotMalariapfPanel_simData_sanger100.vcf.gz. The simulations were created by simulating super infections by sampling the barcode from each of these countries and selecting COIs based on the COIs observed for each country. To use data without indels, the data can be found snp_barcode/SpotMalariapfPanel_simData_snponly_sanger100.vcf.gz.

Back to top

References

Chang, Hsiao-Han, Amy Wesolowski, Ipsita Sinha, Christopher G Jacob, Ayesha Mahmud, Didar Uddin, Sazid Ibna Zaman, et al. 2019. “Mapping Imported Malaria in Bangladesh Using Parasite Genetic and Human Mobility Data.” Elife 8 (April).