C A L M O V E R V I E W
OVERVIEW
The ongoing Yemeni cholera outbreak has been deemed one of the worst cholera outbreaks in history, with over a million people impacted and thousands dead. Triggered by a civil war, the outbreak has been shaped by various political, environmental, and epidemiological factors and is continuing to accelerate.While cholera has several effective treatments, the untimely and ultimately inefficient distribution of existing medicines has been the primary cause of cholera mortality. With the hope of facilitating resource allocation, various mathematical models have been made to track the Yemeni outbreak and identify at-risk governorates (administrative divisions. These models, while useful, are not powerful enough to accurately and consistently forecast exact cholera cases per governorate over multiple timeframes. To address the need for a complex, reliable model, the Lambert iGEM team presents CALM, the Cholera Artificial Learning Model; a system of four extreme-gradient-boosting (XGBoost) machine learning models that forecast the exact number of cholera cases a Yemeni governorate will experience from a time range of 2 weeks to 2 months. CALM provides a novel machine learning approach that makes use of rainfall data, past cholera cases and deaths data, civil war fatalities, and inter-governorate interactions represented across multiple timeframes. Additionally, the use of machine learning, along with extensive feature engineering, allows CALM to easily learn complex non-linear relations apparent in an epidemiological phenomenon. CALM is able to forecast cholera incidence 6-8 weeks ahead within a margin of 4.607 cholera cases per 10,000 people in real world simulation. Similarly, CALM achieved a mean error of 3.921 for a 0-2 week forecast, 4.034 for a 2-4 week forecast, and 4.737 for a 4-6 week forecast. The model’s forecast system provides advanced notice of outbreaks on multiple levels to facilitating the timely allocation of cholera relief supplies and outbreak prevention.