Line 10: | Line 10: | ||
<link rel="shortcut icon" href="assets/images/findingflavi-logo-128x52-1.png" type="image/x-icon"> | <link rel="shortcut icon" href="assets/images/findingflavi-logo-128x52-1.png" type="image/x-icon"> | ||
<meta name="description" content=""> | <meta name="description" content=""> | ||
− | <title> | + | <title>Modeling</title> |
− | + | <link rel="stylesheet" href="https://2018.igem.org/Team:Tokyo_Tech/mobirise/mobirise/css/mbr-additional-css-model2?action=raw&ctype=text/css"> | |
− | <link rel="stylesheet" href="https://2018.igem.org/Team:Tokyo_Tech/mobirise/mobirise/css/mbr-additional-css- | + | |
− | + | ||
</head> | </head> | ||
<body> | <body> | ||
− | + | <section class="header9 cid-r5el55DVH6 mbr-fullscreen mbr-parallax-background" id="header9-g"> | |
− | + | ||
Line 27: | Line 24: | ||
<div class="media-container-column mbr-white col-md-8"> | <div class="media-container-column mbr-white col-md-8"> | ||
<h1 class="mbr-section-title align-left mbr-bold pb-3 mbr-fonts-style display-1">Modeling</h1> | <h1 class="mbr-section-title align-left mbr-bold pb-3 mbr-fonts-style display-1">Modeling</h1> | ||
− | <h3 class="mbr-section-subtitle align-left mbr-light pb-3 mbr-fonts-style display-2">Establishment of dengue virus serotype prediction | + | <h3 class="mbr-section-subtitle align-left mbr-light pb-3 mbr-fonts-style display-2">Establishment of dengue virus serotype prediction system</h3> |
Line 38: | Line 35: | ||
</a> | </a> | ||
</div> | </div> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6FjJbsElO" id="header5-1t"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | <h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Abstract</h1> | ||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7"><span style="font-size: 1.2rem;">In the past iGEM, the measurement theory model of artificial gene circuits has been centrally treated. We tried to apply a new model to predict the fluctuation in the number of infected people with dengue.</span> | ||
+ | <br>We used the data of yearly occurrence of dengue serotypes by primary or secondary dengue infection in children along with climate data in Bangkok, Thailand from 1973 to 1999 to create a model to predict change in the number and ratio of patients in each serotype. <br>This model can predict an epidemic of each serotype of dengue fever, thus can be used by health care institutions to prepare vaccine production distribution and production, furthermore allowing them to take special care of secondary dengue infection patients. <br>From the data we found that some region had distinct epidemic cycle of each serotype while some region had more chaotic pattern, therefore we decided to use simplex projection method for our prediction model, which is suitable for distinguishing chaotic time series from random noise. | ||
+ | <br>If the dengue fever inspection method we developed in the Wet experiment spreads and the data about patients infected with dengue virus is fulfilled, it is possible to estimate the number of infected people more precisely. This can be expected to be useful for measures against abandoned tropical infections. <br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
</section> | </section> | ||
Line 157: | Line 174: | ||
</section> | </section> | ||
− | <section class="header4 cid- | + | <section class="header4 cid-r6FuLREnGy" id="header4-1v"> |
Line 174: | Line 191: | ||
</div> | </div> | ||
<div class="mbr-figure pt-5"> | <div class="mbr-figure pt-5"> | ||
− | <img src=" | + | <img src="https://static.igem.org/mediawiki/2018/b/bf/T--Tokyo_Tech--03simplexprojection-883x714.png" alt="Mobirise" title="" style="width: 60%;"> |
</div> | </div> | ||
</div> | </div> | ||
Line 180: | Line 197: | ||
</section> | </section> | ||
− | <section class="cid- | + | <section class="header4 cid-r6zER2SFi4" id="header4-1m"> |
− | |||
− | |||
− | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-md-center"> | ||
+ | <div class="media-content col-md-10"> | ||
+ | |||
+ | |||
+ | <div class="mbr-text align-center mbr-white pb-3"> | ||
+ | <p class="mbr-text mbr-fonts-style display-7">Figure1 Algorithm of Simplex projection</p> | ||
+ | </div> | ||
+ | |||
+ | </div> | ||
+ | <div class="mbr-figure pt-5"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/0/0a/T--Tokyo_Tech--modeling10.png" alt="Mobirise" title="" style="width: 60%;"> | ||
+ | </div> | ||
</div> | </div> | ||
− | </ | + | </div> |
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6Fz5IjAvs" id="header5-1w"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure2 Simple illustration of algorithm of Simplex projection<br><br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
</section> | </section> | ||
Line 204: | Line 250: | ||
<div class="mbr-text align-center mbr-white pb-3"> | <div class="mbr-text align-center mbr-white pb-3"> | ||
− | <p class="mbr-text mbr-fonts-style display-7">We predicted how the number of people infected with dengue virus and the proportion of each serotype changes by using a mathematical method, Simplex Projection to the data about the yearly occurrence of dengue serotypes by primary or secondary dengue infection in children in Bangkok, Thailand from 1973 to 1999 and climate data in Bangkok, Thailand from 1973 to 1999. <br></p> | + | <p class="mbr-text mbr-fonts-style display-7">We predicted how the number of people infected with dengue virus and the proportion of each serotype changes by using a mathematical method, Simplex Projection to the data about the yearly occurrence of dengue serotypes by primary or secondary dengue infection in children in Bangkok, Thailand from 1973 to 1999 and climate data in Bangkok, Thailand from 1973 to 1999. <br>In this analysis, we evaluate the prediction accuracy by rho (Pearson’s correlation coefficient between predictions and observations) values. The following formula is the definition of rho. It can take a range of values from +1 to -1. A value of 0 indicates that there is no association between the two variables. A value greater than 0 indicates a positive association; that is, as the value of one variable increases, so does the value of the other variable. A value less than 0 indicates a negative association; that is, as the value of one variable increases, the value of the other variable decreases.<br></p> |
</div> | </div> | ||
</div> | </div> | ||
<div class="mbr-figure pt-5"> | <div class="mbr-figure pt-5"> | ||
− | <img src=" | + | <img src="https://static.igem.org/mediawiki/2018/2/2a/T--Tokyo_Tech--modeling12.png" alt="Mobirise" title="" style="width: 60%;"> |
</div> | </div> | ||
</div> | </div> | ||
Line 215: | Line 261: | ||
</section> | </section> | ||
− | <section class="cid- | + | <section class="header5 cid-r6FAjvhJVZ" id="header5-1x"> |
− | |||
− | |||
− | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure3 Definition of rho<br></p> | ||
+ | |||
+ | </div> | ||
</div> | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6IpIcKnV0" id="header5-28"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">The values of rho in each prediction are shown in the table below.<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="section-table cid-r6Ikg6ggPj" id="table1-22"> | ||
+ | |||
+ | |||
+ | |||
+ | <div class="container container-table"> | ||
+ | |||
+ | |||
+ | <div class="table-wrapper"> | ||
+ | <div class="container"> | ||
+ | |||
+ | </div> | ||
+ | |||
+ | <div class="container scroll"> | ||
+ | <table class="table" cellspacing="0"> | ||
+ | <thead> | ||
+ | <tr class="table-heads "> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <th class="head-item mbr-fonts-style display-7">Prediction type</th><th class="head-item mbr-fonts-style display-7">Pearson’s correlation coefficient between predictions and observations</th></tr> | ||
+ | </thead> | ||
+ | |||
+ | <tbody> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in scaled subjects of DENV1<br></td><td class="body-item mbr-fonts-style display-7">0.691924<br></td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in scaled subjects of DENV2</td><td class="body-item mbr-fonts-style display-7">0.4864992</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in scaled subjects of DENV3</td><td class="body-item mbr-fonts-style display-7">0.3443239</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in scaled subjects of DENV4</td><td class="body-item mbr-fonts-style display-7">0.4620425</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in existence rate of DENV1<br></td><td class="body-item mbr-fonts-style display-7">0.6788994</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in existence rate of DENV2</td><td class="body-item mbr-fonts-style display-7">0.7765715</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in existence rate of DENV3</td><td class="body-item mbr-fonts-style display-7">0.6569529</td></tr><tr> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <td class="body-item mbr-fonts-style display-7">Change in existence rate of DENV4</td><td class="body-item mbr-fonts-style display-7">0.8263565</td></tr></tbody> | ||
+ | </table> | ||
+ | </div> | ||
+ | <div class="container table-info-container"> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6IuHpzPHJ" id="header5-2i"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7"><br><br> | ||
+ | Prediction using data on the number of infected people was low in accuracy. Since we treated the data over 25 years, there was a possibility that correct data of the number of infected people could not be obtained in the past when technology was not advanced. There is a high possibility that the number of mothers that can be recognized as the number of dengue fever infected people has changed. | ||
+ | However, prediction using proportion data of infected persons occupying each serum was relatively accurate. Using ratio data of infected persons occupying each serotype, it is considered that the accuracy has increased because the difference in maternal fluctuation can be suppressed to some extent. | ||
+ | If the data are increased more, there is no doubt that the accuracy of the prediction model will improve. | ||
+ | The results of the predictions are shown in the following figures.<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6InaHWmgY" id="image1-23"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 66%;"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/a/aa/T--Tokyo_Tech--modeling4.png" width="1400" alt="Mobirise" title=""> | ||
+ | |||
+ | </div> | ||
</figure> | </figure> | ||
</section> | </section> | ||
− | <section class="header4 cid- | + | <section class="header5 cid-r6Inu5vSsi" id="header5-24"> |
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure4 Prediction of scaled subjects number infected with each dengue virus in Bangkok, Thailand from 1973 to 2000<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header4 cid-r6IqqHwnZA" id="header4-29"> | ||
Line 236: | Line 436: | ||
<div class="row justify-content-md-center"> | <div class="row justify-content-md-center"> | ||
<div class="media-content col-md-10"> | <div class="media-content col-md-10"> | ||
− | + | ||
<div class="mbr-text align-center mbr-white pb-3"> | <div class="mbr-text align-center mbr-white pb-3"> | ||
− | + | ||
</div> | </div> | ||
</div> | </div> | ||
<div class="mbr-figure pt-5"> | <div class="mbr-figure pt-5"> | ||
− | <img src=" | + | <img src="https://static.igem.org/mediawiki/2018/d/d2/T--Tokyo_Tech--modeling3.png" alt="Mobirise" title="" style="width: 60%;"> |
</div> | </div> | ||
</div> | </div> | ||
</div> | </div> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6IqyJ8tRz" id="header5-2a"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure5 Prediction of rate of each dengue virus in Bangkok, Thailand from 1973 to 2000<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6Iu9Dfv9v" id="header5-2g"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | <h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-5">Co-prediction</h1> | ||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">The goal of co-prediction is to quantify dynamic similarity between two time series. Given two time series, x and y, we assume that the dynamics can be represented as:</p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6Ir7GbZLv" id="image1-2b"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 36%;"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/6/63/T--Tokyo_Tech--modeling7.png" width="1400" alt="Mobirise" title=""> | ||
+ | |||
+ | </div> | ||
+ | </figure> | ||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6IruxJ7JK" id="image1-2d"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 36%;"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/5/54/T--Tokyo_Tech--modeling8.png" width="1400" alt="Mobirise" title=""> | ||
+ | |||
+ | </div> | ||
+ | </figure> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6Iur0ofQz" id="header5-2h"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Then co-prediction is a way to quantify how closely F and G resemble each other.<br>We thought that there might be something related to the temperature affecting the dengue-mediated mosquito activity and the total number of patients in each serotype of dengue.<br>In order to identify the relationship, co-prediction was performed using the total number of patients in each serotype and the annual average temperature data from 1973 to 1993 in Bangkok, Thailand.<br>The time series x is total value of infected individuals of each serotype and the time series y is the annual average temperature value.<br>The figure below shows how much accuracy of prediction was improved by co-prediction.</p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6IxgPAloQ" id="image1-2j"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 66%;"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/9/9e/T--Tokyo_Tech--modeling1.png" width="1400" alt="Mobirise" title=""> | ||
+ | |||
+ | </div> | ||
+ | </figure> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6IDKMV15k" id="header5-2m"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure6 Co-prediction by data of total infected individuals of each serotype (x) and annual average temperature data (y)<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6IDxrtJwX" id="header5-2k"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Here, we see that the predictions of total infected individuals of each serotype (x) are better when we try to use the inferred dynamics from annual average temperature data (y) to make forecasts. Likewise, the predictions of annual average temperature data (y) have better forecast skill by using the inferred dynamics from total infected individuals of each serotype (x).<br>Since the time series come from a model simulation where x and y have coefficients, we can infer their dynamical maps are actually related.</p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6JjyLuIyT" id="header5-2p"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | <h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Predicton model</h1> | ||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">In the above fluctuation prediction, prediction accuracy could be improved by using climate data for fluctuation prediction in addition to dengue virus infected patients’ data. It is expected that the prediction of the number of people infected with dengue virus of each serotype will be carried out accurately when the data of infected people with dengue fever increase. Here we constructed a prediction model for the fluctuation in the number of dengue infected people in the coming year from dengue fever infected person data and climate data. With this model we can see which serotype dengue virus will become popular next year from past data.<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6K1p0zJY1" id="image1-2s"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 84%;"> | ||
+ | <img src="https://static.igem.org/mediawiki/2018/2/28/T--Tokyo_Tech--modeling5.png" width="1400" alt="Mobirise" title=""> | ||
+ | |||
+ | </div> | ||
+ | </figure> | ||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6JdPuqigd" id="header5-2n"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">Figure7 Conceptual diagram of the prediction model<br></p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="header5 cid-r6K1cfGpVd" id="header5-2r"> | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | <div class="container"> | ||
+ | <div class="row justify-content-center"> | ||
+ | <div class="mbr-white col-md-10"> | ||
+ | |||
+ | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">The GitHub repository about this prediction model is <a href="https://github.com/iGEMTokyoTech/Dengue-Prediction" target="_blank">here</a>.</p> | ||
+ | |||
+ | </div> | ||
+ | </div> | ||
+ | </div> | ||
+ | |||
+ | |||
+ | </section> | ||
+ | |||
+ | <section class="cid-r6JjRyR17M" id="image1-2q"> | ||
+ | |||
+ | |||
+ | |||
+ | <figure class="mbr-figure container"> | ||
+ | <div class="image-block" style="width: 39%;"> | ||
+ | <a href="https://github.com/iGEMTokyoTech/Dengue-Prediction" target="_blank"><img src="https://static.igem.org/mediawiki/2018/f/ff/T--Tokyo_Tech--modeling9.png" width="1400" alt="Mobirise" title=""></a> | ||
+ | |||
+ | </div> | ||
+ | </figure> | ||
</section> | </section> | ||
Line 259: | Line 663: | ||
<div class="mbr-white col-md-10"> | <div class="mbr-white col-md-10"> | ||
<h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Discussion</h1> | <h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Discussion</h1> | ||
− | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7"></p> | + | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7"><br>When using Simplex projection, it is said that time series data of at least 35 to 40 points is required. Despite very few data on dengue fever infected people, we were able to make predictions with some degree of accuracy. Furthermore, by adding climate data to infected data, it is possible to improve prediction accuracy.If we can gather data on dengue-infected individuals of each serotype from Wet's dengue infection detection kit created in this project, we can improve the accuracy of prediction and further help prevent infection with dengue.<br></p> |
</div> | </div> | ||
Line 277: | Line 681: | ||
<div class="mbr-white col-md-10"> | <div class="mbr-white col-md-10"> | ||
<h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Reference</h1> | <h1 class="mbr-section-title align-center pb-3 mbr-fonts-style display-2">Reference</h1> | ||
− | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7"></p> | + | <p class="mbr-text align-center display-5 pb-3 mbr-fonts-style display-7">http://www.who.int/en/news-room/fact-sheets/detail/dengue-and-severe-dengue |
+ | <br>http://deepeco.ucsd.edu/simplex/ | ||
+ | <br>https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-tutorial.html<br> | ||
+ | https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-coprediction.html<br>https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-algorithms.pdf <br>https://statistics.laerd.com/statistical-guides/pearson-correlation-coefficient-statistical-guide.php<br>http://www.ajtmh.org/content/journals/10.4269/ajtmh.2003.68.191?ijkey=a7c6f78b661b24d3734e97302ea1bc8e754b0b01&keytype2=tf_ipsecsha#dataandmedia<br></p> | ||
</div> | </div> | ||
Line 285: | Line 692: | ||
</section> | </section> | ||
− | |||
</body> | </body> | ||
</html> | </html> | ||
+ | |||
{{Tokyo_Tech/mobirise/footer}} | {{Tokyo_Tech/mobirise/footer}} | ||
{{Tokyo_Tech/mobirise/script}} | {{Tokyo_Tech/mobirise/script}} |
Revision as of 00:56, 18 October 2018
<!DOCTYPE html>
Modeling
Establishment of dengue virus serotype prediction system
Abstract
In the past iGEM, the measurement theory model of artificial gene circuits has been centrally treated. We tried to apply a new model to predict the fluctuation in the number of infected people with dengue.
We used the data of yearly occurrence of dengue serotypes by primary or secondary dengue infection in children along with climate data in Bangkok, Thailand from 1973 to 1999 to create a model to predict change in the number and ratio of patients in each serotype.
This model can predict an epidemic of each serotype of dengue fever, thus can be used by health care institutions to prepare vaccine production distribution and production, furthermore allowing them to take special care of secondary dengue infection patients.
From the data we found that some region had distinct epidemic cycle of each serotype while some region had more chaotic pattern, therefore we decided to use simplex projection method for our prediction model, which is suitable for distinguishing chaotic time series from random noise.
If the dengue fever inspection method we developed in the Wet experiment spreads and the data about patients infected with dengue virus is fulfilled, it is possible to estimate the number of infected people more precisely. This can be expected to be useful for measures against abandoned tropical infections.
Introduction
Dengue is a mosquito-borne viral disease that has rapidly spread in all regions of World Health Organization in recent years. Dengue is widespread throughout the tropics, with local variations in risk influenced by rainfall, temperature and unplanned rapid urbanization.
There are 4 distinct, but closely related, serotypes of the virus that cause dengue (DEN-1, DEN-2, DEN-3 and DEN-4). Recovery from infection by one provides lifelong immunity against that particular serotype. However, cross-immunity to the other serotypes after recovery is only partial and temporary. Subsequent infections by other serotypes increase the risk of developing severe dengue. Severe dengue is a leading cause of serious illness and death among children in some Asian and Latin American countries.
Reference: http://www.who.int/en/news-room/fact-sheets/detail/dengue-and-severe-dengue
Overview of the model
We predicted how the number of people infected with dengue virus and the proportion of each serotype changes by using a mathematical method, Simplex Projection to the data about the yearly occurrence of dengue serotypes by primary or secondary dengue infection in children and climate data in Bangkok, Thailand from 1973 to 1999. We constructed a model that automatically predicts the number of infected individuals and the variation of each serotype from dengue fever dengue-infected subjects data and climate data.
Simplex projection
Simplex projection, originally proposed by George Sugihara and Bob May in 1990 is a powerful tool for detecting patterns (chaos) in what otherwise appears to be randomness. Moreover, it will also tell you how complicated this pattern is, and then predict the future, often with far greater accuracy than any other technique.
Ignoring ties in distances, minimal distances, minimal weights and other potential hazards, the following algorithm performs Simplex projection to predict Tp time-steps ahead.
Notation
Description | |
---|---|
E | E denotes the embedding dimension. |
K | k denotes the number of nearest neighbors we use. For the simplex method, the default is k = E + 1 but for the S-map method it can be much larger. |
Tp | Tp denotes how many time-steps into the future we are trying to predict. |
X | X ∈ R denotes a (potentially long) time series. |
y | y ∈ R is a vector of lagged observations for which we want to make a prediction — in the simplest case where all components of the vector are single time step lags, y1 represents the current value, y2 is the value one time step prior and yE is the value E − 1 time steps prior. |
Figure1 Algorithm of Simplex projection
Figure2 Simple illustration of algorithm of Simplex projection
Estimation of distribution of dengue serotypes
We predicted how the number of people infected with dengue virus and the proportion of each serotype changes by using a mathematical method, Simplex Projection to the data about the yearly occurrence of dengue serotypes by primary or secondary dengue infection in children in Bangkok, Thailand from 1973 to 1999 and climate data in Bangkok, Thailand from 1973 to 1999.
In this analysis, we evaluate the prediction accuracy by rho (Pearson’s correlation coefficient between predictions and observations) values. The following formula is the definition of rho. It can take a range of values from +1 to -1. A value of 0 indicates that there is no association between the two variables. A value greater than 0 indicates a positive association; that is, as the value of one variable increases, so does the value of the other variable. A value less than 0 indicates a negative association; that is, as the value of one variable increases, the value of the other variable decreases.
Figure3 Definition of rho
The values of rho in each prediction are shown in the table below.
Prediction type | Pearson’s correlation coefficient between predictions and observations |
---|---|
Change in scaled subjects of DENV1 | 0.691924 |
Change in scaled subjects of DENV2 | 0.4864992 |
Change in scaled subjects of DENV3 | 0.3443239 |
Change in scaled subjects of DENV4 | 0.4620425 |
Change in existence rate of DENV1 | 0.6788994 |
Change in existence rate of DENV2 | 0.7765715 |
Change in existence rate of DENV3 | 0.6569529 |
Change in existence rate of DENV4 | 0.8263565 |
Prediction using data on the number of infected people was low in accuracy. Since we treated the data over 25 years, there was a possibility that correct data of the number of infected people could not be obtained in the past when technology was not advanced. There is a high possibility that the number of mothers that can be recognized as the number of dengue fever infected people has changed.
However, prediction using proportion data of infected persons occupying each serum was relatively accurate. Using ratio data of infected persons occupying each serotype, it is considered that the accuracy has increased because the difference in maternal fluctuation can be suppressed to some extent.
If the data are increased more, there is no doubt that the accuracy of the prediction model will improve.
The results of the predictions are shown in the following figures.
Figure4 Prediction of scaled subjects number infected with each dengue virus in Bangkok, Thailand from 1973 to 2000
Figure5 Prediction of rate of each dengue virus in Bangkok, Thailand from 1973 to 2000
Co-prediction
The goal of co-prediction is to quantify dynamic similarity between two time series. Given two time series, x and y, we assume that the dynamics can be represented as:
Then co-prediction is a way to quantify how closely F and G resemble each other.
We thought that there might be something related to the temperature affecting the dengue-mediated mosquito activity and the total number of patients in each serotype of dengue.
In order to identify the relationship, co-prediction was performed using the total number of patients in each serotype and the annual average temperature data from 1973 to 1993 in Bangkok, Thailand.
The time series x is total value of infected individuals of each serotype and the time series y is the annual average temperature value.
The figure below shows how much accuracy of prediction was improved by co-prediction.
Figure6 Co-prediction by data of total infected individuals of each serotype (x) and annual average temperature data (y)
Here, we see that the predictions of total infected individuals of each serotype (x) are better when we try to use the inferred dynamics from annual average temperature data (y) to make forecasts. Likewise, the predictions of annual average temperature data (y) have better forecast skill by using the inferred dynamics from total infected individuals of each serotype (x).
Since the time series come from a model simulation where x and y have coefficients, we can infer their dynamical maps are actually related.
Predicton model
In the above fluctuation prediction, prediction accuracy could be improved by using climate data for fluctuation prediction in addition to dengue virus infected patients’ data. It is expected that the prediction of the number of people infected with dengue virus of each serotype will be carried out accurately when the data of infected people with dengue fever increase. Here we constructed a prediction model for the fluctuation in the number of dengue infected people in the coming year from dengue fever infected person data and climate data. With this model we can see which serotype dengue virus will become popular next year from past data.
Figure7 Conceptual diagram of the prediction model
The GitHub repository about this prediction model is here.
Discussion
When using Simplex projection, it is said that time series data of at least 35 to 40 points is required. Despite very few data on dengue fever infected people, we were able to make predictions with some degree of accuracy. Furthermore, by adding climate data to infected data, it is possible to improve prediction accuracy.If we can gather data on dengue-infected individuals of each serotype from Wet's dengue infection detection kit created in this project, we can improve the accuracy of prediction and further help prevent infection with dengue.
Reference
http://www.who.int/en/news-room/fact-sheets/detail/dengue-and-severe-dengue
http://deepeco.ucsd.edu/simplex/
https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-tutorial.html
https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-coprediction.html
https://cran.r-project.org/web/packages/rEDM/vignettes/rEDM-algorithms.pdf
https://statistics.laerd.com/statistical-guides/pearson-correlation-coefficient-statistical-guide.php
http://www.ajtmh.org/content/journals/10.4269/ajtmh.2003.68.191?ijkey=a7c6f78b661b24d3734e97302ea1bc8e754b0b01&keytype2=tf_ipsecsha#dataandmedia