Difference between revisions of "Team:NCTU Formosa/Dry Lab/Microbiota Prediciton"

Revision as of 03:09, 18 October 2018

MENU

HOME

TEAM

PROJECT

PARTS

HUMAN PRACTICES

AWARDS

Navigation Bar

☰

Project

Dry Lab

Microbiota Prediction

Wet Lab

Parts

Human Practice

Education and Public Engagement

Team

Notebook

Microbiota Prediction

Artificial intelligence and machine learning allow us to realize the seemingly insurmountable goal of predicting the fluctuations of entire microbiotas due to the specific effects of bio-stimulators. While traditional ecologists may find it too difficult to consider every unique microbial relationship in an ecosystem, machine learning programs use numerical analysis to not only quickly determine these associations but also use them to predict overall population shifts caused by stimuli. For our modelling purposes we choose Weka, a software with strong classification capabilities, to establish accurate connections between every genera of bacteria in our soil.

Considering General Factors

To begin modelling the relationship between bio-stimulators and microbiota, we first determine the most important factors that affect bacterial growth in soil. Three conditions immediately came to mind: temperature, pH and salinity, whose effects are modelled through the respective equations below:

$$Ratkowsky Equation:
R_{temp}(T)=a\cdot[(T-T_{min})\cdot(1-e^{(b\cdot(T-T_{max}))})]^2$$
$$Cardinal pH Equation:
R_{pH}(pH)=\frac{c\cdot(pH-pH_{min})\cdot(pH-pH_{max})}{d\cdot((pH-pH_{min})\cdot(pH-pH_{max})-e\cdot(pH-pH_{opt})^2)}$$
$$Salinity Equation:
R_{sal}(sal)=(f\cdot sal^2)+(g\cdot sal)+h$$

These factors heavily influence bacterial fluctuation in any environment and are especially important in determining soil microbiota, according to soil expert Professor Young of National Chung Hsing University. Professor Young also suggested we consider the relationship between nitrogen, phosphorus and potassium and soil bacteria, because farm soil is regularly applied with fertilizers containing these vital macronutrients. To take these elements into account, we collected literature discussing their impact on bacterial levels and found the following functions:

These equations model the direct relationship between levels of the elements and levels of bacteria in soil – specifically, the levels of bacteria that metabolize said elements.

Combining these general equations together gives a method of obtaining a rough estimation of how our microbiota will change, based on fluctuation of these factors; our universal factors temperature, pH and salinity assist in modeling general fluctuations in the microbiota, while the more specific factors nitrogen, phosphorus and potassium are quite helpful in predicting how amount of nutrient metabolizing bacteria oscillates when dealing with the effects of fertilizers. But how do we deduce the change in level of bacteria that are unaffected by said nutrients? We turn to our NGS analysis to find missing link.

From our NGS report we calculate the Spearman correlation value of each pair of genera in our soil. This coefficient, assigned a value between -1 and +1, describes the degree of correlation between each pair, with values closer to -1 representing stronger negative correlation and values closer to +1 representing stronger positive correlation. Correlation values between the 20 most abundant bacterial genera in our soil samples are shown in the following heat map.

Figure 1: Top-20 heat map of June

Weka

Once we have our 6 general equations and our correlation values we’re ready to begin using Weka to construct a prediction model. Weka is split into two parts: regression analysis to filter out the non-correlated pairs of bacteria, and cross validation to determine the weighting each bacterial relationship has under different conditions

Regression Analysis

We first take advantage of the machine learning software’s classification ability, using the built-in regression analysis module to determine which pairs of bacteria are heavily affected by correlation. To do this we define coefficient values below -0.7 to be truly negatively correlated and coefficient values above +0.7 to be truly positively correlated; pairs assigned a value in between are ignored. Weka then separates truly correlated pairs from the rest; these are the bacteria that will change as an indirect effect of bio-stimulator application. We start with one genus of bacteria and assess the correlation coefficient it has with each other genus in soil. Any pairs with significant correlation are collected into a fold belonging to that bacteria. Once all pairs are assessed, the resulting fold should contain all the bacteria that are correlated with our starting genus.

For every pair in any particular fold, Weka plots that pair’s data (link) on a graph to find a curve of regression to describe their relationship. For example:

Cross Validation

The resulting curve is the theoretical relationship between the two bacteria; however, the wide range of soil conditions that vary between different samples may alter the relationship. To account for this, Weka assigns weights to each correlation regression curve by performing cross validation, in this case with three folds. The steps are as follows:

Three folds of three different genera of bacteria are compared in pairs to determine the accuracy of each pair’s correlational relationship.

If they exhibit a relationship in line with Weka’s initial assessment, nothing changes and the pair keeps its assigned weight.

If they show unexpected associations, they are said to exhibit paradox. Paradox alerts Weka to the discrepancy between prediction and reality, causing it to adjust waiting accordingly.

Through this cross validation Weka calibrates weighting of each pair and can predict how an entire microbiota is related after analyzing all folds. The result can be expressed in a pie chart describing predicted microbial ratios

Artificial Intelligence

Once our initial model is complete we can begin to make rough predictions about microbiota changes based on a volume of bio-stimulator. The basic rules we established regarding different soil conditions point us in the right direction in terms of bacteria shifts, but to achieve true precise control over soil we must improve our prediction accuracy through artificial intelligence. Artificial intelligence feeds actual data back into our system; more data allows for more calibration and more cross validations, adapting our predictions to the specific nature of our soil sample and improving the accuracy of subsequent predictions.

Model Learning

We began by generating our model using NGS data from April through June. We entered a volume of bio-stimulator as well NGS data from before and after application, thus generating an initial prediction model. The accuracy of our model with only one month of data was approximately 21%, while inclusion of a second month’s data increased accuracy to 51% - at this point if we were to predict results for June, we would get about 51% of the total microbiota correct. Again, we applied bio-stimulator to our soil and waited for our data. Using June data to calibrate our model increased the prediction accuracy of our model by over another 25%, resulting in a microbiota prediction model with 78% accuracy.

Conclusion

Increasingly accurate prediction of microbial shifts due to bio-stimulators is a vital element of our smart farming system. Our goal is to regulate soil microbiota precisely, and we need accurate models to do so. Luckily, machine learning and artificial intelligence can provide just that. A general model formed using established relationships between key environmental factors and bacterial growth is supported by correlation values calculated from NGS data to allow for rough initial predictions of microbial shifts. Raw data obtained after subsequent applications of bio-stimulators is reintroduced into our models through a feedback system, calibrating the weightings of each bacteria correlational relationship to improve accuracy with each cycle. With increasingly precise regulation we can manipulate soil microbiota to produce any desired effect. Visit our real farm demonstration (link) to find out how we use artificial intelligence to increase curcumin concentration in turmeric while maintaining soil health.

Template

@@ Line 220: / Line 220: @@
        display: inline-block;
        margin-left: 10%;
-    }
-    .otuchu{
-        width : 30%;
-        margin-left: 35%;
-        margin-top : 50px;
      }
@@ Line 274: / Line 269: @@
        <div class="text">
          <p>
-           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Machine learning allows us to realize the seemingly insurmountable goal of predicting the fluctuations of entire microbiotas due to the specific effects of bio-stimulators. While traditional ecologists may find it too difficult to consider every unique microbial relationship in an ecosystem, machine learning programs uses numerical analysis to not only quickly determine these associations but also use them to predict overall population shifts caused by stimuli. For our modelling purposes we choose Weka, a software with strong classification capabilities, to establish accurate connections between every genera of bacteria in our soil.
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Artificial intelligence and machine learning allow us to realize the seemingly insurmountable goal of predicting the fluctuations of entire microbiotas due to the specific effects of bio-stimulators. While traditional ecologists may find it too difficult to consider every unique microbial relationship in an ecosystem, machine learning programs use numerical analysis to not only quickly determine these associations but also use them to predict overall population shifts caused by stimuli. For our modelling purposes we choose Weka, a software with strong classification capabilities, to establish accurate connections between every genera of bacteria in our soil.
          </p>
        </div>
-       <div class="title_1">Construction</div>
+       <div class="title_1">Considering General Factors</div>
        <div class="text">
          <p>
-           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Weka predicts overall shifts in microbiota by determining how a target genus is altered, then using correlation values between the target genus and other genera to calculate how the rest of the bacteria change.
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;To begin modelling the relationship between bio-stimulators and microbiota, we first determine the most important factors that affect bacterial growth in soil. Three conditions immediately came to mind: temperature, pH and salinity, whose effects are modelled through the respective equations below:
          </p>
        </div>
-       <img src="https://static.igem.org/mediawiki/2018/9/9d/T--NCTU_Formosa--biosti.png" class="process">
+       <div class="equation">
+        $$Ratkowsky Equation: <br>R_{temp}(T)=a\cdot[(T-T_{min})\cdot(1-e^{(b\cdot(T-T_{max}))})]^2$$<br>
+        $$Cardinal pH Equation: <br>R_{pH}(pH)=\frac{c\cdot(pH-pH_{min})\cdot(pH-pH_{max})}{d\cdot((pH-pH_{min})\cdot(pH-pH_{max})-e\cdot(pH-pH_{opt})^2)}$$<br>
+        $$Salinity Equation: <br>R_{sal}(sal)=(f\cdot sal^2)+(g\cdot sal)+h$$
+      </div>
        <div class="text">
          <p>
-           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;This is where Weka’s support of classification is useful to us, as it can quickly determine which genera affect each other and which do not. We begin with our analyzed <a href="https://2018.igem.org/Team:NCTU_Formosa/Dry_Lab/NGS_Data_Analysis">NGS data</a> – specifically, a heat map detailing correlation values of bacteria in our soil sample. Weka allows us to set our own conditions to detect correlation between two genera of bacteria and will filter the results to yield all the pairs of bacteria determined to be correlated, either positively or negatively. Here, we take June's correlation heatmap of the top 20 bacteria as an example:
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;These factors heavily influence bacterial fluctuation in any environment and are especially important in determining soil microbiota, according to soil expert Professor Young of National Chung Hsing University. Professor Young also suggested we consider the relationship between nitrogen, phosphorus and potassium and soil bacteria, because farm soil is regularly applied with fertilizers containing these vital macronutrients. To take these elements into account, we collected literature discussing their impact on bacterial levels and found the following functions:
          </p>
        </div>
-       <img src="https://static.igem.org/mediawiki/2018/9/9a/T--NCTU_Formosa--June_heatmap.png" class="heatmap">
+       <div class="equation"><p></p></div>
-       <div class="explanation">
+       <div class="text">
-         <svg class="icon" aria-hidden="true" data-prefix="fas" data-icon="arrow-circle-up" class="svg-inline--fa fa-arrow-circle-up fa-w-16" role="img" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 512 512"><path fill="currentColor" d="M8 256C8 119 119 8 256 8s248 111 248 248-111 248-248 248S8 393 8 256zm143.6 28.9l72.4-75.5V392c0 13.3 10.7 24 24 24h16c13.3 0 24-10.7 24-24V209.4l72.4 75.5c9.3 9.7 24.8 9.9 34.3.4l10.9-11c9.4-9.4 9.4-24.6 0-33.9L273 107.7c-9.4-9.4-24.6-9.4-33.9 0L106.3 240.4c-9.4 9.4-9.4 24.6 0 33.9l10.9 11c9.6 9.5 25.1 9.3 34.4-.4z"></path></svg>
+         <p>
-        Figure 1: Temperature growth curve model progress
+          &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;These equations model the direct relationship between levels of the elements and levels of bacteria in soil – specifically, the levels of bacteria that metabolize said elements.
+        </p>
        </div>
        <div class="text">
          <p>
-           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Next, We define positive correlation as any Spearman's Correlation value above 0.7, and negative correlation as any value below -0.7. For example:
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Combining these general equations together gives a method of obtaining a rough estimation of how our microbiota will change, based on fluctuation of these factors; our universal factors temperature, pH and salinity assist in modeling general fluctuations in the microbiota, while the more specific factors nitrogen, phosphorus and potassium are quite helpful in predicting how amount of nutrient metabolizing bacteria oscillates when dealing with the effects of fertilizers. But how do we deduce the change in level of bacteria that are unaffected by said nutrients?  We turn to our NGS analysis to find missing link.
          </p>
        </div>
-      <img src="https://static.igem.org/mediawiki/2018/2/2d/T--NCTU_Formosa--example2.png" class="exp2">
        <div class="text">
          <p>
-           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Once we’ve determined the significantly correlated pairs of bacteria, Weka can further establish the exact nature of these relationships. Not all bacterial relationships exhibit linear regression, so Weka then takes the correlated pairs and plots the data of each pair into a graph to determine the true nature of each association.
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;From our NGS report we calculate the Spearman correlation value of each pair of genera in our soil. This coefficient, assigned a value between -1 and +1, describes the degree of correlation between each pair, with values closer to -1 representing stronger negative correlation and values closer to +1 representing stronger positive correlation. Correlation values between the 20 most abundant bacterial genera in our soil samples are shown in the following heat map.
          </p>
        </div>
-       <div class="ex" style="margin-left: 14.5%;">
+       <img src="https://static.igem.org/mediawiki/2018/9/9a/T--NCTU_Formosa--June_heatmap.png" class="heatmap">
-        <img src="https://static.igem.org/mediawiki/2018/6/63/T--NCTU_Formosa--strepto5.png" class="positive">
+      <div class="explanation">
-        <div class="explanation"><p>$$y=-4.1248x^2+0.2061x+0.0008;\  R^2=0.725$$</p></div>
+         <svg class="icon" aria-hidden="true" data-prefix="fas" data-icon="arrow-circle-up" class="svg-inline--fa fa-arrow-circle-up fa-w-16" role="img" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 512 512"><path fill="currentColor" d="M8 256C8 119 119 8 256 8s248 111 248 248-111 248-248 248S8 393 8 256zm143.6 28.9l72.4-75.5V392c0 13.3 10.7 24 24 24h16c13.3 0 24-10.7 24-24V209.4l72.4 75.5c9.3 9.7 24.8 9.9 34.3.4l10.9-11c9.4-9.4 9.4-24.6 0-33.9L273 107.7c-9.4-9.4-24.6-9.4-33.9 0L106.3 240.4c-9.4 9.4-9.4 24.6 0 33.9l10.9 11c9.6 9.5 25.1 9.3 34.4-.4z"></path></svg>
-         <div class="explanation"><p>
+        Figure 1: Top-20 heat map of June
-          <svg class="icon" aria-hidden="true" data-prefix="fas" data-icon="arrow-circle-up" class="svg-inline--fa fa-arrow-circle-up fa-w-16" role="img" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 512 512"><path fill="currentColor" d="M8 256C8 119 119 8 256 8s248 111 248 248-111 248-248 248S8 393 8 256zm143.6 28.9l72.4-75.5V392c0 13.3 10.7 24 24 24h16c13.3 0 24-10.7 24-24V209.4l72.4 75.5c9.3 9.7 24.8 9.9 34.3.4l10.9-11c9.4-9.4 9.4-24.6 0-33.9L273 107.7c-9.4-9.4-24.6-9.4-33.9 0L106.3 240.4c-9.4 9.4-9.4 24.6 0 33.9l10.9 11c9.6 9.5 25.1 9.3 34.4-.4z"></path></svg>
-          Figure 2: Example of positive non-linear correlation
-          </p>
-        </div>
        </div>
-       <div class="ex">
+       <div class="title_1">Weka</div>
-        <img src="https://static.igem.org/mediawiki/2018/4/46/T--NCTU_Formosa--Negative.png" class="negative">
+      <div class="text">
-        <div class="explanation"><p>$$y=-0.061x+0.0003;\  R^2=0.714$$</p></div>
+         <p>
-         <div class="explanation"><p>
+           &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Once we have our 6 general equations and our correlation values we’re ready to begin using Weka to construct a prediction model. Weka is split into two parts: regression analysis to filter out the non-correlated pairs of bacteria, and cross validation to determine the weighting each bacterial relationship has under different conditions
-           <svg class="icon" aria-hidden="true" data-prefix="fas" data-icon="arrow-circle-up" class="svg-inline--fa fa-arrow-circle-up fa-w-16" role="img" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 512 512"><path fill="currentColor" d="M8 256C8 119 119 8 256 8s248 111 248 248-111 248-248 248S8 393 8 256zm143.6 28.9l72.4-75.5V392c0 13.3 10.7 24 24 24h16c13.3 0 24-10.7 24-24V209.4l72.4 75.5c9.3 9.7 24.8 9.9 34.3.4l10.9-11c9.4-9.4 9.4-24.6 0-33.9L273 107.7c-9.4-9.4-24.6-9.4-33.9 0L106.3 240.4c-9.4 9.4-9.4 24.6 0 33.9l10.9 11c9.6 9.5 25.1 9.3 34.4-.4z"></path></svg>
-          Figure 3: Example of negative linear correlation
-        </p>
-        </div>
        </div>
-       <div class="title_1">Results</div>
+       <div class="title_1">Regression Analysis</div>
-       <div class="pie">
+       <div class="text">
-      <img src="https://static.igem.org/mediawiki/2018/3/37/T--NCTU_Formosa--real.png" class="real">
+        <p>
-      <img src="https://static.igem.org/mediawiki/2018/1/11/T--NCTU_Formosa--Predictive.png" class="predictive">
+          &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;We first take advantage of the machine learning software’s classification ability, using the built-in regression analysis module to determine which pairs of bacteria are heavily affected by correlation. To do this we define coefficient values below -0.7 to be truly negatively correlated and coefficient values above +0.7 to be truly positively correlated; pairs assigned a value in between are ignored. Weka then separates truly correlated pairs from the rest; these are the bacteria that will change as an indirect effect of bio-stimulator application. We start with one genus of bacteria and assess the correlation coefficient it has with each other genus in soil. Any pairs with significant correlation are collected into a fold belonging to that bacteria. Once all pairs are assessed, the resulting fold should contain all the bacteria that are correlated with our starting genus.
-      </div>
+        </p>
-      <div class="table7">
-      <img src="https://static.igem.org/mediawiki/2018/4/4e/T--NCTU_Formosa--weka_OTU_table.jpg" class="otuchu">
        </div>
-    </div>
+      <div class="text">
-    <div class="title_1"><p>References</p></div>
-     <div class="text">
          <p>
-. Bouckaert, R. R., et al. (2013). "WEKA Manual for Version 3-7-8, 2013."  21.<br><br>
+          &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;For every pair in any particular fold, Weka plots that pair’s data (link) on a graph to find a curve of regression to describe their relationship. For example:
-. WI, H., et al. (2011). "Practical machine learning tools and techniques."<br><br>
-. Barabasz, W. and J. J. P. J. o. E. S. Lipiec (2002). "Biological effects of mineral nitrogen fertilization on soil microorganisms."  11(3): 193-198.<br><br>
-. KUMAR, A. and L. C. J. P. RAI (2017). "Soil Organic Carbon and Availability of Soil Phosphorus Regulate Abundance of Culturable Phosphate Solubilizing Bacteria in Paddy Fields of the Indo-Gangetic Plain."<br><br>
-. Lambert, R. J. J. J. o. a. m. (2011). "A new model for the effect of pH on microbial growth: An extension of the Gamma hypothesis."  110(1): 61-68.<br><br>
-. Nihala Jabin, P. (2017). Screening of potash solubilizing bacteria for plant growth promotional activity and nutrient uptake of brinjal, Vasantrao Naik Marathwada Krishi Vidyapeeth, Parbhani.<br><br>
-. Ratkowsky, D. A., et al. (1983). "Model for bacterial culture growth rate throughout the entire biokinetic temperature range." J Bacteriol 154(3): 1222-1226.<br><br>
-. Rousk, J., et al. (2011). "Bacterial salt tolerance is unrelated to soil salinity across an arid agroecosystem salinity gradient."  43(9): 1881-1887.<br><br>
-. Wikipedia contributors. (2018, October 12). Bacillus. In Wikipedia, The Free Encyclopedia. Retrieved 18:17, October 16, 2018, from https://en.wikipedia.org/w/index.php?title=Bacillus&oldid=863696818<br><br>
-. Wikipedia contributors. (2018, March 23). Geobacter. In Wikipedia, The Free Encyclopedia. Retrieved 18:34, October 16, 2018, from https://en.wikipedia.org/w/index.php?title=Geobacter&oldid=831990733<br><br>
-. Espenberg, M., et al. (2018). "Differences in microbial community structure and nitrogen cycling in natural and drained tropical peatland soils." Scientific Reports 8(1): 4742.<br><br>
-. Hou, J., et al. (2015). "PGPR enhanced phytoremediation of petroleum contaminated soil and rhizosphere microbial community response." Chemosphere 138: 592-598.<br><br>
-. Hruska, K., Vyzkumny Ustav Veterinarniho Lekarstvi, Brno (Czech Republic) and M. Kaevska, Vyzkumny Ustav Veterinarniho Lekarstvi, Brno (Czech Republic) (dec2012). "Mycobacteria in water, soil, plants and air: a review."  v. 57.<br><br>
-. Jiao, S., et al. (2016). "Microbial succession in response to pollutants in batch-enrichment culture." Scientific Reports 6: 21791.<br><br>
-. Leys, N. M. E. J., et al. (2004). "Occurrence and Phylogenetic Diversity of Sphingomonas Strains in Soils Contaminated with Polycyclic Aromatic Hydrocarbons." Applied and Environmental Microbiology 70(4): 1944-1955.<br><br>
-. Ma, M., et al. (2018). "Effect of long-term fertilization strategies on bacterial community composition in a 35-year field experiment of Chinese Mollisols." AMB Express 8(1): 20.<br><br>
-. Martineau, C., et al. (2015). "Comparative analysis of denitrifying activity in Hyphomicrobium nitrativorans, Hyphomicrobium denitrificans and Hyphomicrobium zavarzinii." AEM. 00848-00815.<br><br>
-. Rodgers-Vieira, E. A., et al. (2015). "Identification of Anthraquinone-Degrading Bacteria in Soil Contaminated with Polycyclic Aromatic Hydrocarbons." Applied and Environmental Microbiology.<br><br>
-. Sangwan, P., et al. (2005). "Detection and cultivation of soil verrucomicrobia." Appl Environ Microbiol 71(12): 8402-8410.<br><br>
-. Sorensen, J. and O. Nybroe (2004). Pseudomonas in the Soil Environment. Pseudomonas: Volume 1 Genomics, Life Style and Molecular Architecture. J.-L. Ramos. Boston, MA, Springer US: 369-401.<br><br>
-. Umadevi, P., et al. (2018). "Trichoderma harzianum MTCC 5179 impacts the population and functional dynamics of microbial community in the rhizosphere of black pepper (Piper nigrum L.)." Brazilian Journal of Microbiology 49(3): 463-470.<br><br>
-. van Dijl, J. M. and M. Hecker (2013). "Bacillus subtilis: from soil bacterium to super-secreting cell factory." Microb Cell Fact 12: 3.<br><br>
-. Wang, R., et al. (2017). "Microbial community composition is related to soil biological and chemical properties and bacterial wilt outbreak." Scientific Reports 7(1): 343.<br><br>
-. Winston, M. E., et al. (2014). "Understanding Cultivar-Specificity and Soil Determinants of the Cannabis Microbiome." PLOS ONE 9(6): e99641.<br><br>
-. Yan, G., et al. (2017). "Effects of different nitrogen additions on soil microbial communities in different seasons in a boreal forest."  8(7): e01879.<br><br>
          </p>
-     </div>
+      </div>
+      <div class="title_1">Cross Validation</div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;The resulting curve is the theoretical relationship between the two bacteria; however, the wide range of soil conditions that vary between different samples may alter the relationship. To account for this, Weka assigns weights to each correlation regression curve by performing cross validation, in this case with three folds. The steps are as follows:</p></div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Three folds of three different genera of bacteria are compared in pairs to determine the accuracy of each pair’s correlational relationship.</p></div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;If they exhibit a relationship in line with Weka’s initial assessment, nothing changes and the pair keeps its assigned weight.</p></div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;If they show unexpected associations, they are said to exhibit paradox. Paradox alerts Weka to the discrepancy between prediction and reality, causing it to adjust waiting accordingly.</p></div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Through this cross validation Weka calibrates weighting of each pair and can predict how an entire microbiota is related after analyzing all folds. The result can be expressed in a pie chart describing predicted microbial ratios</p></div>
+      <div class="title_1">Artificial Intelligence</div>
+      <div class="text">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Once our initial model is complete we can begin to make rough predictions about microbiota changes based on a volume of bio-stimulator. The basic rules we established regarding different soil conditions point us in the right direction in terms of bacteria shifts, but to achieve true precise control over soil we must improve our prediction accuracy through artificial intelligence. Artificial intelligence feeds actual data back into our system; more data allows for more calibration and more cross validations, adapting our predictions to the specific nature of our soil sample and improving the accuracy of subsequent predictions.</div>
+      <div class="title_1">Model Learning</div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;We began by generating our model using NGS data from April through June. We entered a volume of bio-stimulator as well NGS data from before and after application, thus generating an initial prediction model. The accuracy of our model with only one month of data was approximately 21%, while inclusion of a second month’s data increased accuracy to 51% - at this point if we were to predict results for June, we would get about 51% of the total microbiota correct. Again, we applied bio-stimulator to our soil and waited for our data. Using June data to calibrate our model increased the prediction accuracy of our model by over another 25%, resulting in a microbiota prediction model with 78% accuracy. </p></div>
+      <div class="title_1">Conclusion</div>
+      <div class="text"><p>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Increasingly accurate prediction of microbial shifts due to bio-stimulators is a vital element of our smart farming system. Our goal is to regulate soil microbiota precisely, and we need accurate models to do so. Luckily, machine learning and artificial intelligence can provide just that. A general model formed using established relationships between key environmental factors and bacterial growth is supported by correlation values calculated from NGS data to allow for rough initial predictions of microbial shifts. Raw data obtained after subsequent applications of bio-stimulators is reintroduced into our models through a feedback system, calibrating the weightings of each bacteria correlational relationship to improve accuracy with each cycle. With increasingly precise regulation we can manipulate soil microbiota to produce any desired effect. Visit our real farm demonstration (link) to  find out how we use artificial intelligence to increase curcumin concentration in turmeric while maintaining soil health.</p></div>
+    </div>
 <!----------------------------------------------------------------------------->