

Part 2: The Differential Equation Model
As the first step in the modeling process, we identify the independent and dependent variables. The independent variable is time t, measured in days. We consider two related sets of dependent variables.
The first set of dependent variables counts people in each of the groups, each as a function of time:
S = S(t)  is the number of susceptible individuals, 
I = I(t)  is the number of infected individuals, and 
R = R(t)  is the number of recovered individuals. 
The second set of dependent variables represents the fraction of the total population in each of the three categories. So, if N is the total population (7,900,000 in our example), we have
s(t) = S(t)/N,  the susceptible fraction of the population, 
i(t) = I(t)/N,  the infected fraction of the population, and 
r(t) = R(t)/N,  the recovered fraction of the population. 
It may seem more natural to work with population counts, but some of our calculations will be simpler if we use the fractions instead. The two sets of dependent variables are proportional to each other, so either set will give us the same information about the progress of the epidemic.
Next we make some assumptions about the rates of change of our dependent variables:
No one is added to the susceptible group, since we are ignoring births and immigration. The only way an individual leaves the susceptible group is by becoming infected. We assume that the timerate of change of S(t), the number of susceptibles, depends on the number already susceptible, the number of individuals already infected, and the amount of contact between susceptibles and infecteds. In particular, suppose that each infected individual has a fixed number b of contacts per day that are sufficient to spread the disease. Not all these contacts are with susceptible individuals. If we assume a homogeneous mixing of the population, the fraction of these contacts that are with susceptibles is s(t). Thus, on average, each infected individual generates b s(t) new infected individuals per day.
Let's see what these assumptions tell us about derivatives of our dependent variables.
(a) Explain carefully how each component of the differential equation
(1) 
(2) 
(3) 
(a) Explain why
(4) 
(b) Now explain carefully how each component of the equation
(5) 
Finally, we complete our model by giving each differential equation an initial condition. For this particular virus  Hong Kong flu in New York City in the late 1960's  hardly anyone was immune at the beginning of the epidemic, so almost everyone was susceptible. We will assume that there was a trace level of infection in the population, say, 10 people. Thus, our initial values for the population variables are
S(0) = 7,900,000 
I(0) = 10 
R(0) = 0 
In terms of the scaled variables, these initial conditions are
s(0) = 1 
i(0) = 1.27 x 10^{ 6} 
r(0) = 0 
(Note: The sum of our starting populations is not exactly N, nor is the sum of our fractions exactly 1. The trace level of infection is so small that this won't make any difference.) Our complete model is
(6) 
We don't know values for the parameters b and k yet, but we can estimate them, and then adjust them as necessary to fit the excess death data. We have already estimated the average period of infectiousness at three days, so that would suggest k = 1/3. If we guess that each infected would make a possibly infecting contact every two days, then b would be 1/2. We emphasize that this is just a guess. The following plot shows the solution curves for these choices of b and k.
Because of the standard choice of symbols for the dependent variables, this model is called the SIR Model. In Part 3, we will see how solution curves can be computed even without formulas for the solution functions. [Note: If you have already worked through the Systems of Differential Equations module, you may skip Part 3 of this module and go straight to Part 4.]


modules at math.duke.edu  Copyright CCP and the author(s), 2000 