Skip to main content


next previous up

Next 4- Virus Prevalence Statistics
Previous 3.2.1- Viral Influx
Up 3.2- Viral Spread in Organizations

3.2.2- Internal Spread

We can get some insight into the second issue -- that of internal spread -- by the following simple model.

Let us assume that central reporting and response are perfectly effective, so that an incident is completely cleaned up as soon as any machine is found to be infected. We wish to know:

  1. How many machines are typically infected before the incident is discovered and cleaned up (i.e. what is the distribution of incident sizes and its average)?
  2. What is the average duration of an incident, both in general and as a function of the incident size?

To make the problem tractable, let us assume that homogeneous mixing applies within the organization. Then, an excellent approximation to the distribution of incident sizes can be derived as follows. Suppose that a virus has infected a machine in an organization, and that after some period of time the number of infected machines stemming from this initial event is n. The next event will be either a birth (resulting in n+1 infections) or a death (resulting in 0 infections, assuming that the clean-up is instantaneous and contemporaneous with detection by one of the machines). The rate at which deaths occur is simply tex2html_wrap_inline1102 , and the rate at which births occur is tex2html_wrap_inline1104 , where N is the total number of machines in the organization. Thus the probability of going from n infections to n+1 infections is

 

equation173

with the approximation being valid to the extent that tex2html_wrap_inline1112 . Then the probability that an incident will be discovered and cleaned-up after n machines are infected is:

 

equation181

Thus the size distribution is very nearly exponential, with mean tex2html_wrap_inline1116 given by:

 

equation190

which is valid provided that tex2html_wrap_inline1118 (or equivalently tex2html_wrap_inline1120 ). Previously, we found that in the absence of centralized response, an epidemic can occur if tex2html_wrap_inline1122 . However, Eq. 5 shows that, given perfect centralized response, the average incident size is tex2html_wrap_inline1124 even when tex2html_wrap_inline1126 , provided that tex2html_wrap_inline1128 is not so small as O(1/N).

Note that, if the average incident size is less than two, the organization is below the epidemic threshold, and viruses would not propagate much even if central response were suddenly eliminated. However, if the average incident size is greater than two, the organization is intrinsically above the epidemic threshold, and elimination of central response would make the organization highly susceptible to widespread propagation of any virus that happened to enter it.

As a first step in deriving the distribution of incident durations, we can calculate the probabilities p(n,t) for there to be n infections at time t. Suppose that there are n infected machines at time t. Then the probability per unit time of making a transition to n+1 infected machines is tex2html_wrap_inline1144 . The probability per unit time of discovering the virus on one machine (and thus making an instantaneous transition to 0 infected machines) is tex2html_wrap_inline1146 . From these considerations we obtain the coupled differential equations:

 

eqnarray201

valid for tex2html_wrap_inline1148 . p(0,t) can be obtained either from the rate equation:

 

equation211

or the normalization condition:

 

equation217

Typically, we are interested in solving Eq. 6 given the initial condition p(1,t) = 1; p(n,t) = 0, tex2html_wrap_inline1156 .

If we make the approximation tex2html_wrap_inline1158 , we can solve Eq. 6 analytically. Consider the equation for p(1,t):

 

equation223

Given the initial condition p(1,0) = 1, we immediately obtain:

 

equation228

The equation for p(2,t) is:

 

equation232

Using the method of integrating factors and the initial condition p(2,0)=0, we obtain the solution:

 

eqnarray237

In general, the solution for p(n,t) can be expressed as a convolution involving p(n-1,t):

 

eqnarray246

as can be shown by induction. To obtain p(0,t), we can insert Eq. 13 into the normalization condition given by Eq. 8. Summing the resulting geometric series, we obtain:

 

equation258

It is straightforward to verify that this solution for p(0,t) also satisfies the rate equation (Eq. 7). As one would expect, p(0,t) increases monotonically from 0 at t=0 towards 1 as tex2html_wrap_inline1180 .

Having obtained analytic formulas for the probabilities p(n,t) of n infections at time t, we can now use them to calculate several quantities of interest. As a simple warmup exercise, we can calculate the distribution of incident sizes, which was derived earlier by another method. The probability for there to be n infections at time t followed by a transition to 0 infections at some time t' in the infinitesimal interval tex2html_wrap_inline1192 is tex2html_wrap_inline1194 . Integrating over all possible ``extinction'' times t, we obtain the probability P(n) that the incident size was n:

 

eqnarray264

in agreement with the result given by Eq. 4. (The substitution tex2html_wrap_inline1202 was made in going from the first line to the second in the above derivation.)

The duration distribution Q(n,t) for an incident of size n is simply the extinction time distribution normalized such that tex2html_wrap_inline1208 for all n:

 

eqnarray276

To obtain the average duration Q(n) of an incident of size n, we need to solve the following integral:

 

eqnarray283

For sufficiently large n, Eq. 17 is approximately

 

equation297

where tex2html_wrap_inline1218 is Euler's constant. Thus the expected duration of an incident scales logarithmically with its size. This can be attributed to the exponential growth in the number of infections with time, a hallmark of the homogeneous approximation.

To obtain the overall duration distribution Q(t), we can average the distribution Q(n,t) over all incident sizes n (using the weighting factor given by Eq. 4). Alternatively (and more simply), we can note that

 

eqnarray305

Finally, the overall average duration Q is given by:

 

eqnarray314

In the above derivation, the order of summation was switched in going from the first line to the second, and the fourth line was obtained from the third by identifying the Taylor series expansion for tex2html_wrap_inline1228 . Of course, the same result could have been obtained by performing the integral tex2html_wrap_inline1230 .

The rates tex2html_wrap_inline1232 and tex2html_wrap_inline1234 figure prominently in the various expressions for probability distributions and averages of the incident size and incident duration. By measuring the average incident size tex2html_wrap_inline1236 in a particular organization with good central reporting and response, we might hope to use Eq. 5 to estimate tex2html_wrap_inline1238 in that organization. In order to estimate tex2html_wrap_inline1240 and tex2html_wrap_inline1242 separately, we could combine this estimate of tex2html_wrap_inline1244 with a measurement of the average incident duration and use Eq. 20.

For several reasons, such an exercise might be difficult. Although data on the incident size distribution can be collected (see Section 4), data on incident durations are very difficult to obtain because it is hard to tell when an incident began. In addition, there are several idealizations in this particular model that may not reflect the real world. In principle (if they can be measured), the various probability distributions derived in this section can be used as independent checks of the validity of the approximations made. For example, in a population of individuals in which tex2html_wrap_inline1246 and tex2html_wrap_inline1248 vary somewhat from one individual to another, we might expect the distribution of incident sizes to deviate from the exponential distribution predicted by Eq. 4. Indeed, as will be seen in the next section, the incident size distributions of our sample population exhibit a non-exponential tail. Another potential difficulty is the use of the homogeneous-mixing approximation in deriving these results. In the future, simulations will be used to assess the degree to which topology alters the theoretical results of this section. We expect the results for incident duration to be affected significantly because they appear to contain quantities associated with exponential growth. The results for incident size may be somewhat less affected, because they do not depend on the time scales involved.

Thus, a model based on the organizational perspective has the potential to help us measure important theoretical parameters, but attempts to do so now are probably premature. In the future, by incorporating topological and other effects into the theory and by finding ways of measuring either the average incident duration, tex2html_wrap_inline1250 , or tex2html_wrap_inline1252 , we should be able to tie many attributes of virus incidents together and to estimate parameters that will help us predict virus spread on a global scale.

An additional point should be rescued from the morass of equations and emphasized very clearly here. Central reporting and response appears to be a powerfully effective policy. Even if an organization is intrinsically above the epidemic threshold, central reporting and response prevent the incident size from scaling with the number of machines in the organization. Not only do incidents remain small; their duration is finite (rather than infinite). As will be seen in the next section, our virus prevalence statistics also suggest that organizations should adopt this policy.


next previous up

Next 4- Virus Prevalence Statistics
Previous 3.2.1- Viral Influx
Up 3.2- Viral Spread in Organizations


Back To Index