Probability is the heart and soul of statistics: at a very simple level, we collect data about something we wish to understand, then we use statistics to assess the likelihood—or probability—of the situation described by the data.
In practice, this simple idea gets complicated very quickly. Anyone who's been involved in a 6 Sigma project can confirm that sometimes the data we collect, and the results of our statistical analyses, seem far from simple.
Two Kinds of ProbabilityIf you ask a statistician or mathematician about "probability," you probably need to get more specific. You know how Inuit peoples supposedly have dozens of words for snow, because they have such a detailed awareness of different types of snow? Same thing applies to statisticians and types of probability. You've got subjective probability, conditional probability, joint probability, physical probability, evidential probability...you get the idea.
Now, you may not need to worry about all of those variations unless you're either an Inuit or a statistician (or both). Snow is snow, and probability is probability, right?
Sure...but there's dry snow and then there's the heavy, wet snow that causes tree limbs to break. If you own an orchard, you'd better know the difference. Similarly, there's single-event probability and there's cumulative probability, and if you're looking at data, you need to understand the difference.
Probability of a Single EventThe simplest type of probability is the measure of the chance that a single event will occur. For example, let's say we have a fair, six-sided die. The probability that a single throw will be a 4 is 1/6, because only 1 of the six sides is a 4.
Simple, right? Similarly, the probability that a single roll of the die will be a 1 is 1/6. The same holds true for 2, and for 3, and for 5, and for 6. The single-event probability that a roll of the die will result in any one face you select is 1 in 6.
Cumulative ProbabilityCumulative probability measures the odds of two, three, or more events happening. There's just one catch involved: each event needs to be independent of the others—you can't have two events that occur at the same time, or have the outcome of a first event influence the probability of the next (which would be conditional probability).
An easy way to get the concept of independence is to think about tossing a coin: for any one toss, it cannot land on both heads and tails, right? Moreover, getting a head or a tail on your first toss has no effect on whether you get a head or tail on your second toss. So tossing a coin is an independent event.
Now, back to cumulative probability itself, which tells you the odds of one, two, or more events taking place. The word to remember is "or," because that's what cumulative probability tells you. What are the chances that when you toss a coin five times, you'll get four OR fewer heads? That's cumulative probability.
The events in cumulative probability may be sequential, like coin tosses in a row, or they may be in a range. For example, if you're observing a response with three categories, the cumulative probability for an observation with response 2 would be the probability that the predicted response is 1 OR 2. So to find the odds of ONE of these two events occurring, we add—or accumulate—the chances of either one occurring.
Probability vs. Cumulative Probability: The ReckoningIf all this sounds a little confusing, that's okay. We can very easily illustrate the difference between cumulative and single-event probability by putting the data for rolling a die into a table.
Probability of rolling face value
(odds of 1 possible event)
|1/6 or 16.6%||1/6 or 16.6%||1/6 or 16.6%||1/6 or 16.6%||1/6 or 16.6%||1/6 or 16.6%|
Cumulative Probability of rolling face value or less
(odds of 1, 2, or more possible events)
|1/6 or 16.6%||2/6 or 33.3%||3/6 or 50%||4/6 or 66.6%||5/6 or 83.3%||6/6 or 100%|
The odds that we'll roll a 1 on a single roll of the die will be 1/6, right? That's a single-event probability. But if we roll the die and want to know the probability that we will roll a 1 or a 2, that's cumulative probability, because it is the accumulated value of the odds of one OR the other happening.
So the next time you're analyzing your data—or placing a bet on the craps table in Vegas—make sure you understand the type of probability you're dealing with!