# Statistical Analysis

Blog posts and articles about analyzing data and performing statistical analysis.

As we start off 2018, our eyes are on the winter weather, specifically low temperatures and snowfall. After 2015-2016's warmest winter on record and Chicago breaking records in 2017 with no snow sticking to the ground in January or February, our luck might have run out. We shall see, though. The Old Farmer's Almanac is reporting that 2017-2018 winter temperatures will be colder than last winter. If... Continue Reading
Wildfires in California have killed at least 40 people and burned more than 217,000 acres in the past few weeks. Nearly 8,000 firefighters are trying to contain the blazes with the aid of more than 800 firetrucks, 70 helicopters and 30 planes. In remote areas difficult to access by firetruck, smokejumpers may be needed to parachute in to fight the fires. But danger looms before a smokejumper even... Continue Reading
Overfitting a model is a real problem you need to beware of when performing regression analysis. An overfit model result in misleading regression coefficients, p-values, and R-squared statistics. Nobody wants that, so let's examine what overfit models are, and how to avoid falling into the overfitting trap. Put simply, an overfit model is too complex for the data you're analyzing. Rather than... Continue Reading
Maybe you're just getting started with analyzing data. Maybe you're reasonably knowledgeable about statistics, but it's been a long time since you did a particular analysis and you feel a little bit rusty. In either case, the Assistant menu in Minitab Statistical Software gives you an interactive guide from start to finish. It will help you choose the right tool quickly, analyze your data... Continue Reading
Control charts take data about your process and plot it so you can distinguish between common-cause and special-cause variation. Knowing the difference is important because it permits you to address potential problems without over-controlling your process.   Control charts are fantastic for assessing the stability of a process. Is the process mean unstable, too low, or too high? Is observed... Continue Reading
In statistics, as in life, absolute certainty is rare. That's why statisticians often can't provide a result that is as specific as we might like; instead, they provide the results of an analysis as a range, within which the data suggest the true answer lies. Most of us are familiar with "confidence intervals," but that's just of several different kinds of intervals we can use to characterize the... Continue Reading
by Matthew Barsalou, guest blogger At the end of the first part of this story, a group of evil trouble-making chickens had convinced all of their fellow chickens to march on the walled city of Wetzlar, where, said the evil chickens, they all would be much happier than they were on the farm. The chickens marched through the night and arrived at Wetzlar on the Lahn as the sun came up. “Let us in!”... Continue Reading
by Matthew Barsalou, guest blogger Once upon a time, in the Kingdom of Wetzlar, there was a farm with over a thousand chickens, two pigs, and a cow. The chickens were well treated, but a few rabble-rousers among them got the rest of the chickens worked up. These trouble-making chickens looked almost like the other chickens, but in fact they were evil chickens.  By HerbertT - Eigenproduktion, CC... Continue Reading
The Six Sigma quality improvement methodology has lasted for decades because it gets results. Companies in every country around the world, and in every industry, have used this logical, step-by-step method to improve the quality of their processes, products, and services. And they've saved billions of dollars along the way. However, Six Sigma involves a good deal of statistics and data analysis,... Continue Reading
Six Sigma is a quality improvement method that businesses have used for decades—because it gets results. A Six Sigma project follows a clearly defined series of steps, and companies in every industry in every country around the world have used this method to resolve problems. Along the way, they've saved billions of dollars. But Six Sigma relies heavily on statistics and data analysis, and many... Continue Reading
In April 2017, overbooking of flight seats hit the headlines when a United Airlines customer was dragged off a flight. A TED talk by Nina Klietsch gives a good, but simplistic explanation of why overbooking is so attractive to airlines. Overbooking is not new to the airlines; these strategies were officially sanctioned by The American Civil Aeronautics Board in 1965, and since that time complex... Continue Reading
Can you trust your data?  That's the very first question we need to ask when we perform a statistical analysis. If the data's no good, it doesn't matter what statistical methods we employ, nor how much expertise we have in analyzing data. If we start with bad data, we'll end up with unreliable results. Garbage in, garbage out, as they say. So, can you trust your data? Are you positive?... Continue Reading
We had solar panels fitted on our property in 2011. Last year, we had a few problems with the equipment. It was shutting down at various times throughout the day, typically when it was very sunny, resulting in no electricity being generated. In summer 2016, I completed a statistical analysis in Minitab to confirm my suspicions that my solar panels were not working as well as they did when they were... Continue Reading
There may not be a situation more perilous than being a character on Game of Thrones. Warden of the North, Hand of the King, and apparent protagonist of the entire series? Off with your head before the end of the first season! Last male heir of a royal bloodline? Here, have a pot of molten gold poured on your head! Invited to a wedding? Well, you probably know what happens at weddings in the show. ... Continue Reading
Previously, I’ve written about when to choose nonlinear regression and how to model curvature with both linear and nonlinear regression. Since then, I’ve received several comments expressing confusion about what differentiates nonlinear equations from linear equations. This confusion is understandable because both types can model curves. So, if it’s not the ability to model a curve, what isthe... Continue Reading
It’s usually not a good idea to rely solely on a single statistic to draw conclusions about your process. Do that, and you could fall into the clutches of the “duck-rabbit” illusion shown here: If you fix your eyes solely on the duck, you’ll miss the rabbit—and vice-versa. If you're using Minitab Statistical Software for capability analysis, the capability indices Cp and Cpk are good examples of... Continue Reading
One of the biggest pieces of international news last year was the so-called "Brexit" referendum, in which a majority of voters in the United Kingdom cast their ballots to leave the European Union (EU). That outcome shocked the world. Follow-up media coverage has asserted that the younger generation prefers to remain in the EU since that means more opportunities on the continent. The older... Continue Reading
Last week I was fielding questions on social media about Minitab 18, the latest version of our statistical software. Almost as soon as the new release was announced, we received a question that comes up often from people in pharmaceutical and medical device companies: "Is Minitab 18 FDA-validated?" How Software Gets Validated That's a great question. To satisfy U.S. Food and Drug Administration (FDA)... Continue Reading