Data Analysis

Blog posts and articles with tips for data analysis for quality improvement methodologies, including Six Sigma and Lean.

Wildfires in California have killed at least 40 people and burned more than 217,000 acres in the past few weeks. Nearly 8,000 firefighters are trying to contain the blazes with the aid of more than 800 firetrucks, 70 helicopters and 30 planes. In remote areas difficult to access by firetruck, smokejumpers may be needed to parachute in to fight the fires. But danger looms before a smokejumper even... Continue Reading
Research out of the Juran Institute, which specializes in training, certification, and consulting on quality management globally, reveals that only 30 percent of improvement initiatives succeed.   And why do these initiatives fail so frequently? This research concludes that a lack of management support is the No. 1 reason quality improvement initiatives fail. But this is certainly not a problem... Continue Reading

7 Deadly Statistical Sins Even the Experts Make

Do you know how to avoid them?

Get the facts >
Overfitting a model is a real problem you need to beware of when performing regression analysis. An overfit model result in misleading regression coefficients, p-values, and R-squared statistics. Nobody wants that, so let's examine what overfit models are, and how to avoid falling into the overfitting trap. Put simply, an overfit model is too complex for the data you're analyzing. Rather than... Continue Reading
Maybe you're just getting started with analyzing data. Maybe you're reasonably knowledgeable about statistics, but it's been a long time since you did a particular analysis and you feel a little bit rusty. In either case, the Assistant menu in Minitab Statistical Software gives you an interactive guide from start to finish. It will help you choose the right tool quickly, analyze your data... Continue Reading
Control charts take data about your process and plot it so you can distinguish between common-cause and special-cause variation. Knowing the difference is important because it permits you to address potential problems without over-controlling your process.   Control charts are fantastic for assessing the stability of a process. Is the process mean unstable, too low, or too high? Is observed... Continue Reading
In statistics, as in life, absolute certainty is rare. That's why statisticians often can't provide a result that is as specific as we might like; instead, they provide the results of an analysis as a range, within which the data suggest the true answer lies. Most of us are familiar with "confidence intervals," but that's just of several different kinds of intervals we can use to characterize the... Continue Reading
by Matthew Barsalou, guest blogger At the end of the first part of this story, a group of evil trouble-making chickens had convinced all of their fellow chickens to march on the walled city of Wetzlar, where, said the evil chickens, they all would be much happier than they were on the farm. The chickens marched through the night and arrived at Wetzlar on the Lahn as the sun came up. “Let us in!”... Continue Reading
by Matthew Barsalou, guest blogger Once upon a time, in the Kingdom of Wetzlar, there was a farm with over a thousand chickens, two pigs, and a cow. The chickens were well treated, but a few rabble-rousers among them got the rest of the chickens worked up. These trouble-making chickens looked almost like the other chickens, but in fact they were evil chickens.  By HerbertT - Eigenproduktion, CC... Continue Reading
The Six Sigma quality improvement methodology has lasted for decades because it gets results. Companies in every country around the world, and in every industry, have used this logical, step-by-step method to improve the quality of their processes, products, and services. And they've saved billions of dollars along the way. However, Six Sigma involves a good deal of statistics and data analysis,... Continue Reading
Six Sigma is a quality improvement method that businesses have used for decades—because it gets results. A Six Sigma project follows a clearly defined series of steps, and companies in every industry in every country around the world have used this method to resolve problems. Along the way, they've saved billions of dollars. But Six Sigma relies heavily on statistics and data analysis, and many... Continue Reading
In April 2017, overbooking of flight seats hit the headlines when a United Airlines customer was dragged off a flight. A TED talk by Nina Klietsch gives a good, but simplistic explanation of why overbooking is so attractive to airlines. Overbooking is not new to the airlines; these strategies were officially sanctioned by The American Civil Aeronautics Board in 1965, and since that time complex... Continue Reading
Can you trust your data?  That's the very first question we need to ask when we perform a statistical analysis. If the data's no good, it doesn't matter what statistical methods we employ, nor how much expertise we have in analyzing data. If we start with bad data, we'll end up with unreliable results. Garbage in, garbage out, as they say. So, can you trust your data? Are you positive?... Continue Reading
We had solar panels fitted on our property in 2011. Last year, we had a few problems with the equipment. It was shutting down at various times throughout the day, typically when it was very sunny, resulting in no electricity being generated. In summer 2016, I completed a statistical analysis in Minitab to confirm my suspicions that my solar panels were not working as well as they did when they were... Continue Reading
All processes have variation, some of which is inherent in the process, and isn't a reason for concern. But when processes show unusual variation, it may indicate a change or a "special cause" that requires your attention.  Control charts are the primary tool quality practitioners use to detect special cause variation and distinguish it from natural, inherent process variation. These charts graph... Continue Reading
In my time at Minitab, I’ve gotten a good understanding of what types of graphs users create. Everyone knows about histograms, bar charts, and time series plots. Even relatively less familiar plots like the interval plot and individual value plot are still used quite often. However, one of the most underutilized graphs we have available is the area graph. If you’re not familiar with an Area... Continue Reading
There may not be a situation more perilous than being a character on Game of Thrones. Warden of the North, Hand of the King, and apparent protagonist of the entire series? Off with your head before the end of the first season! Last male heir of a royal bloodline? Here, have a pot of molten gold poured on your head! Invited to a wedding? Well, you probably know what happens at weddings in the show. ... Continue Reading
If you have a process that isn’t meeting specifications, using the Monte Carlo simulation and optimization tools in Companion by Minitab can help. Here’s how you, as an engineer in the medical device industry, could use Companion to improve a packaging process and help ensure patient safety. Your product line at AlphaGamma Medical Devices is shipped in heat-sealed packages with a minimum seal... Continue Reading
How many samples do you need to be “95% confident that at least 95%—or even 99%—of your product is good? The answer depends on the type of response variable you are using, categorical or continuous. The type of response will dictate whether you 'll use: Attribute Sampling: Determine the sample size for a categorical response that classifies each unit as Good or Bad (or, perhaps, In-spec or... Continue Reading
The two previous posts in this series focused on manipulating data using Minitab’s calculator and the Data menu. In this third and final post, we continue to explore helpful features for working with text data and will focus on some features in Minitab’s Editor menu. Using the Editor Menu  The Editor menu is unique in that the options displayed depend on what is currently active (worksheet, graph,... Continue Reading
Have you ever had a probability plot that looks like this? The probability plot above is based on patient weight (in pounds) after surgery minus patient weight (again, in pounds) before surgery. The red line appears to go through the data, indicating a good fit to the Normal, but there are clusters of plotting points at the same measured value. This occurs on a probability plot when there are many... Continue Reading