dcsimg
 

Regression Analysis

Blog posts and articles about regression analysis methods applied to Lean and Six Sigma projects.

Dear Readers, As 2016 comes to a close, it’s time to reflect on the passage of time and changes. As I’m sure you’ve guessed, I love statistics and analyzing data! I also love talking and writing about it. In fact, I’ve been writing statistical blog posts for over five years, and it’s been an absolute blast. John Tukey, the renowned statistician, once said, “The best thing about being a statistician... Continue Reading
by Matt Barsalou, guest blogger I know that Thanksgiving is always on the last Thursday in November, but somehow I failed to notice it was fast approaching until the Monday before Thanksgiving. This led to frantically sending a last-minute invitation, and a hunt for a turkey. I live in Germany and this greatly complicated the matter. Not only is Thanksgiving not celebrated, but also actual turkeys... Continue Reading

7 Deadly Statistical Sins Even the Experts Make

Do you know how to avoid them?

Get the facts >
This week we’re celebrating the annual Thanksgiving holiday in the United States, which is not only a good time to reflect on the things we’re grateful for, but it’s also a good time to stuff yourself with turkey, mashed potatoes, green bean casserole, and the usual suspects that find their way to the Thanksgiving table! While I’m of course very thankful for my family, friends, home, etc., I’m also... Continue Reading
With another Halloween almost upon us, here's a look back at some of the posts we've written about this holiday specifically, and about various creepy things in general. I hope that you enjoy this roundup of 13 scary statistics posts...and that they won't keep you up at night! 1. How to Make Minitab Wear a Halloween Costume As Halloween nears, you can customize your Minitab interface to match the... Continue Reading
Data mining can be helpful in the exploratory phase of an analysis. If you're in the early stages and you're just figuring out which predictors are potentially correlated with your response variable, data mining can help you identify candidates. However, there are problems associated with using data mining to select variables. In my previous post, we used data mining to settle on the following... Continue Reading
Since the release of Minitab Express in 2014, we’ve often received questions in technical support about the differences between Express and Minitab 17.  In this post, I’ll attempt to provide a comparison between these two Minitab products. What Is Minitab 17? Minitab 17 is an all-in-one graphical and statistical analysis package that includes basic analysis tools such as hypothesis testing,... Continue Reading
October 16–22 is National Healthcare Quality Week, started by the National Association for Healthcare Quality to increase awareness of healthcare quality programs and to highlight the work of healthcare quality professionals and their influence on improved patient care outcomes. This event deserves your attention because the quality of healthcare affects every one of us, and so does the cost of... Continue Reading
If you were among the 300 people who attended the first-ever Minitab Insights conference in September, you already know how powerful it was. Attendees learned how practitioners from a wide range of industries use data analysis to address a variety of problems, find solutions, and improve business practices. In the coming weeks and months, we will share more of the great insights and guidance shared... Continue Reading
Face it, you love regression analysis as much as I do. Regression is one of the most satisfying analyses in Minitab: get some predictors that should have a relationship to a response, go through a model selection process, interpret fit statistics like adjusted R2 and predicted R2, and make predictions. Yes, regression really is quite wonderful. Except when it’s not. Dark, seedy corners of the data... Continue Reading
Data mining uses algorithms to explore correlations in data sets. An automated procedure sorts through large numbers of variables and includes them in the model based on statistical significance alone. No thought is given to whether the variables and the signs and magnitudes of their coefficients make theoretical sense. We tend to think of data mining in the context of big data, with its huge... Continue Reading
Today, September 16, is World Ozone Day. You don't hear much about the ozone layer any more. In fact, if you’re under 30, you might think this is just another trivial, obscure observance, along the lines of International Dot Day (yesterday) or National Apple Dumpling Day (tomorrow). But there’s a good reason that, almost 30 years ago, the United Nations designated today to as a day to raise... Continue Reading
You’ve performed multiple linear regression and have settled on a model which contains several predictor variables that are statistically significant. At this point, it’s common to ask, “Which variable is most important?” This question is more complicated than it first appears. For one thing, how you define “most important” often depends on your subject area and goals. For another, how you collect... Continue Reading
There may be huge potential benefits waiting in the data in your servers. These data may be used for many different purposes. Better data allows better decisions, of course. Banks, insurance firms, and telecom companies already own a large amount of data about their customers. These resources are useful for building a more personal relationship with each customer. Some organizations already use... Continue Reading
The college football season is here, and this raises a very important question: Is Alabama going to be undefeated when they win the national championship, or will they lose a regular-season game along the way? Okay, so it's not a given that Alabama is going to win the championship this year, but when you've won 4 of the last 7 you're definitely the odds-on favorite. However, what if we wanted to take... Continue Reading
If you’re in the market for statistical software, there are many considerations and more than a few options for you to evaluate. Check out these seven questions to ask yourself before choosing statistical software—your answers should help guide you towards the best solution for your needs! 1. Who uses statistical software in your organization? Are they expert statisticians, novices, or a mix of both?... Continue Reading
In regression, "sums of squares" are used to represent variation. In this post, we’ll use some sample data to walk through these calculations. The sample data used in this post is available within Minitab by choosing Help > Sample Data, or File > Open Worksheet > Look in Minitab Sample Data folder (depending on your version of Minitab).  The dataset is called ResearcherSalary.MTW, and contains data... Continue Reading
Have you ever accidentally done statistics? Not all of us can (or would want to) be “stat nerds,” but the word “statistics” shouldn’t be scary. In fact, we all analyze things that happen to us every day. Sometimes we don’t realize that we are compiling data and analyzing it, but that’s exactly what we are doing. Yes, there are advanced statistical concepts that can be difficult to understand—but... Continue Reading
Statistics is all about modelling. But that doesn’t mean strutting down the catwalk with a pouty expression.  It means we’re often looking for a mathematical form that best describes relationships between variables in a population, which we can then use to estimate or predict data values, based on known probability distributions. To aid in the search and selection of a “top model,” we often utilize... Continue Reading
You need to consider many factors when you’re buying a used car. Once you narrow your choice down to a particular car model, you can get a wealth of information about individual cars on the market through the Internet. How do you navigate through it all to find the best deal?  By analyzing the data you have available.   Let's look at how this works using the Assistant in Minitab 17. With the... Continue Reading
Design of Experiments (DOE) is the perfect tool to efficiently determine if key inputs are related to key outputs. Behind the scenes, DOE is simply a regression analysis. What’s not simple, however, is all of the choices you have to make when planning your experiment. What X’s should you test? What ranges should you select for your X’s? How many replicates should you use? Do you need center... Continue Reading