Data mining can be helpful in the exploratory phase of an
analysis. If you're in the early stages and you're just figuring
out which predictors are potentially correlated with your response
variable, data mining can help you identify candidates. However,
there are problems associated with using data mining to select
In my previous post, we used data mining to settle on
the following... Continue Reading
The ultimate goal of most quality improvement projects is clear:
reducing the number of defects, improving a response, or making a
change that benefits your customers.
We often want to jump right in and start gathering and analyzing
data so we can solve the problems. Checking your measurement
systems first, with methods like attribute agreement analysis or
Gage R&R, may seem like a needless waste... Continue Reading
We hosted our first-ever Minitab Insights conference in
September, and if you were among the attendees, you already know
the caliber of the speakers and the value of the information they
shared. Experts from a wide range of industries offered a lot of
great lessons about how they use data analysis to improve business
practices and solve a variety of problems.
I blogged earlier about five key...Continue Reading
If you were among the 300 people who attended the first-ever
Minitab Insights conference in September, you already know how
powerful it was. Attendees learned how practitioners from a
wide range of industries use data analysis to address a variety of
problems, find solutions, and improve business practices.
In the coming weeks and months, we will share more of the great
insights and guidance shared... Continue Reading
Face it, you love regression analysis as much as I do.
Regression is one of the most satisfying analyses in Minitab:
get some predictors that should have a relationship to a response,
go through a model selection process, interpret fit statistics like
adjusted R2 and predicted R2, and make
predictions. Yes, regression really is quite wonderful.
Except when it’s not. Dark, seedy corners of the data... Continue Reading
We’ve got a plethora of case studies showing how businesses from different
industries solve problems and implement solutions with data
analysis. Take a look for ideas about how you can use data analysis
to ensure excellence at your business!
Boston Scientific, one of the world’s leading developers of
medical devices, is just one organization who has shared their
story. A team at their Heredia,... Continue Reading
True or false: When comparing a parameter for two sets of
measurements, you should always use a hypothesis test to determine
whether the difference is statistically significant.
The answer? (drumroll...) True!
To understand this paradoxical answer, you need to keep in mind
the difference between samples, populations, and descriptive and
Descriptive Statistics and... Continue Reading
mining uses algorithms to explore correlations in data sets. An
automated procedure sorts through large numbers of variables and
includes them in the model based on statistical significance alone.
No thought is given to whether the variables and the signs and
magnitudes of their coefficients make theoretical sense.
We tend to think of data mining in the context of big data, with
its huge... Continue Reading
September 16, is World Ozone Day. You don't hear much about the
ozone layer any more.
In fact, if you’re under 30, you might think this is just
another trivial, obscure observance, along the lines of International Dot Day (yesterday) or National Apple Dumpling Day (tomorrow).
But there’s a good reason that, almost 30 years ago, the United
Nations designated today to as a day to raise... Continue Reading
I confess: I'm not a natural-born decision-maker. Some people—my
wife, for example—can assess even very complex situations, consider
the options, and confidently choose a way forward. Me? I get
anxious about deciding what to eat for lunch. So you can imagine
what it used to be like when I
needed to confront a really big decision or problem. My approach,
to paraphrase the Byrds, was "Re:... Continue Reading
There may be huge potential benefits waiting in the data in your
servers. These data may be used for many different purposes. Better
data allows better decisions, of course. Banks, insurance firms,
and telecom companies already own a large amount of data about
their customers. These resources are useful for building a more
personal relationship with each customer.
Some organizations already use... Continue Reading
In 2011 we had solar panels fitted on our property. In the last
few months we have noticed a few problems with the inverter (the
equipment that converts the electricity generated by the panels
from DC to AC, and manages the transfer of unused electric to the
power company). It was shutting down at various times throughout
the day, typically when it was very sunny, resulting in no
electricity being... Continue Reading
In regression, "sums of squares" are used to represent
variation. In this post, we’ll use some sample data to walk through
sample data used in this post is available within Minitab by
choosing Help > Sample Data,
or File > Open Worksheet >
Look in Minitab Sample Data folder (depending on
your version of Minitab). The dataset is called
ResearcherSalary.MTW, and contains data... Continue Reading
So the data you nurtured, that you worked so hard to format and
make useful, failed the normality test.
Time to face the truth: despite your best efforts, that data set
is never going to measure up to the assumption you may
have been trained to fervently look for.
Your data's lack of normality seems to make it poorly suited for
analysis. Now what?
Take it easy. Don't get uptight. Just let your data... Continue Reading
See if this
sounds fair to you. I flip a coin.
Heads: You win
$1.Tails: You pay me $1.
You may not like games of chance, but you have to admit it seems
like a fair game. At least, assuming the coin is a normal, balanced
coin, and assuming I’m not a sleight-of-hand magician who can
control the coin.
How about this next
You pay me $2 to play.I flip a coin over and over until
it comes up heads.Your... Continue Reading
The Centers for Medicare and Medicaid Services (CMS) updated
their star ratings on July 27. Turns out, the list of hospitals
provide a great way to look at how easy it is to get random samples
from data within Minitab.
Say for example, that you wanted to look at the association
between the government’s new star ratings and the safety rating
scores provided by hospitalsafetyscore.org. The CMS score... Continue Reading
Often, when we start analyzing
new data, one of the very first things we look at is whether
certain pairs of variables are correlated. Correlation can tell if two variables have a
linear relationship, and the strength of that
makes sense as a starting point, since we're usually looking for
relationships and correlation is an easy way to get a quick handle
on the data set we're... Continue Reading
My recent beach vacation began with the kind of unfortunate
incident that we all dread: killing a distant relative.
It was about 3 a.m. Me, my two sons, and our dog had been on the
road since about 7 p.m. the previous day to get to our beach house
on Plum Island, Massachusetts. Google maps said our exit was coming
up and that we were only about 15 minutes away from our palace.
Buoyed by that... Continue Reading
Minitab is the leading provider of software and services for quality
improvement and statistics education. More than 90% of Fortune 100 companies
use Minitab Statistical Software, our flagship product, and more students
worldwide have used Minitab to learn statistics than any other package.
Minitab Inc. is a privately owned company headquartered in State College,
Pennsylvania, with subsidiaries in the United Kingdom, France, and
Australia. Our global network of representatives serves more than 40
countries around the world.