Research
Error
Every
statistic contains both a true score and an error
score.
A true score is the part of the statistic or
number that truly represents what was being
measured.
An error score is that part of the statistic
or number that represents something other than what
is being measured.
Imagine standing on your bathroom scale and
weighing 140 pounds then standing on your doctor’s
scale an hour later and weighing 142.
Is it likely you gain 2 pounds on the way to
the doctor’s office?
The difference between the two numbers has
much more to do with error than it does weight gain,
especially in that short of a time span.
When a scale, or any measuring device,
provides a score, this score is actually only an
estimate of what your true score really is.
When your bathroom scale reads 140 pounds, it
should be interpreted as an estimate of your true
weight which may actually be 141.
If this is the case, then your score (weight,
in this case) of 140 represents 141 pounds of true
weight and one pound of error.
Confidence
Level.
When we use statistics to summarize any phenomenon,
we are always concerned with how much of that
statistic represents the true score and how much is
error. Imagine
a person scores a 100 on a standardized IQ test.
Is his true IQ really 100 or could this score
be off some due to an unknown level of error?
Chances are that there is error associated
with his score and therefore we must use this score
of 100 as an estimate of his true IQ.
When using an achieved score to estimate a
true score, we must determine how much error is
associated with it.
Methods to estimate a true score are called
estimators, and fall into three main groups: Point
Estimation; Interval Estimation; and Confidence
Interval Estimation.
Point
Estimation.
In point estimation, the value of a sample
statistic or achieved score is used as a best guess
or quick estimate of the population statistic or
true score. In
other words, if a sample of students average 78 on a
final examination, you could estimate that all
students would average 78 on the same test.
The major concern of point estimation is the
lack of concern for error; the achieved score is
assumed to be the true score.
Interval
Estimation.
Interval estimation goes a step further and
assumes that some level of error has occurred in the
achieved score, which is almost always the case.
If the sample students achieve an average of
78, we could estimate the amount of error and then
provide an estimate of the true score based on an
interval rather than a single point.
There are different methods to determine
error but perhaps the most commonly used is called
the standard error of the mean.
Using
a simple statistical formula, the amount of error is
determined and the true score is said to be the
achieved score plus or minus the standard error of
the mean. For
instance, if the students average 78 on their exam
and the standard error of the mean is determined to
be 3 points, the students’ true average would be
estimated as 78 +/- 3 or between 75 and 81.
Confidence
Interval Estimation.
The confidence interval estimation uses the
same method as the interval estimation but provides
a level of confidence or certainty in the true
score. Through
more complex statistics, a specific level of
confidence in an interval can be determined.
We might say then, based on these statistics,
that we are 95% confident that the true score lies
somewhere between 78 and 81.
The more confident we are, the larger the
interval.
Imagine
this exam has a possibility of 100 points.
We would be 100% sure than a student will
score somewhere between 0 and 100.
In fact, we are always 100% confident that a
true score falls somewhere between the minimum
possible score and the maximum possible score.
Narrowing the true score down, however,
reduces our level of confidence. We might only be 98% sure that the true score falls somewhere
between 70 and 90, and only 95% confident that the
true score falls somewhere between 75 and 81.
A
good way to look at confidence interval estimation
is to consider the role of a six-sided dice.
How confident would you be that rolling the
die once would result in a number between one and
six? You
should be 100% confident because those are the only
possible scores.
How sure would you be that the role would net
an even number or an odd number?
Since half of the numbers are even and half
are odd, you would be 50% confident that one of
these two possibilities would occur.
Now, what about rolling only a one?
Since there are six possible scores and you
are estimating the roll to net only one of those
six, you should see the odds as 1:6.
Therefore you would be about 17% confident
that the next roll would result in a score of one.
The more we pinpoint the score, the less
confident we are in our prediction.
|