Chapter 06 – Training Evaluation
© 2013 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in
any manner. This document may not be copied, scanned, duplicated, forwarded, distributed, or posted on a website, in whole or part.
Threats to Validity: Alternative Explanations for Evaluation Results.
A. Threats to validity refer to factors that will lead an evaluator to question either (1) the
believability of the study results or (2) the extent to which the evaluation results are
generalizable to other groups of trainees and situations.
B. Internal validity is the believability of the study. An evaluation study needs internal
validity to provide confidence that the results of the evaluation are due to the training
program and not to another factor.
C. External validity refers to the generalizability of the evaluation results to other groups
and other situations.
D. Methods to control for threats to validity:
1. Use pre-tests and post-tests to determine the extent to which trainees’ knowledge,
skills or behaviors have changed from pretraining to post-training measures. The
pretraining measure essentially establishes a baseline.
2. Use a comparison (or control) group (i.e., a group that participates in the evaluation
study, but does not receive the training) to rule out factors other than training as the
cause of changes in the trainees. The group that does receive the training is referred to
as the training group or treatment group. Often employees in an evaluation will
perform higher just because of the attention they are receiving. This is known as the
Hawthorne effect.
3. Random assignment refers to assigning employees to the control and training groups
on the basis of chance. Randomization helps to ensure that members of the control
group and training group are of similar makeup prior to the training. It can be
impractical and/or even impossible to employ in company settings.
Types of Evaluation Designs
Types of evaluation designs vary as to whether they include a pretest and posttest, a control
or comparison group and randomization.
A. The posttest only design involves collecting only posttraining outcome measures. It
would be strengthened by the use of a control group, which would help to rule out
alternative explanations for changes in performance.
B. The pretest/posttest design involves collecting both pretraining and posttraining outcome
measures to determine whether a change has occurred, but without a control group which
helps to rule out alternative explanations for any change that does occur.
C. The pretest/posttest with comparison group design includes pretraining and posttraining
outcome measurements as well as a comparison group in addition to the group that
receives training. If the posttraining improvement is greater for the group that receives
training, as we would expect, this provides evidence that training was responsible for the
change.
D. The time series design involves collecting outcome measurements at periodic intervals
pre- and posttraining. A comparison group may also be used. The strength of this design
can be improved by using reversal, which refers to a time period in which participants no
longer receive the training intervention. Its advantage are: it allows an analysis of the
stability of training outcomes over time, and using both the reversal and comparison
group helps to rule out alternative explanations for the evaluation results.
E. The Solomon Four-Group design combines the pretest/posttest comparison group design
and the posttest-only control group design. It involves the use of four groups: a training