Psychological - W2 - Chapter 5 - Reliability (DN) Part 3
This deck covers key concepts related to psychometric testing, including reliability, error types, and statistical measures used in test analysis.
restriction of range/variance
SAMPLING PROCEDURES may impact the variance of either variable in a correlation analysis
OUTCOME
if variance of EITHER variable is RESTRICTED by sampling procedure used, then tends to be a LOWER CORRELATION COEFFICIENT (i.e., masking true correlation)
(thought to self - is this also a validity issue e.g., failing to detect - a miss!!!)
conversely referred to as INFLATION OF RANGE/VARIANCE
if variance of EITHER variable is INFLATED by sampling procedure then the resulting CC tends to be HIGHER (i.e., giving a false indicator of correlation
(thought to self - is this also a validity issue e.g., false positive)
p.162
Key Terms
restriction of range/variance
SAMPLING PROCEDURES may impact the variance of either variable in a correlation analysis
OUTCOME
if variance of EITHER variable is REST...
Spearman-Brown formula
allows a test developer/user to estimate the INTERNAL consistency reliability from a correlation of TWO HALVES of a test that has been LENGTHENED o...
speed test
a test, usually of achievement or ability which has a TIME LIMIT
usually contains ITEMS of UNIFORM difficulty (usually uniformly low)
s...
split-half reliability
an ESTIMATE of the INTERNAL CONSISTENCY of a test - obtained by CORRELATING two PAIRS of SCORES taken from EQUIVALENT HALVES of a SINGLE TEST admin...
standard error of a score
in TRUE SCORE THEORY
a STATISTIC designed to ESTIMATE how far an OBSERVED SCORE DEVIATES from a TRUE SCORE
(also called standard error ...
standard error of measurement (SEM)
in TRUE SCORE THEORY
a STATISTIC designed to ESTIMATE how far an OBSERVED SCORE DEVIATES from a TRUE SCORE
(also called STANDARD ERROR ...
Related Flashcard Decks
Study Tips
- Press F to enter focus mode for distraction-free studying
- Review cards regularly to improve retention
- Try to recall the answer before flipping the card
- Share this deck with friends to study together
| Term | Definition |
|---|---|
restriction of range/variance | SAMPLING PROCEDURES may impact the variance of either variable in a correlation analysis OUTCOME if variance of EITHER variable is RESTRICTED by sampling procedure used, then tends to be a LOWER CORRELATION COEFFICIENT (i.e., masking true correlation) (thought to self - is this also a validity issue e.g., failing to detect - a miss!!!) conversely referred to as INFLATION OF RANGE/VARIANCE if variance of EITHER variable is INFLATED by sampling procedure then the resulting CC tends to be HIGHER (i.e., giving a false indicator of correlation (thought to self - is this also a validity issue e.g., false positive) p.162 |
Spearman-Brown formula | allows a test developer/user to estimate the INTERNAL consistency reliability from a correlation of TWO HALVES of a test that has been LENGTHENED or SHORTENED. inappropriate for use with HETEROGENEOUS tests or SPEED tests p. 153-154 |
speed test | a test, usually of achievement or ability which has a TIME LIMIT usually contains ITEMS of UNIFORM difficulty (usually uniformly low) so that when given GENEROUS TIME ALL TESTTAKERS should be able to complete ALL ITEMS CORRECTLY (so its isolating the SPEED variable) (contrast with 'power test') p.163, 272 |
split-half reliability | an ESTIMATE of the INTERNAL CONSISTENCY of a test - obtained by CORRELATING two PAIRS of SCORES taken from EQUIVALENT HALVES of a SINGLE TEST administered ONCE - p.152- 154 |
standard error of a score | in TRUE SCORE THEORY a STATISTIC designed to ESTIMATE how far an OBSERVED SCORE DEVIATES from a TRUE SCORE (also called standard error of measurement (SEM) p.175 |
standard error of measurement (SEM) | in TRUE SCORE THEORY a STATISTIC designed to ESTIMATE how far an OBSERVED SCORE DEVIATES from a TRUE SCORE (also called STANDARD ERROR OF A SCORE) p.132, 175-178 |
standard error of the difference | a STATISTIC designed to aid in determining HOW LARGE a DIFFERENCE between two scores should be BEFORE it is considered STATISTICALLY SIGNIFICANT p. 132, 178 |
static characteristic | a TRAIT, STATE or ABILITY presumed to be relatively STATIC OVER TIME (contrast with dynamic characteristic) p.162 |
systematic error | a source of ERROR in the measurement process typically CONSTANT or PROPORTIONATE to what is presumed to be the TRUE VALUE of the target variable being measured once known, it is predictable & FIXABLE - relative standings remain unchanged may not be VALID but is RELIABLE - p. 146 |
test battery | typically composed of TESTS designed to measure DIFFERENT VARIABLES. quite often psychologists rely on a BATTERY of tests in the process of EVALUATION. p. 155n5, 502-504 see also specific batteries |
test-retest reliability | an estimate of reliability obtained by CORRELATING pairs of scores from the SAME PEOPLE on TWO DIFFERENT administrations of the test appropriate when EVALUATING the RELIABILITY of a test purporting to measure something relatively STABLE over TIME e.g., a personality trait p.150-151, 161 |
theta level (in IRT) | a reference to the DEGREE of the underlying ability or trait that a TESTTAKER is presumed to BRING TO the test also referred to as THETA p. 170 |
transient error | a source of error attributable to the testtaker's FEELINGS, MOODS, or MENTAL STATE OVER TIME p.160 |
true score | according to CLASSICAL TEST THEORY | - a value that GENUINELY reflects an individual's ABILITY or TRAIT level as measured by a particular test p.164 |
true variance | in the TRUE SCORE MODEL the COMPONENT of a score attributable to TRUE DIFFERENCES in the ability or trait being measured can be in an OBSERVED SCORE or a DISTRIBUTION of SCORES p.146 |