rating scale
a system of ordered numerical or verbal descriptors
used to make judgements about the presence, absence, or magnitude of a particular trait, attitude, emotion, or other variable
p.205, 247, 371
Key Terms
rating scale
a system of ordered numerical or verbal descriptors
used to make judgements about the presence, absence, or magnitude of a particular trait, ...
scaling
1) in test construction
the process of setting rules for assigning numbers in measurement
2) the process by which a measuring device
scalogram analysis
an item-analysis procedure
entails graphic mapping of a testtaker's responses
p.250
scoring drift
a discrepancy between the scoring in an anchor protocol and the scoring of another protocol
p. 280
selected-response format
a form of test item
requiring testtakers to select a response
(e.g., true/false, multiple choice, and matching items)
as opposed ...
sensitivity review
a study of test items
usually during test development
items are examined for fairness to all prospective testtakers
for the prese...
Related Flashcard Decks
Study Tips
- Press F to enter focus mode for distraction-free studying
- Review cards regularly to improve retention
- Try to recall the answer before flipping the card
- Share this deck with friends to study together
| Term | Definition |
|---|---|
rating scale | a system of ordered numerical or verbal descriptors used to make judgements about the presence, absence, or magnitude of a particular trait, attitude, emotion, or other variable p.205, 247, 371 |
scaling | 1) in test construction the process of setting rules for assigning numbers in measurement 2) the process by which a measuring device is designed and calibrated & the way numbers (or other indices) are assigned to different amounts of a trait, attribute, or characteristic being measured p.244-251 |
scalogram analysis | an item-analysis procedure entails graphic mapping of a testtaker's responses p.250 |
scoring drift | a discrepancy between the scoring in an anchor protocol and the scoring of another protocol p. 280 |
selected-response format | a form of test item requiring testtakers to select a response (e.g., true/false, multiple choice, and matching items) as opposed to creating one - contrast with constructed-response format p.252 |
sensitivity review | a study of test items usually during test development items are examined for fairness to all prospective testtakers for the presence of offensive language, stereotypes, or situations p.274 |
short-answer item | may also be referred to as a completion item a word, term, sentence or a paragraph may qualify anything beyond this is an essay item p.254 |
summative scale | an index derived from the summing of selected scores on a test or sub-test p. 247 |
test conceptualization | an early stage of the test development process when an idea for a particular test or test revision is conceived p.240, 241-244 |
test construction | a stage in the process of test development entails writing test items (or rewriting/revising existing items) as well as formatting items, setting scoring rules, and otherwise designing and building a test p.240 |
test development | an umbrella term for all that goes into the process of creating a test p. 240-284 |
test revision | action taken to modify a test's content or format for the purpose of improving the test's effectiveness as a tool of measurement p.240 |
test tryout | a stage in the process of test development that entails administering a preliminary version of a test to a representative sample of testtakers under conditions that simulate the conditions under which the final version of the test will be administered p.240, 261-262 |
"think aloud" test administration | a method of qualitative item analysis examinees verbalize their thoughts as they take the test useful in understanding how individual items function in a test testtakers interpret or misinterpret the meaning of the individual items p.274 |
true-false item | a binary-choice item i.e., contains only one of two responses requires testtaker to indicate whether a statement is or is not a fact p.254 |
validity shrinkage | the decrease in item validities that inevitably occurs after cross-validation p. 278 |
What is the optimal item difficulty? | usually midpoint between 1.0 and the probability of answering correctly by guessing which is called the chance success proportion multi choice (50% chance of getting it right by guessing) - .5 +1.00 = 1.5 divided by 2 = .60 10:00 p.263 |
How can you create a visual representation of the best items on a test (i.e., if the objective is to maximise criterion-related validity)? | this can be achieved by plotting each item's item-validity index and item-reliability index p.265 Fig 8-5 |