Back to AI Flashcard MakerEducation /Psychological - W3 - Chapter 8 - Test Development - DN Part 3

Psychological - W3 - Chapter 8 - Test Development - DN Part 3

Education18 CardsCreated 25 days ago

This deck covers key concepts and definitions related to test development, including rating scales, scaling, item analysis, test construction, and more.

rating scale

a system of ordered numerical or verbal descriptors

used to make judgements about the presence, absence, or magnitude of a particular trait, attitude, emotion, or other variable

p.205, 247, 371

Tap or swipe ↕ to flip
Swipe ←→Navigate
1/18

Key Terms

Term
Definition

rating scale

a system of ordered numerical or verbal descriptors

used to make judgements about the presence, absence, or magnitude of a particular trait, ...

scaling

1) in test construction

the process of setting rules for assigning numbers in measurement

2) the process by which a measuring device

scalogram analysis

an item-analysis procedure

entails graphic mapping of a testtaker's responses

p.250

scoring drift

a discrepancy between the scoring in an anchor protocol and the scoring of another protocol

p. 280

selected-response format

a form of test item

requiring testtakers to select a response

(e.g., true/false, multiple choice, and matching items)

as opposed ...

sensitivity review

a study of test items

usually during test development

items are examined for fairness to all prospective testtakers

for the prese...

Related Flashcard Decks

Study Tips

  • Press F to enter focus mode for distraction-free studying
  • Review cards regularly to improve retention
  • Try to recall the answer before flipping the card
  • Share this deck with friends to study together
TermDefinition

rating scale

a system of ordered numerical or verbal descriptors

used to make judgements about the presence, absence, or magnitude of a particular trait, attitude, emotion, or other variable

p.205, 247, 371

scaling

1) in test construction

the process of setting rules for assigning numbers in measurement

2) the process by which a measuring device

is designed and calibrated &

the way numbers (or other indices) are assigned to different amounts of a trait, attribute, or characteristic being measured

p.244-251

scalogram analysis

an item-analysis procedure

entails graphic mapping of a testtaker's responses

p.250

scoring drift

a discrepancy between the scoring in an anchor protocol and the scoring of another protocol

p. 280

selected-response format

a form of test item

requiring testtakers to select a response

(e.g., true/false, multiple choice, and matching items)

as opposed to creating one - contrast with constructed-response format p.252

sensitivity review

a study of test items

usually during test development

items are examined for fairness to all prospective testtakers

for the presence of offensive language, stereotypes, or situations

p.274

short-answer item

may also be referred to as a completion item

a word, term, sentence or a paragraph may qualify

anything beyond this is an essay item

p.254

summative scale

an index derived from the summing of selected scores on a test or sub-test

p. 247

test conceptualization

an early stage of the test development process

when an idea for a particular test or test revision is conceived

p.240, 241-244

test construction

a stage in the process of test development

entails writing test items (or rewriting/revising existing items)

as well as formatting items, setting scoring rules, and otherwise designing and building a test

p.240

test development

an umbrella term for all that goes into the process of creating a test

p. 240-284

test revision

action taken to modify a test's content or format

for the purpose of improving the test's effectiveness as a tool of measurement

p.240

test tryout

a stage in the process of test development that entails administering a preliminary version of a test to a representative sample of testtakers

under conditions that simulate the conditions under which the final version of the test will be administered

p.240, 261-262

"think aloud" test administration

a method of qualitative item analysis

examinees verbalize their thoughts as they take the test

useful in understanding how

individual items function in a test

testtakers interpret or misinterpret the meaning of the individual items

p.274

true-false item

a binary-choice item

i.e., contains only one of two responses

requires testtaker to indicate whether a statement is or is not a fact

p.254

validity shrinkage

the decrease in item validities that inevitably occurs after cross-validation

p. 278

What is the optimal item difficulty?

usually midpoint between 1.0 and the probability of answering correctly by guessing

which is called the chance success proportion

multi choice (50% chance of getting it right by guessing) - .5 +1.00 = 1.5 divided by 2 = .60 10:00

p.263

How can you create a visual representation of the best items on a test

(i.e., if the objective is to maximise criterion-related validity)?

this can be achieved by plotting each item's

item-validity index and

item-reliability index

p.265

Fig 8-5