Unit Four: Basic Principles of Test Construction: Measurement and Evaluation in Education (PDE 105)
Unit Four: Basic Principles of Test Construction: Measurement and Evaluation in Education (PDE 105)
Unit Four: Basic Principles of Test Construction: Measurement and Evaluation in Education (PDE 105)
INTRODUCTION
This unit is a very important one. You need to know how to construct different kinds of tests.
Indeed, tests are not just designed casually or in a haphazard manner. There are rules and
regulations guiding this activity. The unit gives you basic principles to follow when
constructing tests. Before you study this unit, quickly revise the previous unit on
characteristics of good tests.
OBJECTIVES
By the end of this unit, you should be able to:
a. recognize how different types of tests are being constructed;
b. determine the basic principles to follow in constructing tests; and
c. apply these principles in the practical construction of tests.
61
Measurement and Evaluation in Education (PDE 105)
Defining Objectives
As a competent teacher, you should be able to develop instructional objectives that are
behavioural, precise, realistic and at an appropriate level of generality that will serve as a
useful guide to teaching and evaluation.
This job has been made easier as these are already stated in the various curriculum packages
designed by the Federal Ministry of Education, which are available in schools.
However, when you write your behavioural objectives, use such action verbs like define,
compare, contrast, draw, explain, describe, classify, summarize, apply, solve, express, state,
list and give. You should avoid vague and global statements involving the use of verbs such
as appreciate, understand, feel, grasp, think etc.
It is important that we state objectives in behavioural terms so as to determine the terminal
behaviour of a student after having completed a learning task. Martin Haberman (1964) says
the teacher receives the following benefits by using behavioural objectives:
1. Teacher and students get clear purposes.
2. Broad content is broken down to manageable and meaningful pieces.
3. Organizing content into sequences and hierarchies is facilitated.
4. Evaluation is simplified and becomes self-evident.
5. Selecting of materials is clarified (The result of knowing precisely what youngsters
are to do leads to control in the selection of materials, equipment and the management
of resources generally).
asking pupils to give the date of a particular event, capital of a state or recite
multiplication tables.
Examples:
Behavioural objectives: To determine whether students are able to define technical
terms by giving their properties, relations or attributes.
Question:
Volt is a unit of
(a) weight (b) force (c) distance (d) work (e) volume
You can also use picture tests to test knowledge of classification and matching tests to
test knowledge of relationships.
(iii) Application
Here you want to test the ability of the students to use principles; rule and
generalizations in solving problems in novel situations, e.g. how would you recover
table salt from water?
(iv) Analysis
This is to analyze or break an idea into its parts and show that the student understands
their relationships.
(v) Synthesis
The student is expected to synthesize or put elements together to form a new matter
and produce a unique communication, plan or set of abstract relations.
(vi) Evaluation
The student is expected to make judgments based upon evidence.
63
Measurement and Evaluation in Education (PDE 105)
of the content and the process objectives you have been trying to achieve through your series
of lessons.
Percentages are usually assigned to the topics of the content and the process objectives such
that each dimension will add up to 100%. (see the table below).
After this, you should decide on the type of test you want to use and this will depend on the
process objective to be measured, the content and your own skill in constructing the different
types of tests.
Determination of the Total Number of Items
At this stage, you consider the time available for the test, types of test items to be used (essay
or objective) and other factors like the age, ability level of the students and the type of
process objectives to be measured.
When this decision is made, you then proceed to determine the total number of items for each
topic and process objectives as follows:
(i) To obtain the number of items per topic, you multiply the percentage of each by the
total number of items to be constructed and divide by 100. This you will record in the
column in front of each topic in the extreme right corner of the blueprint. In the table
below, 25% was assigned to soil. The total number of items is 50 hence 12 items for
the topic (25% of 50 items = 12 items).
(ii) To obtain the number of items per process objective, we also multiply the percentage
of each by the total number of items for test and divide by 100. These will be
recorded in the bottom row of the blueprint under each process objective. In the table
below:
(a) the percentage assigned to comprehension is 30% of the total number of items
which is 50. Hence, there will be 15 items for this objective (30% of 50
items).
Blue Print for Mid-Term Continuous Assessment Test (Objective Items)
Content
Process Objectives
Areas
Knowledge Comprehension Analysis Synthesis Application Evaluation Number
Recognizes Identifies facts, Break idea Put ele- Applies Judge the of items
Terms & Principles, into its ments to- knowledge worth of
vocabularies Concepts and parts gether to in new Information
Generalizations form new situation
matter
30% 30% 10% 10% 10% 10%
A Soil
4 4 1 1 1 1 12
25%
B Water
3 3 1 1 1 1 10
20%
C Weather
4 4 2 1 1 2 15
30%
D Food
4 4 1 2 2 2 13
25%
Number of
15 15 5 5 5 5 50
Items
64
Measurement and Evaluation in Education (PDE 105)
(b) To decide the number of items in each cell of the blue print, you simply
multiply the total number of items in a topic by the percentage assigned to the
process objective in each row and divide by 100. This procedure is repeated
for all the cells in the blue print. For example, to obtain the number of items
on water under knowledge, you multiply 30% by 10 and divide by 100 i.e. 3.
In summary, planning for a test involves the following basic steps:
(1) Outlining content and process objectives.
(2) Choosing what will be covered under each combination of content and process
objectives.
(3) Assigning percentage of the total test by content area and by process
objectives and getting an estimate of the total number of items.
(4) Choosing the type of item format to be used and an estimate of the number of
such items per cell of the test blue print.
ACTIVITY 1
Fill in the gaps
Test can be grouped into ------------------ major groups. These are ------------------ and---------
The ------------- test has alternatives called -----, which usually follow the -------------------- of
the question.
Crosscheck your answers by re-reading the section on the different kinds of tests to be
constructed.
65
Measurement and Evaluation in Education (PDE 105)
the judgment and personality of the marker cannot influence the correction in any way.
Indeed, many objective tests are scored by machines. This kind of test may be graded more
quickly and objectively than the subjective or the easy type.
In constructing objective tests, the following basic principles must be borne in mind.
1. The instruction of what the candidate should do must be clear, unambiguous and
precise. Do not confuse the candidates. Let them know whether they are to choose
by ticking (√ ), by circling ( o ) or shading the box in the answer sheet.
ANSWER SHEET
ACTIVITY 111
1. Study the instruction above and put on a piece of paper or in your exercise book the
characteristics of the instructions that were presented for the examination.
Crosscheck your answers with the ones below after you have attempted the activity.
As could be seen in the example just presented, instructions of a test must be:
• unambiguous
• clear
• written in short sentences
• numbered in sequence
66
Measurement and Evaluation in Education (PDE 105)
• underlined or bold-faced to show the most important part of the instruction or call
attention to areas of the instruction that must not be overlooked or forgotten.
2. The options (or alternatives) must be discriminating: some may be obviously wrong
but there must be options that are closely competing with the correct option in terms
of characteristics, related concept or component parts.
In question 1, choose the option opposite in meaning to the underlined word.
i. Albert thinks Mary is antagonistic because of her angry tone.
a. noble
b. hostile
c. friendly
d. harsh
Answer the question above. State the options that are competing with each
other. State the options that are obviously wrong.
Compare your answer with the discussion below.
If you have done Activity III very well, you will agree with me that option C
is the correct answer and that options A and B are competing. A if the
candidate remembers that the answer should be in the opposite and B, if
he/she has forgotten this fact.
ii. The correct option should not be loner or shorter than the rest, i.e. the incorrect
options. Differences in length of options may call the attention of the
candidate. The stem of an item must clearly state the problem. The options
should be brief.
iii. As much as possible, you should make alternatives difficult to guess.
Guessing reduces the validity of the test and makes undeserved candidates
pass with no academic effort. The distractions must be plausible, adequate
and attractive. They should be related to the stem.
iv. Only one option must be correct. Do not set objective tests where two or more
options are correct. You confuse a brilliant student and cause undeserved
failure.
v. The objective tests should be based on the syllabus, what is taught, or
expected to be taught. It must provoke deep reasoning, critical thinking, and
value judgments.
vi. Avoid the use of negative statements in the stem of an item. When used, you
should underline the negative word.
vii. Every item should be independent of other items.
viii. Avoid the use of phrases like “all of the above, all of these, none of these or
none of the above”
67
Measurement and Evaluation in Education (PDE 105)
ix. The reading difficulty and vocabulary level must be as simple as possible.
ACTIVITY IV
Answer the questions in this short answer test and bring out the characteristics of the test.
Fill in the gaps with the appropriate words or expressions.
1. In multiple choice tests each student has an ----------------. Candidates have no
opportunity to -------- a different --------- or special --------. But in short answer test,
candidates are allowed to write ----- by filling ------ or writing ---------- sentences.,
Go back to the relevant section to fish out answers that fill the gaps.
68
Measurement and Evaluation in Education (PDE 105)
iii. Essays should not merely require registration of facts learnt in the class. They should
not also be satisfied with only the examples given in class.
iv. Some of the words that can be used in an essay type of test are: compare and contrast,
criticize, critically examine, discuss, describe, outline, enumerate, define, state, relate,
illustrate, explain, summarize, construct, produce, design, etc. Remember, some of
the words are mere words that require regurgitation of facts, while others require
application of facts.
SUMMARY
• In this unit, you have been exposed to the basic principles for constructing multiple-
choice, short answer and essay types of tests. In all tests, instructions must be clear,
unambiguous, precise, and goal-oriented. All tests must be relevant to what is learnt
or expected to be learnt. They must meet the learning needs and demands of the
candidates. Tests should not be too easy or difficult.
ASSIGNMENT
Construct three multiple-choice, short answers and essay-tests each. Use each test
constructed to analyze the basic principles of testing.
REFERENCES
Carroll, J. B. (1983), Psychometric Theory and Language Testing. Rowley, Mass:
Newbury House.
69