0% found this document useful (0 votes)

205 views6 pages

Sample Test Specification

The document describes the development of an achievement test to measure student progress after a 3-month English reading course. It details the test specifications, including content, structure, timing, scoring procedures, and trials conducted during development.

Uploaded by

Thanh Xuân Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

205 views6 pages

Sample Test Specification

Uploaded by

Thanh Xuân Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

• a description of the test, giving details of sections, timings, etc.

7
(which may include a version of the speci cations);

Stages of test development

• sample items (or a complete sample test);
• advice on preparing for taking the test;
• an explanation of how test scores are to be interpreted;
• training materials (for interviewers, raters, etc.);
• details of test administration.

The handbooks should be made available in print form or/and online.

10. Training staff

Using the handbook and other materials, all staff who will be
involved in the test process should be trained. This may include
interviewers, raters, scorers, computer operators and invigilators
(proctors).

11. Test maintenance

If a test is to be used repeatedly over time, statistical and qualitative
analysis should be carried out regularly in order to identify any
problems that may have crept in. At some point, alternative versions
are likely to become necessary, as word spreads of the original
test’s content. In this case, the development process will have to be
repeated, beginning with the writing of items (assuming there is no
perceived need to change the speci cations).

Two examples of test development follow.

EXAMPLE OF TEST DEVELOPMENT 1: AN ACHIEVEMENT TEST

Statement of the problem
There is a need for an achievement test to be administered at the end of a
pre-sessional course of training in the reading of academic texts in the social
sciences and business studies (the students are graduates who are about
to follow postgraduate courses in English-medium universities). The teaching
institution concerned (as well as the sponsors of the students) wants to know
just what progress is being made during the three-month course. The test must
therefore be suf ciently sensitive to measure gain over that relatively short

71
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press
Stages of test development
period. While there is no call for diagnostic information on individuals, it would
be useful to know, for groups, where the greatest dif culties remain at the end
of the course, so that future courses may give more attention to these areas.
Backwash is considered important; the test should encourage the practice of
the reading skills that the students will need in their university studies. This is,
in fact, intended to be only one of a battery of tests, and a maximum of two
hours can be allowed for it. It will not be possible at the outset to write separate
tests for different subject areas. social sciences & business studies

Speci cations
7

Content
Operations These are based on the stated objectives of the course, and
include expeditious and slower, careful reading.

Expeditious reading: Skim for main ideas; search read for information; scan to
nd speci c items in lists, indexes, etc.

Slower, careful reading: Construe the meaning of complex, closely argued

passages.

Underlying skills that are given particular attention in the course:

• Guessing the meaning of unfamiliar words from context;
• Identifying referents of pronouns etc. often some distance removed in the text.

Types of text The texts should be authentic, academic (taken from textbooks
and journal articles).

Addressees Academics at postgraduate level and beyond.

Lengths of texts Expeditious: c. 3,000 words Careful: c. 800 words.

Topics The subject areas will have to be as ‘neutral’ as possible, since the
students are from a variety of social science and business disciplines
(economics, sociology, management etc.).

Readability Not speci ed.

Structural range Unlimited.

Vocabulary range General academic, not specialist technical.

Dialect and style Standard American or British English dialect. Formal, academic
style.

Speed of processing Expeditious: 300 words per minute (not reading all words).
Careful: 100 words per minute.

Structure, timing, medium and techniques

Test structure Two sections: expeditious reading; careful reading.

Number of items 30 expeditious; 20 careful. Total: 50 items.

Number of passages 3 expeditious; 2 careful.

72
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press
7
Timing Expeditious: 15 minutes per passage (each passage collected after 15

Stages of test development

minutes).
Careful: 30 minutes (passage only handed out after 45 minutes, when
expeditious reading has been completed).
TOTAL: 75 minutes.

Medium Paper-and-pencil. Each passage in a separate booklet.

Techniques Short answer and gap lling for both sections.

Examples:

a) For inferring meaning from context:

For each of the following, nd a single word in the text with an
equivalent meaning. Note: the word in the text may have an ending
such as -ing, -s, etc.
highest point (lines 20–35)

b) For identifying referents:

What does each of the following refer to in the text? Be very precise.
the former (line 43)

Criterial levels of performance

Satisfactory performance is represented by 80 percent accuracy in each of the
two sections.
The number of students reaching this level will be the number who have
succeeded in terms of the course’s objectives.

Scoring procedures
There will be independent double scoring. Scorers will be trained to ignore
irrelevant (for example, grammatical) inaccuracy in responses.

Sampling
Texts will be chosen from as wide a range of topics and types of writing as
is compatible with the speci cations. Draft items will only be written after the
suitability of the texts has been agreed.

Item writing and moderation

Items will be based on a consideration of what a competent non-specialist
reader should be able to obtain from the texts. Considerable time will be set
aside for moderation and rewriting of items.

Informal trialling
This will be carried out on 20 expert speaker postgraduate students in the
university.

Trialling and analysis

Trialling of texts and items suf cient for at least two versions will be carried out
with students currently taking the course, with full qualitative and statistical

73
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press
Stages of test development
analysis. An overall reliability coef cient of 0.90 and a percent agreement (see
Chapter 5) of 0.85 are required.

Validation
There will be immediate content validation carried out by staff experienced in
teaching and testing.
Concurrent validation will be against tutors’ ratings of the students.
Predictive validation will be against subject supervisors’ ratings one month after
the students begin their postgraduate studies.
7

Handbooks
One handbook will be written for the students, their sponsors, and their future
supervisors.
Another handbook will be written for internal use.

EXAMPLE OF TEST DEVELOPMENT 2: A PLACEMENT TEST

Statement of the problem
A commercial English language teaching organisation (which has a number
of schools) needs a placement test. Its purpose will be to assign new
students to classes at ve levels: false beginners; lower intermediate; middle
intermediate; upper intermediate; advanced. Course objectives at all levels
are expressed in rather general ‘communicative’ terms, with no one skill being
given greater attention than any other. As well as information on overall ability
in the language, some indication of oral ability would be useful. Suf cient
accuracy is required for there to be little need for changes of class once
teaching is under way. Backwash is not a serious consideration. More than two
thousand new students enrol within a matter of days. The test must be brief
(not more than 45 minutes in length), quick and easy to administer, score and
interpret. Scoring by clerical staff should be possible. The organisation has
previously conducted interviews but the number of students now entering the
school is making this impossible.

Speci cations

Content
Operations Ability to predict missing words (based on the notion of ‘reduced
redundancy’5).

Length of text One turn (of a maximum of about 20 words) per person.

Types of text Constructed ‘spoken’ exchanges involving two people. It is hoped

that the spoken nature of the texts will, however indirectly, draw on students’
oral abilities.

5.
See Chapter 14 for a discussion of reduced redundancy.

74
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press
7
Topics ‘Everyday’. Those found in the textbooks used by the organisation.

Stages of test development

Structural range All those found in the textbooks (listed in the speci cations but
omitted here to save space).

Vocabulary range As found in the textbooks, plus any other common lexis.

Dialect and style Standard English English. Mostly informal style, some formal.

Structure, timing, medium and techniques

Test structure No separate sections.

Number of items 100 (though this will be reduced if the test is shown to do its
job well with fewer items).

Timing 30 minutes (Note: this seems very little time, but the more advanced
students will nd the early passages extremely easy, and will take very little time. It
does not matter whether lower-level students reach the later passages.)

Medium Pencil-and-paper.

Technique All items will be gap lling. One word per gap. Contractions count as
one word. Gaps will relate to vocabulary as well as structure (not always possible
to distinguish what is being tested).

Examples: A: Whose book that?

B: It’s mine.
A: How did you learn French?
B: I just picked it as I went along.

Criterial levels of performance

These will only be decided when comparison is made between performance
on the test and (a) the current assignment of students by the interview and
(b) the teachers’ view of each student’s suitability to the class they have been
assigned to by the interview.

Scoring procedures
Responses will be on a separate response sheet. A template with a key will be
constructed so that scoring can be done rapidly by clerical staff.

Informal trialling
This will be carried out on 20 rst-year expert speaker undergraduate students.

Trialling and analysis

Many more items will be constructed than will nally be used. All of them (in as
many as three different test forms, with linking anchor items) will be trialled on
current students at all levels in the organisation. Problems in administration and
scoring will be noted.

75
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press
Stages of test development
After statistical and qualitative analysis, one test form made up of the ‘best’ items
will be constructed and trialled on a different set of current students. The total
score for each of the students will then be compared with his or her level in the
institution, and decisions as to criterial levels of performance made.

Validation
The nal version of the test will be checked against the list of structures in the
speci cations. If one is honest, however, one must say that at this stage content
validity will be only a matter of academic interest. What will matter is whether the
test does the job it is intended for. Thus the most important form of validation will be
7

criterion-related, the criterion being placement of students in appropriate classes,

as judged by their teachers (and possibly by the students themselves). The smaller
the proportion of misplacements, the more valid the test.

Handbook
A handbook will be written for distribution by the organisation to its various schools.

READER ACTIVITIES
On the basis of experience or intuition, try to write a speci cation for a
test designed to measure the level of language pro ciency of students
applying to study an academic subject in the medium of a foreign
language at an overseas university. Compare your speci cation with those
of tests that have actually been constructed for that purpose.

Test development process

O’Sullivan (2012b) presents an outline of the test development process.
Davidson and Fulcher (2012) offer advice on the development of test
speci cations. Speci cations for a test designed to assess the level
of English of students wishing to study at tertiary level in the UK, the
Test of English for Educational Purposes (TEEP), are to be found in Weir
(1988, 1990).
For other models of test development see Alderson et al. (1995) and
Bachman and Palmer (1996). The model used by Bachman and Palmer is
highly detailed and complex but their book gives information on ten test
development projects.
Alderson and Buck (1993) report on the test development procedures of
certain British testing bodies.

Common European Framework

Language Testing 22, 3 (2005) includes a number of articles about the
use of the Common European Framework (see Online resources, below) in
language testing.

76
https://doi.org/10.1017/9781009024723.007 Published online by Cambridge University Press

Language Testing. Oxford. Oxford University Press)
No ratings yet
Language Testing. Oxford. Oxford University Press)
6 pages
Bachman Chapter 3
100% (1)
Bachman Chapter 3
35 pages
Schmitt, Norbert Schmitt, Diane (2020) Vocabulary in Language Teaching (Second Edition)
No ratings yet
Schmitt, Norbert Schmitt, Diane (2020) Vocabulary in Language Teaching (Second Edition)
291 pages
Assessment in Learning 1 1
No ratings yet
Assessment in Learning 1 1
65 pages
Chapter 8 Second Language Speaking New
No ratings yet
Chapter 8 Second Language Speaking New
34 pages
Enhancing Language Skills in Teacher Training
No ratings yet
Enhancing Language Skills in Teacher Training
11 pages
Roles of Teachers and Learners
No ratings yet
Roles of Teachers and Learners
19 pages
TESOL Pre Interview Language Awareness Task 2016
No ratings yet
TESOL Pre Interview Language Awareness Task 2016
6 pages
Aspects of Task Syllabus Design
No ratings yet
Aspects of Task Syllabus Design
7 pages
2.4 Teaching Large Multilevel Classes - Nathalie Hess PDF
100% (1)
2.4 Teaching Large Multilevel Classes - Nathalie Hess PDF
15 pages
Aubé & Rousseau 2005
No ratings yet
Aubé & Rousseau 2005
16 pages
Trainers Handbook
No ratings yet
Trainers Handbook
63 pages
Language Testing Guide for Teachers
100% (1)
Language Testing Guide for Teachers
2 pages
Medgyes, P. (1986) - Queries From A Communicative Teacher.
No ratings yet
Medgyes, P. (1986) - Queries From A Communicative Teacher.
6 pages
Wenden 1986
No ratings yet
Wenden 1986
20 pages
Repetition in CLIL Teacher Discourse
No ratings yet
Repetition in CLIL Teacher Discourse
10 pages
Previewpdf
No ratings yet
Previewpdf
82 pages
Teaching Oral Language Skills
No ratings yet
Teaching Oral Language Skills
34 pages
Activities - Enormous Importance in The Modern English Language Classroom PDF
No ratings yet
Activities - Enormous Importance in The Modern English Language Classroom PDF
3 pages
How To Teach Writing
100% (1)
How To Teach Writing
14 pages
Student Teaching Lesson Plan - Turn Taking
No ratings yet
Student Teaching Lesson Plan - Turn Taking
5 pages
Pica 1987
No ratings yet
Pica 1987
19 pages
Using The Mother Tongue
No ratings yet
Using The Mother Tongue
2 pages
McGrath Feeding Leading. Showing Throwing p162-172 V 2
100% (1)
McGrath Feeding Leading. Showing Throwing p162-172 V 2
11 pages
Brown & Hudson (1998) - WP16
No ratings yet
Brown & Hudson (1998) - WP16
25 pages
Vandergrift 1997
No ratings yet
Vandergrift 1997
23 pages
Task-Based Learning Boosts Automaticity
No ratings yet
Task-Based Learning Boosts Automaticity
7 pages
Grammar Teaching-Practice or Consciousness-Raising - Ellis - Sem Comentários...
No ratings yet
Grammar Teaching-Practice or Consciousness-Raising - Ellis - Sem Comentários...
5 pages
Language Assessment and Proficiency Standards
100% (1)
Language Assessment and Proficiency Standards
3 pages
Analysing Accuracy, Complexity and Fluency in Written Text
100% (1)
Analysing Accuracy, Complexity and Fluency in Written Text
26 pages
Willis TBL
100% (1)
Willis TBL
9 pages
What Makes Materials Authentic
100% (1)
What Makes Materials Authentic
8 pages
Concept Checking in Language Teaching
No ratings yet
Concept Checking in Language Teaching
10 pages
Materials Development
No ratings yet
Materials Development
15 pages
D. Nunan (1988), Syllabus Design
No ratings yet
D. Nunan (1988), Syllabus Design
83 pages
Seedhouse Article
100% (1)
Seedhouse Article
7 pages
Teacher Questioning Strategies and Classroom Interaction in Ly Thai To School
100% (2)
Teacher Questioning Strategies and Classroom Interaction in Ly Thai To School
39 pages
New Trends in Using L1 in EFL Classroom
No ratings yet
New Trends in Using L1 in EFL Classroom
10 pages
Second Language Assessment
No ratings yet
Second Language Assessment
33 pages
Language Testing Insights
No ratings yet
Language Testing Insights
16 pages
Canagarajah 2011 Codemeshing
No ratings yet
Canagarajah 2011 Codemeshing
18 pages
Clare Painter: Understanding Genre and Register: Implicationsfor Language Teaching
100% (1)
Clare Painter: Understanding Genre and Register: Implicationsfor Language Teaching
14 pages
Focused vs. Unfocused Tasks in ESL
No ratings yet
Focused vs. Unfocused Tasks in ESL
5 pages
LT 42.2
100% (1)
LT 42.2
144 pages
Teaching Oral Language Skills
No ratings yet
Teaching Oral Language Skills
33 pages
ELF, Identity, and Language Attitudes
No ratings yet
ELF, Identity, and Language Attitudes
38 pages
Hughes Summary - Testing For Language Teachers
No ratings yet
Hughes Summary - Testing For Language Teachers
2 pages
Understanding The Lexical Approach in Language Teaching
No ratings yet
Understanding The Lexical Approach in Language Teaching
4 pages
Communicative Syllabus Design and Testing: Dr. Ambreen Shahriar Dr. Ayesha Sohail Dr. Habibullah Pathan
No ratings yet
Communicative Syllabus Design and Testing: Dr. Ambreen Shahriar Dr. Ayesha Sohail Dr. Habibullah Pathan
26 pages
Listening (OUP)
No ratings yet
Listening (OUP)
168 pages
Language Assessment (TSL 3123) 6 Pismp Tesl 4
No ratings yet
Language Assessment (TSL 3123) 6 Pismp Tesl 4
9 pages
Coercive Conditions and Strategic Compl..
No ratings yet
Coercive Conditions and Strategic Compl..
10 pages
Chapter2-ForeignandSecondLanguageAssessmentLiteracy-CoombeTroudi and AlHamly
No ratings yet
Chapter2-ForeignandSecondLanguageAssessmentLiteracy-CoombeTroudi and AlHamly
16 pages
Hughes 7 Stages of Test Development-3
100% (1)
Hughes 7 Stages of Test Development-3
17 pages
Test Construction for Educators
No ratings yet
Test Construction for Educators
4 pages
Unit 3 Stages of Test Development
No ratings yet
Unit 3 Stages of Test Development
9 pages
7 Stages of Test Construction: Statement of The Problem
No ratings yet
7 Stages of Test Construction: Statement of The Problem
14 pages
CLB Assessment Guide for ESL Educators
No ratings yet
CLB Assessment Guide for ESL Educators
9 pages
Example of Test Development 1 - Gap Fill
No ratings yet
Example of Test Development 1 - Gap Fill
3 pages
Stages of Test Developments
No ratings yet
Stages of Test Developments
4 pages
Outlines: Writing Task 2
No ratings yet
Outlines: Writing Task 2
4 pages
Đề kiểm tra anh 10 Global có file nghe END-OF-TERM TEST 1 (Semester 1)
100% (1)
Đề kiểm tra anh 10 Global có file nghe END-OF-TERM TEST 1 (Semester 1)
16 pages
Benefits of Wearing School Uniforms
No ratings yet
Benefits of Wearing School Uniforms
1 page
Week 5 Content Analysis Grounded Theory
No ratings yet
Week 5 Content Analysis Grounded Theory
4 pages
Teacher Expectations and Student Success
No ratings yet
Teacher Expectations and Student Success
17 pages
240 Vocabulary Words Kids Need To Know Grade3
100% (2)
240 Vocabulary Words Kids Need To Know Grade3
81 pages
2025 PDLP Parents Letter (3 Jan)
No ratings yet
2025 PDLP Parents Letter (3 Jan)
9 pages
Unit 3.3 Cognitive Processes: Constructivism: Knowledge Construction/ Learning
100% (1)
Unit 3.3 Cognitive Processes: Constructivism: Knowledge Construction/ Learning
5 pages
Observation & Agreement Sheet
No ratings yet
Observation & Agreement Sheet
1 page
2021 VC Sacro Katha
No ratings yet
2021 VC Sacro Katha
19 pages
TW Groeneweg - Shield Driven Tunnels in UHSC
100% (1)
TW Groeneweg - Shield Driven Tunnels in UHSC
141 pages
Professional Associations
67% (3)
Professional Associations
5 pages
Tareque Mehdi: PhD Student CV
No ratings yet
Tareque Mehdi: PhD Student CV
5 pages
"Fulwari": District - Ashoknagar (M.P.)
No ratings yet
"Fulwari": District - Ashoknagar (M.P.)
12 pages
Ethnomethodology in Communication
No ratings yet
Ethnomethodology in Communication
15 pages
English FAL P3 Grade 11 Nov 2019.pdf R
No ratings yet
English FAL P3 Grade 11 Nov 2019.pdf R
8 pages
Ethics in Human Communication PDF
0% (1)
Ethics in Human Communication PDF
2 pages
Nonverbal Communication
No ratings yet
Nonverbal Communication
6 pages
O24 Btech Co 5 1810-2
No ratings yet
O24 Btech Co 5 1810-2
70 pages
Grade 7 Drama Lesson Plan
No ratings yet
Grade 7 Drama Lesson Plan
9 pages
Universidad Abierta para Adultos (UAPA) : Ingles 3
No ratings yet
Universidad Abierta para Adultos (UAPA) : Ingles 3
4 pages
MODULES 1 To 10 Access2
No ratings yet
MODULES 1 To 10 Access2
3 pages
Compilation of Written Works
No ratings yet
Compilation of Written Works
3 pages
Introduction To Literature Approaches in Teaching English Classroom
No ratings yet
Introduction To Literature Approaches in Teaching English Classroom
8 pages
M.Phil/Ph.D Admission Regulations 2019
No ratings yet
M.Phil/Ph.D Admission Regulations 2019
16 pages
TERI University Placement Brochure 2017
No ratings yet
TERI University Placement Brochure 2017
40 pages
Big Data Enabled Nursing Education, Research and Practice Complete Ebook Edition
100% (19)
Big Data Enabled Nursing Education, Research and Practice Complete Ebook Edition
16 pages
Spelling Development in Grade 2 Students
No ratings yet
Spelling Development in Grade 2 Students
37 pages
Deaf Education & ASL Expertise
No ratings yet
Deaf Education & ASL Expertise
5 pages
NSTSE For Class X (2025-26) - 1
No ratings yet
NSTSE For Class X (2025-26) - 1
1 page
Communication Process Overview
No ratings yet
Communication Process Overview
8 pages
SHS Advocacy Project Proposal
No ratings yet
SHS Advocacy Project Proposal
5 pages
Benjamin Britten in Context (Composers in Context) (Vicki P Stroeher (Editor) Etc.)
100% (1)
Benjamin Britten in Context (Composers in Context) (Vicki P Stroeher (Editor) Etc.)
173 pages
World Religions Video Project
No ratings yet
World Religions Video Project
3 pages
Pp2 Schemes of Work Mathematics
100% (1)
Pp2 Schemes of Work Mathematics
10 pages

Uploaded by

Uploaded by

• a description of the test, giving details of sections, timings, etc.

Stages of test development

The handbooks should be made available in print form or/and online.

10. Training staff

11. Test maintenance

Two examples of test development follow.

EXAMPLE OF TEST DEVELOPMENT 1: AN ACHIEVEMENT TEST

Slower, careful reading: Construe the meaning of complex, closely argued

Underlying skills that are given particular attention in the course:

Addressees Academics at postgraduate level and beyond.

Lengths of texts Expeditious: c. 3,000 words Careful: c. 800 words.

Readability Not speci ed.

Structural range Unlimited.

Vocabulary range General academic, not specialist technical.

Structure, timing, medium and techniques

Number of items 30 expeditious; 20 careful. Total: 50 items.

Number of passages 3 expeditious; 2 careful.

Stages of test development

Medium Paper-and-pencil. Each passage in a separate booklet.

Techniques Short answer and gap lling for both sections.

a) For inferring meaning from context:

b) For identifying referents:

Criterial levels of performance

Item writing and moderation

Trialling and analysis

EXAMPLE OF TEST DEVELOPMENT 2: A PLACEMENT TEST

Types of text Constructed ‘spoken’ exchanges involving two people. It is hoped

Stages of test development

Structure, timing, medium and techniques

Examples: A: Whose book that?

Criterial levels of performance

Trialling and analysis

criterion-related, the criterion being placement of students in appropriate classes,

Test development process

Common European Framework

You might also like