0% found this document useful (0 votes)

14 views25 pages

Chapter One1

Uploaded by

Hans Penda Hilundwa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views25 pages

Chapter One1

Uploaded by

Hans Penda Hilundwa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER ONE: DESCRIPTIVE STATISTICS

Definitions:

Statistics: Is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in
making more effective decisions

Descriptive statistics are methods of organizing, summarizing, and presenting data in an informative
way.

Inferential statistics: The methods used to determine something about a population on basis of the
sample.

Summary of Types of variables:

Types of variables

Quantitative
Qualitative

Continuous
e.g Discrete

• Brand of PC
• Marital status,
• hair color • Children in a family • Amount of income
• Strikes in a golf hole tax paid
• TV sets owned • Weight of a student
• Yearly rainfall
• Temperature

1
Levels of measurements

Data can be classified according to levels of measurements. The level of measurement of the data
often dictates the calculations that can be done to summarize and present the data. It also determines
the statistical tests that should be performed.

Levels of Measurement

Nominal Ordinal Interval Ratio

Data may only be Data are ranked Meaningful difference Meaningful o

classified between values point and ratio
between values

• Make of car • Your rank in

Temperature • Number
• Eyes’color class
of
• Team standing
patient
in the
• Distance
premiership
to school

1.1. Frequency Distributions and Graphical Descriptive Techniques

Constructing a frequency distribution

Definition:

A frequency distribution is grouping of data into mutually exclusive classes showing the number of
observations in each.

Steps for organizing data into a frequency distribution:

2
Step 1: Decide on the number of classes (k): Use “2 to the k rule”
This rule suggests that you select the smallest number (k) for the number of classes such that 2k is
greater than the number of observations ( n ) .
2k > n

Step 2: Determine the class width or class interval.

Generally the class width should be the same for all classes. The class width is determined using the
H −L
following formula: i ≥
k
Where i is the class width, H is the highest observed value, L is the lowest observed value, and k
is the number of classes.

Step 3: Set the individual class limits

Avoid overlapping or unclear class limits.

Step 4: Tally the observations into classes.

Step 5: Count the number of items in each class.(frequencies)

Relative frequency distribution

Definition

A relative frequency distribution converts the frequency into a percentage.

Graphical Descriptive Techniques

Three charts that will help portray a frequency distribution graphical are histogram, frequency
polygon, and cumulative frequency polygon.

HISTOGRAM

A graph in which the classes are marked on the horizontal axis and the class frequencies on the
vertical axis. The classes frequencies are represented by the heights of the bar, and bar are drawn
adjacent to each other.
Example

3
FREQUENCY POLYGON

A frequency polygon consists of line segments connecting the points formed by intersections of the
class midpoints and the frequencies.

4
CUMULATIVE FREQUENCY DISTRIBUTIONS AND CUMILATIVE FREQUENCY POLYGON .

Cumulative frequency distributions

Selling prices Frequency [Link]
15 -18 8 8
18-21 23 31 8+23
21-24 17 48 8+23+17
24-27 18 66 8+23+17+18
27-30 8 74
30-33 4 78
33-36 2 80

5
Cumulative frequency polygon (OGIVE)

LINE GRAPHS

Line charts are particularly effective for business and economic data because they show change or
trends in variable over time.

Example:

Year 1992 1993 1994 1995 1996 1997 1998 1999 2000
Unemployment_rate 17.8 13.7 11 10.2 11.3 9.1 8.8 8.5 7.9

6
7
PIE CHARTS

A pie chart is especially useful for illustrating nominal level data

BAR CHART

8
1.2. Measures of central location

1. Arithmetic mean (AM)

The AM is the commonly used measure of central tendency. To calculate AM for ungrouped
data, we use
x1 + x2 + .... + xn
x=
n
n

∑x
i =1
i
=
n
Example:
For 10 years a company declared its percentage dividends as follows:

year 1 2 3 4 5 6 7 8 9 10
Dividend(xi) 5 6 14 20 30 10 15 20 20 30

Calculate the average dividend of the percentage declared By the company during the 10 years

Solution

Calculating the AM from frequency distribution

∑fx
i =1
i i
The AM a discrete frequency is calculated x = k

∑f
i =1
i

Annual profit( Number

xi ) outlets f i
10 3
15 8
20 23
25 10
30 6

Solution

Calculating the AM from Grouped frequency distribution

9
n

∑fx
i =1
i i
The AM a discrete frequency is calculated x = k

∑f
i =1
i

Where the xi is the class mid-point value of the ith class and
fi is the number of observations falling the ith class.

Example:

The following frequency distribution summarizes data on service times in minutes at the
checkout counter of a supermarket.

Time
interval Customers
1.99-<2.50 3
2.50-<3.00 8
3.00-<3.50 23
3.50-<4.00 10
4.00-<4.5 6

Calculate the estimated average time a customer takes for a checkout at the counter in this
supermarket.

Solution

10
2. Median (Mdn/Md)

The median is defined as the middle value when the data set are arranged in ascending order. It divides
the data set into two equal parts.

Calculating the median for ungrouped data set

Steps:

1. Arrange the data set in ascending order.

2. If the number of observations ( n ) in the data set is odd, then the median position is given by
n +1
2

3. If the number of observations ( n ) in the data set is even, then the median is given by the
n n
average of values in positions and + 1
2 2
Example:

Calculating the median for grouped data set:

Calculating the median for discrete frequency distribution

Steps:

1. Construct the less than cumulative frequency distribution

n
2. Calculate where n is the total cumulative frequency
2
n
3. Find the cumulative frequency equal or just greater than the value of calculated in step 2
2
4. The value at which the cumulative frequency is equal to that corresponding to cumulative
frequency calculated in step 3 is the median for the data set.

11
Example:

In a survey of 50 retail outlets, the following data were collected.

Number
Annual profit outlets
10 3
15 8
20 23
25 10
30 6
Calculate the median for the annual profit.

Calculating the median for grouped frequency distribution with equal intervals

Steps:

1. Construct the less than cumulative frequency distribution

n
2. Calculate where n is the total cumulative frequency
2
n
3. Find the cumulative frequency equal or just greater than the value of calculated in
2
step 2
4. The median class at which the cumulative frequency is that corresponding to cumulative
frequency calculated in step 3.
5. Calculate the median using the following formula:

h n 
M d =I M d +  −F
f 2 

Where,

M d is the median of the data set

I M d is the lower class limit of the median class

h is the width of the median class

f is the frequency of the median class

12
n is the total cumulative frequency

F is the cumulative frequency of the class immediately before the median class.

Example:

The following frequency distribution summarizes data on service times in minutes at the checkout
counter of a supermarket.

Time
interval Customers
2.00-<2.50 3
2.50-<3.00 8
3.00-<3.50 23
3.50-<4.00 10
4.00-<4.5 6

Calculate the median for the time it takes for a customer to be checked out at counter in this
supermarket.

Solution

Step 1

Time Customers
interval (fi) Fi
2.00-<2.50 3 3
2.50-<3.00 8 11
3.00-<3.50 23 34
3.50-<4.00 10 44
4.00-<4.5 6 50
Step 2:

n 50
= = 25
2 2

n
Step 3: the cumulative frequency equal to or just greater than is 34
2

Step 4:

The medial class is 3.00-<3.50

13
h n 
Step 5: The median is found by using the interpolation formula M d =I M d +  −F
f 2 

I M d =3.00 is the lower class limit of the median class

h =0.5 is the width of the median class

f =23 is the frequency of the median class

n =50 is the total cumulative frequency

F =11 is the cumulative frequency of the class immediately before the median class.

0.5  50 
M d =+
3  − 11 =3.30
23  2 

3. Mode ( M o )

The mode of a data set is the value in the data set that occurs most with the greatest frequency.

It is a data point that occurs most frequently in the measurements that constitute a data set.

Calculating the mode from ungrouped data set.

To find the mode of ungrouped data set we simply observe the data value that occurs most
frequently in the data set.

Calculating the mode from grouped data set:

Calculating the mode from a discrete frequency distribution:

The mode is the value that has the highest frequency.

14
Example:

Example:

In a survey of 50 retail outlets, the following data were collected.

Number
Annual profit outlets
10 3
15 8
20 23
25 10
30 6
Calculate the mode for the annual profit.

Solution

The highest frequency is 23. Therefore, 20 is the mode.

Calculating the mode for grouped frequency distribution.

The mode is calculated using the following interpolation formula:

f1 − f 0
Mo =
lM o + ×h
( f1 − f0 ) + ( f1 − f 2 )
f1 − f 0
=lM o + ×h
2 f1 − f 0 − f 2

Where

M o is the mode

lM o is lower limit of the modal class

h is the width of the modal class

f1 is the frequency of the modal class

f 0 is the frequency of the class immediately before the modal class.

f 2 is the frequency of the class immediately after the modal class

15
Definition:

A modal class is the class interval having the highest frequency.

Example

The following frequency distribution summarizes data on service times in minutes at the
checkout counter of a supermarket.

Time
interval Customers
2.00-<2.50 3
2.50-<3.00 8
3.00-<3.50 23
3.50-<4.00 10
4.00-<4.5 6

Calculate the mode for the time it takes for a customer to be checked out at counter in this

Solution:

The modal class is 3.00-<3.50 as it has the highest frequency 23.

lM o =3.0 is lower limit of the modal class

h =0.5 is the width of the modal class

f1 =23 is the frequency of the modal class

f 0 =8 is the frequency of the class immediately before the modal class.

f 2 =10 is the frequency of the class immediately after the modal class

Therefore,

23 − 8
Mo =
3+ × 0.5
( 23 − 8) + ( 231 − 10 )
=3.27

16
MEASURES OF DISPERSION

1.3. Partition values: Quartiles and Percentiles

Partition values are values of a variable that divide a data set into a number of equal parts
e.g. Quartiles, Percentiles, deciles

17
1. Quartiles
Quartiles of a data set are values (partition values) that divide the data set into four equal parts
when data are arranged in ascending order.
There are three quartiles called lower quartile ( Q1 ), the middle quartile (second quartile Q2 ),
and upper quartile ( Q3 ).
Calculating quartiles from frequency distributions

To calculate the kth quartile from grouped frequency distributions, we use the following
procedure:
Step 1: Construct less than cumulative frequency distribution.
k
Step 2: Calculate nk= ×n
4
For Q1 , the value of k=1
For Q2 , the value of k=2
For Q3 , the value of k=3
k
Step 3: Find the cumulative frequency equal to or just greater than the value of × n calculated
4
in step 2.
Step4: The kth quartile class is the class at which the cumulative frequency corresponds to the
cumulative frequency in step 3.
Step 5: The kth quartile class is calculated using the following interpolation formula:
h k 
Qk = lk +  ×n− F 
fk  4 
Where
Qk is the kth quartile for the data set;
lk is the lower class limit of the kth quartile class;
h is the width of the kth quartile class;
f k is the frequency of the kth quartile class;
F is the cumulative frequency of the class immediately before the the kth quartile class;

n is the total cumulative frequency

Example:

The human resource department of a company analyzed the level of absenteeism of 56

employees who reported ill over the past year.

Absenteeism level (days absent) Number of employees ( f i )

18
3-<7 14

7-<11 22

11-<15 11

15-<19 6

19-<23 3

Determine the first quartile, the second quartile, and the third quartile.

2. Percentiles

The percentiles of a data set are values of a random variable dividing a data set into hundred
equal parts, with each containing 1% of values when the values are arranged in ascending order.
There ninety-nine percentiles called first percentile, second percentile,…, and ninety-ninth
percentile.
The fiftieth percentile is the median of the data set
The 25th percentile is the 1st quartile,
And 75 th percentile is 3rd quartile

Calculating percentiles from frequency distributions

To calculate the kth percentile from grouped frequency distributions, we use the following
procedure:
Step 1: Construct less than cumulative frequency distribution.
k
Step 2: Calculate =
nk ×n
100
For p1 , the value of k=1
For p2 , the value of k=2
For p3 , the value of k=3
.
.
.
For p99 , the value of k=99

k
Step 3: Find the cumulative frequency equal to or just greater than the value of ×n
100
calculated in step 2.

19
Step4: The kth percentile class is the class at which the cumulative frequency corresponds to the
cumulative frequency in step 3.

Step 5: The kth percentile is calculated using the following interpolation formula:

h  k 
pk = lk +  ×n− F 
f k  100 

pk is the kth quartile for the data set;

lk is the lower class limit of the kth percentile class;
h is the width of the kth percentile class;
f k is the frequency of the kth percentile class;
F is the cumulative frequency of the class immediately before the kth percentile class;

Example:

The human resource department of a company analyzed the level of absenteeism of 56

employees who reported ill over the past year.

Absenteeism level (days absent) Number of employees ( f i )

3-<7 14

7-<11 22

11-<15 11

15-<19 6

19-<23 3

Determine the 65th percentile, the 70th percentile, and the 90th percentile

1.4. Measures of dispersion

Two or more data sets may have the same mean and yet be very different in the way they spread
out. To describe this difference quantitatively, we use measures of dispersion. A measure of
dispersion indicates the amount of variation in a data set. Some of the commonly used measures of

20
spread are the range, Inter-quartile range, semi-quartile (Quartile deviation) variance, and standard
deviation, and coefficient of variation.

1. Range

The range is the difference between the highest and lowest values in a data set.

It measures the distance across the entire data set.

=Range Maximum value − min imum value

Example:

18 26 17 10 7 27 24 17 17 23 29 28
18 10 23 16 9 12 26 5 12 23 22 24
16 5

xmax = 29
xmin = 5
Range = 29 − 5 = 24

2. Inter-quartile range (IQ)

Definition

Quartiles of a data set are values (partition values) that divide the data set into four equal parts when
data are arranged in ascending order.

There are three quartiles called lower quartile, the middle quartile (second quartile), and upper quartile.

= Q3 − Q1
IQR

3. Semi interquartile range or quartile deviation

21
Q3 − Q1
SIQR(Q.D) =
2
Example:
Let
Q1 = 14.5days
Q2 = 18.89days
Q3 = 23.93days

23.93 − 14.5
SIQR(Q.D) =
2
=4.715 days
Interpretation: 50% of all observations are expected to lie within 4.715 days either side of the
median of 18.89 days. Or 25% of observations are considered to lie within 4.715 days below the
median and 25% of observations are expected to lie within 4.715 days above the median value.

4. Variance and standard deviation

The most useful and reliable measures of dispersion are those that:
• Take every observation into account, and
• Are based on average deviation from the central value.

Because the variance is such a measure that satisfies these properties, it has become the most
commonly used measure of dispersion. It is extensively used in statistical analysis.

The variance is calculated as the average of sum squared deviation.

sum of squares deviation

var iance =
sample size − 1

For ungrouped data, the variance is calculated using the following formula:

∑(x − x ) ∑x − nx 2
2 2
i i
==
S 2
=
i 1 =i 1

n −1 n −1
x

Mathematical computational formulae for grouped data is

n n

∑ f (x − x) ∑fx − nx 2
2 2
i i i i
=
=
S 2 i 1 =i 1
=
n −1 n −1
x

The variance is a measure of average of sum squared deviation about the arithmetic mean. It is
expressed in squared units. Consequently, its meaning in practical sense is obscure.

22
Because of this interpretation problem, a measure that uses original units is derived from the
variance: Standard deviation.

5. Standard deviation

Sx = Sx2
The standard deviation describes how observations are spread about the mean.

6. Coefficient of variation
Sometimes, it is necessary to compare the samples of data from different random variables to
establish which sample data shows greater variability. A direct comparison of their respective
standard deviations would be misleading as the random variables may be measured in different
units. Thus, a meaningful comparison should be based on measure variability expressed in the
same units. This achieved by producing a measure of relative variability (i.e. relative to their
mean) expressed in percentage terms, called coefficient of variation.

Sx
=
CV ×100%
x
Example

Turnover/month Employee age

Mean R54588 38.2 yrs
Standard deviation R8444 7.9 yrs
CV 15.47% 20.68%

The age characteristic shows greater variability than turnover/month.

TUTORIAL 1

23
1. A company employs 12 persons in managerial positions. Their seniority (in years of service)
and sex are listed below:

Sex F M F M F M M F F F F M
Seniority (yrs) 8 15 6 2 9 21 9 3 4 7 2 10
Find the seniority mean, the seniority median and the seniority mode for the above data set.
2. The daily percentage change (to the nearest percentage ) of equity traded on the JSE was
monitored for 100 days by an investment analyst. These daily percentage changes were
summarized into the frequency distribution below.

Daily
percentage
change of
an Number
equity(%) of days
2 15
3 30
4 25
5 19
6 8
7 2
8 1
Find the mean daily percentage change, the median daily percentage change, and mode
daily percentage change.

3. Mary secully is employed as an “Affirmative Action Officer” by Ortex electronics. Mary

reports directly to the plant manager, and is responsible for monitoring and making
recommendations on Ortex hiring procedures, working conditions and compensation plans.
As part of her ongoing monitoring of compensation plans, Mary collected data on hourly
earnings on all non-salaried employees at Ortex. T aid in interpreting the data, Mary
organized the data into the following frequency distribution:

24
Number of
Hourly
earnings(Rands) Women Men
4.70-4.90 6 5
4.90-5.10 31 16
5.10-5.30 15 25
5.30-5.50 29 30
5.50-5.70 19 24
Calculate the mean, median and the mode of the hourly earnings for the men

4. The annual earnings of a company’s salesmen at its Johannesburg and Cape Town offices
are as follows:

Number of salesmen
Cape
Earnings(R1000s) Johannesburg Town
6-<8 3 2
8-<10 7 3
10-<12 13 6
12-<14 17 8
14-<16 4 3
16-<20 4 2
20-<25 2 6

(a) Compare the salesmen’s earnings in Johannesburg and Cape Town offices by find the
means, medians and quartile deviations
(b) Find the standard deviation

Chapter 15 (3) NNN
No ratings yet
Chapter 15 (3) NNN
16 pages
Module 1
No ratings yet
Module 1
108 pages
Frequency Distribution PDF
No ratings yet
Frequency Distribution PDF
36 pages
Lecture 2
No ratings yet
Lecture 2
73 pages
Statistics
No ratings yet
Statistics
6 pages
Statistics For Css
No ratings yet
Statistics For Css
73 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
105 pages
Measures of Central TendencyGrouped Module 1
No ratings yet
Measures of Central TendencyGrouped Module 1
10 pages
MTH302 Short Notes Lec 23 To 45 VUAnswer - Com-1
100% (1)
MTH302 Short Notes Lec 23 To 45 VUAnswer - Com-1
14 pages
Chapter One Illustration
No ratings yet
Chapter One Illustration
9 pages
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
No ratings yet
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
16 pages
Understanding Median: Calculation Methods
No ratings yet
Understanding Median: Calculation Methods
23 pages
Lecture 3-Statistics-New
No ratings yet
Lecture 3-Statistics-New
8 pages
Lecture 5 Introduction To Statistics
No ratings yet
Lecture 5 Introduction To Statistics
54 pages
Intro To Statistics
No ratings yet
Intro To Statistics
38 pages
STATISTICS
No ratings yet
STATISTICS
10 pages
Mode From Frequency Table
No ratings yet
Mode From Frequency Table
16 pages
Biostatistics
No ratings yet
Biostatistics
49 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
13 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
26 pages
11.11 Statistics
No ratings yet
11.11 Statistics
28 pages
Measures of Central Tendency: Presentation By: DR Dharuv
No ratings yet
Measures of Central Tendency: Presentation By: DR Dharuv
44 pages
Understanding Statistics: Data Analysis Techniques
100% (1)
Understanding Statistics: Data Analysis Techniques
46 pages
Chapter 3 - Measures of Central Tendency
No ratings yet
Chapter 3 - Measures of Central Tendency
8 pages
Advanced Quantitative Methods - Mean Mode
No ratings yet
Advanced Quantitative Methods - Mean Mode
5 pages
Sit 212 Lecture Note
No ratings yet
Sit 212 Lecture Note
99 pages
Measures of Central Tendency: Median & Mode
No ratings yet
Measures of Central Tendency: Median & Mode
22 pages
Psychological Statistics Midterm - 2023 2024
No ratings yet
Psychological Statistics Midterm - 2023 2024
7 pages
Introduction To Statistics: Ungrouped Data
No ratings yet
Introduction To Statistics: Ungrouped Data
8 pages
Chapter - 14 Statistics
No ratings yet
Chapter - 14 Statistics
33 pages
Understanding Mean, Median, and Mode
No ratings yet
Understanding Mean, Median, and Mode
4 pages
Final Module in Assessment 1
No ratings yet
Final Module in Assessment 1
23 pages
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
No ratings yet
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
39 pages
Statistical Unit-3 Maths
No ratings yet
Statistical Unit-3 Maths
50 pages
Revised Lectures 2,3 and 4
No ratings yet
Revised Lectures 2,3 and 4
13 pages
Week 1-2
No ratings yet
Week 1-2
9 pages
Statistics
No ratings yet
Statistics
164 pages
Business Stats for Students
No ratings yet
Business Stats for Students
101 pages
Fresher Four
No ratings yet
Fresher Four
33 pages
Unit-V Basic Statistics and Probability: Presentation - Three Forms - Histogram, Bar Chart, Frequency Polygon
No ratings yet
Unit-V Basic Statistics and Probability: Presentation - Three Forms - Histogram, Bar Chart, Frequency Polygon
6 pages
Understanding Frequency Distributions
No ratings yet
Understanding Frequency Distributions
24 pages
Statistical Analysis With Software Application - Week2
No ratings yet
Statistical Analysis With Software Application - Week2
76 pages
MODULE 2 Measures of Central Tendency
No ratings yet
MODULE 2 Measures of Central Tendency
8 pages
Chapter 9
No ratings yet
Chapter 9
12 pages
Statistics for Students
100% (2)
Statistics for Students
25 pages
Mean of Ungrouped Data
100% (1)
Mean of Ungrouped Data
27 pages
Lecture Sheet C
No ratings yet
Lecture Sheet C
42 pages
Chapter 2 - 250720 - 111725
No ratings yet
Chapter 2 - 250720 - 111725
35 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
98 pages
CAS - Descriptive Statistics - Final PPT-1
No ratings yet
CAS - Descriptive Statistics - Final PPT-1
112 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
59 pages
310 Spring2012 Chapter11
No ratings yet
310 Spring2012 Chapter11
50 pages
Egy1 L3
No ratings yet
Egy1 L3
13 pages
Lecture, Introduction To AC, 2014
No ratings yet
Lecture, Introduction To AC, 2014
79 pages
Pavement Design for Engineers
No ratings yet
Pavement Design for Engineers
41 pages
2 Manual On Road Traffic Signs - Draft1
No ratings yet
2 Manual On Road Traffic Signs - Draft1
56 pages
Comprehensive Pavement Design Guide
No ratings yet
Comprehensive Pavement Design Guide
56 pages
Ibn Nadim VII Philo
No ratings yet
Ibn Nadim VII Philo
34 pages
AUT For PsyArXiv
No ratings yet
AUT For PsyArXiv
43 pages
Mechanics Revision for M1 Candidates
No ratings yet
Mechanics Revision for M1 Candidates
6 pages
Water Treatment Plant Project
33% (3)
Water Treatment Plant Project
16 pages
Iso 17781
No ratings yet
Iso 17781
26 pages
Summative Assessment
No ratings yet
Summative Assessment
6 pages
Experiment No 7-1
No ratings yet
Experiment No 7-1
4 pages
Toaz Info b2 Studentx27s Book Answer Keypdf PR Ed5f77170722d06f
100% (1)
Toaz Info b2 Studentx27s Book Answer Keypdf PR Ed5f77170722d06f
25 pages
Well Site Geology GTO
No ratings yet
Well Site Geology GTO
43 pages
OAP Poste Test With Answers
50% (2)
OAP Poste Test With Answers
5 pages
Defining Environmental Sustainability
No ratings yet
Defining Environmental Sustainability
10 pages
Inelastic Buckling in Castellated Beams
No ratings yet
Inelastic Buckling in Castellated Beams
14 pages
Fundamentals of Communication
No ratings yet
Fundamentals of Communication
9 pages
Maths 4MB1 - 02R - Rms - Jan 2022
No ratings yet
Maths 4MB1 - 02R - Rms - Jan 2022
17 pages
Kandungan Patin
No ratings yet
Kandungan Patin
9 pages
HDD Drilling Fluids and Additives Guide
No ratings yet
HDD Drilling Fluids and Additives Guide
6 pages
CVLE 432 Foundation Engineering: Course Syllabus - Spring 2015-2016
No ratings yet
CVLE 432 Foundation Engineering: Course Syllabus - Spring 2015-2016
2 pages
SGC Pubes P 352019
No ratings yet
SGC Pubes P 352019
314 pages
Lecture 6
No ratings yet
Lecture 6
9 pages
Changhi Sweet Soup: Communication Failures
No ratings yet
Changhi Sweet Soup: Communication Failures
12 pages
Annexure I Self Nomination Form
No ratings yet
Annexure I Self Nomination Form
4 pages
Tos 1ST Quarter Pre-Test 2020-2021
0% (1)
Tos 1ST Quarter Pre-Test 2020-2021
4 pages
The Abstract Society - Zijderveld, Anton - Anna's Archive
No ratings yet
The Abstract Society - Zijderveld, Anton - Anna's Archive
196 pages
Preboard in Prof Ed 03-17-23 With Answers
No ratings yet
Preboard in Prof Ed 03-17-23 With Answers
38 pages
Math Homework Help Websites
100% (1)
Math Homework Help Websites
8 pages
Session-6-Unit-1 Introduction To Project Management
No ratings yet
Session-6-Unit-1 Introduction To Project Management
4 pages
Work and Kinetic Energy Experiment
No ratings yet
Work and Kinetic Energy Experiment
8 pages
Astm - stp29062s - en Us
No ratings yet
Astm - stp29062s - en Us
30 pages
Structural Engineering
100% (1)
Structural Engineering
68 pages
Environmental Ethics in Business Practices
No ratings yet
Environmental Ethics in Business Practices
8 pages

Uploaded by

Uploaded by

CHAPTER ONE: DESCRIPTIVE STATISTICS

Summary of Types of variables:

Nominal Ordinal Interval Ratio

Data may only be Data are ranked Meaningful difference Meaningful o

• Make of car • Your rank in

1.1. Frequency Distributions and Graphical Descriptive Techniques

Constructing a frequency distribution

Steps for organizing data into a frequency distribution:

Step 2: Determine the class width or class interval.

Step 3: Set the individual class limits

Step 4: Tally the observations into classes.

Step 5: Count the number of items in each class.(frequencies)

Relative frequency distribution

A relative frequency distribution converts the frequency into a percentage.

Graphical Descriptive Techniques

Cumulative frequency distributions

A pie chart is especially useful for illustrating nominal level data

1. Arithmetic mean (AM)

Calculating the AM from frequency distribution

Annual profit( Number

Calculating the AM from Grouped frequency distribution

Calculating the median for ungrouped data set

1. Arrange the data set in ascending order.

Calculating the median for grouped data set:

Calculating the median for discrete frequency distribution

1. Construct the less than cumulative frequency distribution

In a survey of 50 retail outlets, the following data were collected.

1. Construct the less than cumulative frequency distribution

M d is the median of the data set

I M d is the lower class limit of the median class

h is the width of the median class

f is the frequency of the median class

The medial class is 3.00-<3.50

I M d =3.00 is the lower class limit of the median class

h =0.5 is the width of the median class

f =23 is the frequency of the median class

n =50 is the total cumulative frequency

Calculating the mode from ungrouped data set.

Calculating the mode from grouped data set:

Calculating the mode from a discrete frequency distribution:

The mode is the value that has the highest frequency.

In a survey of 50 retail outlets, the following data were collected.

The highest frequency is 23. Therefore, 20 is the mode.

Calculating the mode for grouped frequency distribution.

The mode is calculated using the following interpolation formula:

lM o is lower limit of the modal class

h is the width of the modal class

f1 is the frequency of the modal class

f 0 is the frequency of the class immediately before the modal class.

f 2 is the frequency of the class immediately after the modal class

A modal class is the class interval having the highest frequency.

The modal class is 3.00-<3.50 as it has the highest frequency 23.

lM o =3.0 is lower limit of the modal class

h =0.5 is the width of the modal class

f1 =23 is the frequency of the modal class

f 0 =8 is the frequency of the class immediately before the modal class.

1.3. Partition values: Quartiles and Percentiles

n is the total cumulative frequency

The human resource department of a company analyzed the level of absenteeism of 56

Absenteeism level (days absent) Number of employees ( f i )

Calculating percentiles from frequency distributions

pk is the kth quartile for the data set;

The human resource department of a company analyzed the level of absenteeism of 56

Absenteeism level (days absent) Number of employees ( f i )

1.4. Measures of dispersion

It measures the distance across the entire data set.

=Range Maximum value − min imum value

2. Inter-quartile range (IQ)

3. Semi interquartile range or quartile deviation

4. Variance and standard deviation

The variance is calculated as the average of sum squared deviation.

sum of squares deviation

Mathematical computational formulae for grouped data is

Turnover/month Employee age

The age characteristic shows greater variability than turnover/month.

3. Mary secully is employed as an “Affirmative Action Officer” by Ortex electronics. Mary

You might also like