Summary statistic for ungrouped data
Summary statistic summarize and provide information about the sample data. It includes the minimum value of the data, first quartile ($Q_1$), median (i.e., $Q_2$), mean ($\overline{x}$), third quartile ($Q_3$) and the minimum value of the data.
Summary statistic includes
- minimum value ($\min$),
- first quartile ($Q_1$),
- $\text{median }$ ($Q_2$),
- sample mean ($\overline{x}$),
- third quartile ($Q_3$),
- maximum value ($\max$).
Formula
$\min$, $Q_1$, $\text{median}$, $\overline{x}$, $Q_3$ and $\max$
The mean of $X$ is denoted by $\overline{x}$ and is given by
$\overline{x} =\dfrac{1}{n}\sum_{i=1}^{n}x_i$
Quartiles
The formula for $i^{th}$ quartile is
$Q_i =$ Value of $\bigg(\dfrac{i(n+1)}{4}\bigg)^{th}$ observation, $i=1,2,3$
where $n$ is the total number of observations.
Example 1
The age (in years) of 6 randomly selected students from a class are
22,25,24,23,24,20.
Find the summary statistics of the data.
Solution
Arrange the data in ascending order
20, 22, 23, 24, 24, 25.
Minumum Value
The minimum age of students is $\min = 20$
years.
Maximum Value
The maximum age of students is $\max = 25$
years.
Sample mean
The sample mean of $X$ is
$$ \begin{aligned} \overline{x} &=\frac{1}{n}\sum_{i=1}^n x_i\\ &=\frac{138}{6}\\ &=23\text{ years} \end{aligned} $$
The sample mean of age of students is $23$ years.
Quartiles
The formula for $i^{th}$ quartile is
$Q_i =$ Value of $\bigg(\dfrac{i(n+1)}{4}\bigg)^{th}$ observation, $i=1,2,3$
where $n$ is the total number of observations.
First Quartile $Q_1$
The first quartle $Q_1$ can be computed as follows:
$$ \begin{aligned} Q_{1} &=\text{Value of }\bigg(\dfrac{1(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{1(6+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{ Value of }\big(1.75\big)^{th} \text{ observation}\\ &= \text{Value of }\big(1\big)^{th} \text{ obs.}+0.75 \big(\text{Value of } \big(2\big)^{th}\text{ obs.}-\text{Value of }\big(1\big)^{th} \text{ obs.}\big)\\ &=20+0.75\big(22 -20\big)\\ &=21.5 \text{ years} \end{aligned} $$
Thus, $25$ % of the students had age less than or equal to $21.5$ years.
Median (M) (i.e., Second Quartile $Q_2$)
The median ($M$) or second quartile $Q_2$ can be computed as follows:
$$ \begin{aligned} M &=\text{Value of }\bigg(\dfrac{2(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{2(6+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{ Value of }\big(3.5\big)^{th} \text{ observation}\\ &= \text{Value of }\big(3\big)^{th} \text{ obs.}+0.5 \big(\text{Value of } \big(4\big)^{th}\text{ obs.}-\text{Value of }\big(3\big)^{th} \text{ obs.}\big)\\ &=23+0.5\big(24 -23\big)\\ &=23.5 \text{ years} \end{aligned} $$
Thus, $50$ % of the students had age less than or equal to $23.5$ years.
Third Quartile $Q_3$
The third quartile $Q_3$ can be computed as follows:
$$ \begin{aligned} Q_{3} &=\text{Value of }\bigg(\dfrac{3(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{3(6+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{ Value of }\big(5.25\big)^{th} \text{ observation}\\ &= \text{Value of }\big(5\big)^{th} \text{ obs.}+0.25 \big(\text{Value of } \big(6\big)^{th}\text{ obs.}-\text{Value of }\big(5\big)^{th} \text{ obs.}\big)\\ &=24+0.25\big(25 -24\big)\\ &=24.25 \text{ years} \end{aligned} $$
Thus, $75$ % of the students had age less than or equal to $24.25$ years.
Thus the summary statistics of given data set is
$\min = 20$ years, $Q_1 = 21.5$ years, $\text{median }=23.5$ years,$\text{mean }=23$ years, $Q_3=24.25$ years and $\max = 25$ years.
Example 2
A random sample of 15 patients yielded the following data on the length of stay (in days) in the hospital.
5, 6, 9, 10, 15, 10, 14, 12, 10, 13, 13, 9, 8, 10, 12.
Find the summary statistics of the data.
Solution
Arrange the data in ascending order
5, 6, 8, 9, 9, 10, 10, 10, 10, 12, 12, 13, 13, 14, 15.
Minumum Value
The minimum age of students is $\min = 5$
days.
Maximum Value
The maximum age of students is $\max = 15$
days.
Sample mean
The sample mean of $X$ is
$$ \begin{aligned} \overline{x} &=\frac{1}{n}\sum_{i=1}^n x_i\\ &=\frac{156}{15}\\ &=10.4\text{ days} \end{aligned} $$
The sample mean of length of stay in the hospital of students is $10.4$ days.
Quartiles
The formula for $i^{th}$ quartile is
$Q_i =$ Value of $\bigg(\dfrac{i(n+1)}{4}\bigg)^{th}$ observation, $i=1,2,3$
where $n$ is the total number of observations.
First Quartile $Q_1$
The first quartle $Q_1$ can be computed as follows:
$$ \begin{aligned} Q_{1} &=\text{Value of }\bigg(\dfrac{1(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{1(15+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{Value of }\big(4\big)^{th} \text{ observation}\\ &=9 \text{ days} \end{aligned} $$
Thus, $25$ % of the patients had length of stay in the hospital less than or equal to $9$ days.
Second Quartile $Q_2$
The second quartile $Q_2$ can be computed as follows:
$$ \begin{aligned} Q_{2} &=\text{Value of }\bigg(\dfrac{2(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{2(15+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{Value of }\big(8\big)^{th} \text{ observation}\\ &=10 \text{ days} \end{aligned} $$
Thus, $50$ % of the patients had length of stay in the hospital less than or equal to $10$ days.
Third Quartile $Q_3$
The third quartile $Q_3$ can be computed as follows:
$$ \begin{aligned} Q_{3} &=\text{Value of }\bigg(\dfrac{3(n+1)}{4}\bigg)^{th} \text{ observation}\\ &=\text{Value of }\bigg(\dfrac{3(15+1)}{4}\bigg)^{th} \text{ observation}\\ &= \text{Value of }\big(12\big)^{th} \text{ observation}\\ &=13 \text{ days} \end{aligned} $$
Thus, $75$ % of the patients had length of stay in the hospital less than or equal to $13$ days.
Thus the summary statistics of given data set is
$\min = 5$ days, $Q_1 = 9$ days, $\text{median }=10$ days,$\text{mean }=10.4$ days, $Q_3=13$ days and $\max = 15$ days.
Related Resources
Suggestions and comments will be appreciated.