Karl Pearson coefficient of skewness for grouped data
Let $(x_i,f_i), i=1,2, \cdots , n$
be given frequency distribution.
Karl Pearson coefficient of skewness formula
The Karl Pearson’s coefficient skewness for grouped data is given by
$S_k =\dfrac{Mean-Mode)}{sd}=\dfrac{\overline{x}-\text{Mode}}{s_x}$
OR
$S_k =\dfrac{3(Mean-Median)}{sd}=\dfrac{\overline{x}-M}{s_x}$
where,
- $\overline{x}$ is the sample mean,
- $M$ is the median,
- $s_x$ is the sample standard deviation.
Sample mean
The sample mean $\overline{x}$ is given by
$$ \begin{eqnarray*} \overline{x}& =\frac{1}{N}\sum_{i=1}^{n}f_ix_i \end{eqnarray*} $$
Sample median
The median is given by
$\text{Median } = l + \bigg(\dfrac{\frac{N}{2} - F_<}{f}\bigg)\times h$
where,
- $N$, total number of observations
- $l$, the lower limit of the median class
- $f$, frequency of the median class
- $F_<$, cumulative frequency of the pre median class
- $h$, the class width
Sample mode
The mode of the distribution is given by
$\text{Mode } = l + \bigg(\dfrac{f_m - f_1}{2f_m-f_1-f_2}\bigg)\times h$
where,
- $l$, the lower limit of the modal class
- $f_m$, frequency of the modal class
- $f_1$, frequency of the class pre-modal class
- $f_2$, frequency of the class post-modal class
- $h$, the class width
Sample Standard deviation
Sample standard deviation is given by
$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{\dfrac{1}{N-1}\bigg(\sum_{i=1}^{n}f_ix_i^2-\frac{\big(\sum_{i=1}^n f_ix_i\big)^2}{N}\bigg)} \end{aligned} $$
Karl Pearson coefficient of skewness formula with Example 1
The number of students absent in a class was recorded every day for 60 days and the information is given in the following frequency distribution.
No.of Students absent (x) | 0 | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|---|
No.of days (f) | 3 | 6 | 18 | 18 | 8 | 5 | 2 |
Find the Karl Pearson coefficient of skewness.
Solution
$x_i$ | $f_i$ | $f_i*x_i$ | $f_i*x_i^2$ | $cf$ | |
---|---|---|---|---|---|
0 | 3 | 0 | 0 | 3 | |
1 | 6 | 6 | 6 | 9 | |
2 | 18 | 36 | 72 | 27 | |
3 | 18 | 54 | 162 | 45 | |
4 | 8 | 32 | 128 | 53 | |
5 | 5 | 25 | 125 | 58 | |
6 | 2 | 12 | 72 | 60 | |
Total | 60 | 165 | 565 |
Sample mean
The sample mean of $X$ is
$$ \begin{aligned} \overline{x} &=\frac{1}{N}\sum_{i=1}^n f_ix_i\\ &=\frac{165}{60}\\ &=2.75 \end{aligned} $$
The average of no. of students absent is $2.75$ students.
Since the given frequency distribution is bimodal,Karl Pearson’s coefficient of skewness can be calculated by using empirical formula.
For asymmetric distribution,
$$ \begin{aligned} \text{Mean} - \text{Mode} &= 3(\text{Mean} - \text{Median}) \end{aligned} $$
Thus, Karl Pearson’s coefficient of skewness can be calculated by
$$ \begin{aligned} S_k &=\dfrac{3(Mean-Median)}{sd}\\ &=\dfrac{\overline{x}-M}{s_x} \end{aligned} $$
Sample Median
Median no. of students absent is
$$ \begin{aligned} \text{Median} &=\bigg(\dfrac{N}{2}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{60}{2}\bigg)^{th}\text{ value}\\ &=\big(30\big)^{th}\text{ value} \end{aligned} $$
The cumulative frequency just greater than or equal to $30$ is $45$. The corresponding value of $x$ is median. That is, $M =3$.
Thus, median number of accidents $M$ = $3$.
Sample variance
Sample variance of $X$ is
$$ \begin{aligned} s_x^2 &=\dfrac{1}{N-1}\bigg(\sum_{i=1}^{n}f_ix_i^2-\frac{\big(\sum_{i=1}^n f_ix_i\big)^2}{N}\bigg)\\ &=\dfrac{1}{59}\bigg(565-\frac{(165)^2}{60}\bigg)\\ &=\dfrac{1}{59}\big(565-\frac{27225}{60}\big)\\ &=\dfrac{1}{59}\big(565-453.75\big)\\ &= \frac{111.25}{59}\\ &=1.8856 \end{aligned} $$
Sample standard deviation
The standard deviation is the positive square root of the variance.
The sample standard deviation is
$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{1.8856}\\ &=1.3732 \end{aligned} $$
Thus the standard deviation of no. of students absent is $1.3732$ students.
Karl Pearson’s coefficient of skewness
The Karl Pearson coefficient of skewness can be calculated by
$$ \begin{aligned} s_k &=\frac{3(Mean-Median)}{sd}\\ &=\frac{3\times(2.75-3)}{2.1602}\\ &= -0.5462 \end{aligned} $$
As the value of $s_k < 0$, the data is $\text{negatively skewed}$.
Karl Pearson coefficient of skewness formula with Example 2
The following table gives the distribution of weight (in pounds) of 100 newborn babies at certain hospital in 2012.
Weight (in pounds) | 3-5 | 5-7 | 7-9 | 9-11 | 11-13 |
---|---|---|---|---|---|
No.of babies | 10 | 30 | 28 | 18 | 14 |
Calculate Karl Pearson coefficient of skewness.
Solution
Class Interval | mid-value ($x$) | $f$ | $f*x$ | $f*x^2$ | |
---|---|---|---|---|---|
3-5 | 4 | 10 | 40 | 160 | |
5-7 | 6 | 30 | 180 | 1080 | |
7-9 | 8 | 28 | 224 | 1792 | |
9-11 | 10 | 18 | 180 | 1800 | |
11-13 | 12 | 14 | 168 | 2016 | |
Total | 100 | 792 | 6848 |
Mean
The mean weight of babies is
$$ \begin{aligned} \overline{x} &=\frac{1}{N}\sum_{i=1}^n f_ix_i\\ &=\frac{792}{100}\\ &=7.92 \text{ pounds} \end{aligned} $$
Sample Mode
The maximum frequency is $30$, the corresponding class $5-7$ is the modal class.
Mode of the given frequency distribution is:
$$ \begin{aligned} \text{Mode } &= l + \bigg(\frac{f_m - f_1}{2f_m-f_1-f_2}\bigg)\times h\\ \end{aligned} $$
where,
- $l = 5$, the lower limit of the modal class
- $f_m =30$, frequency of the modal class
- $f_1 = 10$, frequency of the pre-modal class
- $f_2 = 28$, frequency of the post-modal class
- $h =2$, the class width
Thus mode of a frequency distribution is
$$ \begin{aligned} \text{Mode } &= l + \bigg(\frac{f_m - f_1}{2f_m-f_1-f_2}\bigg)\times h\\ &= 5 + \bigg(\frac{30 - 10}{2\times30 - 10 - 28}\bigg)\times 2\\ &= 5 + \bigg(\frac{20}{22}\bigg)\times 2\\ &= 5 + \big(0.9091\big)\times 2\\ &= 5 + \big(1.8182\big)\\ &= 6.8182 \text{ pounds} \end{aligned} $$
Sample variance
Sample variance of $X$ is
$$ \begin{aligned} s_x^2 &=\dfrac{1}{N-1}\bigg(\sum_{i=1}^{n}f_ix_i^2-\frac{\big(\sum_{i=1}^n f_ix_i\big)^2}{N}\bigg)\\ &=\dfrac{1}{99}\bigg(6848-\frac{(792)^2}{100}\bigg)\\ &=\dfrac{1}{99}\big(6848-\frac{627264}{100}\big)\\ &=\dfrac{1}{99}\big(6848-6272.64\big)\\ &= \frac{575.36}{99}\\ &=5.8117 \end{aligned} $$
Sample standard deviation
The sample standard deviation is
$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{5.8117}\\ &=2.4107 \text{ pounds} \end{aligned} $$
Thus the standard deviation of weight of babies is $2.4107$ pounds.
Karl Pearson’s coefficient of skewness
The Karl Pearson coefficient of skewness can be calculated by
$$ \begin{aligned} s_k &=\frac{Mean-\text{Mode}}{sd}\\ &=\frac{7.92-6.8182}{3.1623}\\ &= 0.457 \end{aligned} $$
As the value of $s_k > 0$, the data is $\text{positively skewed}$.
Hope you like Karl Pearson coefficient of skewness for grouped data and step by step explanation about how to find Karl Pearson coefficient of skewness with examples.
Calculate Pearson coefficient of skewness for grouped data using Calculator link given below under resource section.
You can also refer Karl Pearson coefficient of skewness formula using formula link given below under resource section.