Karl Pearson coefficient of skewness for ungrouped data

Let $x_i, i=1,2, \cdots , n$ be $n$ observations.

Formula

The Karl Pearson’s coefficient skewness is given by

$S_K =\dfrac{3(Mean-Median)}{sd}=\dfrac{3(\overline{x}-M)}{s_x}$

where,

  • $\overline{x}$ is the sample mean,
  • $M$ is the median,
  • $s_x$ is the sample standard deviation.

Sample mean

The sample mean $\overline{x}$ is given by

$$ \begin{eqnarray*} \overline{x}& =\frac{1}{n}\sum_{i=1}^{n}x_i \end{eqnarray*} $$

Sample Median

Arrange the data in ascending order of magnitude.

Median of $X$ is given by

$$ \begin{equation*} Md= \left\{ \begin{array}{ll} \text{value of }\big(\frac{n+1}{2}\big)^{th}\text{ obs.}, & \hbox{if $n$ is odd;} \\ \text{average of }\big(\frac{n}{2}\big)^{th}\text{ and }\big(\frac{n}{2}+1\big)^{th} \text{ obs.}, & \hbox{if $n$ is even.} \end{array} \right. \end{equation*} $$

Sample Standard deviation

sample standard deviation is given by

$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{\dfrac{1}{n-1}\bigg(\sum_{i=1}^{n}x_i^2-\frac{\big(\sum_{i=1}^n x_i\big)^2}{n}\bigg)} \end{aligned} $$

Example 1

The age (in years) of 6 randomly selected students from a class are

22,25,24,23,24,20.

Find the Karl Pearson’s coefficient of skewness.

Solution

$x_i$ $x_i^2$
22 484
25 625
24 576
23 529
24 576
20 400
Total 138 3190

Sample mean

The sample mean of $X$ is

$$ \begin{aligned} \overline{x} &=\frac{1}{n}\sum_{i=1}^n x_i\\ &=\frac{138}{6}\\ &=23\text{ years} \end{aligned} $$

The average of age of students is $23$ years.

Sample Median The data in ascending order of magnitude is $20, 22, 23, 24, 24, 25$.

Here $n = 6$ which is even.

Sample median = average of $(\frac{n}{2})^{th}$ and $(\frac{n}{2}+1)^{th}$ observations.

Thus the median age of students is

$$ \begin{aligned} M &= \frac{\big(\frac{6}{2}\big)^{th}\text{Obs.} +\big(\frac{6}{2}+1\big)^{th}\text{Obs.}}{2}\\ &= \frac{\big(3\big)^{th}\text{Obs.} +\big(4\big)^{th}\text{Obs.}}{2}\\ &=\frac{23 +24}{2} \\ &= 23.5 \text{ years}. \end{aligned} $$ The median age of students is $M=23.5$ years.

Sample variance

Sample variance of $X$ is

$$ \begin{aligned} s_x^2 &=\dfrac{1}{n-1}\bigg(\sum_{i=1}^{n}x_i^2-\frac{\big(\sum_{i=1}^n x_i\big)^2}{n}\bigg)\\ &=\dfrac{1}{5}\bigg(3190-\frac{(138)^2}{6}\bigg)\\ &=\dfrac{1}{5}\big(3190-\frac{19044}{6}\big)\\ &=\dfrac{1}{5}\big(3190-3174\big)\\ &= \frac{16}{5}\\ &=3.2 \end{aligned} $$

Sample standard deviation

The standard deviation is the positive square root of the variance.

The sample standard deviation is

$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{3.2}\\ &=1.7889 \text{ years} \end{aligned} $$

Thus the standard deviation of age of students is $1.7889$ years.

Karl Pearson’s coefficient of skewness

The Karl Pearson’s coefficient skewness is

$$ \begin{aligned} s_k &=\frac{3(Mean-Median)}{sd}\\ &=\frac{3\times(23-23.5)}{1.7889}\\ &= -0.8385 \end{aligned} $$

As the value of $s_k < 0$, the data is $\text{negatively skewed}$.

Example 2

A random sample of 11 patients yielded the following data on the length of stay (in days) in the hospital.

12,9,10,15,10,14,7,10,8,11,15

Find the Karl Pearson’s coefficient of skewness.

Solution

$x_i$ $x_i^2$
12 144
9 81
10 100
15 225
10 100
14 196
7 49
10 100
8 64
11 121
15 225
Total 121 1405

Sample mean

The sample mean of $X$ is

$$ \begin{aligned} \overline{x} &=\frac{1}{n}\sum_{i=1}^n x_i\\ &=\frac{121}{11}\\ &=11\text{ days} \end{aligned} $$

The average of length of stay in the hospital is $11$ days.

Sample Median

$n = 11$ which is odd.

The data in ascending order of magnitude is $7, 8, 9, 10, 10, 10, 11, 12, 14, 15, 15$.

Sample median = average of $(\frac{n}{2})^{th}$ and $(\frac{n}{2}+1)^{th}$ observations

That is

$$ \begin{aligned} M &= \text{value of }\bigg(\frac{n+1}{2}\bigg)^{th}\text{ obs.}\\ &= \text{value of }\bigg(\frac{11+1}{2}\bigg)^{th}\text{ obs.}\\ &= \text{value of } \big(6\big)^{th}\text{Obs.}\\ &=10 \text{ days} \end{aligned} $$ The median length of stay in the hospital is $M=10$ days.

Sample variance

Sample variance of $X$ is

$$ \begin{aligned} s_x^2 &=\dfrac{1}{n-1}\bigg(\sum_{i=1}^{n}x_i^2-\frac{\big(\sum_{i=1}^n x_i\big)^2}{n}\bigg)\\ &=\dfrac{1}{10}\bigg(1405-\frac{(121)^2}{11}\bigg)\\ &=\dfrac{1}{10}\big(1405-\frac{14641}{11}\big)\\ &=\dfrac{1}{10}\big(1405-1331\big)\\ &= \frac{74}{10}\\ &=7.4 \end{aligned} $$

Sample standard deviation

The standard deviation is the positive square root of the variance.

The sample standard deviation is

$$ \begin{aligned} s_x &=\sqrt{s_x^2}\\ &=\sqrt{7.4}\\ &=2.7203 \text{ days} \end{aligned} $$

Thus the standard deviation of length of stay in the hospital is $2.7203$ days.

Karl Pearson’s coefficient of skewness

The Karl Pearson’s coefficient skewness is

$$ \begin{aligned} s_k &=\frac{3(Mean-Median)}{sd}\\ &=\frac{3\times(11-10)}{2.7203}\\ &= 1.1028 \end{aligned} $$

As the value of $s_k > 0$, the data is $\text{positively skewed}$.

Related Resources