Kelly’s coefficient of skewness for grouped data

For a symmetric distribution, the first decile namely $D_1$ and nineth decile $D_9$ are equidistance from the median i.e. $D_5$. Thus, $D_9 - D_5 = D_5 -D_1$.

Kelly’s coefficient of skewness is based on deciles $D_{1}$, $1^{st}$ decile, $D_{5}$, $5^{th}$ decile, and $D_{9}$, $9^{th}$ decile). Only 20% of the observations are excluded from the measure.

Formula

$S_k =\dfrac{D_9+D_1 -2*Median}{D_9-D_1}$

OR

$S_k =\dfrac{P_{90}+P_{10} -2*Median}{P_{90}-P_{10}}$

where,

  • $D_1 (=P_{10})$ is the first decile (or tenth percentile) of the data
  • $Median=D_5 (=P_{50})$ is the median of the data
  • $D_9 (=P_{90})$ is the ninth decile (or nineteenth percentile) of the data.

Interpretation

  • If $S_k<0$, the data is negatively skewed.
  • If $S_k=0$, the data is symmetric(i.e., not skewed).
  • If $S_k>0$, the data is positively skewed.

Example 1

A librarian keeps the records about the amount of time spent (in minutes) in a library by college students. Data is as follows:

Time spent 30 32 35 38 40
No. of students 8 12 20 10 5

Calculate Kelly’s coefficient of skewness.

Solution

$x_i$ $f_i$ $cf$
30 8 8
32 12 20
35 20 40
38 10 50
40 5 55
Total 55

Deciles

The formula for $i^{th}$ deciles is

$D_i =\bigg(\dfrac{i(N)}{4}\bigg)^{th}$ value, $i=1,2,\cdots, 9$

where $N$ is the total number of observations.

First Decile $D_1$

$$ \begin{aligned} D_{1} &=\bigg(\dfrac{1(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{1(55)}{10}\bigg)^{th}\text{ value}\\ &=\big(5.5\big)^{th}\text{ value} \end{aligned} $$

The cumulative frequency just greater than or equal to $5.5$ is $8$. The corresponding value of $X$ is the $1^{st}$ decile. That is, $D_1 =30$ minutes.

Fifth Decile $D_5$

$$ \begin{aligned} D_{5} &=\bigg(\dfrac{5(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{5(55)}{10}\bigg)^{th}\text{ value}\\ &=\big(27.5\big)^{th}\text{ value} \end{aligned} $$

The cumulative frequency just greater than or equal to $27.5$ is $40$. The corresponding value of $X$ is the $5^{th}$ decile. That is, $D_5 =35$ minutes.

Ninth Decile $D_9$

$$ \begin{aligned} D_{9} &=\bigg(\dfrac{9(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{9(55)}{10}\bigg)^{th}\text{ value}\\ &=\big(49.5\big)^{th}\text{ value} \end{aligned} $$

The cumulative frequency just greater than or equal to $49.5$ is $50$. The corresponding value of $X$ is the $9^{th}$ decile. That is, $D_9 =38$ minutes.

Kelly’s coefficient of skewness

Kelly’s coefficient of skewness is

$$ \begin{aligned} S_k &= \frac{D_9+D_1 - 2D_5}{D_9 -D_1}\\ &=\frac{38+30 - 2* 35}{38 - 30}\\ &=\frac{-2}{8}\\ &=-0.25 \end{aligned} $$

As the coefficient of skewness $S_k$ is $\text{less than zero}$ (i.e., $S_k < 0$), the distribution is $\text{negatively skewed}$.

Example 2

The following table gives the amount of time (in minutes) spent on the internet each evening by a group of 56 students.

Time spent on Internet ($x$) 10-12 13-15 16-18 19-21 22-24
No. of students ($f$) 3 12 15 24 2

Calculate Kelly’s coefficient of skewness.

Solution

Let $X$ denote the amount of time (in minutes) spent on the internet.

Here the classes are inclusive. To make them exclusive type subtract 0.5 from the lower limit and add 0.5 to the upper limit of each class.

Class Interval Class Boundries $f_i$ $cf$
10-12 9.5-12.5 3 3
13-15 12.5-15.5 12 15
16-18 15.5-18.5 15 30
19-21 18.5-21.5 24 54
22-24 21.5-24.5 2 56
Total 56

The formula for $i^{th}$ decile is

$D_i =\bigg(\dfrac{i(N)}{10}\bigg)^{th}$ value, $i=1,2,\cdots, 9$

where $N$ is the total number of observations.

First Decile ($D_1$)

$$ \begin{aligned} D_{1} &=\bigg(\dfrac{1(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{1(56)}{10}\bigg)^{th}\text{ value}\\ &=\big(5.6\big)^{th}\text{ value} \end{aligned} $$ The cumulative frequency just greater than or equal to $5.6$ is $15$, the corresponding class $12.5-15.5$ is the $1^{st}$ decile class.

Thus

  • $l = 12.5$, the lower limit of the $1^{st}$ decile class
  • $N=56$, total number of observations
  • $f =12$, frequency of the $1^{st}$ decile class
  • $F_< = 3$, cumulative frequency of the class previous to $1^{st}$ decile class
  • $h =3$, the class width

The first decile $D_1$ can be computed as follows:

$$ \begin{aligned} D_1 &= l + \bigg(\frac{\frac{1(N)}{10} - F_<}{f}\bigg)\times h\\ &= 12.5 + \bigg(\frac{\frac{1*56}{10} - 3}{12}\bigg)\times 3\\ &= 12.5 + \bigg(\frac{5.6 - 3}{12}\bigg)\times 3\\ &= 12.5 + \big(0.2167\big)\times 3\\ &= 12.5 + 0.65\\ &= 13.15 \text{ minutes} \end{aligned} $$

Fifth Decile ($D_5$)

$$ \begin{aligned} D_{5} &=\bigg(\dfrac{5(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{5(56)}{10}\bigg)^{th}\text{ value}\\ &=\big(28\big)^{th}\text{ value} \end{aligned} $$

The cumulative frequency just greater than or equal to $28$ is $30$, the corresponding class $15.5-18.5$ is the $5^{th}$ decile class.

Thus

  • $l = 15.5$, the lower limit of the $5^{th}$ decile class
  • $N=56$, total number of observations
  • $f =15$, frequency of the $5^{th}$ decile class
  • $F_< = 15$, cumulative frequency of the class previous to $5^{th}$ decile class
  • $h =3$, the class width

The fifth decile $D_5$ can be computed as follows:

$$ \begin{aligned} D_5 &= l + \bigg(\frac{\frac{5(N)}{10} - F_<}{f}\bigg)\times h\\ &= 15.5 + \bigg(\frac{\frac{5*56}{10} - 15}{15}\bigg)\times 3\\ &= 15.5 + \bigg(\frac{28 - 15}{15}\bigg)\times 3\\ &= 15.5 + \big(0.8667\big)\times 3\\ &= 15.5 + 2.6\\ &= 18.1 \text{ minutes} \end{aligned} $$

Ninth Decile ($D_9$)

$$ \begin{aligned} D_{9} &=\bigg(\dfrac{9(N)}{10}\bigg)^{th}\text{ value}\\ &= \bigg(\dfrac{9(56)}{10}\bigg)^{th}\text{ value}\\ &=\big(50.4\big)^{th}\text{ value} \end{aligned} $$

The cumulative frequency just greater than or equal to $50.4$ is $54$, the corresponding class $18.5-21.5$ is the $9^{th}$ decile class.

Thus

  • $l = 18.5$, the lower limit of the $9^{th}$ decile class
  • $N=56$, total number of observations
  • $f =24$, frequency of the $9^{th}$ decile class
  • $F_< = 30$, cumulative frequency of the class previous to $9^{th}$ decile class
  • $h =3$, the class width

The ninth decile $D_9$ can be computed as follows:

$$ \begin{aligned} D_9 &= l + \bigg(\frac{\frac{9(N)}{10} - F_<}{f}\bigg)\times h\\ &= 18.5 + \bigg(\frac{\frac{9*56}{10} - 30}{24}\bigg)\times 3\\ &= 18.5 + \bigg(\frac{50.4 - 30}{24}\bigg)\times 3\\ &= 18.5 + \big(0.85\big)\times 3\\ &= 18.5 + 2.55\\ &= 21.05 \text{ minutes} \end{aligned} $$

Kelly’s coefficient of skewness

Kelly’s coefficient of skewness is

$$ \begin{aligned} S_k &= \frac{D_9+D_1 - 2D_5}{D_9 -D_1}\\ &=\frac{21.05+13.15 - 2* 18.1}{21.05 - 13.15}\\ &=\frac{-2}{7.9}\\ &=-0.25316 \end{aligned} $$

As the coefficient of skewness $S_k$ is $\text{less than zero}$ (i.e., $S_k < 0$), the distribution is $\text{negatively skewed}$.

Related Resources