Skewness Overview
Skewness measures the asymmetry of a distribution. It indicates whether data is symmetrically distributed or skewed toward one side.
Interpretation:
- Skewness = 0: Perfectly symmetric (bell-shaped)
- Skewness > 0: Positively skewed (right-skewed, tail extends right)
- Skewness < 0: Negatively skewed (left-skewed, tail extends left)
Pearson’s Coefficient of Skewness
Pearson’s coefficient uses the relationship between mean, median, and standard deviation.
Formula
$$S_K = \frac{3(\text{Mean} - \text{Median})}{s}$$
where:
- Mean = $\bar{x}$
- Median = $M$
- s = Standard deviation
Pearson’s Skewness for Ungrouped Data
Example:
Age (in years) of 6 students: 22, 25, 24, 23, 24, 20
Calculate Pearson’s skewness.
Solution:
| $x_i$ | $x_i^2$ |
|---|---|
| 22 | 484 |
| 25 | 625 |
| 24 | 576 |
| 23 | 529 |
| 24 | 576 |
| 20 | 400 |
| 138 | 3190 |
Mean: $$\bar{x} = \frac{138}{6} = 23 \text{ years}$$
Median: Arranged: 20, 22, 23, 24, 24, 25 $$M = \frac{23 + 24}{2} = 23.5 \text{ years}$$
Variance: $$s^2 = \frac{1}{5}\left(3190 - \frac{138^2}{6}\right) = \frac{1}{5}(3190 - 3174) = 3.2$$
Standard Deviation: $$s = \sqrt{3.2} = 1.789$$
Pearson’s Skewness: $$S_K = \frac{3(23 - 23.5)}{1.789} = \frac{-1.5}{1.789} = -0.839$$
Interpretation: Negatively skewed (slightly left-skewed)
Pearson’s Skewness for Grouped Data
Example:
| Class | f | Midpoint |
|---|---|---|
| 0-10 | 5 | 5 |
| 10-20 | 12 | 15 |
| 20-30 | 18 | 25 |
| 30-40 | 10 | 35 |
| 40-50 | 5 | 45 |
Calculate Pearson’s skewness (μ = 25.5, M ≈ 23.3, s ≈ 12.1)
$$S_K = \frac{3(25.5 - 23.3)}{12.1} = 0.54$$
Interpretation: Positively skewed
Bowley’s Coefficient of Skewness
Bowley’s coefficient uses quartiles and is based on the position of the median relative to Q₁ and Q₃.
Formula
$$S_{Bowley} = \frac{(Q_3 - M) - (M - Q_1)}{Q_3 - Q_1} = \frac{Q_3 + Q_1 - 2M}{Q_3 - Q_1}$$
Interpretation:
- Falls between -1 and 1
- 0 = symmetric
- Positive value = right-skewed
- Negative value = left-skewed
Bowley’s Skewness for Ungrouped Data
Example:
Hospital stay (days) for 15 patients: 5, 6, 8, 9, 9, 10, 10, 10, 10, 12, 12, 13, 13, 14, 15
Calculate Bowley’s skewness.
Solution:
- Q₁ = 9
- M = 10 (median, 8th value)
- Q₃ = 13
$$S_{Bowley} = \frac{(13 - 10) - (10 - 9)}{13 - 9} = \frac{3 - 1}{4} = \frac{2}{4} = 0.5$$
Interpretation: Positively skewed
Bowley’s Skewness for Grouped Data
Using grouped data quartile formulas: $$S_{Bowley} = \frac{Q_3 + Q_1 - 2M}{Q_3 - Q_1}$$
Kelly’s Coefficient of Skewness
Kelly’s coefficient uses deciles (D₉ and D₁) instead of quartiles, making it more stable for large datasets.
Formula
$$S_{Kelly} = \frac{(D_9 - M) - (M - D_1)}{D_9 - D_1} = \frac{D_9 + D_1 - 2M}{D_9 - D_1}$$
Interpretation:
- Similar to Bowley’s, ranges from -1 to 1
- Uses outer deciles (10th and 90th percentiles)
- More resistant to extreme values
Example: Kelly’s Skewness
For a distribution with:
- D₁ = 15
- M = 45
- D₉ = 75
$$S_{Kelly} = \frac{(75 - 45) - (45 - 15)}{75 - 15} = \frac{30 - 30}{60} = 0$$
Interpretation: Symmetric distribution
Moment Coefficient of Skewness
The moment coefficient uses the third central moment, providing the most theoretically rigorous measure.
Formula
For ungrouped data: $$\gamma_1 = \frac{m_3}{s^3}$$
where:
- $m_3 = \frac{\sum(x_i - \bar{x})^3}{n}$ (third central moment)
- $s$ = standard deviation
For grouped data: $$\gamma_1 = \frac{\sum f_i(m_i - \bar{x})^3}{n \cdot s^3}$$
Moment Coefficient for Ungrouped Data
Example:
Test scores: 45, 52, 58, 62, 68
Calculate moment coefficient skewness.
Solution:
| $x_i$ | $(x_i - \bar{x})$ | $(x_i - \bar{x})^2$ | $(x_i - \bar{x})^3$ |
|---|---|---|---|
| 45 | -12 | 144 | -1728 |
| 52 | -5 | 25 | -125 |
| 58 | 1 | 1 | 1 |
| 62 | 5 | 25 | 125 |
| 68 | 11 | 121 | 1331 |
Mean: $\bar{x} = 57$
Third Central Moment: $$m_3 = \frac{-1728 - 125 + 1 + 125 + 1331}{5} = \frac{-396}{5} = -79.2$$
Variance: $s^2 = \frac{316}{5} = 63.2$, so $s = 7.95$
Moment Coefficient: $$\gamma_1 = \frac{-79.2}{(7.95)^3} = \frac{-79.2}{502.46} = -0.158$$
Moment Coefficient for Grouped Data
$$\gamma_1 = \frac{\sum f_i(m_i - \bar{x})^3}{n \cdot s^3}$$
Calculate using class midpoints, frequencies, and deviations from mean.
Comparison of Skewness Methods
| Method | Advantages | Disadvantages |
|---|---|---|
| Pearson | Simple, intuitive | Only uses 2 measures (mean, median) |
| Bowley | Based on quartiles, robust | Less sensitive to extreme values |
| Kelly | Uses deciles, more stable | More complex to calculate |
| Moment | Theoretically rigorous | Affected by extreme outliers |
Interpretation Guidelines
| Skewness Value | Distribution Type | Characteristics |
|---|---|---|
| -1 to 0 | Left-skewed | Tail extends left, mean < median |
| 0 | Symmetric | Balanced distribution |
| 0 to 1 | Right-skewed | Tail extends right, mean > median |
| > 1 or < -1 | Highly skewed | Extreme asymmetry |
Visual Representation
Left-Skewed Symmetric Right-Skewed
(Negative) (Zero) (Positive)
| | | | |
| | | | |
| | | | | | | | |
| | | | | | | | | | | |
────────────────────────────────────────────────
Mean Mean Mean
Median Median Median
Tail ← Symmetric → Tail
Related Articles
Related Shape Measures:
- Kurtosis - Tail behavior and peakedness
- Central Moments - Mathematical basis for shape measures
Related Concepts:
- Mean, Median, and Mode - Relationship to skewness
- Variance and Standard Deviation - Measuring spread
- Quartiles and Percentiles - Distribution division
Applications:
- Position Measures and Quantiles - Understanding distribution
- Outlier Detection Methods - Identifying extreme values
- Descriptive Statistics Complete Guide - Master all concepts
Applications
- Income Distribution: Identifying wealth inequality
- Quality Control: Detecting process imbalances
- Data Transformation: Deciding normalization techniques
- Statistical Testing: Checking assumptions
- Pattern Recognition: Understanding data characteristics
References
-
Anderson, D.R., Sweeney, D.J., & Williams, T.A. (2018). Statistics for Business and Economics (14th ed.). Cengage Learning. - Comprehensive treatment of skewness calculation methods and interpretation.
-
NIST/SEMATECH. (2023). e-Handbook of Statistical Methods. Retrieved from https://www.itl.nist.gov/div898/handbook/ - Authoritative reference on statistical measures including moment-based skewness.