Confidence Interval for Proportions examples

Feb 20, 2024 by Dr. Raju Chaudhari

CI for difference between two population proportions

In this tutorial we will discuss some examples on confidence interval for difference between two population proportions.

Example 1

A random sample of 85 parts manufactured by machine A yields 10 defective and a random sample of 110 parts manufactured by machine B shows 28 defective.

Compute 95% confidence interval for difference between the proportions of defectives.

Are the two machine differ significantly with respect to the proportion of defectives?

Solution

Given information

.	Machine A	Machine B
Sample size	$n_1=85$	$n_2=110$
Observed no. of defectives	$X_1=10$	$X_2=28$

We wish to determine $95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$.

Step 1 Specify the confidence level

Confidence level is $1-\alpha = 0.95$. Thus, the level of significance is $\alpha = 0.05$.

Step 2 Given information

Given that $X_1 = 10$, $X_2 = 28$, $n_1 = 85$, $n_2 = 110$.

The estimate of the population proportions $p_1$ is $\hat{p}_1 =\frac{X_1}{n_1} =\frac{10}{85}=0.1176$ and the estimate of the population proportion $p_2$ is $\hat{p}_2 =\frac{X_2}{n_2} =\frac{28}{110}=0.2545$.

Step 3 Specify the formula

$100(1-\alpha)$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E \leq (p_1 -p_2) \leq (\hat{p}_1 -\hat{p}_2)+ E. \end{aligned} $$ where $E = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}$ and $Z_{\alpha/2}$ is the $Z$ value providing an area of $\alpha/2$ in the upper tail of the standard normal probability distribution.

Step 4 Determine the critical value

The critical value of $Z$ for given level of significance is $Z_{\alpha/2}$.

Thus $Z_{\alpha/2} = Z_{0.025} = 1.96$ .

Step 5 Compute the margin of error

The margin of error for the difference $(p_1-p_2)$ is $$ \begin{aligned} E & = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}\\ & = 1.96 \sqrt{\frac{0.1176*(1-0.1176)}{85}+\frac{0.2545*(1-0.2545)}{110}}\\ &= 0.1064. \end{aligned} $$

Step 6 Determine the confidence interval

$95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E &\leq (p_1-p_2) \leq (\hat{p}_1-\hat{p}_2) + E\\ (0.1176-0.2545) - 0.1064 & \leq (p_1-p_2) \leq (0.1176-0.2545) + 0.1064\\ -0.2433 & \leq (p_1-p_2) \leq -0.0305 \end{aligned} $$ Thus, $95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $(-0.2433,-0.0305)$.

Interpretation

We can be $95$% confident that the difference between two population proportions $(p_1-p_2)$ is between $-0.2433$ and $-0.0305$.

Because the $95$% confidence interval does not include the point zero, we conclude that at $0.05$ level of significance the two machine differ significantly with respect to the proportion of defectives.

Example 2

In a recent survey of randomly selected adults 65 or older, 411 of 1012 men and 525 of 1062 women say they suffer from some form of arthritis.

a. Construct a 98% confidence interval for the difference between senior men and women who suffer from arthritis.

b. Does there appear to be a difference between senior men and women as far as suffering from arthritis?

Solution

Given information

.	Men	Women
Sample size	$n_1=1012$	$n_2=1062$
Suffering from arthritis	$X_1=411$	$X_2=525$

We want to determine $98$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$.

Step 1 Specify the confidence level

Confidence level is $1-\alpha = 0.98$. Thus, the level of significance is $\alpha = 0.02$.

Step 2 Given information

Given that $X_1 = 411$, $X_2 = 525$, $n_1 = 1012$, $n_2 = 1062$.

The estimate of the population proportions $p_1$ is $\hat{p}_1 =\frac{X_1}{n_1} =\frac{411}{1012}=0.4061$ and the estimate of the population proportion $p_2$ is $\hat{p}_2 =\frac{X_2}{n_2} =\frac{525}{1062}=0.4944$.

Step 3 Specify the formula

$100(1-\alpha)$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E \leq (p_1 -p_2) \leq (\hat{p}_1 -\hat{p}_2)+ E. \end{aligned} $$ where $E = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}$ and $Z_{\alpha/2}$ is the $Z$ value providing an area of $\alpha/2$ in the upper tail of the standard normal probability distribution.

Step 4 Determine the critical value

The critical value of $Z$ for given level of significance is $Z_{\alpha/2}$.

Thus $Z_{\alpha/2} = Z_{0.01} = 2.33$ .

Step 5 Compute the margin of error

The margin of error for the difference $(p_1-p_2)$ is $$ \begin{aligned} E & = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}\\ & = 2.33 \sqrt{\frac{0.4061*(1-0.4061)}{1012}+\frac{0.4944*(1-0.4944)}{1062}}\\ &= 0.0507. \end{aligned} $$

Step 6 Determine the confidence interval

$98$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E &\leq (p_1-p_2) \leq (\hat{p}_1-\hat{p}_2) + E\\ (0.4061-0.4944) - 0.0507 & \leq (p_1-p_2) \leq (0.4061-0.4944) + 0.0507\\ -0.1389 & \leq (p_1-p_2) \leq -0.0375 \end{aligned} $$

Thus, $98$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $(-0.1389,-0.0375)$.

Interpretation

We can be $98$% confident that the difference between two population proportions $(p_1-p_2)$ is between $-0.1389$ and $-0.0375$.

Because the $98$% confidence interval does not include the point zero, we conclude that at $0.02$ level of significance there appears to be significant difference between the proportion of senior men and proportion of women who suffer from arthritis.

Example 3

Two machines used in the same operation are to be compared. A random sample of 80 parts from the first machine yields 6 non-conforming ones. A random sample of 120 parts from the second machine shows 14 non-conforming ones.

Find a 95% confidence interval for difference in the proportion of non-conforming parts between the two machines.

Solution

Given information

.	First Machine	Second Machine
Sample size	$n_1=80$	$n_2=200$
no.of non-confirming	$X_1=6$	$X_2=14$

We want to determine $95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$.

Step 1 Specify the confidence level

Confidence level is $1-\alpha = 0.95$. Thus, the level of significance is $\alpha = 0.05$.

Step 2 Given information

Given that $X_1 = 6$, $X_2 = 14$, $n_1 = 80$, $n_2 = 200$.

The estimate of the population proportions $p_1$ is $\hat{p}_1 =\frac{X_1}{n_1} =\frac{6}{80}=0.075$ and the estimate of the population proportion $p_2$ is $\hat{p}_2 =\frac{X_2}{n_2} =\frac{14}{200}=0.07$.

Step 3 Specify the formula

$100(1-\alpha)$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E \leq (p_1 -p_2) \leq (\hat{p}_1 -\hat{p}_2)+ E. \end{aligned} $$ where $E = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}$ and $Z_{\alpha/2}$ is the $Z$ value providing an area of $\alpha/2$ in the upper tail of the standard normal probability distribution.

Step 4 Determine the critical value

The critical value of $Z$ for given level of significance is $Z_{\alpha/2}$.

Thus $Z_{\alpha/2} = Z_{0.025} = 1.96$ .

Step 5 Compute the margin of error

The margin of error for the difference $(p_1-p_2)$ is $$ \begin{aligned} E & = Z_{\alpha/2} \sqrt{\frac{\hat{p}_1*(1-\hat{p}_1)}{n_1}+\frac{\hat{p}_2*(1-\hat{p}_2)}{n_2}}\\ & = 1.96 \sqrt{\frac{0.075*(1-0.075)}{80}+\frac{0.07*(1-0.07)}{200}}\\ &= 0.0677. \end{aligned} $$

Step 6 Determine the confidence interval

$95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $$ \begin{aligned} (\hat{p}_1-\hat{p}_2) - E &\leq (p_1-p_2) \leq (\hat{p}_1-\hat{p}_2) + E\\ (0.075-0.07) - 0.0677 & \leq (p_1-p_2) \leq (0.075-0.07) + 0.0677\\ -0.0627 & \leq (p_1-p_2) \leq 0.0727 \end{aligned} $$

Thus, $95$% confidence interval estimate of the difference between two population proportions $(p_1-p_2)$ is $(-0.0627,0.0727)$.

Interpretation

We can be $95$% confident that the difference between two population proportions $(p_1-p_2)$ is between $-0.0627$ and $0.0727$.

Because the $95$% confidence interval include the point zero, we conclude that at $0.05$ level of significance there is no significant difference between the proportion of non-confirming parts between the two machines.

Related Resources

Theory

Calculator

Confidence Interval for Ratio of variances

Feb 20, 2024
TUTORIALS

Confidence Interval for Proportions

Feb 20, 2024
TUTORIALS