UNISTAT - the ultimate Excel statistics add-in

6.4.4. Unpaired Proportions

A 2 x 2 table is formed to perform the procedures listed under this topic. This can be done from raw data consisting of two columns (not necessarily of equal length) or by directly entering the four cell frequencies in the following order:

(1,1) contains the number of category 1s in the first sample

(1,2) contains the number of category 1s in the second sample

(2,1) contains the number of category 2s in the first sample

(2,2) contains the number of category 2s in the second sample

This data structure is explained in detail at the beginning of this chapter (see 6.0.7. 2 x 2 Tables). The user should take care to distinguish this table from a table formed on the same pair of columns by the Paired Proportions procedure. Here, the total table frequency is the sum of valid cases in sample 1 and sample 2, whereas in Paired Proportions (as in 2 x 2 cross-tabulation) the total frequency is the number of valid pairs.

Nonparametric Tests-Unpaired Proportions

When the four frequency values for a 2 x 2 table are already available in the spreadsheet, you do not have to type them again into the Cell Frequencies are Given dialogue. All statistics available under Binomial Proportion, Unpaired Proportions and Paired Proportions procedures are also available in Contingency Table and Cross-Tabulation procedures (see 6.6.2.3. 2 x 2 Table Statistics).

6.4.4.1. Difference Between Unpaired Proportions

The null hypothesis “the two proportions are equal” is tested. See Armitage, P. & G. Berry (1994) p. 129. Let:

     Nonparametric Tests-Unpaired Proportions

and:

     Nonparametric Tests-Unpaired Proportions

The confidence interval for the difference between two proportions uses the separate sample variance, which is defined as:

     Nonparametric Tests-Unpaired Proportions

and the confidence interval is:

     Nonparametric Tests-Unpaired Proportions

The pooled variance is used to test the difference between two proportions. The test statistic is based on the following normal approximation:

     Nonparametric Tests-Unpaired Proportions

where:

     Nonparametric Tests-Unpaired Proportions

and:

     Nonparametric Tests-Unpaired Proportions

The pooled confidence limits are computed as:

     Nonparametric Tests-Unpaired Proportions

Example 1

Example 4.11 on p. 129 from Armitage, P. & G. Berry (1994). The effect of two treatments on the mortality rates of two random groups is assessed. The data is given in the form of a 2 x 2 table.

Select Statistics 1Nonparametric Tests (1-2 Samples) → Unpaired Proportions and select the data option 3 Cell Frequencies are Given. Enter 41 in (Sample 1 = 1), 64 in (Sample 2 = 1), 216 in (Sample 1 = 2) and 180 in (Sample 2 = 2). Select only the Difference Between Unpaired Proportions output option to obtain the following results:

Unpaired Proportions

Data option: Cell Frequencies are Given

 

 

Sample 1

Sample 2

Total

 1

 41

 64

 105

 2

 216

 180

 396

Total

 257

 244

 501

 

Difference Between Unpaired Proportions

 

Proportion 1 =

 0.1595

Proportion 2 =

 0.2623

 

 

Difference

Standard Error

Z-Statistic

1-Tail Probability

2-Tail Probability

Pooled Variance

-0.1028

 0.0364

-2.8247

 0.0024

 0.0047

Separate Variance

 

 0.0363

-2.8341

 0.0023

 0.0046

 

 

Lower 95%

Upper 95%

Pooled Variance

-0.1741

-0.0315

Separate Variance

-0.1738

-0.0317

 

In this example, Armitage and Berry report the Z-statistic and pooled variance confidence interval only.

Example 2

Example on p. 32, Gardner M. J., Altman, D. G. (1989). Number of patients responding to treatment in two groups is given in the form of a 2 x 2 table.

Select Statistics 1Nonparametric Tests (1-2 Samples) → Unpaired Proportions and select the data option 3 Cell Frequencies are Given. Enter 61 in (Sample 1 = 1), 45 in (Sample 2 = 1), 19 in (Sample 1 = 2) and 35 in (Sample 2 = 2) to obtain the following results:

Unpaired Proportions

Data option: Cell Frequencies are Given

 

 

Sample 1

Sample 2

Total

 1

 61

 45

 106

 2

 19

 35

 54

Total

 80

 80

 160

 

Difference Between Unpaired Proportions

Proportion 1 =

 0.7625

Proportion 2 =

 0.5625

 

 

Difference

Standard Error

Z-Statistic

1-Tail Probability

2-Tail Probability

Pooled Variance

 0.2000

 0.0748

 2.6750

 0.0037

 0.0075

Separate Variance

 

 0.0731

 2.7369

 0.0031

 0.0062

 

 

Lower 95%

Upper 95%

Pooled Variance

 0.0535

 0.3465

Separate Variance

 0.0568

 0.3432

 

In this example, Gardner and Altman (1989) report the separate variance confidence interval only.

6.4.4.2. Risk Ratio

Risk Ratio is defined as (see Gardner M. J., Altman, D. G. 1989, p. 53.):

      Nonparametric Tests-Unpaired Proportions

where the logarithm of R has a standard error of:

     Nonparametric Tests-Unpaired Proportions

and the confidence limits are reported as:

     Nonparametric Tests-Unpaired Proportions

Example

Example on p. 52, Gardner M. J., Altman, D. G. (1989). Susceptibility to rubella in Asian and non Asian antenatal patients is studied. Data is a 2 x 2 table.

Select Statistics 1Nonparametric Tests (1-2 Samples) → Unpaired Proportions and select the data option 3 Cell Frequencies are Given. Enter 161 in (Sample 1 = 1), 748 in (Sample 2 = 1), 2475 in (Sample 1 = 2) and 34020 in (Sample 2 = 2). Select only the Risk Ratio output option.

Unpaired Proportions

Data option: Cell Frequencies are Given

 

 

Sample 1

Sample 2

Total

 1

 161

 748

 909

 2

 2475

 34020

 36495

Total

 2636

 34768

 37404

 

Risk Ratio

 

Value

Lower 95%

Upper 95%

Risk Ratio

 2.8390

 2.4057

 3.3503

 

6.4.4.3. Odds Ratio and Relative Risks

The last part of output reports the odds ratio:

     Nonparametric Tests-Unpaired Proportions

where the logarithm of OR has a standard error of:

     Nonparametric Tests-Unpaired Proportions

and the confidence intervals, which are also known as logit limits, are:

     Nonparametric Tests-Unpaired Proportions

For the odds ratio for paired cases see 6.4.5.4. Odds Ratio (Paired).

The relative risk for column 1 (cohort 1) is given as:

     Nonparametric Tests-Unpaired Proportions

and its confidence interval:

     Nonparametric Tests-Unpaired Proportions

where:

     Nonparametric Tests-Unpaired Proportions

The relative risk for column 2 (cohort 2) is found by interchanging the indices.

Example

Example 4.12 on p. 132 from Armitage, P. & G. Berry (1994). Association of bronchial carcinoma and asbestos exposure is investigated. The data is given in the form of a 2 x 2 table.

Select Statistics 1Nonparametric Tests (1-2 Samples) → Unpaired Proportions and select the data option 3 Cell Frequencies are Given. Enter 148 in (Sample 1 = 1), 372 in (Sample 2 = 1), 75 in (Sample 1 = 2) and 343 in (Sample 2 = 2). Select only the Odds Ratio and Relative Risks output option to obtain the following results:

Unpaired Proportions

Data option: Cell Frequencies are Given

 

 

Sample 1

Sample 2

Total

 1

 148

 372

 520

 2

 75

 343

 418

Total

 223

 715

 938

 

Odds Ratio and Relative Risks

 

Value

Lower 95%

Upper 95%

Odds Ratio

 1.8195

 1.3290

 2.4911

Exact

 

 1.3152

 2.5283

Relative Risk (Cohort 1)

 1.5863

 1.2401

 2.0290

Relative Risk (Cohort 2)

 0.8718

 0.8126

 0.9353