UNISTAT - the ultimate Excel statistics add-in

6.4.2. Paired Samples

Like the nonparametric tests on unpaired samples of the previous section, the tests in this section are also used to assess the significance of the difference between population distributions of two samples. In this case the two samples are assumed to consist of matched pairs.

In general, a test is run on paired data by selecting two numeric data columns as [Variable]s. When three or more columns are selected the test will be performed on all possible pairs with equal length (see 6.0.3. Tests with Paired Data). The missing values are omitted pairwise.

Despite the section title Paired Samples, it is also possible to select a single column. When only one column is selected, the test is performed against a hypothetical second variable consisting of zeros.

An Output Options Dialogue offering four options is displayed next.

Nonparametric Tests-Paired Samples

6.4.2.1. Wilcoxon Signed Rank Test

This test is used to assess the significance of the difference between population distributions of the two samples consisting of matched pairs. The absolute values of the difference between the pairs are ranked and the rank sums of negative and positive differences are computed. The signed ranks can be displayed using the Table of Ranks option below.

A very small or a very large sum indicates that the two samples do not have similar distributions. The smaller of the two values is selected as the test statistic.

table displaying the number of positive and negative differences, rank sums and mean ranks. One- and two-tailed asymptotic probabilities are reported without and with continuity correction, as well as the one- and two-tailed exact probabilities.

Asymptotic without Continuity Correction: The Z-statistic is defined as:

     Nonparametric Tests-Paired Samples

Asymptotic with Continuity Correction: In this case the Z-statistic is:

     Nonparametric Tests-Paired Samples

where the mean of the Wilcoxon Signed Rank distribution is given as:

     Nonparametric Tests-Paired Samples

and its standard deviation:

     Nonparametric Tests-Paired Samples

R is the smaller sum of the like-signed ranks (the test statistic) and T is the sum of t3 - t where t is the number of ties at a given rank.

If n > 20 then the Z statistic provides a good approximation for the distribution of the test statistic. The exact p‑value is reported for n £ 150 and it is accurate for data sets with or without ties. To change this limit, the following line should be entered and edited in the [Options] section of Documents\Unistat60 \Unistat60.ini file:

WMWMaxExactSize=150

It is also possible to save the complete exact one-tailed cumulative probability distribution of the test statistic by including the following line in the [Options] section of Unistat60.ini:

WMWSaveDist=1

By default, the distribution will be saved to the following file:

..\Documents\Unistat60\WMWExactDist.txt

This file name can be changed by entering and editing the following line in the [Options] section of Documents\Unistat60\Unistat60.ini file:

WMWSaveDistFile=..\Documents\Unistat60\WMWExactDist.txt

Example 1

Example 13.2 on p. 451 from Armitage, P. & G. Berry (1994). The null hypothesis “there is no difference between the effects of the drug and the placebo” is tested.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Drug (C5) and Placebo (C6) as [Variable]s and check only the Wilcoxon Signed Rank Test output option to obtain the following results:

Paired Samples

Wilcoxon Signed Rank Test

For Drug and Placebo

 

 

Cases

Rank Sum

Mean Rank

Negative Differences

 6

 38.0000

 6.3333

Positive Differences

 4

 17.0000

 4.2500

Total

 10

 55.0000

 5.5000

 

Correction for Ties =

 1.5000

 

 

W

Z-Statistic

1-Tail Probability

2-Tail Probability

Asymptotic

 17.0000

-1.0273

 0.1521

 0.3043

Asymptotic with CC

 

-1.0787

 0.1404

 0.2807

Exact

 

 

 0.1611

 0.3223

 

Since the two-tailed probability is far greater than 5% the test result is not significant. Therefore, do not reject the null hypothesis.

section of Documents\Unistat60\Unistat60.ini file, the exact one-tailed cumulative distribution of the rank sum is saved to the WMWExactDist.txt file as follows (shortened):

 


Rank Sum

One Tail Probability

 0

 0.0009765625

 2.5

 0.0048828125

 5

 0.01171875

 6.5

 0.013671875

 7.5

 0.021484375

 8

 0.0224609375

 9

 0.0302734375

 9.5

 0.0322265625

 10

 0.0390625

 10.5

 0.04296875

 11.5

 0.056640625

 12

 0.064453125

 12.5

 0.068359375

 13

 0.076171875


 

 42

 0.931640625

 42.5

 0.935546875

 43

 0.943359375

 43.5

 0.95703125

 44.5

 0.9609375

 45

 0.9677734375

 45.5

 0.9697265625

 46

 0.9775390625

 47

 0.978515625

 47.5

 0.986328125

 48.5

 0.98828125

 50

 0.9951171875

 52.5

 0.9990234375

 55

 1


Example 2

Example 9.4, p. 185 from Zar, J. H. (2010). The null hypothesis “deer hindleg length is the same as foreleg length” is tested.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Hindleg (C7) and Foreleg (C8) as [Variable]s and check only the Wilcoxon Signed Rank Test output option to obtain the following results:

Paired Samples

Wilcoxon Signed Rank Test

For Hindleg and Foreleg

 

 

Cases

Rank Sum

Mean Rank

Negative Differences

 2

 4.0000

 2.0000

Positive Differences

 8

 51.0000

 6.3750

Total

 10

 55.0000

 5.5000

 

Correction for Ties =

 0.7500

 

 

W

Z-Statistic

1-Tail Probability

2-Tail Probability

Asymptotic

 4.0000

-2.3536

 0.0093

 0.0186

Asymptotic with CC

 

-2.4047

 0.0081

 0.0162

Exact

 

 

 0.0059

 0.0117

 

Since the probability is less than 5%, reject the null hypothesis.

6.4.2.2. Hodges-Lehmann Estimator (Paired)

This statistic will estimate the median difference. First, the difference between each pair is computed for n cases. Then the averages of all combinations of differences (also known as Walsh averages) are computed. The n(n + 1)/2 averages are sorted in increasing order and their median (the Hodges-Lehmann estimator or the shift parameter) is found.

and standard deviation of the test statistic are displayed.

The limits of the asymptotic confidence interval are the K*th smallest and the K*th largest difference:

      Nonparametric Tests-Paired Samples

where K* is rounded up to the nearest integer and the mean and standard deviation of the signed rank statistic are as given in the previous section.

The exact confidence interval is also displayed, which is based on the exact distribution of the test statistic. To determine the lower bound of the exact interval (the K*l smallest difference), find K*l such that:

      Nonparametric Tests-Paired Samples

round K*l up to the nearest integer. The upper limit is determined likewise, for:

      Nonparametric Tests-Paired Samples

For the unpaired case of this test see 6.4.1.2. Hodges-Lehmann Estimator (Unpaired).

Example 1

Example 13.2 on p. 452 from Armitage, P. & G. Berry (1994). An estimate of the median difference is required.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Drug (C5) and Placebo (C6) as [Variable]s and check only the Hodges-Lehmann Estimator (Paired) output option to obtain the following results:

Paired Samples

Hodges-Lehmann Estimator (Paired)

For Drug and Placebo

 

 

Minimum

Maximum

Mean

Standard Deviation

Rank Sum

 0.0000

 55.0000

 27.5000

 9.7340

 

 

K*

Median of Walsh Differences

Lower 95%

Upper 95%

Normal Approximation

 9

-1.0000

-4.5000

 1.0000

Exact

 

 

-4.5000

 1.0000

 

Example 2

Example on pp. 76-77, Gardner M. J., Altman, D. G. (1989). Beta endorphin concentrations in subjects before and after running in a half marathon are measured. We would like to estimate the sample median for the pairwise averages between differences and the 95% confidence intervals.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select After (C9) and Before (C10) as [Variable]s and check only the Hodges-Lehmann Estimator (Paired) output option to obtain the following results:

Paired Samples

Hodges-Lehmann Estimator (Paired)

For After and Before

 

 

Minimum

Maximum

Mean

Standard Deviation

Rank Sum

 0.0000

 66.0000

 33.0000

 11.2472

 

 

K*

Median of Walsh Differences

Lower 95%

Upper 95%

Normal Approximation

 11

 18.8250

 11.9000

 25.1000

Exact

 

 

 11.9000

 25.1000

 

6.4.2.3. Sign Test

This is a weaker version of Wilcoxon Signed Rank Test. The negative and positive differences are counted and the ties are ignored. Since the probability that either sum exceeds the other is 0.5, it is equivalent to a Binomial Test with p = 0.5.

The exact probability is calculated from the binomial distribution. The asymptotic probability is based on a normal approximation:

     Nonparametric Tests-Paired Samples

where np and nn are the numbers of positive and negative differences respectively. In both cases a two-tailed probability is reported. The output consists of the number of negative and positive differences, number of ties, test statistic and the exact binomial and asymptotic two-tailed probabilities.

Example 1

Example 13.1 on p. 450 from Armitage, P. & G. Berry (1994). The null hypothesis “there is no difference between the effects of the drug and the placebo” is tested.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Drug (C5) and Placebo (C6) as [Variable]s and check only the Sign Test output option to obtain the following results:

Paired Samples

Sign Test

For Drug and Placebo

 

 

Cases

Negative Differences

 6

Positive Differences

 4

Ties

 0

Total

 10

 

 

Value

Z-Statistic

1-Tail Probability

2-Tail Probability

Asymptotic

 6.0000

 0.3162

 0.3759

 0.7518

Exact

 

 

 0.3770

 0.7539

 

Example 2

Example 24.10, p. 538 from Zar, J. H. (2010). The null hypothesis is “deer hindleg length is the same as foreleg length”.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Hindleg (C7) and Foreleg (C8) as [Variable]s and check only the Sign Test output option to obtain the following results:

Paired Samples

Sign Test

For Hindleg and Foreleg

 

 

Cases

Negative Differences

 2

Positive Differences

 8

Ties

 0

Total

 10

 

 

Value

Z-Statistic

1-Tail Probability

2-Tail Probability

Asymptotic

 8.0000

 1.5811

 0.0569

 0.1138

Exact

 

 

 0.0547

 0.1094

 

Since the two-tailed exact binomial probability is greater than 5%, do not reject the null hypothesis.

6.4.2.4. Table of Ranks

When this output option is selected, a table will be formed displaying the two columns, their differences, the signed rank of their absolute value and the differences ordered in ascending order. The signed ranks are the intermediate values used in Wilcoxon Signed Rank Test.

The last column, ordered difference, can be used to run a Walsh Test. This test is used to determine whether two samples have been drawn from symmetrically distributed populations. It is assumed that the distributions are continuous. The test can be performed meaningfully only on small data sets with n £ 15.

First the signed difference for each matched pair is computed and then differences are ranked in increasing size. The null hypothesis is that “the average of differences is equal to zero” against the alternative hypothesis that “the population mean is other than zero”. Output displays the two data columns, their differences and the ranked difference. For probability values, tables for the Walsh Test must be consulted.

Example

Table 99 on p. 248 from Cohen, L. & M. Holliday (1983). With and without practice errors in a manual dexterity selection test are given for 11 candidates.

Open NONPAR12 and select Statistics 1Nonparametric Tests (1-2 Samples) → Paired Samples. Select Without (C11) and With (C12) as [Variable]s and check only the Table of Ranks output option to obtain the following results:

Paired Samples

Table of Ranks

For Without and With

 

Row

Without

With

Difference

Signed Rank

Ordered Difference

1

 11.0000

 6.0000

 5.0000

 7.5000

-1.0000

2

 4.0000

 2.0000

 2.0000

 3.0000

 0.0000

3

 5.0000

 4.0000

 1.0000

 1.5000

 1.0000

4

 9.0000

 3.0000

 6.0000

 9.5000

 2.0000

5

 5.0000

 5.0000

 0.0000

 0.0000

 3.0000

6

 13.0000

 7.0000

 6.0000

 9.5000

 4.0000

7

 5.0000

 6.0000

-1.0000

-1.5000

 4.0000

8

 7.0000

 3.0000

 4.0000

 5.5000

 5.0000

9

 8.0000

 4.0000

 4.0000

 5.5000

 5.0000

10

 10.0000

 7.0000

 3.0000

 4.0000

 6.0000

11

 12.0000

 7.0000

 5.0000

 7.5000

 6.0000

 

Consult tables for critical values of the Walsh Test with n = 11. We see from the table that a two-tailed test with n =11 is significant at the 5.6% level if:

max[d7, ½(d5+d11)] < 0 or min[d5, ½(d1+d7)] > 0

In this example:

max[4, ½(3+6)] < 0 or min[3, ½(-1+4)] > 0

max[4, 4½] < 0 or min[3, 2½] > 0

Since 3 > 0, this result is significant at the 5.6% level. Hence reject the null hypothesis that “manual dexterity does not change with practice”.