UNISTAT - the ultimate Excel statistics add-in

5.3.6. Box and Whisker and Dot Plots

Box and Whisker and Dot Plots

Multisample data can be entered in the form of multiple columns or data columns classified by factor columns. If at least one factor is selected, then a further dialogue will pop up asking for the combination of the categories on the X-axis. Although an unlimited number of data series can be plotted, properties of only the first nine can be individually controlled from the EditData Series dialogue. The rest of the series will repeat the properties of the first nine in a circular fashion. The Apply to all variables check box allows you to apply the current variable’s settings to all selected variables.

Box and Whisker and Dot Plots

Symbol type, symbol size, colour and Point Labels can be controlled for outlying points on Box and Whisker Plot for each data series individually.

Box and Whisker and Dot Plots

The EditWidth / Notch / Dots dialogue can be used to control the statistical parameters represented on the graph. The three check boxes in the Type panel allows drawing any combination of Box and Whisker Plot, Dot Plot and Error Bar Plot on the same graph. The Confidence although it is also available in the Variable Selection Dialogue. When the Type is changed, the change will apply to all data series.

5.3.6.1. Box and Whisker Plot

Box and Whisker and Dot Plots

A box and whisker plot conveys the following information:

Lower Quartile: Bottom line of the box.

Median: Middle line of the box.

Upper Quartile: Top line of the box.

Lower Whisker: Lower adjacent value. This is equal to the maximum of (i) lower quartile minus 1.5 times the inter-quartile range and (ii) the minimum observation. Any values below this are outliers and are plotted individually.

Upper Whisker: Upper adjacent value. This is equal to the minimum of (i) upper quartile plus 1.5 times the inter-quartile range and (ii) the maximum observation. Any values above this are outliers and are plotted individually.

Box Width: The variable box width conveys information about the size of the sample. See below.

Notch: When there is a notch, it conveys information about the dispersion of data about the median. See below.

On the Width / Notch / Dots dialogue, the first group of controls concerns the Box and Whisker plots.

Box and Whisker and Dot Plots

Width: The width of boxes can be used to convey information about sample sizes:

      Fixed: No information is conveyed.

      Sqr(n): The widths are proportional to the square root of their sample size.

      Log(n): The widths are proportional to the 10 based logarithm of their sample size.

      n: The widths are proportional to their sample size.

Box and Whisker and Dot Plots

Notch: The extent of notches represents the following dispersion measures:

      None: A notch is not drawn.

      t-interval:

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      where Box and Whisker and Dot Plotsis the critical value from t-distribution with n - 1 degrees of freedom.

      Z-interval:

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      Standard Error:

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      Standard Deviation: As above, but with sample standard deviation.

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      Variance: As above, but with sample variance.

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      Robust Confidence Interval: The robust standard error (SE*) is defined as:

           Box and Whisker and Dot Plots

where IQR is the inter-quartile range and n is the sample size. The robust confidence interval is then defined as:

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      where Box and Whisker and Dot Plots is the critical value from the standard normal distribution (see McGill, R., Tukey, J. W. and Larsen, W. A. 1978).

5.3.6.2. Dot Plot

Box and Whisker and Dot Plots

The second group of controls concerns the Dot plots.

Number of Classes: The dot plot is essentially a histogram and this parameter controls the number of classes (the default is 20). The size of dots can be adjusted from the EditData SeriesSymbol panel to obtain the desired appearance.

Centre Dots: When this box is checked, dots will be centred. Otherwise they will be left-justified.

5.3.6.3. Error Bar Plot

Box and Whisker and Dot Plots

Central Tendency and Confidence Interval: The following central tendency measures and their confidence limits can be drawn.

Box and Whisker and Dot Plots

· Mean

·      t-interval

·      Z-interval

·      Standard Error

·      Standard Deviation

·      Variance

·  Geometric Mean

·      t-interval

·      Z-interval

·  Harmonic Mean

·      t-interval

·      Z-interval

·  Median

·      Quartiles

·      95% Quantile

·      Robust Confidence Interval

      When Central Tendency is Mean and one of Standard Error or Standard Deviation options is selected, a dialogue pops up asking for a multiplier.

Box and Whisker and Dot Plots

      Error bars for standard error will then be calculated as:

           Box and Whisker and Dot Plots

           Box and Whisker and Dot Plots

      and for standard deviation:

           Box and Whisker and Dot Plots

            Box and Whisker and Dot Plots

      where k is the multiplier defined by the user.