UNISTAT - the ultimate Excel statistics add-in

5.3.1. Histogram

Histogram

This procedure allows the choice of ungrouped data, data with frequency counts or grouped data (see 5.0.2. One-Sample Data Types). It is possible to draw frequency and cumulative histograms for string variables and histograms with irregular class widths.

Histogram

You can choose to display on the X-axis either the mid points or the lower and upper limits for each class using the EditBars dialogue.

5.3.1.1. Regular and Irregular Class Intervals

After variable selection, a second dialogue will allow you to edit the number of classes suggested by the program and choose between regular and irregular class intervals. At this stage, the program would already have calculated the default values for the lower and upper bounds and the class interval.

Histogram

Regular Class Intervals: If this (default) option is selected then the program will proceed with drawing the graph. The lower and upper bound and the class interval values can be edited subsequently, by opening the EditAxes dialogue. If the lower limit is higher than the minimum observation or the upper limit is lower than the maximum observation or more than 200 classes are generated, then a warning will be issued. In such cases the program will still proceed with plotting a histogram. If a wider class interval is entered, then the program will not rescale the Y-axis to cater for higher bars. This can be done manually.

Histogram

Irregular Class Intervals: If this option is selected then the program will open a new dialogue to allow you to edit the suggested class intervals.

Histogram

The dialogue contains a vertical scroll bar to edit up to 200 lower limits and the upper limit for the last class. Changes to the number of classes should be made before entering these values. The program will not proceed until a valid selection is made for all classes.

Histogram

5.3.1.2. Histogram Output Options

With this version of UNISTAT, an Output Options Dialogue is introduced for Histogram and 3D Histogram procedures. The Summary Information option will produce three tables for frequencies, statistics and fitted distribution parameters (if any). As in other output options (see 2.1.5. Output Options Dialogue), when you click on the [Finish] button, the summary information and the histogram will be sent to the Output Medium with default options. If you want to edit the properties of the histogram, you can send it to Graphics Editor by clicking on the [Opt] button situated to the left of the Draw Chart check box.

Histogram

The Summary Information option will produce output in the following format:

Histogram

Frequency Table

Room Averages

Frequency

440 - 520

 7

520 - 600

 28

600 - 680

 36

680 - 760

 39

760 - 840

 30

840 - 920

 11

920 - 1000

 8

1000 - 1080

 6

1080 - 1160

 3

Total

 168

 

Descriptive Statistics

 

Room Averages

Mean

 722.2976

Median

 709.5000

Mode

 720.0000

Lower Quartile

 612.0000

Upper Quartile

 805.5000

 

Distribution Fitting

 

Parameter 1

Parameter 2

Parameter 3

Normal

 722.2976

 142.6569

 

Student's t

 722.2976

 142.6569

 2.0001

Gamma

 25.6358

 0.0355

 

Erlang

 26.0000

 0.0360

 

Discrete Uniform

 168.0000

 0.0000

 

Negative Binomial

 722.2976

 26.5791

 

 

If the Class intervals box is unchecked in the EditBars dialogue (i.e. if the midpoints will be displayed), then the Frequency Table will look as follows:

Frequency Table

Room Averages

Frequency

480

 7

560

 28

640

 36

720

 39

800

 30

880

 11

960

 8

1040

 6

1120

 3

Total

 168

5.3.1.3. Fitting Distribution Functions

When a histogram is displayed with default options, the program will already have fitted eighteen Distribution Functions (except for the negative binomial distribution) on the data. Any six of these can be displayed simultaneously by selecting the EditDistributions dialogue. The type, parameters and appearance of these Distribution Functions can be controlled by the user.

Histogram

The EditDistributions dialogue features a Distribution and Parameters group at the bottom containing a drop-down list for all distributions supported. When a distribution is selected from this list, up to three more text fields are displayed immediately to the right of the list. These fields contain the estimated parameters for each distribution function (see Appendix). For instance, while for the normal distribution two fields will display the estimated mean and standard deviation, for t-distribution a third field will display the estimated degrees of freedom. A parameter which cannot be estimated is assigned the value ‑99. You can edit the values in each parameter field. For each distribution function you can also select the line style, thickness, colour, symbols, etc.

Any combination of continuous and discrete distribution functions can be selected for up to six distributions. The same distribution can be selected more than once. This may be useful for displaying one or more theoretical curves of the same distribution with different parameters against the fitted parameters.

Distributions in the drop-down list are in the same order as they are in the Distribution Functions dialogue (see 5.2.1. Cumulative Probability). Hypergeometric distribution ‑ for which the estimated frequencies procedure is not implemented ‑ is excluded. It is also possible to plot Distribution Functions without having to fit them on a frequency histogram by means of the Plot of Distribution Functions procedure.

Colour: This controls the colour of the fitted curves.

Symbol: The usual symbol selection group can be used to display Symbols for discrete distributions. When a selection is made other than None for a discrete distribution, a symbol will be drawn on the line at each distinct value of the X-axis variable.

Plot Frequency: This control determines the resolution of fitted distribution curves. The default value of 10 means that the functions will be evaluated at every 10th pixel. This field can have a minimum value of 1, in which case the functions will be evaluated at every pixel. This will take 10 times longer to compute and it may be more difficult to distinguish various curves.

5.3.1.4. Bars

Histogram

This dialogue provides controls for editing aspects of the histogram bars.

Function: The available options are (i) Frequency and (ii) Cumulative. Distributions can be fitted in either case.

Bar Fill Style: Bars can be filled with solid colours or with one of the six cross-hatch patterns provided.

Bar Colour: This controls the colour of the histogram bars.

Mean / Median / Mode: For numeric variables, this will draw a vertical line for each statistic along the X-axis. For string variables only the mode is drawn.

Quartiles: For numeric variables, a vertical line for 25% and 75% quantiles will be drawn along the X-axis.

Class Intervals: X-axis tick marks and their corresponding value labels can be drawn either in the middle of a class, or at the lower and upper boundaries. This option is available only for histograms with regular class intervals. Irregular histograms will always display class intervals. If the selected column contains String Data, tick marks will always be drawn at mid points.