|
A frequency table lists in one column the data categories
or classes and in another column the corresponding frequencies. |
| Grade | Frequency |
|---|---|
| 9 (freshmen) | 30 |
| 10 (sophomores) | 26 |
| 11 (juniors) | 28 |
| 12 (seniors) | 20 |
| Test Score | Frequency |
|---|---|
| 0 - 19 | 2 |
| 20 - 39 | 11 |
| 40 - 59 | 9 |
| 60 - 79 | 11 |
| 80 - 99 | 8 |
| 100-119 | 7 |
| 120-139 | 2 |
Often, the category column will have continuous data and hence be presented via a range of values. In such a case, terms used to identify the class limits, class boundaries, class widths, and class marks must be well understood. For the following examples, use the data above right (1998 Algebra Diagnostic score distribution).
| Class limits are the largest or smallest numbers which can actually belong to each class. |
For this example, the class limits are as displayed above in the left table column. For the largest class they are 120 and 139. Each class has a lower class limit and an upper class limit.
|
Class boundaries are the numbers which separate classes. They are equally spaced halfway between neighboring class limits. |
For this example, the boundaries would be -0.5, 19.5, 39.5, 59.5, 79.5, 99.5, 119.5, and 139.5. Note that 19.49999... is another name for and identical with 19.50000....
| Class marks are the midpoints of the classes. |
For this example, the class marks are 9.5, 29.5, 49.5, .... It may be necessary to utilize class marks to find the mean and standard deviation, etc. of data summarized in a frequency table.
| Class width is the difference between two class boundaries (or corresponding class limits). |
For this example, the class width is 20.0. Following are guidelines for constructing frequency tables.
If your limits are not immediately obvious based on the data, try to find an appropriate width by rounding up the range divided by the number of classes. Your lower limit should be either the lowest score, or a convenient value slightly less. Avoid irrelevant decimal places. Large data sets justify having more classes. One published guide is: number of classes = 1 + log2n. This gives you 5 classes for small data sets of 12 to 22 elements and 10 classes for larger data sets of 362 to 724 elements. The seven classes used above for 50 elements is right on target. It is not uncommon to omit empty classesbe alert for such guideline violations! Omitted classes do not change the class width, but can be a real source of confusion!
| Relative freqency tables contain the relative frequency instead of absolute frequency. |
Relative frequencies can be expressed either as percentages or their decimal fraction equivalents.
| Cumulative frequency tables contain frequencies which are cumulative for subsequent classes. |
In a cumulative frequency table, the words less than usually also appear in the left column.
|
|
|
| A relative frequency histogram has the same shape and horizontal scale as a histogram, but the vertical scale is now the relative frequency. |
| A Pareto chart is a bar graph for qualitative data. |
The bars in a pareto chart should be arranged in descending order of frequency, from left to right.
| Frequency polygons are similar to histograms, but use line segments to connect the points. |
When construction a frequency polygon, the class marks should be used on the horizontal scale. The graph should also be extended to the left and right so that it begins and ends with a frequency of 0.
| Cumulative frequency polygons, also known as ogives, are also commonly encountered. |
A pictograph depicts data by using pictures of an object, such as coins, money bags, airplanes, etc. Those which use multiple objects the same size are ok. Those which use similar objects, scaled linearly to represent data, can easily distort things. There may be many other variations, but those listed above are most common.
| 4 | | | 23667899 |
| 5 | | | 0111112244444555566677778 |
| 6 | | | 0111244589 |
Please note that the separation line should be continuous, but time constraints limited accomplishing that feat. The following rules should be observed when constructing stem-and-leaf diagrams.
| 4 | | | 23 |
| 4 | | | 667899 |
| 5 | | | 0111112244444 |
| 5 | | | 555566677778 |
| 6 | | | 0111244 |
| 6 | | | 589 |
|
|
|
Please note that you can press TRACE and obtain the 5-number summary of: 42, 51?, 55.5, 58?, 69. The whiskers extend from either 1.5 inner quartile range above and below the quartiles or from the minimum to maximum values. The former is terms a modified box plot and will have outliers individually plotted via a symbol of your choice. They also can be traced. You may want to try the data given in the previous lesson illustrating outliers.
| BACK | HOMEWORK | ACTIVITY | CONTINUE |
|---|