2, ANOVA
Our two hypotheses have special names: the null hypothesis and the alternative hypothesis. Historically, the null (invalid, void, amounting to nothing) hypothesis was what the researcher hoped to reject. However, these days it is common practice not to associate any special meaning to which hypothesis is which. The null hypothesis is represented by H0 and the alternative hypothesis by Ha. Although simple hypotheses would be easiest to test, it is much more common to have one of each type or for both to be composite. If the values specified by Ha are all on one side of the value specified by H0, then we have a one-sided test (one-tailed), whereas if the Ha values lie on both sides of H0, then we have a two-sided test (two tailed).
The outcome of our test regarding the population parameter will be that we either reject the null hypothesis or fail to reject the null hypothesis. It is now considered poor form to "accept" the null hypothesis, although if we fail to reject it, that is in fact essentially what we are doing.
| Reject\Truth | H0 True | Ha True |
|---|---|---|
| Reject Ha | no error | False positive, Type II, beta=P(Reject Ha|Ha true) |
| Reject H0 | False negative, Type I, alpha=P(Reject H0|H0 true) | no error |
The term false positive for type II errors comes from perhaps a blood test where the test results came back positive, but it is not the case (false) that the person has whatever was being tested for. The term false negative for type I errors then would mean that the person does indeed have whatever was being tested for, but the test didn't find it. When testing for pregnancy, AIDS, or other medical conditions, both types of errors can be a very serious matter. Formally, alpha=P(Accept Ha|H0 true), meaning the probability that we "accepted" Ha when in fact H0 was true. This meaning for alpha is very similar to that encountered earlier and is often called the level of significance. Alpha and beta usually cannot both be minimized---there is a trade-off between the two. Historically, a fixed level of significance was selected (alpha=0.05 for the social sciences and alpha=0.01 or alpha=0.001 for the natural sciences, for instance). This was due to the fact that the null hypothesis was considered the "current theory" and the size of Type I errors was much more important than that of Type II errors. Now both are usually considered together when determining an adequately sized sample. Instead of testing against a fixed level of alpha, now a P-value is often reported.
| The P-value of a test is the probability that the test statistic would take a value as extreme or more extreme than that actually observed, assuming H0 is true. |
Obviously, the smaller the P-value, the stronger the evidence (higher significance, smaller alpha) provided by the data is against H0.
It is easy to misspeak power (1-beta) and P-value (alpha).
The
2 distribution is
characterized by one parameter called the degrees of freedom which is
often denoted by v (the greek letter nu) and used as a subscript:
2v.
2.
Although he wasn't able to prove this mathematically, he
demonstrated it by dividing a prison population of 3000 into
750 random samples of size four and used their heights.A common application of the chi-square statistics is in a test for goodness of fit as described in the homework. It is also use for tests of indepedence. Chi-square contingency tables are often formed and a contingency coefficient may also be used, especially when working with nonparametric measurements.
When comparing standard deviations the test is called analysis of variance or more commonly by its acronym ANOVA. The ANOVA F allows us to compare sevaral means, not just two as was done earlier with the t statistic.
Since we have use the term F several times it now
behooves us to look at the underlying F distribution.
The F distribution is named in honor of R. A. Fisher who
first studied it in 1924. (As you can see by this date and Gauss's work,
Statistics really only recently developed.) Specifically, the F
distribution compares the variance of two normal populations. If
12=
22,
then we expect s12 - s22
to be distributed about zero or equivalently the ratio
s12/s22
to be close to 1.0. However, this will depend on both sample sizes,
or more precisely, on the degrees of freedom.
|
The ratio of the variances of two independent random samples
taken from normal parent populations with equal variances has an F-distribution characterized by the degrees of freedom: v1=n1-1 and v2=n2-1 |
| T. OF CONTENTS | HOMEWORK | SOLUTIONS | ACTIVITY |
|---|