![]() |
Brighton Webs Ltd. statistical and data services for industry |
|
Home Index Feedback |
Hypothesis Testing The reason we mess with statistics is to help make decisions. Experience teaches us that many aspects of day-to-day life are subject to random variations. For example, on most days the bus to work arrives within a few minutes of the advertised time (buses can be early as well as late, however, most people only notice when its late). There will be days when the bus appears to be very late. Most people will form an opinion if the late arrival of the bus is typical, in which case they will do something else, e.g. catch an earlier bus, walk to work, become a monk etc.. However, if the late arrival is judged to be exceptional, they will keep catching the the bus each day. Hypothesis testing is simply the formalisation of this process. In practice, hypothesis tests are reduced to testing to see if an observed value is either less than or greater than a critical value. Students are expected to appreciate the origins of this process which is based on two mutually exclusive hypotheses, the Null Hypothesis and the Alternative Hypothesis.
The key skill is designing a test which allows the acceptance or rejection of the Null hypothesis. Each case will have its own test, what works for a week's experience of hanging round bus shelters may not be suitable determining if a pharmaceutical product has any therapeutic value. Level of Significance The outcome of a significance test will be influenced by the threshold value used to test against. Example 1 - Population Mean and standard deviation known, Is a sample mean equal to the population mean. The average weight of a apples from an orchard is 200gm with a standard deviation of 24gm.. The average weight of nine apples from a shady part of the orchard is 175gm, we want to know if the average weight of the sample apples is less than that for the whole orchard. The Hypothesises are:
The central limit theorem tells us that the sample mean will be normally distributed with a standard deviation of:
Thus the 5% confidence limits for the sample are:
Thus as the population mean is greater than the upper bound of the confidence interval, we reject the null hypothesis. Thus the average weight of the sample is less than that for the whole orchard. Example 2 - Is a Single value typical of a Population The example is based on winter rainfall in southern England. This was reportedly low for the winter of 05/06. The histogram shows data for the period 1959 to 2005. The dataset is a left skewed distribution which suggests a modal value of 325mm. The 90% confidence interval is 250mm to 550mm. The value for the winter of 05/06 is approximately 260mm. The null hypothesis is that the experience of 05/06 due to chance (i.e. rainfall is subject to random variation) and the alternative hypothesis is that 05/06 was an exceptional winter. The Null hypothesis is accepted because the 05/06 value is within the 90% confidence interval. This means that whilst the winter of 05/06 is drier than common experience (i.e. its less than the modal value), it is not exceptional in the context of a 90% confidence interval. Page updated: 13-May-2006 |
|
For more information: info@brighton-webs.co.uk |