![]() |
Brighton Webs Ltd. statistical and data services for industry |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Home Index Feedback |
Comparison of Two Sample Means - StDev of Population Unknown A common problem facing researchers is to determine if two samples are from the same population or represent two separate populations and the only data available is the two samples. An example of this type of problem is salinity data from two adjacent groups of boreholes, if the mean salinity from the two groups is the same, there is an inference that all the boreholes have penetrated the same reservoir (the null hypothesis is that variation is due to random fluctuations), however, if the means are different, then it maybe that each group of wells is in different reservoirs (the alternative hypothesis is that the difference in means is not due to chance). Hypothesies and t statistic It is not the intention of this page to work through the underlying maths, but the process is based on the null hypothesis: Which can be rearranged to: And this leads to the t statistic: If the null hypothesis is accepted, t will follow a t distribution with: Estimation of Standard Deviation The unbiased estimate of the population's standard deviation is obtained by using data from both samples: Sample Data The sample data consists of a pair of datasets, each with two samples. Set 1 is composed of two samples both drawn from a population with a mean of 10.00 and standard deviation of 0.5. For Set 2, sample A is the same as sample A in Set1, however, Sample B is drawn from a population with a mean of 11.00 and standard deviation of 0.5. Without resorting to any form of statistical analysis, knowledge of the composition of the data suggests that the samples from Set 1 will have similar means and that those of Set 2 will be significantly different. Example Some intermediate values are missing from the calculation below, however, they can easily be recreated using Excel if required. Values are shown to 2 d.p. thus rounding errors may be present.
For Set 1, the null hypothesis is accepted (i.e. the difference in the means of the samples is due to chance) because the value of t is less than the critical value (t with 16 degrees of freedom). However, it is rejected for Set 2 (i.e. the difference between the means is greater than would be expected from chance alone) because the value of t is outside the critical values. Page updated: 26-Sep-2006 |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
For more information: info@brighton-webs.co.uk |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||