N Sample Size The Variances example essay topic
To do draw a conclusion, we must first assume normal distribution. We must also set the null hypothesis to m 1 - m 2 = 0. And per this assignment we must set the a-level at. 05 and the hypothesis alternative to m 1 - m 2 ^1 0; thus requiring a two-tailed test. The random samples have a mean of 11.6 days absent for the smoker and 8.76 days absent for the non-smoker. All of my calculations were done using the data analysis tool in Excel but can be done manually with given equations: Sample Mean : n = sample size The variances of each sample are 37.33333333 for the smoking population and 18.94 for the non-smoking population.
Their standard deviations are 6.110100927 and 4.352011029 respectively. Manually: Sample Variance (s 2): Std Dev (s): Next we must determine if the population variances of these two samples are equal at an a-level of. 05. Based on the data and the assumption that the populations are normal, the variances are equal though barely (Fc = 1.983757159 and the F-test = 1.971136824). (Note if this was tested at a different a-level, we may not of accepted the variances as equal and would use a different equation to test the hypothesis). With the use of the F-table, these variances can be tested for equality manually also: To determine if Variances are equal : n 1 = n 1 - 1 Use F table: n 2 = n 2 - 1 Reject Region (Fc): F Fa, n 1, n 2 Test Statistic: Since the variances were tested and found equal at the a-level of.
05. Then we must use the pooled variances (the weighted average of the two sample variances) to calculate the test statistic. Excel did this calculation for me: 28.136666667. It can be done manually using this equation: Pooled variance = Now we perform the t-test under the assumption that is normally distributed. Again, this can be done manually: tc = ta/2, n 1+n 2-2 Utilizing the data analysis tool in Excel, I found the t-statistic for these sample means is 1.892940764 and the tc for a two-tailed test is +/- 2.01063358. Since the t-statistic falls within this range we accept the null hypothesis that the means of the samples are equal.
Based on these samples, the data does not offer evidence of differences in absentees in the workplace between smokers and non-smokers. I would like to note though that the test's conclusion very much depended on the a-significance level. If we had set the hypothesis alternative to m 1 - m 2 0 and did a one-tailed test, the tc would be 1.677224191. Since the Reject Region is t tc, in this instance, we would reject the null hypothesis and thus conclude that the mean of absentees in smoking workers is greater than the mean of absentees in non-smoking workers. I feel the results of this test should not be used as a basis for any decisions or policies. Further studies with greater number of samples would be needed to assess the true means of the populations..