Suppose you are performing a 2-sample, independent sample t-test, and you use the 2-sided rejection points with α = 0.05. For which of the following situations would the actual α be highest when , ?
The error on a linear combination of means is:
Error decreases with a increase in N, so the only real options are A and D, and for this example:
The 2-sample independent sample t-test makes several statistical assumptions. Which of the following assumptions is most likely to cause severe problems if it is substantially violated?
Suppose you somehow knew that the population standard deviations are 10 in two populations, and that the population distributions are normal. You wish to test the null hypothesis that , using the 2-Sample Z-Test of the form:
What minimum sample size N do you need for each group to insure that power is at least 0.95 if α = 0.05 and ?
Power, in general is a function of several factors:
The two sample z-test is of the form:
The null hypothesis is accepted when:
Power then is:
The minimum sample size then is:
You wish to test the null hypothesis that the population proportion p is less than or equal to .50, using the version of the Z-statistic that incorporates the null hypothesis in the denominator. You obtain a sample proportion of , based on a sample size of 100. In this case:
Obtained Value of the Z-statistic | the null hypothesis is rejected | |
---|---|---|
2.7 | true | |
2.4 | true | |
3.14485 | false | |
3.3 | true | |
✓ | 3.0 | true |
2.4 | false | |
3.0 | false | |
3.14485 | true |
So, unless the rejection point was greater than 3 (Φ(3) ≊ 99.87%), the hypothesis would be rejected. Another verification of this is to construct the 95% confidence interval:
Assuming the distribution is reasonably normal, i.e.:
Then the confidence interval is:
You wish to perform a 2-sample matched sample t-test of equality of means on a group of N = 91 people who were all measured on 2 occasions. Unfortunately, you do not have the raw data, i.e., two columns of scores representing the repeated measurements. However, you do have the mean difference, . You also have the variances at time 1 and time 2, and the covariance between the two columns of numbers. These are , and . Using your knowledge of linear combinations, use this information to compute the variance of the difference scores, , and then compute the matched sample t statistic. In this case, you obtain:
T-Statistic Value | Degrees of Freedom | Critical Value of the t Distribution with α = 0.05 | |
---|---|---|---|
7.69408 | 90 | 1.98667 | |
6.99462 | 90 | 1.66196 | |
0.512871 | 90 | 1.98667 | |
✓ | 6.99462 | 90 | 1.98667 |
6.50581 | 90 | 1.98667 | |
5.4893 | 90 | 1.98667 | |
6.99462 | 91 | 1.98638 | |
6.29516 | 90 | 1.98667 |
In general:
So:
The appropriate t-statistic then is:
has 90 degrees of freedom and a value that is looked up in a t-test table.
You perform an experiment design to test whether a persuasive message can effectively change opinion on a political issue. You measure a group of people on two occasions, and use McNemar’s Z-test to assess the null hypothesis that p1 = p2, where p1 and p2 are the proportions of people who said "yes" to the pollster on the two occasions. The data are summarized in a 2 × 2 table, where n10 is the number of people who said "yes" at time 1 but "no" at time 2, n01 is the number of people who said "no" at time 1 and "yes" at time 2. Suppose we have n01 = 70 and n10 = 44. The absolute value of the Z-statistic is:
You observe a sample correlation of 0.46 based on a sample of N = 60.0 independent observations from a bivariate normal distribution. You test the hypothesis that ρ = 0 using the t-statistic. You calculate:
Value of t | Degrees of Freedom | |
---|---|---|
3.94547 | 59 | |
4.34001 | 59 | |
3.55092 | 58 | |
4.67932 | 58 | |
4.34001 | 58 | |
✓ | 3.94547 | 58 |
3.59432 | 58 | |
3.94547 | 60 |
For H0: ρ = 0 there is a special form of the test:
You test the hypothesis that ρ = 0.5 using the Fisher transform. The sample correlation you observe is r = 0.53. The sample size is N = 56. The Z-statistic value is:
The Fisher transform is simply:
The z-statistic for H0: ρ = a can be computed using:
Suppose you have two independent groups of size N = 200. These groups represent random samples from two populations. If 62 people in group 1 and 149 people in group 2 can perform a behavior, construct a 95% confidence interval on p2 - p1, the population difference in the proportions of people who can perform the behavior. The endpoints of the interval are:
The relevant proportions are:
The confidence interval then is:
Suppose you obtain random samples of N1 = 70 males and N2 = 51 females, and obtain a correlation r1 = 0.54 between two variables of for the male participants, and r2 = .40 for the famale participants. If you test the null hypothesis with the standard Z-statistic, what value should you obtain?
Suppose you obtain a random samples of N = 74 individuals, and obtain a correlation r = 0.53 between two variables. Suppose you construct a 90% confidence interval for the population correlation. What are the endpoints of the confidence interval?
The confidence interval on φ(r) is:
The confidince interval on r then is:
The standard F-test for comparing two variances for equality:
Suppose X has a distribution. What is the variance of X?
For a chi-square distribution, , with ν degrees of freedom:
Suppose you take a sample of size 48, and observe a sample variance of 60. The endpoints for the 95% confidence interval on σ2 are:
The values for χ2 are looked up in a table.
You obtain samples of size N1 = 30 and N2 = 40, and observe sample variances of 127 and 29. Test the null hypothesis H0: using the standard F statistic. The results are:
Test Statistic Value | Degrees of Freedom - 1 | Degrees of Freedom - 2 | Null Hypothesis | |
---|---|---|---|---|
4.37931 | 29 | 39 | not rejected | |
4.37931 | 30 | 40 | not rejected | |
5.37931 | 30 | 40 | not rejected | |
✓ | 4.37931 | 29 | 39 | rejected |
4.37931 | 30 | 40 | rejected | |
3.37931 | 29 | 39 | rejected |
When N = 12, and the sample is independent and random from a normal distribution with standard deviation 41.0, the sample standard deviation S has an expected value of 40.0889. We know that the expected value of S2 is σ2 , i.e., 1681.0. Using this information, and a well known formula for the variance of a random variable, the sampling variance of S is:
Suppose you know the population variance, and it is σ2 = 182. You take 3 samples from the (normally distributed) populations, and all of them are of size N = 11. The sample means are 18.63, 21.82, and 33.27. Compute a chi-square statistic for testing the null hypothesis that all 3 populations have the same mean. The value of the statistic is:
χ2 | Degrees of Freedom | |
---|---|---|
✓ | 7.16427 | 2 |
35.8213 | 3 | |
35.8213 | 2 | |
7.16427 | 3 | |
7.16427 | 30 |
Suppose you sample N = 43 independent observations from a normal distribution, and observe a sample variance of S2 = 220.2. You test the null hypothesis that σ2 = 100 with α =. 05.
Observed Value of the χ2 Statistic | Degrees of Freedom | Critical Value of the Test Statistic | |
---|---|---|---|
94.686 | 42.0 | 58.124 | |
✓ | 92.484 | 42.0 | 61.7768 |
101.732 | 42.0 | 61.7768 | |
92.484 | 42.0 | 59.3035 | |
92.484 | 43.0 | 61.7768 | |
184.968 | 42.0 | 61.7768 |
You have the following data from 3 groups:
Group 1 | Group 2 | Group 3 | |
---|---|---|---|
Mean | 11 | 37 | 48 |
Variance | 257 | 287 | 299 |
Sample Size (N) | 18 | 18 | 18 |
You perform a 1-Way Analysis of variance. You obtain the following results:
F-statistic | Degrees of Freedom (numerator) | Degrees of Freedom (denominator) | Critical Value from the F distribution with α =. 05 | |
---|---|---|---|---|
✓ | 23.1246 | 2 | 51 | 3.17880 |
23.1246 | 2 | 51 | 2.78623 | |
23.1246 | 3 | 51 | 3.17880 | |
27.9807 | 2 | 51 | 3.17880 | |
34.6868 | 2 | 51 | 3.17880 | |
20.8121 | 3 | 51 | 3.17880 | |
34.6868 | 3 | 51 | 3.17880 | |
25.437 | 2 | 51 | 3.17880 |
You observe the following results for 3 independent groups. You compute the 1-Way ANOVA for unequal N. The test is performed with α = .05.
Group 1 | Group 2 | Group 3 |
---|---|---|
4 | 1 | 5 |
11 | 7 | 12 |
7 | 3 | 15 |
3 | 9 | |
16 |
SSbetween | SSwithin | dfbetween | dfwithin | SSbetween | SSwithin | Fobserved | Fcritical | |
---|---|---|---|---|---|---|---|---|
✓ | 139.383 | 124.867 | 2 | 9 | 69.6917 | 13.8741 | 5.02316 | 4.25649 |
139.383 | 129.867 | 2 | 9 | 69.6917 | 13.8741 | 5.52547 | 4.25649 | |
135.383 | 124.867 | 2 | 9 | 69.6917 | 13.8741 | 5.02316 | 4.25649 | |
139.383 | 124.867 | 2 | 9 | 69.6917 | 13.8741 | 5.02316 | 5.71471 | |
139.383 | 124.867 | 2 | 9 | 69.6917 | 8.91905 | 5.02316 | 4.25649 | |
139.383 | 124.867 | 2 | 9 | 69.6917 | 13.8741 | 5.02316 | 8.02152 | |
139.383 | 124.867 | 2 | 9 | 46.4611 | 13.8741 | 5.02316 | 4.25649 | |
139.383 | 124.867 | 2 | 9 | 69.6917 | 13.8741 | 1.11626 | 4.25649 |
Suppose statistic A has a χ2 distribution with 90 degrees of freedom, and statistic B has a χ2 distribution with 93 degrees of freedom. If A and B are independent, then what is the distribution of A/B?