Compute the sample mean for the following 5 numbers: 1, 4, 2, 3, 4
Compute the sample variance for the following 5 numbers: 3, 4, 2, 4, 5
Suppose you have 10 numbers and have computed the mean to be 6.0. You then discover that the last number in the data was entered incorrectly. It was entered as 9.0 when it should have been 8.0. If you replace the incorrect value (9.0) with the correct one (8.0), and recompute the mean, you will obtain a new mean of:
A series of numbers that will give the original mean are:
The reason for this is to get the average of and to equal .
from this set can then be replaced with to compute the new average.
Suppose two different scores, each with a minimum of zero and a maximum possible value of 25, are summed to produce a "total raw admission score" that is used to determine university admissions. Suppose that score A has a mean of and a standard deviation of , and score B has a mean of and a standard deviation of . Joseph has a Z-score of 1 on Score A and a Z-score of -1 on Score B, while Marilyn has a Z-score of -1 on Score A and a Z-score of 1 on Score B. Which student will have the higher total raw admission score?
You have 4 numbers: a, b, c, d. If you multiply them all by 4, the mean of the resulting data set will be
In general, if:
Then:
In this particular situation, , so they cancel.
You have 10 numbers with a sample mean of 9.0 and a sample variance of 11.0. You discover that the last number in the list was recorded as 10.0 when it should have been recorded as 14.0. If you correct your error and correctly recompute the sample variance, what value will you obtain?
Consider the following data array:
5 | 3 | 1 | 2 |
5 | 9 | 9 | 8 |
5 | 7 | 15 | 2 |
4 | 7 | 6 | 1 |
Compute:
For square matrices, indices range from 1 to ni and 1 to nj and:
You have a sample of N observations, and you transform them to Z-scores. Assuming the original scores are not all equal, the quantity
will always be equal to:
This derivation relies on a transformation of the variance when dealing with z-scores: (Recall that and .)
You have the following set of data: 1,5,3,7,8. Find the median.
For the median, the elements must be in ascending or descending order, so the set is 1,3,5,7,8. Since the size of the set is odd, the median is simply the middle value.
Which of the following is not true of the median?
Outliers can affect the mean, but the median only takes the number of values into account and the central value.
Widgets are weighed on a scale that is accurate to the nearest 100 grams. A group of 50 widgets is weighed. The heaviest widget is recorded as 9000 grams, the lightest at 4000 grams. The inclusive range is the difference between the upper real limit and lower real limit for the data. What is the inclusive range for these data?
If a widget weighs more than 3950 grams, it will register as 4000. Similiarly, if it weighs less than 9050 it will register as 9000, so the range for x is 3950 < x < 9050.
You have a set of data that have a mean of 58 and a standard deviation of 12. You wish them to have a mean of 67 and a standard deviation of 11, while retaining the shape of the present distribution. What values of a and b in the linear transformation formula Y = aX + b will produce a new set of data with the desired mean and standard deviation?
The scale factor will be:
The shift is found by scaling the mean and comparing it to the needed mean:
Suppose you have evenly spaced numbers, and the distance between adjacent numbers is k. It is well known that, if N = 3, the sample mean of the 3 numbers is the middle value and their sample variance is . Suppose N = 5. Find an expression for the sample variance of the 5 numbers.
First, consider the sum of a sequence of numbers:
The same basic logic produces:
This can be used in the expansion of n elements evenly spaced by k units:
And again, through a similar process:
Consider then the mean:
The variance then is:
ToDo: Finish this derivation.
The probability distribution shown below is
For the following data array, compute .
5 | 4 | 1 | 2 |
5 | 3 | 4 | 9 |
5 | 2 | 20 | 1 |
4 | 4 | 8 | 2 |
For the following data, compute .
7 | 7 | 1 | 2 |
7 | 3 | 7 | 14 |
5 | 2 | 49 | 0 |
4 | 4 | 14 | 5 |
For the following frequency distribution of number of children for psychology faculty members at a major Eastern institution
X | f |
---|---|
3 | 3 |
2 | 6 |
1 | 3 |
0 | 8 |
Compute the sample mean
Group A is composed of a sample of 10 observations, and has a sample mean of 3 and a sample variance of 30. Group B is composed of a sample of 10 observations, and has a sample mean of 5 and a sample variance of 10. If A and B are combined to form one large group, what will be the variance of this combined group?
Which of the following is not true?
Jamal took an exam where the class mean was and the class standard deviation was . What Z-score must Jamal exceed to have a grade that exceeds 75?
In the frequency distribution below, what is the cumulative relative frequency of the value X = 1?
X | f |
---|---|
3 | 7 |
2 | 7 |
1 | 7 |
0 | 4 |
The cumulative relative frequency is:
In a normal distribution with a mean of 60 and a standard deviation of 20, which of the following scores is at the 10th percentile?
From the table in Glass & Hopkins on page 615, r(-1.282) = .1. -1.282(20) + 60 = 34.36.
IQ scores have a distribution that is approximately normal in shape, with a mean of 100 and a standard deviation of 15. What percentage of scores is at or above an IQ of 116?
(116 - 100) / 15 ≊ 1.0666
Group A has a mean of 0 and a standard deviation of 1. Group B has a mean of 0 and a standard deviation of 1.11. The populations have a normal distribution. Consider the proportion of scores at or above 3.0 in Group B, and the proportion of scores at or above 3.0 for Group A? What is the ratio of these proportions?
You have a set of 11 numbers with a mean of 18. If you add an additional number to the original group, and that new number has a value of 23, what will be the mean of the new set of 12 numbers?
(23 - 18) / 12 ≊ 0.417
Consider two columns of numbers, X and Y. The sum of cross-products of deviation scores of the numbers is equal to:
SCP is always equal to which of the following quantities:
The table below, adapted from Glass and Hopkins, Table 5.1, shows the expected value of the range for random samples of size N from a normal distribution, when σ = 1. Suppose heights of men in the general population have a mean of 69 and a standard deviation of 2.9, and have a distribution that is very closely approximated by a normal distribution. What is the expected value of the tallest man in a random sample of size 100?
N | E(Range) |
---|---|
2 | 1.1 |
5 | 2.3 |
10 | 3.1 |
20 | 3.7 |
50 | 4.5 |
100 | 5.0 |
200 | 5.5 |
500 | 6.1 |
1000 | 6.5 |
IQ scores have a distribution that is approximately normal in shape, with a mean of 100 and a standard deviation of 15 in the general population. Assuming a normal distribution is a good approximations, what proportion of the general population has IQ scores betweeen 81. 0 and 107. 0?
Fred took an exam and got a Z-score of 1.76. For which of the following exam metrics will Fred get the highest raw score?
You have 10 numbers with a mean of 11.0. You first add 5.0 to all the numbers, then divide them all by 3.0. What will the mean of the numbers be when you are finished?
When we compare the stem-leaf diagram with a standard frequency histogram plot, we see:
Four distributions of 25 numbers have the identical shape, except that the X distribution has a mean of 100 and a standard deviation of 15, while the Y distribution has a mean of 20 and a standard deviation of 10, the W distribution a mean of 50, and a standard deviation of 20, and the K distribution a mean of 500 and a standard deviation of 100. Which of the following scores has the highest percentile rank in its respective distribution?
After the first statistics exam, the professor posts the grades, and you compute your Z-score. It is +1.4. A day later, the professor announces that he forgot to scale the grades, and that he is linearly rescaling the grades so that the mean changes from 60 to 78, and the standard deviation changes from 10 to 12. What will your Z-score become?
The maximum value of the regression coefficient b in the linear regression equation y = bx + c is
Compute the correlation coefficient for the following data:
1 | 4 |
2 | 5 |
4 | 4 |
5 | 3 |
2 | 6 |
X has a variance of 40. Y has a variance of 40. X and Y have a covariance of 21.6. Find the variance of X - Y.
Given three uncorrelated variables X, Y, and Z, all with means of 15 and variances of 100. Consider the linear combination W = 7X + 11Y + 9Z. What would be its mean?
Given three uncorrelated variables X, Y, and Z, all with means of 15 and variances of 100. Consider the linear combination W = 8X + 12Y - 10Z. What would be its variance?
X has a variance of 40. Y has a variance of 54. X and Y have a covariance of 21.379. Find the correlation ρY,X.
Suppose that you computed the covariance and the variances for 2 variables with N in the denominator of each of the 3 formulas, instead of N - 1, and you computed the correlation coefficient with these "new" definitions of sample variance and covariance. What effect would that have on the value of a positive correlation coefficient?
The correlation ρX,Y between X and Y is 0.67. The variance of X is 105. The variance of Y is 181.0. What is the variance () of the predicted scores Y when Y is predicted from Y X?
In a standard linear regression setup, you are predicting Y from X. The data are r X,Y = 0. 48, s 2 = 154. 0, s 2 = 121. 0, Y ∙ = 13. 0, X ∙ = 22. 0. In the linear regression X Y equation Y = bX + c for predicting Y from X, what are the values of b and c respectively?
Melinda has a Z-score of 1. 0 on a standardized entrance exam used by a large university. The correlation between performance on this exam and first-year GPA at the university is 0. 67. At the university, first-year GPA’s average 2. 8 with a standard deviation of 0. 77. What is the predicted average GPA for students with the same score as Melinda on the entrance exam?
Suppose W = bX + a, where b = σY,X, the covariance between Y and X, and a is an unknown constant. In that case, what is the covariance of W and Y?
Which of the following expressions is a correct formula for the variance of W when W is obtained from X and Y from the formula W = 4X + 3Y?
Suppose a large representative group of students in grade 7 were given a standardized intelligence test (X), and that the test was readministered (Y) in grade 8. On each occasion the mean was 100 and the standard deviation was 15. The correlation in this bivariate normal distribution was .80, and the linear regression error scores had a standard deviation of σE = 9. What percentage of examinees who scored 145 at grade 7 would be expected to do at least as well on the grade 8 test?