For all problems, unless otherwise indicated, assume a Type 1 error rate = .05.

  1. An experiment is conducted in which behaviorally inhibited and behaviorally uninhibited children are exposed to an unfamiliar robot. The researcher quantifies heart rate responses in beats per minute (BPM) during exposure to the robot. The data are as follows:

    Subject Number Group Average Heart Rate (in BPM)
    1Uninhibited66
    2Uninhibited61
    3Uninhibited67
    4Uninhibited60
    5Uninhibited71
    6Uninhibited63
    7Uninhibited67
    8Uninhibited66
    9Inhibited89
    10Inhibited78
    11Inhibited67
    12Inhibited86
    13Inhibited82
    14Inhibited66
    15Inhibited85
    16Inhibited79
    17Inhibited88
    18Inhibited79
    19Inhibited90
    20Inhibited64

    The researcher is initially interested in testing the hypothesis that the inhibited group has more variable response times across subjects than the uninhibited group. (One could also assess variability on a within-subjects basis, but that is not the focus of the present study.) Using hand calculations test the hypothesis that the variability of heart rate within the behaviorally inhibited group is greater than the variability of heart rate within the behaviorally uninhibited group.

    H0 : σ12 < σ22

    A reasonable test when comparing variances is the two-tailed F-test. The F-test will fail to reject a two-tailed H0 iff:

    F n1-1 n2-1 α2 s12s22 F n1-1 n2-1 1-α2

    This particular F-test is one-tailed and so the measure is:

    s12s22 F n1-1 n2-1 1-α
    Subject Group Heart Rate Y_ y-Y_ 2 si2= yi-Yi_ 2 ni-1
    1 Uninhibited 66 65.125 0.766 12.982
    2Uninhibited6117.016
    3Uninhibited673.516
    4Uninhibited6026.266
    5Uninhibited715.641
    6Uninhibited634.516
    7Uninhibited673.516
    8Uninhibited660.766
    9 Inhibited 89 79.427 91.840 84.811
    10Inhibited782.007
    11Inhibited67154.174
    12Inhibited8643.340
    13Inhibited826.674
    14Inhibited66180.007
    15Inhibited8531.174
    16Inhibited790.174
    17Inhibited8873.674
    18Inhibited790.174
    19Inhibited90112.007
    20Inhibited64237.674
    s12s22 12.92884.811 0.153 0.21 F711 .025

    F711 0.01 0.153 , the null hypothesis is rejected.

  2. A study involves testing the effects of three medications on the ability of phobics to approach a feared object. 30 individuals who are phobic of spiders are randomly assigned to receive one of 3 medications (Zoloft, Naltrexone, and Valium) (10 subjects per group). After ingesting the medication, each participant attempts to approach a live spider. The researcher measures the distance in feet of each participant from the spider. The raw data are below:

    Zoloft9115121514131276
    Naltrexone15161212181923201317
    Valium911125131511869
    1. In order to assess whether medication condition has an effect on approach behavior, analyze these data using a one-way between subjects ANOVA. Please do the calculations by hand and show your work. Present the results in an ANOVA source table. Also be sure to indicate the statistical hypotheses (H0 & H1) that you are testing and whether or not you reject the null hypothesis.

      H0 : μi ; μi = μj H0 : μzoloft = μnaltrexone = μvalium H1 : ij μi μj
      Drug Sample Mean SSW MSW SSB MSB MSB MSW
      Y_ y-Y_ 2 yi-Yi_ 2 N-J n Y_-Y__ 2 nj Y_j-Y__ 2 J-1 nj Y_j-Y__ 2 J-1 yi-Yi_ 2 N-J
      Zoloft 9 10.40 1.960 11.622 34.844 135.033 11.619
      110.360
      529.160
      122.560
      1521.160
      1412.960
      136.760
      122.560
      711.560
      619.360
      Naltrexone 15 16.50 2.250 179.211
      160.250
      1220.250
      1220.250
      182.250
      196.250
      2342.250
      2012.250
      1312.250
      170.250
      Valium 9 9.90 0.810 56.011
      111.210
      124.410
      524.010
      139.610
      1526.010
      111.210
      83.610
      615.210
      90.810

      As an ANOVA source table, this is:

      Source of VariationSum of SquaresdfMean SquareF
      Between270.0672135.03311.619
      Within313.802711.622
      Total583.876

      F227 0.999 11.619 3.35 F227 0.95 , the null hypothesis is rejected.

    2. Replicate your results by doing an ANOVA using SAS. Turn in both the SAS program and your output.

      spider_analysis.sas is a simple SAS program to compute ANOVA on spider_data.csv. More interestingly, homework_1.R is a R program to do the same thing.

      The output from R is:

      Response: Response
                Df  Sum Sq Mean Sq F value    Pr(>F)    
      Drug       2 270.067 135.033  11.618 0.0002289 ***
      Residuals 27 313.800  11.622
      
      Signif. codes: 0 '***' 0.001 '**' 0.01 '**' 0.05 '.' 0.1 ' ' 1
  3. One of the undergraduate students in your lab left the results of an ANOVA in your mailbox. Due to some really odd printer problem only some of the cells are legible. After unsuccessfully trying to contact the student you realize that you actually have enough information left to figure out what the missing cells have to be.

    Source of VariationSum of SquaresdfMean SquareF
    Between53.22
    Within10.60
    Total382.65
    1. Fill in the blanks.

      MSB MSW ~ F J-1 N-J MSB = F MSW = 3.2210.60 = 34.132 MSB = SSB J-1 = SSB dfSSB SSB = MSB dfSSB = 34.1325 = 170.66 SST = SSB+ SSW SSW = SST- SSB = 382.65-170.66 = 211.99 MSW = SSW N-J = SSW dfSSW dfSSW = SSW MSW = 211.9910.60 20
      Source of VariationSum of SquaresdfMean SquareF
      Between170.66534.1323.22
      Within211.992010.60
      Total382.65
    2. Indicate whether the results would lead to rejection of the null hypothesis.

      F J-1N-J α = F 520 0.97 3.22 2.71 F 520 0.95

      This F-value would reject H0 at the 95% confidence level.

    3. Assuming that the assumption of homogeneity of population variances is met here, present your best estimate of the variance within the populations.

      When population variances are homogenous, the estimated value of the mean square within is an unbiased estimator of the population variance. For this data, the mean square within is 10.60.

  4. In some statistical computations it's helpful to know the variance of the sample variance: that is Vars^2 . That is, consider the sampling distribution of the sample variance across a large number of random samples. The "variance of the sample variance" is the variance of this distribution and is an indicator of how variable the sample variance is likely to be from sample to sample. Prove:

    Vars^2 = 2σ4n-1

    Begin with a well-known distribution involving the sample variance:

    n-1 s^2 σ2 ~ χn-12

    In order for the two distributions to be the same, their variances must be the same. The distribution of the chi-square is known:

    Var n-1 s^2 σ2 = Var χn-12 = 2n-1

    Constant factors within a variance may be factored out, but their value must be squared. Both the population variance and number of samples are constant in relation to the sample variance:

    n-1 σ2 2 Var s^2 = 2n-1

    The rest is simply algebra:

    Var s^2 = 2n-1σ4 n-12 = 2σ4 n-1