For all problems, unless otherwise indicated, assume a Type 1 error rate = .05.

  1. An emotion researcher regularly uses different emotional stimuli within a given category (e.g., a set of happiness-inducing film clips) to elicit emotional responses among subjects. Let's say that she is interested in the degree to which variability in responses to happiness-inducing stimuli is due to variability among the stimuli themselves. She conducts an experiment in which 50 subjects are randomly assigned to one of 5 stimulus conditions (ni = 10). Subjects in each stimulus condition are shown one film clip designed to elicit happiness. A different film clip is used in each condition. The 5 clips can be considered a random sample from a much larger population of potentially happiness-inducing clips. Self-report responses to each film clip are recorded on a 1 to 100 scale (higher scores indicate greater happiness).

    The data are as follows:

    Clip #151626756606365606534
    Clip #250523359515468516043
    Clip #356533841334964495053
    Clip #466737471514957606854
    Clip #546656053426157644055
    1. Indicate the null and alternative hypotheses.

      H0: The amount of happiness induced by a film clip is a function solely from the viewer watching the clip and not from the clip itself. Essentially that these five clips are drawn from a random set of clips that are all equally happiness inducing.

      If any element of this set of happiness inducing things was shown to everyone in the world the average response would be the same:

      H0 : μ1 = μ2 = μ3 = = μ

      Another way of saying this is that the variability because of differences in the clips is zero:

      H0 : σclip2 = 0

      H1: The amount of happiness induced by watching a file clip is a function of both the viewer and the clip being watched.

      H1 : σclip2 0 σclip2 > 0

      Recall that the variance can't be less than zero.


      Really what is being tested by this experiment is whatever method is being used to pick clips "designed to elicit happiness." The ANOVA doesn't seem the appropriate test however.

      I'm assuming that if I designed a method for picking clips that elicit happiness I would like to prove that it works. I'll design a method of picking clips now that I believe won't work, alternating pictures of kittens and surgery, I can test if that won't work, but proving it won't be a surprising finding.

      All that the ANOVA can do is fail to disprove that it doesn't work. Failing to disprove the null is not the same thing as supporting it. Instead of the ANOVA, a test is needed whose null hypothesis is that the means are unequal.

      A situation where my experimental might genuinely be that the means are unequal is if these five clips were used in another experiment and I am hypothesizing that they were responsible for some of the variability in that experiment. In this situation though the clips are not random, they're fixed.

      I suppose if the researcher has a very large sample of clips that are used to elicit happiness and she is spot checking the whole set by selecting five and testing them, then this experiment makes some sense. She would likely want to do contrasts then if the ANOVA failed.

      Another option would be a set of clips that would be assumed to elicit similar amounts of happiness. Maybe I have five clips of puppies playing and I want to assess how much of the variability comes from the concept of "puppy" as opposed to the specific characteristics of the puppy playing. That makes sense as a research question.

    2. Conduct a one-way ANOVA on these data. You can either use SAS or a hand computation. Indicate the value of your observed F and whether it is statistically significant. Interpret the results.

      For this data, R produces:

                  Df Sum Sq Mean Sq F value  Pr(>F)
      Clip_Number  4 1139.7   284.9  3.2678 0.01954
      Residuals   45 3923.6    87.2

      The null hypothesis is rejected in this situation and the experimenter's belief that the clips are not equal in the amount of happiness they elicit is supported.

    3. Indicate the expected values for the Mean Squares that you used to compute the F ratio of interest in the ANOVA that you conducted in part b.

      For a one-way random effects design:

      EffectE(Mean Square)H0E(Mean Square |H0)
      Random σε +nσRandom2 σRandom2 = 0 σε2
      Within σε2 σε2 σε2
      MSRandom MSWithin ~ F dfRandom dfWithin = F Random-1 N-Random
    4. Based on these sample data, compute an unbiased estimate of the proportion of the total variability in self-report responses in the population that is due to variations in film clip content.

      The proportion of the total variability from the randomness of the factor and not the samples, ρ1, is:

      ρ1 = σRandom2 σε2 +σRandom2

      This quantity requires an unbiased estimate of the variance from the factor:

      σ^Random2 = nσ^Random2 n + σ^ε2 -σ^ε2 n = nσ^Random2 + σ^ε2 -σ^ε2 n = E MSRandom - E MSWithin n ρ^1 = σ^Random2 σ^ε2 +σ^Random2 = MSRandom -MSWithin n MSRandom -MSWithin n + MSWithin = MSRandom -MSWithin MSRandom +n-1 MSWithin

      For this data, R calculates:

      ρ^1 0.1849

      This means that 18.5% of the overall variability comes from a difference in the clips rather than from the subjects.

    5. Why would we be unlikely to follow up this analysis with contrasts testing for differences between the means of the five film clips?

      A contrast of these specific means will only tell if these individual clips are not equivalent in inducing happiness. It is not a generalizable research question.

  2. A pharmaceutical company is interested in comparing the efficacy of three drugs (A, B, and C) for the treatment of panic disorder. In this context, drug can be considered a fixed effect. Four hospitals are recruited for the study. At each hospital, three patients within each of the three drug conditions are treated. The post-treatment scores on an inventory of panic symptoms are as follows (lower scores indicate fewer symptoms and better treatment effects):

    HospitalDrug ADrug BDrug C
    LA General81109106
    9193105
    6795109
    Chicago VA86105111
    75111106
    7995102
    Des Moines Baptist89106115
    95115117
    99102106
    Nashville Centennial106115111
    111117118
    103106114

    Studies on this data is going to be interested in the expected value of a particular Hospital / Drug pair. The expected values (means) are:

    HospitalDrug ADrug BDrug CMarginal
    LA General79.66699.000106.666795.111
    Chicago VA80.00000103.6667106.333396.666
    Des Moines Baptist94.333107.6667107.6667103.222
    Nashville Centennial106.666112.6667114.3333111.222
    Marginal90.166105.750108.750101.555
    1. Let’s assume that these four clinics are the sole customers of the pharmaceutical company. Thus, the company representatives who are supervising the study are not interested in generalizing the results beyond these four clinics.

      1. What would be the expected values of each of the four relevant mean squares?

        For this section a new notation will be used:

        θx2 = xix xi2 x-1 = x x-1 σx2 θxy2 = xyij xy xy ij 2 x-1 y-1

        In this situation the hospitals are fixed factors and the analysis is simply a 4×3 fixed effects ANOVA. The characteristics of the mean squares are:

        EffectE(Mean Square)H0E(Mean Square |H0)F-ratio
        Fixed α σε +αn θα2 αi = 0 i σε2 MSα MSWithin
        Fixed β σε +βn θβ2 βi = 0 i σε2 MSβ MSWithin
        α × β Interaction σε +n θαβ2 αβ ij = 0 ij σε2 MSαβ MSWithin
        Within σε2 σε2 σε2
      2. What would be the null hypotheses for the main effects of drug and of hospital?

        • Main Effect of Hospital — LA General, the Chicago VA, Des Moines Baptist and Nashville Centennial are, on average, equally effective in delivering drug treatments for panic disorders.
        • Main Effect of Drug — Drugs A, B and C are overall equally equally effective in treating panic disorders.
    2. Now, let’s assume a different situation. Let’s say that these four hospitals can be considered a random sample of the extremely large number of hospitals around the world that the company sells to.

      1. What would be the expected values of each of the four relevant mean squares?

        EffectE(Mean Square)H0E(Mean Square |H0)F-ratio
        Fixed α σε +n σαβ2 +αn θα2 αi = 0 i σε2 +n σαβ2 MSα MSαβ
        Random β σε +βn σβ2 βi = 0 i σε2 MSβ MSWithin
        α × β Interaction σε +n σαβ2 αβ ij = 0 ij σε2 MSαβ MSWithin
        Within σε2 σε2 σε2
      2. What would be the null hypotheses for the main effects of drug and of hospital?

        • Main Effect of Hospital — The hospital where a drug treatment is delivered does not affect the effectiveness of drug treatments for panic disorders.
        • Main Effect of Drug — Drugs A, B and C are, overall, equally effective in treating panic disorders.
    3. Let’s assume that the assumption articulated in part b holds (i.e., hospitals as random samples).

      1. Using SAS or other software, conduct an omnibus ANOVA that tests each of the main effects and the interaction. Summarize your results and conclusions.

        It turns out doing this is not easily explained in R, so I did it in SAS in the interests of surviving the end of the semester:

         Source                     DF      Type I SS    Mean Square   F Value   Pr > F
         drug                        2    2617.055556    1308.527778     30.83   <.0001
         hosp                        3    1523.638889     507.879630     11.97   <.0001
         drug*hosp                   6     441.611111      73.601852      1.73   0.1562

        Both the hospital and the drug being used have a significant effect on the outcome of treatment of panic disorders. There is no evidince that a particular hospital does especially well or poorly with a particular drug.

      2. Using SAS or other software, conduct a planned contrast (two-tailed) comparing the average of the A and B drug groups to drug C. Summarize your results and conclusions.

            Tests of Hypotheses Using the Type III MS for drug*hosp as an Error Term
         Contrast                   DF    Contrast SS    Mean Square   F Value   Pr > F
         drug 1&2 vs 3               1    1160.013889    1160.013889     15.76   0.0074

        There is a statistically significant probability that Drug C is less effective than the average of Drugs A and B.