©Richard Lowry, 1999-
All rights reserved.
¶Example 1. Comparative Effects of Two Methods of Hypnotic Induction
X = the score on the index of primary suggestibility Y = the score on the index of hypnotic induction |
Method A | Method BSub- | ject Xa | Ya | Sub- | ject Xb | Yb |
a1 | a2 a3 a4 a5 a6 a7 a8 a9 a10
5 | 10 12 9 23 21 14 18 6 13
20 | 23 30 25 34 40 27 38 24 31
b1 | b2 b3 b4 b5 b6 b7 b8 b9 b10
7 | 12 27 24 18 22 26 21 14 9
19 | 26 33 35 30 31 34 28 23 22 Means | 13.1 | 29.2 |
| 18.0 | 28.1 | |
If you have not already done so, please click here to place a version of the data table into the frame on the left. |
Items to be calculated: SST(Y) SSwg(Y) SSbg(Y) |
Items to be calculated: SST(X) SSwg(X) |
Items to be calculated: SCT SCwg |
1. Calculations for the Dependent Variable Y
The following table shows the values of Y along with the several summary statistics required for the calculation of SST(Y), SSwg(Y), and SSbg(Y):
Ya | Yb
|
20 | 23 30 25 34 40 27 38 24 31
19 | 26 33 35 30 31 34 28 23 22 for total array of data
|
| Click here if you wish to see the details of calculation for this data set. N | 10 | 10 | 20 |
| SST(Y) = 668.5 SSwg(Y) = 662.5 SSbg(Y) = 6.0 .∑Yi
| 292 | 281 | 573 | .∑Yi2
| 8920 | 8165 | 17085 | SS | 393.6 | 268.9 |
| Mean | 29.2 | 28.1 | 28.7 | |
Just as an aside, note the discrepancy between SSbg(Y)=6.0 [0.9% of total variability] and SSwg(Y)=662.5 [99.1% of total variability] This reflects the fact mentioned earlier, that the mean difference between the groups is quite small in comparison with the variability that exists inside the groups. |
The next table has the same structure as the one above, but now what we show are the values of X along with the several summary statistics required for the calculation of SST(X) and SSwg(X). (We need not bother with SSbg(X), since it does not enter into any subsequent calculations.)
Xa | Xb
|
5 | 10 12 9 23 21 14 18 6 13
7 | 12 27 24 18 22 26 21 14 9 for total array of data
|
| Click here if you wish to see the details of calculation for this data set. N | 10 | 10 | 20 |
| SST(X) = 908.9 SSwg(X) = 788.9 .∑Xi
| 131 | 180 | 311 | .∑Xi2
| 2045 | 3700 | 5745 | SS | 328.9 | 460.0 |
| Mean | 13.1 | 18.0 | 15.6 | |
You will recall from Chapter 3 that the raw measure of the covariance between two variables, X and Y, is a quantity known as the sum of co-deviates:
SC = ∑(XiMX)(YiMY) | (XiMX) = deviateX (YiMY) = deviateY (XiMX)(YiMY) = co-deviateXY |
SC | = ∑(XiYi) | (∑Xi)(∑Yi) N |
We will be calculating two separate values of SC: one for the covariance of X and Y within the total array of data, and another for the covariance within the two groups.
The next table shows the cross-products of Xi and Yi for each subject in each of the two groups. (E.g., the first subject in group A had
GroupsA | B |
| For group A: o∑Xai = 131 o∑Yai = 292 For group B: o∑Xbi = 180 o∑Ybi = 281 For total array: o∑XTi = 311 o∑YTi = 573 XaYa | XbYb |
100 | 230 360 225 782 840 378 684 144 403
133 | 312 891 840 540 682 884 588 322 198 for total array of data Sums | 4146 | 5390 | 9536 |
| .∑(XaiYai) | .∑(XbiYbi) | .∑(XTiYTi) | |
SCT | = ∑(XTiYTi) | (∑XTi)(∑YTi) NT |
SCT | = 9536 | (311)(573) 20 | = 625.9 |
For Group A:
| SCwg(a)
| = ∑(XaiYai)
| (∑Xai)(∑Yai) | Na |
SCwg(a) | = 4146 | (131)(292) 10 | = 320.8 |
For Group B:
| SCwg(b)
| = ∑(XbiYbi)
| (∑Xbi)(∑Ybi) | Nb |
SCwg(b) | = 5390 | (180)(281) 10 | = 332.0 |
the sum of which will then yield the within-groups covariance measure of
SCwg | = SCwg(a) + SCwg(b)
|
|
| = 320.8 + 332.0 = 652.8 | |
We begin with a summary of the values of SS and SC obtained so far, as these will be needed in the calculations that follow. Recall that Y is the variable in which we are chiefly interested, and X is the covariate whose effects we are seeking to remove.
X | Y | Covariance
|
SST(X) = 908.9 | SSwg(X) = 788.9
|
SST(Y) = 668.5 | SSwg(Y) = 662.5 SSbg(Y) = 6.0
|
SCT = 625.9 | SCwg = 652.8
|
For handy reference, click here to place a version of | this table into the frame on the left.] |
From Chapter 3 you know that the overall correlation between X and Y (both groups combined) can be calculated as
rT | = |
SCT sqrt[SST(X) x SST(Y)] | ||||||
= |
625.9 sqrt[908.9 x 668.5]
|
| = | +.803 | |
The proportion of the total variability of Y attributable to its covariance with X is accordingly
(rT)2 = (+.803)2 = .645 |
In the first step of this series, we adjust SST(Y) by removing from it this proportion of covariance. Given
668.5 x .645 = 431.2 |
668.5431.2 = 237.3 [tentative] |
[adj]SST(Y) | = SST(Y) |
(SCT)2 SST(X) | "[adj]" = "adjusted"
|
| = 668.5 | (625.9)2 | 908.9
|
| = 237.5 | |
By analogy, the aggregate correlation between X and Y within the two groups can be calculated as
rwg | = |
SCwg sqrt[SSwg(X) x SSwg(Y)] | |||||
= |
652.8 sqrt[788.9 x 662.5]
| = | +.903 | |
The proportion of the within-groups variability of Y attributable to covariance with X is therefore
(rwg)2 = (+.903)2 = .815 |
662.5 x .815 = 539.9 |
662.5539.9 = 122.6 [tentative] |
[adj]SSwg(Y) | = SSwg(Y) |
(SCwg)2 SSwg(X)
|
| = 662.5 | (652.8)2 | 788.9
|
| = 122.3 | |
The adjusted value of SSbg(Y) can then be obtained through simple subtraction as
[adj]SSbg(Y) | = |
[adj]SST(Y)
[adj]SSwg(Y)
|
| = | 237.5 122.3 = 115.2 | |
bwg | = | SCwg SSwg(X)
|
|
| = | 652.8 | 788.9 = +.83 |
| |
MX | MYgroup A | 13.1 | 29.2 | group B | 18.0 | 28.1 | combined | 15.55 | 28.65 | |
[adj]MYa = 29.2 + (2.45x.83) = 31.23
and
|
| [adj]MYb = 28.1 (2.45x.83) = 26.07
| |
This is the conceptual structure for the adjustment of the means of Y for the groups. Once you have the concept, the mechanics of the process can be accomplished somewhat more expeditiously via the following computational formula:
For Group A:
|
| [adj]MYa | = MYa bwg(MXaMXT) |
|
|
| = 29.2 .83(13.115.55) |
|
|
| = 31.23 | |
For Group B:
|
| [adj]MYb | = MYb bwg(MXbMXT) |
|
|
| = 28.1 .83(18.015.55) |
|
|
| = 26.07 | |
As with the corresponding one-way ANOVA, the final step in a one-way analysis of covariance involves the calculation of an
F = | MSeffect MSerror | = | MSbg MSwg | = | SSbg/dfbg SSwg/dfwg |
[adj]dfwg(Y) = NTk1 | for the present example: 2021 = 17 |
dfbg(Y) = k1 | for the present example: 21 = 1 |
F | = | [adj]SSbg(Y)/dfbg(Y) [adj]SSwg(Y)/[adj]dfwg(Y)
|
|
| = | 115.2/1 | 122.3/17
|
|
|
| = | 16.01 [with df=1,17] | |
df denomi- nator | df numerator1 | 2 | 3 | 17 | 4.45 | 8.40 3.59 | 6.11 3.20 | 5.19 |
MYa=29.2 versus MYb=28.1
| |
[adj]MYa=31.23 versus [adj]MYb=26.07 |
The analysis of covariance has the same underlying assumptions as its parent, the analysis of variance. It also has the same robustness with respect to the non-satisfaction of these assumptions, providing that all groups have the same number of subjects. There is, however, one assumption that the analysis of covariance has in addition, by virtue of its co-descent from correlation and regressionnamely, that the slopes of the regression lines for each of the groups considered separately are all approximately the same.
The operative word here is "approximately." Because of random variability, it would rarely happen that two or more samples of bivariate XY values would all end up with precisely the same slope, even though the samples might be drawn from the very same population. And so it is for the slopes of the separate regression lines for our two present samples. They are clearly not precisely the same. The question is, are they close enough to be regarded as reflecting the same underlying relationship between X
Home | Click this link only if the present page does not appear in a frameset headed by the logo Concepts and Applications of Inferential Statistics |