Factorial ANOVA: Independent Samples: 4

Chapter 16.
Two-Way Analysis of Variance for Independent Samples
Part 4

Example 3.

The first difference between the present example and the first two is simply structural. In Examples 1 and 2 we had data arrayed in a matrix of two rows by two columns; now the arrangement is two rows by three columns. The second difference, of greater conceptual importance, is that we will now be analyzing data that derive not from an equal-interval scale of measurement, but from a merely ordinal rating scale. As noted in Chapter 14, the analysis of variance is quite robust—relatively unperturbed when its assumptions are not fully met—providing that the several groups of measures are all of the same size. These assumptions are the same for the two-way independent-samples ANOVA as for the one-way version:

that the scale on which the dependent variable is measured has the properties of an equal interval scale;_T
that the several samples are independently and randomly drawn from the source population(s);_T
that the source population(s) can be reasonably supposed to have a normal distribution; and_T
that the several samples have approximately equal variances.

Although the flexibility afforded by this robustness applies most directly to assumptions 3 and 4, there is also a case to be made for its being extended to assumption 1. The details of that case are more complex than we would want to get into just now. Suffice it to say that it is less compelling than the case that can be made for flexibility with respect to assumptions 3 and 4. Compelling or not, however, you will often find the analysis of variance applied to rating-scale data, and for that reason you should have a sense of what such an analysis looks like. Toward the end of the chapter I will provide a simulation that might help you to reach your own conclusion on this question.

The data examined in this example come from the research project of an undergraduate student, conducted under the supervision of my Vassar colleague Jannay Morrow. The question at issue in the project was whether an individual's level of self-esteem is predictive of how he or she will perceive other persons. As a first step, 72 subjects (college students) were pre-sorted according to their relative self-esteem scores (high, medium, or low) as measured by the Coopersmith Self-Esteem Inventory. Within each of these three groupings by self-esteem level, subjects were then randomly assigned to one or the other of two experimental conditions. For each condition, the subject's task was to read a description of a "target" person. In one of the conditions the description was worded so as to suggest a low level of self-esteem on the part of the target, while in the other it was worded to suggest a high level of self-esteem. After reading the description, subjects were then asked to rate the target according to three questions: (i) how happy is this person? (ii) how similar is this person to you? and (iii) how likely is this person to experience divorce at some point in life? For each question the rating was performed on a 5-point scale, with "1" representing the lowest rating and "5" the highest.

The following table shows the ratings that resulted from the second question: how similar is this person to you? Note that there is a total of rc=6 independent groups, with N_g=12 observations in each group.

raw data		Subject's Self-Esteem
raw data		Low	Medium	High
Target's Self- Esteem	Low	44 35 44 54 24 42	33 34 42 44 12 23	31 33 35 32 33 33
Target's Self- Esteem	High	22 42 23 24 22 23	43 12 13 24 31 14	32 32 34 34 43 34

Once again, we can zip through the computational details of the analysis with a minimum of commentary. (Click here if you would like a printable summary of the raw data and summary values for this example.)

Summary Values from Preliminary Number-Crunching_Q

summary data		Subject's Self-Esteem
summary data		Low	Medium	High	rows
Target's Self- Esteem	Low	N_g1=12 ∑X_g1=45 ∑X²_g1=179	N_g2=12 ∑X_g2=35 ∑X²_g2=113	N_g3=12 ∑X_g3=35 ∑X²_g3=111	N_r1=36 ∑X_r1=115
Target's Self- Esteem	High	N_g4=12 ∑X_g4=30 ∑X²_g4=82	N_g5=12 ∑X_g5=29 ∑X²_g5=87	N_g6=12 ∑X_g6=38 ∑X²_g6=126	N_r2=36 ∑X_r2=97

	columns	N_c1=24 ∑X_c1=75	N_c2=24 ∑X_c2=64	N_c3=24 ∑X_c3=73	N_T=72 ∑X_T=212 ∑X²_T=698

Means and Graph of Group Means_Q

		Subject's Self-Esteem
		Low	Medium	High	rows
Target's Self- Esteem	Low	M_g1=3.8	M_g2=2.9	M_g3=2.9	M_r1=3.2
Target's Self- Esteem	High	M_g4=2.5	M_g5=2.4	M_g6=3.2	M_r2=2.7

	columns	M_c1=3.1	M_c2=2.7	M_c3=3.0	M_T=2.9

Preliminary SS Values_Q

		Columns
		col1	col2	col3
Rows	row1	SS_g1=10.25	SS_g2=10.92	SS_g3=8.92
Rows	row2	SS_g4=7.00	SS_g5=16.92	SS_g6=5.67
					SS_T=73.78

	SS_wg	= SS_g1+SS_g2+SS_g3+SS_g4+SS_g5+SS_g6
		= 10.25 + 10.92 + 8.92 + 7.00 + 16.92 + 5.67
		= 59.67

	SS_bg	= SS_T — SS_wg
		= 73.78 — 59.67
		= 14.11

SS_rows	=	(∑X_r1)² N_r1	+	(∑X_r2)² N_r2	—	(∑X_T)² N_T

	=	(115)² 36	+	(97)² 36	—	(212)² 72

	=	4.50

SS_cols	=	(∑X_c1)² N_c1	+	(∑X_c2)² N_c2	+	(∑X_c3)² N_c3	—	(∑X_T)² N_T

	=	(75)² 24	+	(64)² 24	+	(73)² 24	—	(212)² 72

	=	2.86

	SS_rxc	= SS_bg — SS_rows — SS_cols
		= 14.11 — 4.50 — 2.86
		= 6.75

The following table shows (in red) the values of [null]M_g* for each of the six groups, as calculated by the method described in connection with Example 1. As before, the observed means of the groups (3.8, 2.9, etc.) appear in black. The graphs below the table show the observed group means in comparison with the pattern that would be expected if there were zero interaction between the row and column variables. Here again you can see the makings of an interaction effect.

means
Columns

col1
col2
col3
rows

rows
row1
3.8
3.4
2.9
3.0
2.9
3.3
M_r1=3.2

row2
2.5
2.9
2.4
2.5
3.2
2.8
M_r2=2.7

columns
M_c1=3.1
M_c2=2.7
M_c3=3.0
M_T=2.9

observed
expected

Degrees of Freedom_Q

degrees of freedom	in general	for the present example
Total	df_T = N_T—1	72—1=71
within- groups (error)	df_wg = N_T—rc	72—(2)(3)=66
between- groups	df_bg = rc—1	(2)(3)—1=5
rows	df_rows = r—1	2—1=1
columns	df_cols = c—1	3—1=2
interaction	df_rxc = (r—1)(c—1)	(2—1)(3—1)=2

MS Values_Q


MS_rows	=	SS_rows df_rows	MS_cols	=	SS_cols df_cols	MS_rxc	=	SS_rxc df_rxc

	=	4.50 1		=	2.86 2		=	6.75 2

	=	4.50		=	1.43		=	3.37


MS_error	=	SS_wg df_wg

	=	59.67 66	= 0.90

F-ratios_Q


F_rows	=	MS_rows MS_error	F_cols	=	MS_cols MS_error	F_rxc	=	MS_rxc MS_error

	=	4.50 0.90		=	1.43 0.90		=	3.37 0.90

	=	4.98		=	1.58		=	3.73
with df=1,66			with df=2,66			with df=2,66

Here is the portion of Appendix D that includes the critical values of F for df=1,66 and df=2,66. As you can see, F_rows and F_rxc are both significant beyond the .05 level, while F_cols is non-significant. Thus there is a significant main effect for the row variable and a significant rows-by-columns interaction effect, but no significant main effect for the column variable.

df denomi- nator	df numerator			F_rows=4.98df=1,66 F_cols=1.58df=2,66^T F_rxc=3.73df=2,66^T
	1	2	3
66	3.99 7.04	3.14 4.94	2.74 4.09

The following table shows a plot for each of these dimensions of the analysis, along with a little commentary. Recall that the dependent variable in this research is a subject's rating of the target figure in response to the question: how similar is this person to you?

Rows	The fundamental meaning of the significant row effect is that the difference between the two row means is greater than what could be expected on the basis of mere random variability. In the present instance it indicates that subjects rating the low self-esteem targets saw the targets as more similar to themselves than did subjects rating the high self-esteem targets. Recall that the row means in this example are calculated across all three levels (low, medium, and high) of subjects' self-esteem.
Columns	The fundamental meaning of the non-significant column effect is that the aggregate differences among the three column means are no greater than what might be expected on the basis of random variability. Hence the observed results provide no evidence that similarity ratings of the target figure differ in accordance with the self-esteem level of the subjects doing the ratings. Recall that column means in the example are calculated across both levels (low and high) of the target figure's imputed self-esteem.
Interaction Observed Expected Zero Interaction	As in the earlier examples, the interaction effect refers to the difference between (i) the observed pattern of group means and (ii) the pattern that would be expected if the combined effects of the row and column variables were merely additive. A significant interaction indicates that this difference reflects something more than mere random variability. For the present example, the short of it is that low and high self-esteem target figures are rated by subjects as more or less similar to themselves in accordance with the self-esteem level of the subjects. As evident from the adjacent graph, this is particularly salient when subjects in the lowest self-esteem category are rating a low self-esteem target figure, and when subjects in the highest self-esteem category are rating a high self-esteem target figure. (The fact that low self-esteem target figures, overall, are rated as more similar than high self-esteem targets is a reflection of the significant main effect for the row variable.)

Ah! But here's the complication. Any conclusions concerning "significant" effects for rows, columns, or interaction in such an analysis are meaningful only in the degree that it is legitimate to perform the analysis using data that derive from a merely ordinal scale of measurement. Which is to say: only in the degree that the analysis of variance is "robust" enough to handle this particular non-compliance with its assumptions. We will address this question in Part 5. That final portion of the chapter will also include a summary of step-by-step computational procedures for the two-way independent-samples ANOVA.

End of Chapter 16, Part 4.
Return to Top of Chapter 16, Part 4
Go to Chapter 16, Part 5

Home

Click this link only if the present page does not appear in a frameset headed by the logo Concepts and Applications of Inferential Statistics

means		Columns
means		col1	col2	col3	rows
rows	row1	3.8 3.4	2.9 3.0	2.9 3.3	M_r1=3.2
rows	row2	2.5 2.9	2.4 2.5	3.2 2.8	M_r2=2.7

	columns	M_c1=3.1	M_c2=2.7	M_c3=3.0	M_T=2.9