The Effect of Question's Types and Levels on Students' Academic AchievementRafi' A. Daraghmah --- October, 1997 --- Nablus, Palestine |
|
|
|
|
|
|
|
Chapter FourThe ResultsThe data of this study were analyzed by using Three Way Analysis of Variance (2 x2 x3) to investigate the effect of question type (essay versus objective), question level (RI, RG, UG) and student’s ability levels (high, medium, and low). One Way Analysis of Variance ( 1 x 7 ) was also used to compare between the control and the six experimental groups. The analysis of variance was conducted independently on each sub-test which measured each level of learning: remember an instance, remember a generality and use a generality, and on the total test which included all sub-tests for both types of tests multiple-choice and essay tests. Results of the Remember - an -Instance Multiple-Choice Sub-test (RIM ) On the Remember - an-Instance Multiple-Choice Sub-test (RIM ), general ‘F’ revealed a significant effect for only question levels (P< .0002) and student’s ability levels ( P < .003 ) ( See, Table 1) . Table (1). A Three-Way ANOVA summary table for question types, levels, and student’s ability on RIM sub-test.
The post hoc ANOVA by using a Scheffe test showed at .05 level of significance
that the mean of the experimental group who received questions on Remember
a Generality (RG) level (x= 2.95 ) was higher than the mean of those who
received either Remember an Instance (RI) questions (x =2.62), or Use a
Generality ones (x =2.47). The last two means did not differ significantly
(See, Table 2).
* P > .05
Table (3). Means, standard deviations, and number of students for question types, and student’s ability on a RIM sub-test.
Results of the Remember - a - Generality Multiple-Choice
Sub -test (RGM)
On the Remember - a - Generality Multiple-Choice Sub-test (RGM), general
‘F’ revealed significant differences for question type (P< .0445) and
student’s ability (P<.0008);‘F’ test also revealed a significant interaction
between question types and question levels (P< .0025) which indicated
that groups who received remember instance essay questions performed higher
than those who received remember instance multiple-choice questions, whereas
the performance did not differ for those who received either multiple-choice
or essay questions measuring RG or UG levels (See, Table 4 & 5 ).
The post hoc ANOVA (by using a Scheffe test at (.05) level of significance) did not show significant differences for question type: essay (x= 2.23 ) and multiple-choice one (x = 2.43 ), though the essay type mean was higher than the multiple-choice one ( see, Table 5 ). Table (5). Means, standard deviations, and number of students in each cell for question types, and levels on a RGM sub-test.
On the other hand, Scheffe test showed significant differences at (.05) for student’s ability which indicated that the high student’s ability mean was higher than the low student’s ability one (x = 2.55 versus 1.85 ), whereas, Scheffe test didn’t reveal any significant difference neither between low and medium student’s ability means ( x = 1.85 versus 2.26 ) nor between high and mid student’s ability means ( x = 2.55 versus 2.26 ). See, Table ( 6 ). Table(6). Means and the number of students in each cell for question types, and students’ ability on a RGM sub-test.
Results of the Use - a - Generality Multiple-Choice
Sub-Test ( UGM )
On the Use - a - Generality Multiple-Choice Sub-Test (UGM ), general ‘F’ revealed a significant difference for student’s ability only ( P < .0007 ) (See, Table 7). Table(7). A Three-Way ANOVA summary table for question types, levels, and student’s ability on a UGM sub-test.
The post hoc ANOVA by using Scheffe test showed at .05 level of significance that the mean of high ability students was higher than the low one ( x = 2.37 versus 1.57 ), whereas, there were no significant differences neither between low and medium student’s ability means ( x = 1.57 versus 2.07 ) nor between high and medium student’s ability means (x = 2.37 versus 2.07 ) ( See, Table 8 ). Table (8). Means and number of students in each cell for question types, and student’s ability on a UGM sub-test.
Results of the Remember - an - Instance Essay Sub-Test ( RIE ) On the remember - an - instance essay sub-test ( RIE ), general ‘F’ revealed significant differences for question types (P< .0006), question levels (P< .0256 ), and student’s ability ( P < .0323). ‘F’ test also revealed a significant interaction between question types and question levels (P< .0047), which indicated that remember - a - generality multiple-choice (RGM) group performed better (x = 2.6) than remember - a - generality essay RGE (x= 1.2), and remember - an - instance multiple-choice (RIM) (x = 2.4) performed better than remember - an - instance essay (RIE) (x =1.3), whereas the performance of use - a - generality groups didn’t differ from the multiple-choice type questions to essay ones ( See, Table 9). Table(9). A Three-Way ANOVA summary table for question types, levels, and student’s ability on a RIE sub-test .
In terms of question types, Scheffe test showed at (.05) level of significance that groups who got multiple-choice type questions had higher mean than groups who got essay type questions (x = 2.06 vs.1.21). In terms of question levels, Scheffe test revealed that groups who got remember - a - generality level questions had a higher means (x= 2.6 ) than those who got remember - an - instance (x = 2.4) or use - a - generality (x = 1.1) levels questions. In terms of ability levels, the high ability students’ mean was higher than the low ability students’ one ( x = 1.86 versus 1.09 ) (See, Table 10 ). Table (10). Means, standard deviations, and number of students in each cell for question types, and levels on a RIE sub-test.
At the same time Scheffe test did not reveal any significant difference between medium and low student’s ability ( x=1.61 versus x=1.09) nor between medium and high ability students ( x = 1.61 versus 1.86 ) ( See, Table 11) . Table ( 11). Means, standard deviations and number of students in each cell for question types, and student’s ability on a RIE sub-test.
Results of the Remember - a - Generality Essay
Sub-Test ( RGE )
On the Remember - a - Generality Essay Sub-Test (RGE), general ‘F’ revealed a significant difference for student’s ability only( P < .0001) (See, Table 12 ). Table (12). A Three-Way ANOVA summary table for question types, levels, and student’s ability on a RGE sub-test.
The Post Hoc ANOVA by using a Scheffe test at (.05) level of significance revealed that high ability students performed better than low ability ones (x =3.69 versus 2), whereas the medium ability students performed better than the low ability ones ( x = 3.12 versus 2 ). At the same time, Scheffe test did not reveal any significant difference between the mean of high ability students and the mean of the medium one ( x = 3.69 versus 3.12) (See, Table 13 ). Table (13). Means, standard deviations and number of students in each cell for question types, and student’s ability on a RGE sub-test.
Results of the Use - a - Generality Essay Sub-Test
( UGE )
On the Use - a - Generality Essay Sub-Test ( UGE), general ‘F’ revealed a significant difference for the question levels ( RI, RG, and UG ) only ( P < .0339 ) (See, Table 14). Table (14). A Three-Way ANOVA summary table for question types, levels, and student’s ability on a UGE sub-test.
But the post hoc ANOVA by using Scheffe test at (.05)level of significance did not reveal any significant difference between the means of three question levels ( RI, RG, and UG) ( x = 1.05 vs. 1.45 vs. 1.35 ), though the mean of RG group was higher than the UG or RI ones (See, Table 15). Table ( 15 ). Means, standard deviations, and number of students in each cell for question types, and levels on a UGE sub-test.
Results of the Total Multiple-Choice
Test
On the total multiple-choice test, general ‘F’ revealed a significant difference for the student’s ability only (high, medium , and low ) only ( P < .0001 ) (See, Table 16 ). Table (16). A Three-Way ANOVA summary table for the total multiple- choice test.
The post hoc ANOVA by using Scheffe test at (.05) level of significance revealed significant difference between high student’s ability versus the low ones (x = 7.759 versus x=5.714), and medium student’s ability versus low ones (x = 7.024 versus x =5.714 ). But it failed to reveal a significant difference between the mean of high versus medium ability students ( x = 7.759 versus x = 7.024 ) (See, Table 17 ). Table (17). Means, standard deviations and number of students in each cell for the total multiple-choice test.
Results of the Total Essay Test
On the total essay test, general ‘F’ revealed a significant difference for the student’s ability (high, medium, low ) only ( P< .0004 ) (See, Table 18 ). Table (18). A Three-Way ANOVA summary table for the total essay test.
The post hoc ANOVA by using Scheffe test revealed significant difference between the levels of student’s ability i.e., high versus low (x = 7.017 versus x= 4.286) only. But it did not reveal any significant difference between high versus medium ability students ( x = 7.017 vs. x = 5.817 ) nor between medium versus low ( x = 5.817 versus x = 4.286 ) ( See, Table 19). Table (19). Means standard deviations and number of students in each cell for the total essay test.
The Results of the Overall Test On the overall test, there were
the following results:
1. General ‘F’ didn’t show significant differences between essay versus multiple-choice questions (p >.760), thus, there was no need to use the Scheffe test. By this result, we answered the first question of this study "Which type of questions (essay versus objective ) has more effect on students’ overall learning?", by stating that there were no differences between essay and multiple-choice tests, though the mean of essay test was higher than the mean of multiple-choice (See, Table 20 &20:1 and Figure 1). At the same time, we accepted the first null hypothesis of this study which says "There are no significant differences at ( 0.05 ) a priori level of significance between the essay questions and the objective ones on all levels of learning (RI, RG, UG ). Table (20). A Three-Way ANOVA summary table for the overall learning test.
Table (20:1) shows the mean of each test type.
Figure (1) shows the mean scores of essay and multiple-choice questions on the overall learning test.
2. General ‘F’ didn’t also show significant differences among the three
levels of questions (RI, RG, and UG) (p = .180), thus, there was no need
to use a Scheffe test. By this result, we answered the second question
of this study
Table (20:2) shows the mean scores of each level of questions.
Figure (2) shows the mean differences between the three levels of questions (RI, RG, and UG) on the overall learning test.
3. General ‘F’ showed significant differences among the three levels of student’s ability (high, medium, and low) (p >.0001). The post hoc ANOVA (by using Scheffe test) revealed that there is a significant difference between high and low student’s ability (x = 14.776 versus 10) and between medium and low student’s ability ( x = 12.841 versus 10 ). But it did not reveal any significant difference between high versus medium of student’s ability means (x =14.776 vs. x = 12.841 ). By this result, we answered the third question of this study "Which level of student’s ability (high, medium, or low) has more effect on the students’ overall learning?, by stating that the performance of the high ability students was better than the performance of the medium and low ability ones on the overall learning (See, Table 20:3 and Figure 3 ). At the same time , we rejected the third null hypothesis of this study which says "There are no significant differences at ( 0.05 ) a priori level of significance among the effects of the three levels of the student’s ability ( high, medium, and low ) on all levels of learning. Table (20:3) shows the mean scores of each level
of student’s ability.
Figure (3) shows the performance of the three levels of student’s ability (high, medium, and low) on the overall learning test.
4. In terms of the interaction between types, levels and student’s ability,
general ‘F’ didn’t show any significant interaction between question types
(essay versus multiple-choice) and question levels (RI, RG, and UG) (p
= .4811), nor between question levels (RI, RG, and UG) and student’s ability
(high, medium, and low) (p =.863), or between question types (essay versus
multiple-choice ) and student’s ability (high, medium, and low) (p = .712
). General ‘F’ didn’t also show a significant interaction among question
types (essay versus multiple-choice), question levels (RI, RG, and UG)
and student’s ability (high, medium, and low) (p = .8832 ) simultaneously.
By these results, we answered the fourth, fifth, sixth, and seventh questions
of this study, they are question 4: "Is there an interaction between question
types ( essay versus objective) and question levels (RI, RG, UG) on students’
overall learning ?", question 5 "Is there an interaction between question
levels ( RI, RG, UG ) and student’s ability ( high, medium, low ) on students’
overall learning ?", question 6 "Is there an interaction between question
types ( essay versus objective ) and student’s ability on students’ overall
learning ?", question 7: Is there an interaction between types and levels
of questions and student’s ability on students’ overall learning ?.
Table (20:4) shows the interaction among types, levels of questions and student’s ability.
Results of the Experimental and Control Groups General F’ showed significant differences between the experimental and control groups on the total multiple-choice test (P >.012). But Scheffe test didn’t reveal any significant differences between each experimental group and the control group (See, Table 21). Table (21). Means, standard deviations, and number of students in each cell of the experimental and control groups on the overall multiple-choice test.
General ‘F’, on the other hand, didn’t show any significant differences between the experimental and control groups on the total essay test (p > .2211) (See, Table 22), nor on the overall test (p > .1685). Table (22). Means, standard deviations, and number of students in each cell of the experimental and control groups on the overall essay test.
Thus, there was no need to use Scheffe test for post hoc ANOVA. By these results, we answered the last question of this study which says"Are there significant differences between the performance of the experimental groups which manipulated questions during the experiment compared with the performance of the control group which did not manipulate any questions during the experiment?", by stating that there were no differences among the means of all groups, though the mean scores of (RG) group, then the mean scores of (UG) was highest, whereas the mean scores of the control group was the lowest (See, Table 23 and Figure 4 ). At the same time, we accepted the eighth null hypothesis which says "There is no significant difference between the control groups’ performance compared with the experimental groups’ performance". Table (23). Means, standard deviations, and number of students in each cell of the experimental and control groups on the overall learning.
Figure (4) shows the six experimental groups and the control group on the overall learning test. |