Inter-rater agreement in assigning cognitive demand to Life Sciences examination questions
DOI:
https://doi.org/10.38140/pie.v36i1.3586Keywords:
Assessment, Cognitive demand, Inter-rater agreement, Bloom's taxonomy, ReliabilityAbstract
Taxonomies of cognitive demand are frequently used to ensure that assessment tasks include questions ranging from low to high cognitive demand. This paper investigates inter-rater agreement among four evaluators on the cognitive demand of the South African National Senior Certificate Life Sciences examinations after training, practice and revision. The taxonomy used was based on the cognitive dimension of the Revised Bloom’s Taxonomy, with analysis, evaluation and synthesis combined into one category. Descriptors from the Revised Bloom’s taxonomy were slightly modified to suit Life Sciences. Inter-rater agreement was poor to fair, but pairwise percent agreement did not reach acceptable levels. Evaluators found it difficult to assign cognitive demand to examination items, and constantly referred to the descriptors. We question the usefulness of a taxonomy of cognitive demand when individuals differ in their interpretations of the levels of cognitive demand. The results indicate that standards of Life Sciences examination papers cannot reliably be assessed by evaluating cognitive demand using Bloom’s Taxonomy.