Developing a Criterion-Referenced Test in Environmental Education Using Amplified Objectives: A Rasch (1PL) Analysis
Article Number: e2025579 | Available Online: December 2025 | DOI: 10.22521/edupij.2025.19.579
Safaa Tariq Habeeb , Burhan Ismael Ali
Full text PDF |
420 |
182
Abstract
|
Background/purpose. The current research aims to construct a criterion-referenced test using the amplified objectives for the Environmental Education subject for students of the Educational and Psychological Sciences Departments in the Faculties of Education, based on item response theory (IRT) and the Rasch one-parameter logistic (1PL) model. The academic content was derived from five chapters developed by the Deans' Council of the Ministry of Higher Education and Scientific Research and reviewed by subject instructors. From this, seventy behavioral objectives were formulated, distributed among Bloom's taxonomy levels: remembering (35), understanding (29), and application (8). Materials/methods. Seventy multiple-choice items were developed and evaluated by specialized experts for validity. The research sample included 500 male and female second-stage students from the Universities of Baghdad, Al-Mustansiriya, Al-Iraqiya, Karbala, Anbar, and Diyala for the academic year 2024–2025. Students were selected using a stratified random method with proportional distribution. A pilot test was conducted with 30 students to ensure clarity of test items and instructions; all were found to be clear. Results. The cut-off score was established using the amplified objectives and the contrasting groups method under item response theory (IRT), resulting in a score of 47 (65.28%). Statistical analysis using SPSS and the ICL program confirmed that all items had strong psychometric properties. The results indicated a decrease in the number of proficient students in Environmental Education. |
Conclusion. The test developed in this study has strong psychometric properties and effectively measures student proficiency. The decline in performance suggests a need for further educational and assessment interventions. These calibrated items, aligned to amplified objectives, support diagnostic decisions about mastery and can guide remediation within environmental education courses.
Keywords: Criterion-referenced assessment; amplified objectives; Rasch model; item response theory; environmental education; standard setting
ReferencesAbdel Hamid, J. (1998). Educational evaluation and psychological measurement (3rd ed.). Dar Al-Nahda Al-Arabiya. https://iasj.rdd.edu.iq/journals/uploads/2024/12/07/7c27c74b32e554b6661e7b87d9858644.pdf
Abu Hatab, F., Othman, S. A., & Sadiq, A. (2003). Psychological evaluation (3rd ed.). Anglo-Egyptian Library. https://doi.org/10.26034/cm.jostrans.2019.171
Al-Sharqawi, A. M., Al-Sheikh, S. A.-K., Kazim, A. M., & Abd Al-Salam, N. M. (1996). Contemporary trends in psychological and educational measurement and evaluation. Anglo-Egyptian Library. https://iasj.rdd.edu.iq/journals/uploads/2025/01/02/09eaf25899e6e10962dee76efb07735d.pdf
Allam, S. E. M. (1986). Contemporary developments in psychological and educational measurement. Al-Qabas Commercial Printing Press. https://files.eric.ed.gov/fulltext/EJ1131564.pdf
Allam, S. E. (2001). Diagnostic tests: A criterion in educational, psychological, and training fields (2nd ed.). Dar Al-Fikr Al-Arabi. https://iasj.rdd.edu.iq/journals/uploads/2025/04/24/8dba8f7fc2e02f15a7c55c199875449e.pdf
Allam, S. E. (2005). Unidimensional and multidimensional item response models and their applications in psychological and educational measurement (1st ed.). Dar Al-Fikr Al-Arabi. https://iasj.rdd.edu.iq/journals/uploads/2025/04/24/8dba8f7fc2e02f15a7c55c199875449e.pdf
Gallagher, J. D. (1998). Classroom assessment for teachers. Prentice Hall. https://g.co/kgs/kmSTtyX
Gray, W. M. (1978). A comparison of Piagetian theory and criterion-referenced measurement. Review of Educational Research, 48(2), 223–249. https://doi.org/10.3102/00346543048002223
Habeeb, S. T., & Kadhim, B. (2018). Modern and traditional measurement theories: Principles and applications. https://www.researchgate.net/publication/381957459_nzryty_alqyas_alhdytht_waltqlydyt_mbady_wttbyqat
Hambleton, R. K. (1995). Comparison of classical test theory and item response theory and their applications to test development: Issues and practice. Journal of Educational Measurement, 12(1), 38–47. https://www.researchgate.net/publication/249469819_An_NCME_Instructional_Module_on_Comparison_of_Classical_Test_Theory_and_Item_Response_Theory_and_Their_Applications_to_Test_Development
Harris, D. (1984). Comparison of one-, two-, and three-parameter IRT models. Educational Measurement, Spring Issue. https://www.stat.cmu.edu/~brian/PIER-methods/For%202013-02-26/Readings/harris-article.pdf
Kazem, A. M. (1988). A critical theoretical study on the objective measurement of behavior: The Rasch model. Kuwait Foundation for the Advancement of Sciences. https://www.researchgate.net/publication/375379963_Differential_Item_Functioning_Analysis_of_an_Emotional_Intelligence_Scale_for_Human_Resources_Management_at_Sohar_University
Kubiszyn, T., & Borich, G. (2000). Educational testing and measurement (6th ed.). John Wiley & Sons. https://www.scirp.org/reference/referencespapers?referenceid=2411142
Nunnally, J. C. (1978). Psychological theory (2nd ed.). McGraw-Hill. https://www.scirp.org/reference/referencespapers?referenceid=2615575
Popham, W. J. (1978). Criterion-referenced measurement. Prentice Hall. https://www.amazon.com/Criterion-Referenced-Measurement-W-James-Popham/dp/0131936077
Popham, W. J. (2014). Criterion-referenced measurement: Half a century wasted? Educational Leadership, 71(6), 62–66. https://www.researchgate.net/publication/287234320_Criterion-Referenced_Measurement_Half_a_Century_Wasted