This paper aims to translate for practitioners the principles and methods for evaluating screening measures in education, including benchmark goals and cut points, from our technical manuscript “Evaluation of Diagnostic Systems: The Selection of Students at Risk of Academic Difficulties” (this issue). We offer a brief description of procedures developed over the past 50 years including the receiver operating characteristic (ROC) curves, the area under the ROC curve as a general measure of screener accuracy, and approaches to selecting a specific cut score to indicate risk. We also provide reporting standards to help practitioners evaluate research on screeners supported by best practices and to encourage researchers to attend to key reporting principles, such as using confidence bounds as estimates of precision. We then discuss examples from the literature and emphasize the imprecision of statistical estimates from small samples. Screeners and diagnostic tests, developed and evaluated with care and implemented consistently in schools, can improve educators’ decisions about resource allocation and ultimately improve the delivery of supports to students.

Atkinson, R. C. (1963). A variable sensitivity theory of signal detection. Psychological Review, 70, 91106.
Google Scholar | Crossref | Medline | ISI
Coker, D. L., Ritchey, K. D. (2014). Universal screening for writing risk in kindergarten. Assessment for Effective Intervention, 39, 245256.
Google Scholar | SAGE Journals
Missall, K., Hosp, J. (2013). Goal setting for grade-level standards in Iowa: K–6 reading. Iowa City: The University of Iowa.
Google Scholar
Smolkowski, K., Cummings, K. (2015). Evaluation of the DIBELS (6th Edition) diagnostic system for the selection of native and proficient English speakers at risk of reading difficulties. Journal of Psychoeducational Assessment. Advance online publication. doi:10.1177/0734282915589017
Google Scholar | SAGE Journals | ISI
Smolkowski, K., Cummings, K. D., Stryker, L. (in press). An introduction to the statistical evaluation of fluency measures with signal detection theory. In Cummings, K. D., Petscher, Y. (Eds.), Fluency metrics in education: Implications for test developers, researchers, and practitioners. New York, NY: Springer.
Google Scholar
Tanner, W. P., Swets, J. A. (1954). A decision-making theory of signal detection. Psychological Review, 61, 401409.
Google Scholar | Crossref | Medline | ISI
View access options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Your Access Options


Purchase

AEI-article-ppv for $15.00