Several states have changed their statewide achievement tests over the past 5 years. These changes may pose difficulties for educators tasked with identifying students in need of additional support. This study evaluated the stability of decision-making accuracy estimates across changes to the statewide achievement test. We analyzed extant data from a large suburban district in Wisconsin in 2014–2015 (N = 2,774) and 2015–2016 (N = 2,882). We estimated the decision-making accuracy of recommendations from the Measures of Academic Progress for predicting risk on a Common Core State Standards aligned test (2014–2015) and a new test based on updated academic standards (2015–2016) in reading and math. Findings suggest that sensitivity and specificity estimates were relatively stable in math. Changes in the criterion measure were associated with decreased sensitivity when predicting performance in reading. These results provide initial support for educators to continue existing screening practices until test vendors or state educational agencies establish cut-scores for predicting risk on the newer test. Using a lower cut-score to establish risk (increasing sensitivity while decreasing specificity) may be prudent in reading. Limitations and directions for future research are discussed.

Baker, D. L., Biancarosa, G., Park, B. J., Bousselot, T., Smith, J.-L., Baker, S. K., . . . Tindal, G. (2015). Validity of CBM measures of oral reading fluency and reading comprehension on high-stakes reading assessments in grades 7 and 8. Reading and Writing, 28, 57105. doi:10.1007/s11145-014-9505-4
Google Scholar | Crossref | ISI
Catts, H. W., Fey, M. E., Zhang, X., Tomblin, J. B. (2001). Estimating the risk of future reading difficulties in kindergarten children: A research-based model and its clinical implementation. Language, Speech, and Hearing Services in Schools, 32, 3850.
Google Scholar | Crossref | Medline | ISI
Catts, H. W., Petscher, Y., Schatschneider, C., Bridges, M. S., Mendoza, K. (2009). Floor effects associated with universal screening and their impact on the early identification of reading disabilities. Journal of Learning Disabilities, 42, 163176.
Google Scholar | SAGE Journals | ISI
Center on Standards, Alignment, Instruction, and Learning . (2016). State activity around adoption and replacement of CCR standards and aligned assessments: 2011-2015. Available from http://www.c-sail.org
Google Scholar
Codding, R. S., Petscher, Y., Truckenmiller, A. (2015). CBM reading, mathematics, and written expression at the secondary level: Examining latent composite relations among indices and unique predictions with a state achievement test. Journal of Educational Psychology, 107, 437450. doi:10.1037/a0037520
Google Scholar | Crossref | Medline
Decker, D. M., Hixson, M. D., Shaw, A., Johnson, G. (2014). Classification accuracy of oral reading fluency and maze in predicting performance on large-scale reading assessments. Psychology in the Schools, 51, 625635. doi:10.1002/pits.21773
Google Scholar | Crossref | ISI
Espin, C., Wallace, T., Lembke, E., Campbell, H., Long, J. D. (2010). Creating a progress-monitoring system in reading for middle-school students: Tracking progress toward meeting high-stakes standards. Learning Disabilities Research & Practice, 25, 6075. doi:10.1111/j.1540-5826.2010.00304.x
Google Scholar | Crossref
Every Student Succeeds Act , Pub. L., 114-95. (2015).
Google Scholar
Ford, J. W., Missall, K. N., Hosp, J. L., Kuhle, J. L. (2016). Comparing two CBM maze selection tools: Considering scoring and interpretive metrics for universal screening. Journal of Applied School Psychology, 32, 329353. doi:10.1080/15377903.2016.1207738
Google Scholar | Crossref
Fuchs, L. S., Fuchs, D., Compton, D. L. (2010). Rethinking response to intervention at middle and high school. School Psychology Review, 39, 2228.
Google Scholar | ISI
Gersten, R., Beckmann, S., Clarke, B., Foegen, A., Marsh, L., Witzel, B. (2009). Assisting students struggling with mathematics: Response to Intervention (RtI) for elementary and middle schools (NCEE 2009-4060). Washington, DC: National Center for Education Evaluation and Regional Assistance, Institute of Education Sciences, U.S. Department of Education. Retrieved from https://ies.ed.gov/ncee/wwc/practiceGuide/2
Google Scholar
Gersten, R., Clarke, B., Jordan, N. C., Newman-Gonchar, R., Haymond, K., Wilkins, C. (2012). Universal screening in mathematics for the primary grades: Beginnings of a research base. Exceptional Children, 78, 423445. doi:10.1177/001440291207800403
Google Scholar | SAGE Journals | ISI
Graham, J. W., Olchowski, A. E., Gilreath, T. D. (2007). How many imputations are really needed? Some practical clarifications of multiple imputation theory. Prevention Science, 8, 206213. doi:10.1007/s11121-007-0070-9
Google Scholar | Crossref | Medline | ISI
Harper, R., Reeves, B. (1999). Reporting of precision of estimates for diagnostic accuracy: A review. British Medical Journal, 318, 13221323. doi:10.1136/bmj.318.7194.1322
Google Scholar | Crossref | Medline
January, S. A., Ardoin, S. P. (2015). Technical adequacy and acceptability of curriculum-based measurement and the measures of academic progress. Assessment for Effective Intervention, 41, 315. doi:10.1177/1534508415579095
Google Scholar | SAGE Journals
Jenkins, J. R., Hudson, R. F., Johnson, E. S. (2007). Screening for service delivery in an RTI framework: Candidate measures. School Psychology Review, 36, 560582.
Google Scholar
Kane, M. T. (2013). The argument-based approach to validation. School Psychology Review, 42, 448457.
Google Scholar
Kilgus, S. P., Methe, S. A., Maggin, D. M., Tomasula, J. L. (2014). Curriculum-based measurement of oral reading (R-CBM): A diagnostic test accuracy meta-analysis of evidence supporting use in universal screening. Journal of School Psychology, 52, 377405. doi:10.1016/j.jsp.2014.06.002
Google Scholar | Crossref | Medline | ISI
Klingbeil, D. A., Nelson, P. M., Van Norman, E. R., Birr, C. (2017). Diagnostic accuracy of multivariate universal screening procedures for reading in upper elementary grades. Remedial and Special Education, 35, 308320. doi:10.1177/0741932517697446
Google Scholar | SAGE Journals
Leblanc, M., Dufore, E., McDougal, J. (2012). Using general outcome measures to predict student performance on state-mandated assessments: An applied approach for establishing predictive cutscores. Journal of Applied School Psychology, 28, 113. doi:10.1080/15377903.2012.643753
Google Scholar | Crossref
López-Ratón, M., Rodriguez-Álvarez, M. X., Suárez, C. C., Sampedro, F. G. (2014). OptimalCutpoints: An R package for selecting optimal cutpoints in diagnostic tests. Journal of Statistical Software, 61, 136.
Google Scholar | Crossref | ISI
Meehl, P. E., Rosen, A. (1955). Antecedent probability and the efficiency of psychometric signs, patterns, or cutting scores. Psychological Bulletin, 52, 194216. doi:10.1037/h0048070
Google Scholar | Crossref | Medline | ISI
National Center for Education Statistics . (2006). School locale definitions. Retrieved from https://nces.ed.gov/surveys/urbaned/definitions.asp
Google Scholar
National Center for Response to Intervention . (2016). Screening tools chart. Retrieved from https://rti4success.org/resources/tools-charts/screening-tools-chart
Google Scholar
Nelson, P. M., Van Norman, E. R., Lackner, S. K. (2016). A comparison of methods to screen middle school students for reading and math difficulties. School Psychology Review, 45, 327342.
Google Scholar | Crossref | ISI
Nelson, P. M., Van Norman, E. R., VanDerHeyden, A. (2016). Reduce, reuse, recycle: The longitudinal value of local cut scores using state test data. Journal of Psychoeducational Assessment, 35, 683694. doi:10.1177/0734282916658567
Google Scholar | SAGE Journals
Nese, J. F., Park, B. J., Alonzo, J., Tindal, G. (2011). Applied curriculum-based measurement as a predictor of high-stakes assessment: Implications for researchers and teachers. The Elementary School Journal, 111, 608624. doi:10.1086/659034
Google Scholar | Crossref | ISI
Northwest Evaluation Association . (2011). Technical manual for measures of academic progress (MAP) and measures of academic progress for primary grades (MPG). Portland, OR: Author.
Google Scholar
Northwest Evaluation Association . (2015). Smarter balanced preliminary performance levels. Portland, OR: Author.
Google Scholar
Petscher, Y., Kim, Y.-S., Foorman, B. R. (2011). The importance of predictive power in early screening assessments. Assessment for Effective Intervention, 36, 158166. doi:10.1177/1534508410396698
Google Scholar | SAGE Journals
R Core Team . (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Available from http://www.R-project.org/
Google Scholar
Shapiro, E. S. (2011). Academic skills problems (4th ed.). New York, NY: Guilford Press.
Google Scholar
Shapiro, E. S., Gebhardt, S. N. (2012). Comparing computer-adaptive and curriculum-based measurement methods of assessment. School Psychology Review, 41, 295305.
Google Scholar
Smarter Balanced Assessment Consortium . (2016). 2014-2015 technical report. Los Angeles, CA: Author.
Google Scholar
Stevenson, N. A., Reed, D. K., Tighe, E. L. (2016). Examining potential bias in screening measures for middle school students by special education and low socioeconomic status subgroups. Psychology in the Schools, 53, 533547. doi:10.1002/pits.21919
Google Scholar | Crossref | ISI
Swets, J. A., Dawes, R. M., Monahan, J. (2000). Psychological science can improve diagnostic decisions. Psychological Science in the Public Interest, 1, 126.
Google Scholar | SAGE Journals
Thum, Y. M., Hauser, C. H. (2015). NWEA 2015 MAP norms for student and school achievement status and growth. Portland, OR: Northwest Evaluation Association.
Google Scholar
VanDerHeyden, A. M. (2011). Technical adequacy of RTI decisions. Exceptional Children, 77, 335350. doi:10.1177/001440291107700305
Google Scholar | SAGE Journals | ISI
VanDerHeyden, A. M., Codding, R. S., Martin, R. (2017). Relative value of common screening measures in mathematics. School Psychology Review, 46, 6587. doi:10.17105/SPR46-1.65-87
Google Scholar | Crossref
Van Norman, E. R., Nelson, P. M., Klingbeil, D. A. (2017). Single measure and gated screening approaches for identifying students at-risk for academic problems: Implications for sensitivity and specificity. School Psychology Quarterly, 32, 405413. doi:10.1037/spq0000177
Google Scholar | Crossref | Medline
Vaughn, S., Fletcher, J. M. (2012). Response to intervention with secondary school students with reading difficulties. Journal of Learning Disabilities, 45, 244256. doi:10.1177/0022219412442157
Google Scholar | SAGE Journals | ISI
Wisconsin Department of Public Instruction . (2016). Wisconsin forward exam: Spring 2016 technical report. Madison: Author.
Google Scholar
Zhou, X., Obuchowski, N. A., McClish, D. (2011). Statistical methods in diagnostic medicine. New York, NY: John Wiley.
Google Scholar | Crossref
View access options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Your Access Options


Purchase

AEI-article-ppv for $15.00

Article available in:

Related Articles