Cognitive or skills diagnosis models are discrete latent variable models developed specifically for the purpose of identifying the presence or absence of multiple fine-grained skills. However, applications of these models typically involve dichotomous or dichotomized data, including data from multiple-choice (MC) assessments that are scored as right or wrong. The dichotomization approach to the analysis of MC data ignores the potential diagnostic information that can be found in the distractors and is therefore deemed diagnostically suboptimal. To maximize the diagnostic value of MC assessments, this article prescribes how MC options should be constructed to make them more cognitively diagnostic and proposes a cognitive diagnosis model for analyzing such data. The article discusses the specification of the proposed model and estimation of its parameters. Moreover, results of a simulation study evaluating the viability of the model and an estimation algorithm are presented. Finally, practical considerations concerning the proposed framework are discussed.

Baker, F.B. (1992). Item response theory: Parameter estimation techniques . New York: Dekker.
Google Scholar
Birenbaum, M. , Tatsuoka, C. , & Xin, T. (2005). Large-scale diagnostic assessment: Comparison of eighth graders' mathematics performance in the United States, Singapore and Israel. Assessment in Education Principles Policy and Practice, 12, 167-181.
Google Scholar | Crossref
Briggs, D. , Alonzo, A. , Schwab, C. , & Wilson, M. (2006). Diagnostic assessment with ordered multiple-choice items. Educational Assessment, 11, 33-63.
Google Scholar | Crossref
Carlin, B.P. , & Louis, T.A. (2000). Bayes and empirical Bayes methods for data analysis . New York: Chapman & Hall .
Google Scholar | Crossref
de la Torre, J. (2006, June). Skills profile comparisons at the state level: An application and extension of cognitive diagnosis modeling in NAEP . Paper presented at the International Meeting of the Psychometric Society, Montreal, Canada.
Google Scholar
de la Torre, J. , & Patz, R. (2005). Making the most of what we have: A practical application of multidimensional item response theory in test scoring. Journal of Educational and Behavioral Statistics, 30, 295-311.
Google Scholar | SAGE Journals | ISI
Doignon, J.P. , & Falmagne, J.C. (1999). Knowledge spaces. New York: Springer-Verlag.
Google Scholar | Crossref
Doornik, J.A. (2002). Object-oriented matrix programming using Ox (Version 3.1). [Computer software]. London: Timberlake Consultants Press.
Google Scholar
Embretson, S. (1984). A general latent trait model for response processes . Psychometrika, 49, 175-186.
Google Scholar | Crossref | ISI
Haertel, E.H. (1989). Using restricted latent class models to map the skill structure of achievement items. Journal of Educational Measurement, 26, 333-352.
Google Scholar | Crossref | ISI
Haertel, E.H. , & Wiley, D.E. (1993). Representations of ability structures: Implications for testing. In N. Frederiksen , R. J. Mislevy , & I. Bejar (Eds.), Test theory for a new generation of tests (pp. 359-384). Hillsdale, NJ: Erlbaum.
Google Scholar
Halford, G.S. , Andrews, G. , Dalton, C. , Boag, C. , & Zielinski, T. (2002). Young children's performance on the balance scale: The influence of relational complexity. Journal of Experimental Child Psychology, 81, 417-445.
Google Scholar | Crossref | Medline | ISI
Jansen, B.R.J. , & van der Maas, H.L.J. (2002). The development of children's rule use on the balance scale task. Journal of Experimental Child Psychology, 81, 383-416.
Google Scholar | Crossref | Medline | ISI
Junker, B.W. , & Sijtsma, K. (2001). Cognitive assessment models with few assumptions, and connections with nonparametric item response theory. Applied Psychological Measurement, 25, 258-272.
Google Scholar | SAGE Journals | ISI
Macready, G.B. , & Dayton, C.M. (1977). The use of probabilistic models in the assessment of mastery. Journal of Educational Statistics, 33, 379-416.
Google Scholar
Mislevy, R. (1995). Probability-based inference in cognitive diagnosis . In P. D. Nichols , S. F. Chipman , & R. L. Brennan (Eds.), Cognitively diagnostic assessment (pp. 43-71). Hillsdale, NJ: Erlbaum.
Google Scholar
National Research Council. (2001). Knowing what students know: The science and design of educational assessment . Washington, DC: National Academies Press.
Google Scholar
National Research Council. (2003). Strategic education research partnership/Committee on a strategic education research partnership; M. S. Donovan, A. K. Wigdor, and C. E. Snow, Editors . Washington, DC: National Academies Press.
Google Scholar
Neyman, J. , & Scott, E.L. (1948). Consistent estimates based on partially consistent observations. Econometrika, 16, 661-679.
Google Scholar | Crossref | ISI
Nitko, A.J. (2001). Educational assessment of students (3rd ed.). Columbus, OH: Merrill Prentice Hall.
Google Scholar
Osterlind, S.J. (1998). Constructing test items: Multiple choice, constructed-response, performance and other formats (2nd ed.). Boston: Kluwer Academic.
Google Scholar
Pellegrino, J.W. , Baxter, G.P. , & Glaser, R. (1999). Addressing the ``two disciplines'' problem: Linking theories of cognition and learning with assessment and instructional practices . In A. Iran-Nejad & P. D. Pearson (Eds.), Review of research in education (pp. 307-353). Washington, DC. American Educational Research Association.
Google Scholar
Sadler, P.M. (1998). Psychometric models of student conceptions in science: Reconciling qualitative studies and distractor-driven assessment instruments. Journal of Research in Science Teaching, 35, 265-296.
Google Scholar | Crossref | ISI
Shultz, T.R. , Mareschal, D. , & Schmidt, W.C. (1994). Modeling cognitive development on balance scale phenomena. Machine Learning, 16, 57-86.
Google Scholar | Crossref | ISI
Siegler, R.S. (1976). Three aspects of cognitive development. Cognitive Psychology, 8, 481-520.
Google Scholar | Crossref | ISI
Siegler, R.S. (1981). Developmental sequences within and between concepts . Monographs of the Society for Research in Child Development , 46 (2, Serial No. 189).
Google Scholar | Crossref | ISI
Stiggins. R.J. (2002). Assessment crisis: The absence of assessment for learning. Phi Delta Kappan, 83, 758-765.
Google Scholar | SAGE Journals | ISI
Tatsuoka, C. (2002). Data-analytic methods for latent partially ordered classification models. Journal of the Royal Statistical Society Series C (Applied Statistics), 51, 337-350.
Google Scholar | Crossref | ISI
Tatsuoka, K.K. (1983). Rule-space: An approach for dealing with misconceptions based on item response theory. Journal of Educational Measurement , 20, 345-354.
Google Scholar | Crossref | ISI
Tatsuoka, K.K. (1990). Toward an integration of item-response theory and cognitive error diagnosis. In N. Frederiksen , R. Glaser , A. Lesgold , & Safto, M. (Eds.), Monitoring skills and knowledge acquisition (pp. 453-488). Hillsdale, NJ: Erlbaum.
Google Scholar
Tatsuoka, K.K. , Corter, J. , & Tatsuoka, C. (2004). Patterns of diagnosed mathematical content and process skills in TIMSS-R across a sample of 20 countries. American Educational Research Journal, 41, 901-906.
Google Scholar | SAGE Journals | ISI
Thissen, D. , & Steinberg, L. (1997). A response model for multiple-choice items. In W. van der Linden & R. Hambleton (Eds.), Handbook of modern item response theory (pp. 51-65). New York: Springer.
Google Scholar | Crossref
van der Maas, H.L.J. , & Jansen, B R. J. (2003). What response times tell of children's behavior on the balance scale task. Journal of Experimental Child Psychology, 85, 141-177.
Google Scholar | Crossref | Medline | ISI
Wainer, H. , Vevea, J.L. , Camacho, F. , Reeve, B. , Rosa, K. , Nelson, L. , et al. (2001). Augmented scores- ``Borrowing strength'' to compute scores based on small number of items. In D. Thissen & H. Wainer (Eds.), Test scoring (pp. 343-387). Mahwah, NJ: Lawrence Erlbaum.
Google Scholar
Wiggins, G. (1998). Educative assessment: Designing assessment to inform and improve performance. San Francisco: Jossey-Bass.
Google Scholar
Wilson, M. , & Sloane, K. (2000). From principles to practice: An embedded assessment system. Applied Measurement in Education, 13, 181-208.
Google Scholar | Crossref | ISI
Access Options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Research off-campus without worrying about access issues. Find out about Lean Library here

Your Access Options


Purchase

APM-article-ppv for $37.50
Single Issue 24 hour E-access for $225.66

Cookies Notification

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more.
Top