Abstract
If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating reasons for misfit, it is helpful to compare item response curves (IRCs) visually to detect outlier items. This is only feasible if the IRF employed is sufficiently flexible to display deviations in shape from the norm. A quasi-parametric IRF that can be made arbitrarily flexible by increasing the number of parameters is proposed for this purpose. To take capitalization on chance into account, the use of Akaike information criterion or Bayesian information criterion goodness of approximation measures is recommended for suggesting the number of parameters to be retained. These measures balance the effect on fit of random error of estimation against systematic error of approximation. Computational aspects are considered and efficacy of the methodology developed is demonstrated.
References
|
Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In Petrox, B. N., Caski, F. (Eds.), Second international symposium on information theory (pp. 267–281). Budapest, Hungary: Akademiai Kiado. Google Scholar | |
|
Baker, F. B. (1992). Item response theory: Parameter estimation techniques. New York, NY: Marcel Dekker. Google Scholar | |
|
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In Lord, F. M., Novick, M. R. (Eds.), Statistical theories of mental test scores (pp. 399–402). Reading MA: Addison-Wesley. Google Scholar | |
|
Bock, R. D., Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443–459. Google Scholar | Crossref | |
|
Bock, R. D., Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179–197. Google Scholar | Crossref | |
|
Bock, R. D., Moustaki, I. (2007). Item response theory in a general framework. In Rao, C. R., Sinharay, S. (Eds.), Handbook of statistics, volume 26: Psychometrics (pp. 469–514). Amsterdam, The Netherlands: North-Holland. Google Scholar | |
|
Browne, M. W. (2000). Cross-validation methods. Journal of Mathematical Psychology, 44, 108–132. Google Scholar | Crossref | Medline | |
|
Burnham, K. P., Anderson, D. R. (2004). Multimodel inference: Understanding AIC and BIC in model selection. Sociological Methods and Research, 33, 261–304. Google Scholar | SAGE Journals | |
|
Cudeck, R., Henly, S. J. (1991). Model selection in covariance structures analysis and the “problem” of sample size: A clarification. Psychological Bulletin, 109, 512–519. Google Scholar | Crossref | Medline | |
|
De Leeuw, J. (1992). Introduction to Akaike (1973) information theory and an extension of the maximum likelihood principle. In Kotz, S., Johnson, N. L. (Eds.), Breakthroughs in statistics (Vol. 1, pp. 599–609). London, England: Springer-Verlag. Google Scholar | Crossref | |
|
Drasgow, F., Levine, M. V., Williams, B., McLaughlin, M. E., Candell, G. L. (1989). Modeling incorrect responses to multiple-choice items with multilinear formula score theory. Applied Psychological Measurement, 13, 285–299. Google Scholar | SAGE Journals | |
|
Duncan, K. A., MacEachern, S. N. (2008). Nonparametric Bayesian modeling for item response. Statistical Modeling, 8, 41–66. Google Scholar | SAGE Journals | |
|
Duncan, K. A., MacEachern, S. N. (2013). Nonparametric Bayesian modeling for item response with a three parameter logistic prior mean. In Edwards, M. C., MacCallum, R. C. (Eds.), Current topics in the theory and application of latent variable methods. New York, NY: Routledge. Google Scholar | |
|
Elphinstone, C. D. (1983). A target distribution model for nonparametric density estimation. Communications in Statistics—Theory and Methods, 12, 161–198. Google Scholar | Crossref | |
|
Elphinstone, C. D. (1985). A method of distribution and density estimation (Unpublished dissertation). University of South Africa, Pretoria, South Africa. Google Scholar | |
|
Hayley, D.C. (1952). Estimation of the dosage mortality relationship when the dose is subject to error. (Technical Report No. 15). Stanford, CA: Stanford University, Applied Mathematics and Statistics Laboratory. Google Scholar | |
|
Hawkins, D. M. (1994). Fitting monotonic polynomials to data. Computational Statistics, 9, 233–247. Google Scholar | |
|
Lee, Y.-S. (2007). A comparison of methods for nonparametric estimation of item characteristic curves for binary items. Applied Psychological Measurement, 31, 121–134. Google Scholar | SAGE Journals | |
|
Lord, F. M., Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison Wesley. Google Scholar | |
|
Meijer, R. R., Baneke, J. J. (2004). Analyzing psychopathology items: A case for nonmetric item response theory modeling. Psychological Methods, 9, 354–368. Google Scholar | Crossref | Medline | |
|
Neyman, J., Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrika, 16, 1–32. Google Scholar | Crossref | |
|
Ramsay, J. O. (1977). Monotonic weighted power transformations to additivity. Psychometrika, 42, 83–109. Google Scholar | Crossref | |
|
Ramsay, J. O. (1991). Kernel smoothing approaches to nonparametric item characteristic curve estimation. Psychometrika, 56, 611–630. Google Scholar | Crossref | |
|
Ramsay, J. O. (2000). TestGraf: A program for the graphical analysis of multiple choice test and questionnaire data [Computer program and manual]. Retrieved from http://www.psych.mcgill.ca/faculty/ramsay/ramsay.html Google Scholar | |
|
Ramsay, J. O., Winsberg, S. (1991). Maximum marginal likelihood estimation for semiparametric item analysis. Psychometrika, 56, 365–379. Google Scholar | Crossref | |
|
Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464. Google Scholar | Crossref | |
|
Sinnott, L. T. (1997). Filtered polynomial density approximations and their application to discriminant analysis (MS Thesis). The Ohio State University, Columbus, OH. Google Scholar | |
|
Thissen, D., Chen, W.-H, Bock, R. D. (2003). Multilog (version 7) [Computer software]. Lincolnwood, IL: Scientific Software International. Google Scholar | |
|
Thissen, D., Orlando, M. (2001). Item response theory for items scored in two categories. In Thissen, D., Wainer, H. (Eds.), Test scoring (pp. 73–140). Mahwah, NJ: Lawrence Erlbaum. Google Scholar | |
|
Woods, C. M., Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spline-based densities. Psychometrika, 71, 281–301. Google Scholar | Crossref | Medline |
