If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating reasons for misfit, it is helpful to compare item response curves (IRCs) visually to detect outlier items. This is only feasible if the IRF employed is sufficiently flexible to display deviations in shape from the norm. A quasi-parametric IRF that can be made arbitrarily flexible by increasing the number of parameters is proposed for this purpose. To take capitalization on chance into account, the use of Akaike information criterion or Bayesian information criterion goodness of approximation measures is recommended for suggesting the number of parameters to be retained. These measures balance the effect on fit of random error of estimation against systematic error of approximation. Computational aspects are considered and efficacy of the methodology developed is demonstrated.

Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In Petrox, B. N., Caski, F. (Eds.), Second international symposium on information theory (pp. 267281). Budapest, Hungary: Akademiai Kiado.
Google Scholar
Baker, F. B. (1992). Item response theory: Parameter estimation techniques. New York, NY: Marcel Dekker.
Google Scholar
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In Lord, F. M., Novick, M. R. (Eds.), Statistical theories of mental test scores (pp. 399402). Reading MA: Addison-Wesley.
Google Scholar
Bock, R. D., Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443459.
Google Scholar | Crossref
Bock, R. D., Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179197.
Google Scholar | Crossref
Bock, R. D., Moustaki, I. (2007). Item response theory in a general framework. In Rao, C. R., Sinharay, S. (Eds.), Handbook of statistics, volume 26: Psychometrics (pp. 469514). Amsterdam, The Netherlands: North-Holland.
Google Scholar
Browne, M. W. (2000). Cross-validation methods. Journal of Mathematical Psychology, 44, 108132.
Google Scholar | Crossref | Medline
Burnham, K. P., Anderson, D. R. (2004). Multimodel inference: Understanding AIC and BIC in model selection. Sociological Methods and Research, 33, 261304.
Google Scholar | SAGE Journals
Cudeck, R., Henly, S. J. (1991). Model selection in covariance structures analysis and the “problem” of sample size: A clarification. Psychological Bulletin, 109, 512519.
Google Scholar | Crossref | Medline
De Leeuw, J. (1992). Introduction to Akaike (1973) information theory and an extension of the maximum likelihood principle. In Kotz, S., Johnson, N. L. (Eds.), Breakthroughs in statistics (Vol. 1, pp. 599609). London, England: Springer-Verlag.
Google Scholar | Crossref
Drasgow, F., Levine, M. V., Williams, B., McLaughlin, M. E., Candell, G. L. (1989). Modeling incorrect responses to multiple-choice items with multilinear formula score theory. Applied Psychological Measurement, 13, 285299.
Google Scholar | SAGE Journals
Duncan, K. A., MacEachern, S. N. (2008). Nonparametric Bayesian modeling for item response. Statistical Modeling, 8, 4166.
Google Scholar | SAGE Journals
Duncan, K. A., MacEachern, S. N. (2013). Nonparametric Bayesian modeling for item response with a three parameter logistic prior mean. In Edwards, M. C., MacCallum, R. C. (Eds.), Current topics in the theory and application of latent variable methods. New York, NY: Routledge.
Google Scholar
Elphinstone, C. D. (1983). A target distribution model for nonparametric density estimation. Communications in Statistics—Theory and Methods, 12, 161198.
Google Scholar | Crossref
Elphinstone, C. D. (1985). A method of distribution and density estimation (Unpublished dissertation). University of South Africa, Pretoria, South Africa.
Google Scholar
Hayley, D.C. (1952). Estimation of the dosage mortality relationship when the dose is subject to error. (Technical Report No. 15). Stanford, CA: Stanford University, Applied Mathematics and Statistics Laboratory.
Google Scholar
Hawkins, D. M. (1994). Fitting monotonic polynomials to data. Computational Statistics, 9, 233247.
Google Scholar
Lee, Y.-S. (2007). A comparison of methods for nonparametric estimation of item characteristic curves for binary items. Applied Psychological Measurement, 31, 121134.
Google Scholar | SAGE Journals
Lord, F. M., Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison Wesley.
Google Scholar
Meijer, R. R., Baneke, J. J. (2004). Analyzing psychopathology items: A case for nonmetric item response theory modeling. Psychological Methods, 9, 354368.
Google Scholar | Crossref | Medline
Neyman, J., Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrika, 16, 132.
Google Scholar | Crossref
Ramsay, J. O. (1977). Monotonic weighted power transformations to additivity. Psychometrika, 42, 83109.
Google Scholar | Crossref
Ramsay, J. O. (1991). Kernel smoothing approaches to nonparametric item characteristic curve estimation. Psychometrika, 56, 611630.
Google Scholar | Crossref
Ramsay, J. O. (2000). TestGraf: A program for the graphical analysis of multiple choice test and questionnaire data [Computer program and manual]. Retrieved from http://www.psych.mcgill.ca/faculty/ramsay/ramsay.html
Google Scholar
Ramsay, J. O., Winsberg, S. (1991). Maximum marginal likelihood estimation for semiparametric item analysis. Psychometrika, 56, 365379.
Google Scholar | Crossref
Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461464.
Google Scholar | Crossref
Sinnott, L. T. (1997). Filtered polynomial density approximations and their application to discriminant analysis (MS Thesis). The Ohio State University, Columbus, OH.
Google Scholar
Thissen, D., Chen, W.-H, Bock, R. D. (2003). Multilog (version 7) [Computer software]. Lincolnwood, IL: Scientific Software International.
Google Scholar
Thissen, D., Orlando, M. (2001). Item response theory for items scored in two categories. In Thissen, D., Wainer, H. (Eds.), Test scoring (pp. 73140). Mahwah, NJ: Lawrence Erlbaum.
Google Scholar
Woods, C. M., Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spline-based densities. Psychometrika, 71, 281301.
Google Scholar | Crossref | Medline
Access Options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Research off-campus without worrying about access issues. Find out about Lean Library here

Your Access Options


Purchase

JEB-article-ppv for $37.50

Cookies Notification

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more.
Top