Dual item response theory (IRT) models in which items and individuals have different amounts of measurement error have been proposed in the literature. Any developments in these models, however, are feasible only for continuous responses. This article discusses a comprehensive dual modeling approach, based on underlying latent response variables, from which specific models for continuous, graded, and binary responses are obtained. Procedures for (a) calibrating the items, (b) scoring individuals, (c) assessing model appropriateness, and (d) assessing measurement precision are discussed for all the resulting models. Simulation results suggest that the proposal is quite feasible. A practical illustration is given with an empirical example in the personality domain.

Baron, R. M., Kenny, D. A. (1986). The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173-1182.
Google Scholar | Crossref | Medline | ISI
Bock, R. D., Mislevy, R. J. (1982). Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement, 6, 431-444.
Google Scholar | SAGE Journals | ISI
Brown, A., Croudace, T. (2015). Scoring and estimating score precision using multidimensional IRT. In Reise, S. P., Revicki, D. A. (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 307-333). New York, NY: Routledge.
Google Scholar
Conijn, J. M., Emons, W. H., Page, B. F., Sijtsma, K., Van der Does, W., Carlier, I. V., Giltay, E. J. (2016). Response inconsistency of patient-reported symptoms as a predictor of discrepancy between patient and clinician-reported depression severity. Assessment, 25, 917-928.
Google Scholar | SAGE Journals
Culpepper, S. A. (2013). The reliability and precision of total scores and IRT estimates as a function of polytomous IRT parameters and latent trait distribution. Applied Psychological Measurement, 37, 201-225.
Google Scholar | SAGE Journals | ISI
Cureton, E. E. (1957). The upper and lower twenty-seven per cent rule. Psychometrika, 22, 293-296.
Google Scholar | Crossref | ISI
DeFleur, M. L., Catton, W. R. (1957). The limits of determinacy in attitude measurement. Social Forces, 35, 295-300.
Google Scholar | Crossref
Ferrando, P. J. (2002). Theoretical and empirical comparisons between two models for continuous item responses. Multivariate Behavioral Research, 37, 521-542.
Google Scholar | Crossref | Medline | ISI
Ferrando, P. J. (2004). Person reliability in personality measurement: An item response theory analysis. Applied Psychological Measurement, 28, 126-140.
Google Scholar | SAGE Journals | ISI
Ferrando, P. J. (2007). A Pearson-type-VII item response model for assessing person fluctuation. Psychometrika, 72, 25-41.
Google Scholar | Crossref | ISI
Ferrando, P. J. (2009). A graded response model for measuring person reliability. British Journal of Mathematical and Statistical Psychology, 62, 641-662.
Google Scholar | Crossref | Medline | ISI
Ferrando, P. J. (2013). A general linear framework for modeling continuous responses with error in persons and items. Methodology, 9, 150-161.
Google Scholar | Crossref | ISI
Ferrando, P. J. (2014). A factor-analytic model for assessing individual differences in response scale usage. Multivariate Behavioral Research, 49, 390-405.
Google Scholar | Crossref | Medline
Ferrando, P. J. (2016). An IRT modeling approach for assessing item and person discrimination in binary personality responses. Applied Psychological Measurement, 40, 218-232.
Google Scholar | SAGE Journals | ISI
Fiske, D. W. (1968). Items and persons: Formal duals and psychological differences. Multivariate Behavioral Research, 3, 393-401.
Google Scholar | Crossref | Medline | ISI
LaHuis, D. M., Barnes, T., Hakoyama, S., Blackmore, C., Hartman, M. J. (2017). Measuring traitedness with person reliabilities parameters. Personality and Individual Differences, 109, 111-116.
Google Scholar | Crossref
Levine, M. V., Rubin, D. B. (1979). Measuring the appropriateness of multiple choice test scores. Journal of Educational Statistics, 4, 269-290.
Google Scholar | SAGE Journals
Lord, F. M., Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Google Scholar
Lorenzo-Seva, U., Ferrando, P. J. (2013). FACTOR 9.2: A comprehensive program for fitting exploratory and semiconfirmatory factor analysis and IRT models. Applied Psychological Measurement, 37, 497-498.
Google Scholar | SAGE Journals | ISI
Lubbe, D., Schuster, C. (2016). Consistent differential discrimination model estimation. Multivariate Behavioral Research, 51, 581-587.
Google Scholar | Crossref | Medline
Lubbe, D., Schuster, C. (2017). The graded response differential discrimination model accounting for extreme response style. Multivariate Behavioral Research, 52, 616-629.
Google Scholar | Crossref | Medline
Lumsden, J. (1978). Tests are perfectly reliable. British Journal of Mathematical and Statistical Psychology, 31, 19-26.
Google Scholar | Crossref | ISI
Lumsden, J. (1980). Variations on a theme by Thurstone. Applied Psychological Measurement, 4, 1-7.
Google Scholar | SAGE Journals
Markus, H. (1977). Self-schemata and processing information about the self. Journal of Personality and Social Psychology, 35, 63-78.
Google Scholar | Crossref | ISI
McDonald, R. P. (1982). Linear versus models in item response theory. Applied Psychological Measurement, 6, 379-396.
Google Scholar | SAGE Journals | ISI
Mislevy, R. J. (1986). Bayes modal estimation in item response models. Psychometrika, 51, 177-195.
Google Scholar | Crossref | ISI
Mosier, C. I. (1942). Psychophysics and mental test theory II: The constant process. Psychological Review, 48, 235-249.
Google Scholar | Crossref
Muthén, B. (1984). A general structural equation model with dichotomous, ordered, categorical and continuous latent variable indicators. Psychometrika, 49, 115-132.
Google Scholar | Crossref | ISI
Muthén, B., Kaplan, D. (1985). A comparison of some methodologies for the factor analysis of non-normal Likert variables. British Journal of Mathematical and Statistical Psychology, 38, 171-189.
Google Scholar | Crossref | ISI
Novick, M. R., Jackson, P. H. (1974). Statistical methods for educational and psychological research. New York, NY: McGraw-Hill.
Google Scholar
Pallero, R., Ferrando, P. J., Lorenzo-Seva, U. (1998). Questionnaire Tarragona of anxiety for blind people. In Sifferman, E., Williams, M., Blasch, B. B. (Eds.), The 9th international mobility conference proceedings (pp. 250-253). Atlanta, GA: Rehabilitation Research and Development Center.
Google Scholar
Paunonen, S. V. (1988). Trait relevance and the differential predictability of behavior. Journal of Personality, 56, 599-619.
Google Scholar | Crossref | ISI
Reise, S. P., Waller, N. G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology, 65, 143-151.
Google Scholar | Crossref | ISI
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometrika Monograph No. 17). Iowa City, Iowa: Psychometric Society.
Google Scholar | Crossref
Strandmark, N. L., Linn, R. L. (1987). A generalized logistic item response model parameterizing test score inappropriateness. Applied Psychological Measurement, 11, 355-370.
Google Scholar | SAGE Journals | ISI
Taylor, J. B. (1977). Item homogeneity, scale reliability, and the self-concept hypothesis. Educational and Psychological Measurement, 37, 349-361.
Google Scholar | SAGE Journals | ISI
Tellegen, A. (1988). The analysis of consistency in personality assessment. Journal of Personality, 56, 622-663.
Google Scholar | Crossref | ISI
Torgerson, W. (1958). Theory and methods of scaling. New York. NY: Wiley.
Google Scholar
van der Maas, H. L., Molenaar, D., Maris, G., Kievit, R. A., Borsboom, D. (2011). Cognitive psychology meets psychometric theory: On the relation between process models for decision making and latent variable models for individual differences. Psychological Review, 118, 339-356.
Google Scholar | Crossref | Medline | ISI
Yuan, K. H., Chan, W., Marcoulides, G. A., Bentler, P. M. (2016). Assessing structural equation models by equivalence testing with adjusted fit indexes. Structural Equation Modeling: A Multidisciplinary Journal, 23, 319-330.
Google Scholar | Crossref
Access Options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Research off-campus without worrying about access issues. Find out about Lean Library here

Your Access Options


Purchase

APM-article-ppv for $37.50
Single Issue 24 hour E-access for $225.66

Cookies Notification

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more.
Top