Abstract
Dual item response theory (IRT) models in which items and individuals have different amounts of measurement error have been proposed in the literature. Any developments in these models, however, are feasible only for continuous responses. This article discusses a comprehensive dual modeling approach, based on underlying latent response variables, from which specific models for continuous, graded, and binary responses are obtained. Procedures for (a) calibrating the items, (b) scoring individuals, (c) assessing model appropriateness, and (d) assessing measurement precision are discussed for all the resulting models. Simulation results suggest that the proposal is quite feasible. A practical illustration is given with an empirical example in the personality domain.
References
|
Baron, R. M., Kenny, D. A. (1986). The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173-1182. Google Scholar | Crossref | Medline | ISI | |
|
Bock, R. D., Mislevy, R. J. (1982). Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement, 6, 431-444. Google Scholar | SAGE Journals | ISI | |
|
Brown, A., Croudace, T. (2015). Scoring and estimating score precision using multidimensional IRT. In Reise, S. P., Revicki, D. A. (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 307-333). New York, NY: Routledge. Google Scholar | |
|
Conijn, J. M., Emons, W. H., Page, B. F., Sijtsma, K., Van der Does, W., Carlier, I. V., Giltay, E. J. (2016). Response inconsistency of patient-reported symptoms as a predictor of discrepancy between patient and clinician-reported depression severity. Assessment, 25, 917-928. Google Scholar | SAGE Journals | |
|
Culpepper, S. A. (2013). The reliability and precision of total scores and IRT estimates as a function of polytomous IRT parameters and latent trait distribution. Applied Psychological Measurement, 37, 201-225. Google Scholar | SAGE Journals | ISI | |
|
Cureton, E. E. (1957). The upper and lower twenty-seven per cent rule. Psychometrika, 22, 293-296. Google Scholar | Crossref | ISI | |
|
DeFleur, M. L., Catton, W. R. (1957). The limits of determinacy in attitude measurement. Social Forces, 35, 295-300. Google Scholar | Crossref | |
|
Ferrando, P. J. (2002). Theoretical and empirical comparisons between two models for continuous item responses. Multivariate Behavioral Research, 37, 521-542. Google Scholar | Crossref | Medline | ISI | |
|
Ferrando, P. J. (2004). Person reliability in personality measurement: An item response theory analysis. Applied Psychological Measurement, 28, 126-140. Google Scholar | SAGE Journals | ISI | |
|
Ferrando, P. J. (2007). A Pearson-type-VII item response model for assessing person fluctuation. Psychometrika, 72, 25-41. Google Scholar | Crossref | ISI | |
|
Ferrando, P. J. (2009). A graded response model for measuring person reliability. British Journal of Mathematical and Statistical Psychology, 62, 641-662. Google Scholar | Crossref | Medline | ISI | |
|
Ferrando, P. J. (2013). A general linear framework for modeling continuous responses with error in persons and items. Methodology, 9, 150-161. Google Scholar | Crossref | ISI | |
|
Ferrando, P. J. (2014). A factor-analytic model for assessing individual differences in response scale usage. Multivariate Behavioral Research, 49, 390-405. Google Scholar | Crossref | Medline | |
|
Ferrando, P. J. (2016). An IRT modeling approach for assessing item and person discrimination in binary personality responses. Applied Psychological Measurement, 40, 218-232. Google Scholar | SAGE Journals | ISI | |
|
Fiske, D. W. (1968). Items and persons: Formal duals and psychological differences. Multivariate Behavioral Research, 3, 393-401. Google Scholar | Crossref | Medline | ISI | |
|
LaHuis, D. M., Barnes, T., Hakoyama, S., Blackmore, C., Hartman, M. J. (2017). Measuring traitedness with person reliabilities parameters. Personality and Individual Differences, 109, 111-116. Google Scholar | Crossref | |
|
Levine, M. V., Rubin, D. B. (1979). Measuring the appropriateness of multiple choice test scores. Journal of Educational Statistics, 4, 269-290. Google Scholar | SAGE Journals | |
|
Lord, F. M., Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley. Google Scholar | |
|
Lorenzo-Seva, U., Ferrando, P. J. (2013). FACTOR 9.2: A comprehensive program for fitting exploratory and semiconfirmatory factor analysis and IRT models. Applied Psychological Measurement, 37, 497-498. Google Scholar | SAGE Journals | ISI | |
|
Lubbe, D., Schuster, C. (2016). Consistent differential discrimination model estimation. Multivariate Behavioral Research, 51, 581-587. Google Scholar | Crossref | Medline | |
|
Lubbe, D., Schuster, C. (2017). The graded response differential discrimination model accounting for extreme response style. Multivariate Behavioral Research, 52, 616-629. Google Scholar | Crossref | Medline | |
|
Lumsden, J. (1978). Tests are perfectly reliable. British Journal of Mathematical and Statistical Psychology, 31, 19-26. Google Scholar | Crossref | ISI | |
|
Lumsden, J. (1980). Variations on a theme by Thurstone. Applied Psychological Measurement, 4, 1-7. Google Scholar | SAGE Journals | |
|
Markus, H. (1977). Self-schemata and processing information about the self. Journal of Personality and Social Psychology, 35, 63-78. Google Scholar | Crossref | ISI | |
|
McDonald, R. P. (1982). Linear versus models in item response theory. Applied Psychological Measurement, 6, 379-396. Google Scholar | SAGE Journals | ISI | |
|
Mislevy, R. J. (1986). Bayes modal estimation in item response models. Psychometrika, 51, 177-195. Google Scholar | Crossref | ISI | |
|
Mosier, C. I. (1942). Psychophysics and mental test theory II: The constant process. Psychological Review, 48, 235-249. Google Scholar | Crossref | |
|
Muthén, B. (1984). A general structural equation model with dichotomous, ordered, categorical and continuous latent variable indicators. Psychometrika, 49, 115-132. Google Scholar | Crossref | ISI | |
|
Muthén, B., Kaplan, D. (1985). A comparison of some methodologies for the factor analysis of non-normal Likert variables. British Journal of Mathematical and Statistical Psychology, 38, 171-189. Google Scholar | Crossref | ISI | |
|
Novick, M. R., Jackson, P. H. (1974). Statistical methods for educational and psychological research. New York, NY: McGraw-Hill. Google Scholar | |
|
Pallero, R., Ferrando, P. J., Lorenzo-Seva, U. (1998). Questionnaire Tarragona of anxiety for blind people. In Sifferman, E., Williams, M., Blasch, B. B. (Eds.), The 9th international mobility conference proceedings (pp. 250-253). Atlanta, GA: Rehabilitation Research and Development Center. Google Scholar | |
|
Paunonen, S. V. (1988). Trait relevance and the differential predictability of behavior. Journal of Personality, 56, 599-619. Google Scholar | Crossref | ISI | |
|
Reise, S. P., Waller, N. G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology, 65, 143-151. Google Scholar | Crossref | ISI | |
|
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometrika Monograph No. 17). Iowa City, Iowa: Psychometric Society. Google Scholar | Crossref | |
|
Strandmark, N. L., Linn, R. L. (1987). A generalized logistic item response model parameterizing test score inappropriateness. Applied Psychological Measurement, 11, 355-370. Google Scholar | SAGE Journals | ISI | |
|
Taylor, J. B. (1977). Item homogeneity, scale reliability, and the self-concept hypothesis. Educational and Psychological Measurement, 37, 349-361. Google Scholar | SAGE Journals | ISI | |
|
Tellegen, A. (1988). The analysis of consistency in personality assessment. Journal of Personality, 56, 622-663. Google Scholar | Crossref | ISI | |
|
Torgerson, W. (1958). Theory and methods of scaling. New York. NY: Wiley. Google Scholar | |
|
van der Maas, H. L., Molenaar, D., Maris, G., Kievit, R. A., Borsboom, D. (2011). Cognitive psychology meets psychometric theory: On the relation between process models for decision making and latent variable models for individual differences. Psychological Review, 118, 339-356. Google Scholar | Crossref | Medline | ISI | |
|
Yuan, K. H., Chan, W., Marcoulides, G. A., Bentler, P. M. (2016). Assessing structural equation models by equivalence testing with adjusted fit indexes. Structural Equation Modeling: A Multidisciplinary Journal, 23, 319-330. Google Scholar | Crossref |
