Abstract
When covariates are measured with error, inference based on conventional generalized linear models can yield biased estimates of regression parameters. This problem can potentially be rectified by using generalized linear latent and mixed models (GLLAMM), including a measurement model for the relationship between observed and true covariates. However, the models are typically estimated under the assumption that both the true covariates and the measurement errors are normally distributed, although skewed covariate distributions are often observed in practice. In this article we relax the normality assumption for the true covariates by developing nonparametric maximum likelihood estimation (NPMLE) for GLLAMMs. The methodology is applied to estimating the effect of dietary fibre intake on coronary heart disease. We also assess the performance of estimation of regression parameters and empirical Bayes prediction of the true covariate. Normal as well as skewed covariate distributions are simulated and inference is performed based on both maximum likelihood assuming normality and NPMLE. Both estimators are unbiased and have similar root mean square errors when the true covariate is normal. With a skewed covariate, the conventional estimator is biased but has a smaller mean square error than the NPMLE. NPMLE produces substantially improved empirical Bayes predictions of the true covariate when its distribution is skewed.
|
Aitkin M
(1996)
A general maximum likelihood analysis of overdispersion in generalized
linear models
. Statistics and Computing, 6,
251-262
. Google Scholar | Crossref | ISI | |
|
Aitkin M
(1999)
A general maximum likelihood analysis of variance components in generalized
linear models
. Biometrics, 55,
117-128
. Google Scholar | Crossref | Medline | ISI | |
|
Aitkin M
,
Rocci R
(2002)
A general maximum likelihood analysis of measurement error in generalized
linear models
. Statistics and Computing, 12,
163-174
. Google Scholar | Crossref | |
|
Bock RD
,
Lieberman M
(1970)
Fitting a response model for n dichotomously scored items
. Psychometrika, 33,
179-197
. Google Scholar | Crossref | |
|
Butler JS
,
Moffitt R
(1982)
A computationally efficient quadrature procedure for the one-factor
multinomial probit model
. Econometrica, 50,
761-764
. Google Scholar | Crossref | |
|
Butler SM
,
Louis TA
(1992)
Random effects models with nonparametric priors
. Statistics in Medicine, 11,
1981-2000
. Google Scholar | Crossref | Medline | ISI | |
|
Böhning D
(1982)
Convergence of Simar’s algorithm for finding the maximum
likelihood estimate of a compound Poisson process
. Annals of Statistics, 10,
1006-1008
. Google Scholar | Crossref | |
|
Böhning D
(2000) Computer-assisted analysis of mixtures and
applications. Meta-analysis, disease mapping and others.
London: Chapman & Hall
. Google Scholar | |
|
Carroll RJ
,
Ruppert D
,
Stefanski LA
(1995) Measurement Error in Nonlinear Models.
London: Chapman & Hall
. Google Scholar | Crossref | |
|
Clayton DG
(1992) Models for the analysis of cohort and case-control
studies with inaccurately measured exposures. In
Dwyer JH
,
Feinlieb M
,
Lippert P
,
Hoffmeister H
eds. Statistical models for longitudinal studies on health.
New York: Oxford University Press
. Google Scholar | |
|
Davidian M
,
Gallant AR
(1992)
Smooth nonparametric maximum likelihood estimation for population
pharmacokinetics, with application to quindine
. Journal of Pharmacokinetics and Biopharmaceutics,
20,
529-556
. Google Scholar | Crossref | Medline | |
|
Davies R
,
Pickles A
(1987)
A joint trip timing store-type choice model for grocery shopping, including
inventory effects and nonparametric control for omitted variables
. Transportation Research, A, 21,
345-361
. Google Scholar | Crossref | |
|
Dunn G
(1989) Design and analysis of reliability studies:
statistical evaluation of measurement errors.
Sevenoaks: Edward Arnold
. Google Scholar | |
|
Follmann DA
,
Lambert D
(1989)
Generalizing logistic regression by nonparametric mixing
. Journal of the American Statistical Association,
84,
295-300
. Google Scholar | Crossref | ISI | |
|
Gallant AR
,
Nychka DW
(1987)
Semi-nonparametric maximum likelihood estimation
. Econometrica, 55,
363-390
. Google Scholar | Crossref | |
|
Heckman J
,
Singer B
(1984)
A method for minimizing the impact of distributional assumptions in
econometric models for duration data
. Econometrica, 52,
271-320
. Google Scholar | Crossref | |
|
Hinde JP
,
Wood ATA
(1987) Binomial variance component models with a
non-parametric assumption concerning random effects. In
Crouchley R
ed. Longitudinal data analysis.
Aldershot: Avebury
. Google Scholar | |
|
Bo Hu P
,
Tsiatis AA
,
Davidian M
(1998)
Estimating the parameters in the Cox model when the covariate variables are
measured with error
. Biometrics, 54,
1407-1419
. Google Scholar | Crossref | Medline | |
|
Jewell NP
(1982)
Mixtures of exponential distributions
. Annals of Statistics, 10,
479-484
. Google Scholar | Crossref | |
|
Laird N
(1978)
Nonparametric maximum likelihood estimation of a mixing distribution
. Journal of the American Statistical Association,
73,
805-811
. Google Scholar | Crossref | ISI | |
|
Lesaffre E
,
Spiessens B
(2001)
On the effect of the number of quadrature points in a logistic
random-effects model: an example
. Applied Statistics, 50,
325-335
. Google Scholar | ISI | |
|
Lindsay BG
(1983)
The geometry of mixture likelihoods. Part I: a general theory
. Annals of Statistics, 11,
783-792
. Google Scholar | Crossref | |
|
Lindsay BG
(1995)
Mixture models: theory, geometry and applications
. NSF-CBMS Regional Conference Series in Probability and
Statistics, Vol. 5. Hayward, CA: Institute of Mathematical Statistics. Google Scholar | |
|
Lindsay BG
,
Clogg CC
,
Grego J
(1991)
Semiparametric estimation in the Rasch model and related exponential
response models, including a simple latent class model for item analysis
. Journal of the American Statistical Association,
86,
96-107
. Google Scholar | Crossref | ISI | |
|
Magder SM
,
Zeger SL
(1996)
A smooth nonparametric estimate of a mixing distribution using mixtures of Gaussians
. Journal of the American Statistical Association,
11,
1141-1151
. Google Scholar | Crossref | |
|
Morris JN
,
Marr JW
,
Clayton DG
(1977)
Diet and heart: postscript
. British Medical Journal, 2,
1307-1314
. Google Scholar | Crossref | Medline | |
|
Naylor JC
,
Smith, AFM
(1982)
Applications of a method for the efficient computation of posterior distributions
. Applied Statistics, 31,
214-225
. Google Scholar | Crossref | |
|
Plummer M
,
Clayton D
(1993)
Measurement error in dietary assessment: an investigation using covariance
structure models. Part II
. Statistics in Medicine, 12,
937-948
. Google Scholar | Crossref | Medline | |
|
Rabe-Hesketh S
,
Skrondal A
(2001)
Parameterization of multivariate random effects models for categorical data
. Biometrics, 57,
1256-1264
. Google Scholar | Crossref | Medline | ISI | |
|
Rabe-Hesketh S
,
Pickles A
,
Skrondal A
(2001a)
GLLAMM: A general class of multilevel models and a Stata program
. Multilevel Modelling Newsletter, 13,
17-23
. Google Scholar | |
|
Rabe-Hesketh S
,
Pickles A
,
Skrondal A
(2001b) GLLAMM Manual. Technical report. 2001/01.
Department of Biostatistics and Computing, Institute of Psychiatry,
King’s College, University of London
. Downloadable from
http://www.iop.kcl.ac.uk/iop/departments/biocomp/programs/gllamm.html
(accessed 20 May 2003). Google Scholar | |
|
Rabe-Hesketh S
,
Yang S
,
Pickles A
(2001c)
Multilevel models for censored and latent responses
. Statistical Methods in Medical Research, 10,
409-427
. Google Scholar | SAGE Journals | ISI | |
|
Rabe-Hesketh S
,
Skrondal A
,
Pickles A
(2002)
Reliable estimation of generalized linear mixed models using adaptive quadrature
. The Stata Journal, 2,
1-21
. Google Scholar | |
|
Rabe-Hesketh S
,
Skrondal A
,
Pickles A
(2003a)
Generalized multilevel structural equation modeling
. Psychometrika, in press. Google Scholar | |
|
Rabe-Hesketh S
,
Skrondal A
,
Pickles A
(2003b) Maximum likelihood estimation of limited and discrete
dependent variable models with nested random effects. Submitted for publication. Google Scholar | |
|
Reiersøl O
(1950)
Identifiability of a linear relation between variables which are subject to error
. Econometrica, 18,
375-389
. Google Scholar | Crossref | |
|
Roeder K
,
Carroll RJ
,
Lindsay BG
(1996)
A semiparametric mixture approach to case-control studies with errors in covariables
. Journal of the American Statistical Association,
91,
722-732
. Google Scholar | Crossref | ISI | |
|
Rosner B
,
Spiegelman D
,
Willett WC
(1990)
Correction of logistic regression relative risk estimates and confidence
intervals for measurement error: the case of multiple covariates measured
with error
. American Journal of Epidemiology, 132,
734-745
. Google Scholar | Crossref | Medline | ISI | |
|
Rubin DB
(1976)
Inference and missing data
. Biometrika, 63,
581-592
. Google Scholar | Crossref | ISI | |
|
Schafer DW
(2001)
Semiparametric maximum likelihood for measurement error regression
. Biometrics, 57,
53-61
. Google Scholar | Crossref | Medline | |
|
Simar L
(1976)
Maximum likelihood estimation of a compound Poisson process
. Annals of Statistics, 4,
1200-1209
. Google Scholar | Crossref | |
|
Skrondal A
,
Rabe-Hesketh S
(2003)
Multilevel logistic regression for polytomous data and rankings
. Psychometrika, 68,
267-287
. Google Scholar | Crossref | ISI | |
|
Skrondal A
,
Rabe-Hesketh S
(2004) Generalized latent variable modeling: multilevel,
longitudinal and structural equation models.
Boca Raton, FL: Chapman & Hall/CRC
. Google Scholar | Crossref | |
|
StataCorp
(2003) Stata Statistical Software: Release 8.0.
College Station, TX: Stata Corporation
. Google Scholar | |
|
Stefanski L
,
Carroll RJ
(1990)
Deconvoluting kernel density estimators
. Statistics, 21,
169-184
. Google Scholar | Crossref | |
|
Thoresen M
,
Laake P
(2000)
A simulation study of measurement error correction methods in logistic regression
. Biometrics, 56,
868-872
. Google Scholar | Crossref | Medline | ISI | |
|
Zhang C
(1990)
Fourier methods for estimating mixing densities and distributions
. Annals of Statistics, 18,
806-831
. Google Scholar | Crossref |
