Abstract
Finite mixture densities can be used to model data from populations known or suspected to contain a number of separate subpopulations. Most commonly used are mixture densities with Gaussian (univariate or multivariate) components, but mixtures with other types of component are also increas ingly used to model, for example, survival times. This paper gives a general introduction to the topic which should help when considering the other more specialized papers in this issue.
|
Tarter ME , Lock MD Model-free curve estimation. London: Chapman and Hall, 1993. Google Scholar | |
|
McLachlan GJ , Basford KE Mixture models; inference and applications to clustering. New York: Marcel Dekker, Inc., 1988 . Google Scholar | |
|
McGiffin DC , Galbraith AJ , McLachlan GJ et al. Aortic valve infection-risk factors for death and recurrent endocarditis following aortic valve replacement. Journal of Thoracic Cardiovascular Surgery 1992; 104: 511-20. Google Scholar | Medline | ISI | |
|
Blackstone EH , Naftel DC , Turner ME The decomposition of time-varying hazard into phases, each incorporating a separate stream of concomitant information. Journal of the American Statistical Association 1986; 81: 615-24. Google Scholar | Crossref | ISI | |
|
Pickering RM , Forbes JF A classification of Scottish infants using latent class analysis. Statistics in Medicine 1984; 3: 249-59. Google Scholar | Crossref | Medline | ISI | |
|
Pearson K. Contribution to the mathematical theory of evolution. Philosophical Transactions A 1894; 185: 71-110. Google Scholar | Crossref | |
|
Tan WY , Chang WC Some comparisons of the method of moments and the method of maximum likelihood in estimating parameters of a mixture of two normal densities. Journal of the American Statistical Association 1972; 67: 702-708. Google Scholar | Crossref | ISI | |
|
Charlier Cvl. Researchers into the theory of probability. Lunds Universities Årskrift, Ny foljd 1906; 2.1, No. 5. Google Scholar | |
|
Strömgren B. Tables and diagrams for dissecting a frequency curve into components by the half-invariant method. Skand. Aktuarietidskr 1934; 17: 7-54. Google Scholar | |
|
Rao CR The utilization of multiple measurements in the problems of biological classification . Journal of the Royal Statistical Society, Series B 1948; 10: 159-203. Google Scholar | |
|
Charlier Cvl , Wicksell SD On the dissection of frequency functions. Arkiv. for Matematik, Astronomi och Fysik 1924; 18: no. 6. Google Scholar | |
|
Doetsch G. Die Elimination des Dopplereffekts bei spektroskopischen feinstrukturen and exakte Bestimming der Kompronenten. Zeitschrift für Physik 1928; 49: 705-30. Google Scholar | Crossref | |
|
Tiselius A. , Kabat EA Electrophoretic study of immune sera and purified antibody preparations. Journal of Experimental Medicine 1939; 69: 119-31. Google Scholar | Crossref | Medline | |
|
Harding JP The use of probability paper for the graphical analysis of polymodal frequency distributions. Journal of the Marine Biology Association of the UK 1949; 28: 141-53. Google Scholar | Crossref | ISI | |
|
Hald A. Statistical theory with engineering applications. New York : Wiley, 1952. Google Scholar | |
|
Cassie RM Some uses of probability paper in the analysis of size frequency distributions . Australian Journal of Marine and Freshwater Research 1954; 5: 513-22. Google Scholar | |
|
Bhattacharya CG A simple method of resolution of a distribution into Gaussian components. Biometrics 1967; 23: 115-35. Google Scholar | Crossref | Medline | ISI | |
|
Newcombe S. A generalized theory of the combination of observations so as to obtain the best result. American Journal of Mathematics 1886; 8: 343-66. Google Scholar | Crossref | |
|
Hasselblad V. Estimation of parameters for a mixture of normal distributions. Technometrics 1966; 8: 431-44. Google Scholar | Crossref | ISI | |
|
Hasselblad V. Estimation of finite mixtures of distribution from the exponential family. Journal of the American Statistical Association 1969; 64: 1459-71. Google Scholar | Crossref | ISI | |
|
Wolfe JH A computer program for the maximum likelihood analysis of types. Technical Bulletin, 65-15: San Diego: US Naval Personnel Research Activity, 1965. Google Scholar | |
|
Wolfe JH NORMIX: computational methods for estimating the parameters of multivariate normal mixtures of distributions. Research Memorandum, SRM 68-2. San Diego: US Naval Personnel Research Activity, 1967. Google Scholar | |
|
Wolfe JH Pattern clustering by multivariate mixture analysis. Multivariate Behavioural Research 1970; 5: 329-50. Google Scholar | Crossref | Medline | ISI | |
|
Dempster AP , Laird NM , Rubin DB Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B 1977; 39: 1-38. Google Scholar | |
|
Medgyessi P. Decompositions of superpositions of distribution functions. Budapest: Hungarian Academy of Sciences, 1961. Google Scholar | |
|
Blischke WR Mixtures of distributions, In: International encyclopedia of statistics, Kruskal WH , Tanur JM eds. New York: The Free Press, 1979. Google Scholar | |
|
Thomas Eac. Mathematical models for the clustered firing of single cortical neurones. British Journal of Mathematical and Statistical Psychology 1966; 19: 151-62. Google Scholar | Crossref | Medline | ISI | |
|
Joffe AD Mixed exponential estimation by the method of half moments. Applied Statistics 1964; 13: 91-98. Google Scholar | Crossref | |
|
Mendenhall W. , Hader RJ Estimation of parameters of mixed exponentially distributed failure time distributions from censored life test data. Biometrika 1958; 45: 504-20. Google Scholar | Crossref | ISI | |
|
Kao Jhk. A graphical estimation of mixed Weibull parameters in life-testing electron tubes. Technometrics 1959; 1: 389-407. Google Scholar | Crossref | |
|
Lui KJ , Darrow WW , Rutherford GW A model-based estimate of the mean incubation period for AIDS in homosexual men. Science 1988; 20: 1333-35. Google Scholar | |
|
McLachlan GJ , McGiffin DC. On the role of finite mixture models in survival analysis. Statistical Methods in Medical Research 1994; 2: 211-26. Google Scholar | SAGE Journals | |
|
Green BF A general solution for the latent class model of latent structure analysis . Psychometrika 1951; 16: 151-66. Google Scholar | Crossref | Medline | |
|
Gibson WA Three multivariate models: factor analysis, latent structure analysis and latent profile analysis. Psychometrika 1959 ; 24: 229-52. Google Scholar | Crossref | ISI | |
|
Lazarsfeld PF , Henry NW Latent structure analysis. New York: Houghton Mifflin, 1968. Google Scholar | |
|
Aitkin M. , Anderson D. , Hinde J. Statistical modelling of data on teaching styles. Journal of the Royal Statistical Society, Series A 1981; 144: 419-48. Google Scholar | Crossref | |
|
Cohen AC Estimation in mixture of two normal distributions. Technometrics 1967; 9: 15-28. Google Scholar | Crossref | ISI | |
|
Everitt BS , Hand DJ Finite mixture distributions. London: Chapman and Hall, 1981. Google Scholar | Crossref | |
|
Quandt RE , Ramsey JB Estimating mixtures of normal distributions and switching regressions. Journal of the American Statistical Association 1978; 73: 730-38. Google Scholar | Crossref | ISI | |
|
Lindsay BG , Basak P. Multivariate normal mixtures: a fast consistent method of moments. Journal of the American Statistical Association 1993; 88: 468-76. Google Scholar | Crossref | ISI | |
|
Day NE Estimating the components of a mixture of normal distributions. Biometrika 1969; 56: 463-74. Google Scholar | Crossref | ISI | |
|
Jones PN , McLachlan GJ. Improving the convergence rate of the EM algorithm for a mixture model fitted to grouped truncated data. Journal of Statistical Computation and Simulation 1992; 43: 31-44. Google Scholar | Crossref | |
|
Keifer J. , Wolfowitz J. Consistency of the maximum likelihood estimates in the presence of infinitely many incidental parameters. Annals of Mathematical Statistics 1956; 27: 887-906. Google Scholar | Crossref | |
|
Hathaway RJ A constrained formulation of maximum-likelihood estimation for normal mixture distributions. Annals of Statistics 1985; 13: 795-800. Google Scholar | Crossref | ISI | |
|
Everitt BS Cluster analysis. London: Arnold , 1993. Google Scholar | |
|
Banfield JD , Raferty AE Model-based Gaussian and non-Gaussian clustering . Biometrics 1993; 49: 803-21. Google Scholar | Crossref | ISI | |
|
Behboodian J. Information matrix for a mixture of two normal distributions. Journal of Statistical Computation and Simulation 1972; 1: 295-314. Google Scholar | |
|
Chang WC Confidence interval estimation and transformation of data in a mixture of two multivariate normal distributions with any given large dimension. Technometrics 1979; 21: 351-55. Google Scholar | Crossref | ISI | |
|
Holgersson N. , Jorner U. Decomposition of a mixture of two normal components . Research Report 76-13, University of Uppsala, Sweden, 1976. Google Scholar | |
|
Fowlkes EB Some methods for studying the mixture of two normal (lognormal) distributions . Journal of the American Statistical Association 1979; 74: 561-75. Google Scholar | Crossref | ISI | |
|
Wolfe JH A Monte Carlo study of the sampling distribution of the likelihood ratio for mixtures of multinormal distributions. Technical Bulletin STB 72-2. San Diego: Naval Personnel and Training Research Laboratory , 1971. Google Scholar | |
|
Hernandez Avila A. Problems in cluster analysis. [Thesis]. Oxford, 1979. Google Scholar | |
|
Everitt BS A Monte Carlo investigation of the likelihood ratio test for the number of components in a mixture of normal distributions. Multivariate Behavioural Research 1981; 16: 171-80. Google Scholar | Crossref | Medline | ISI | |
|
Thode HC , Finch SJ , Mendell NR Simulated percentage points for the null distribution of the likelihood ratio test for a mixture of two normals. Biometrics 1989; 44: 1195-1201. Google Scholar | Crossref | ISI | |
|
McLachlan GJ On bootstrapping the likelihood ratio test statistic for the number of components in a normal mixture. Applied Statistics 1987; 36: 318-24. Google Scholar | Crossref | ISI | |
|
Gosh JM , Sen PK On the asymptotic performance of the log-likelihood ratio statistic for the mixture model and related results. In: Proceedings of the Berkeley Conference in Honour of Jerzy Neyman and Jack Keifer, Volume II, Le Cam LM , Olshen RA eds. Monterey: Wadsworth, 1985. Google Scholar | |
|
Self SG , Liang KY Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association 1987; 82: 605-10. Google Scholar | Crossref | ISI | |
|
Kraeplin E. Dementia praecox and paraphenia. Edinburgh: Livingstone, 1919. Google Scholar | |
|
Lewine Rrj. Sex differences in schizophrenia: timing or subtypes? Psychological Bulletin 1981; 90: 432-44. Google Scholar | |
|
Betemps EJ , Buncher CR Birthplace as a risk factor in motor neurone disease and Parkinson's disease . International Journal of Epidemiology 1993 ; 22: 898-904. Google Scholar | Crossref | Medline | ISI | |
|
Fergusson DM , Horwood LJ A latent class model of smoking experimentation in children. Journal of Child Psychology and Psychiatry 1989; 30: 761-73. Google Scholar | Crossref | Medline | ISI | |
|
Everitt BS A finite model for the clustering of mixed mode data. Statistics and Probability Letters 1988; 6: 305-309. Google Scholar | Crossref | ISI | |
|
Lawrence CJ , Krzanowski WJ Mixture separation for mixed-mode data. Statistics and Computing 1995, 6: 85-92. Google Scholar | Crossref | ISI | |
|
Titterington DM Mixture distributions (update). In: Kotz SM ed. Encyclopedia of statistical science (update). New York: Wiley, 1996. Google Scholar | |
|
Ripley BD Neural networks and related models for classification (with discussion). Journal of the Royal Statistical Society, Series B 1994; 56: 409-56. Google Scholar | |
|
Jorgensen MA Influence-based diagnostics for finite mixture models. Biometrics 1990; 46: 1047-58. Google Scholar | Crossref | ISI | |
|
Lindsay BG , Roeder K. Residual diagnostics for mixture models. Journal of the American Statistical Association 1992; 87: 785-94. Google Scholar | Crossref | ISI | |
|
Roeder K. A graphical technique for determining the number of components in a mixture of normals. Journal of the American Statistical Association 1994; 89: 487-95. Google Scholar | Crossref | ISI |
