Impact of Sampling Variability When Estimating the Explained Common Variance

Assessing multidimensionality of a scale or test is a staple of educational and psychological measurement. One approach to evaluate approximate unidimensionality is to fit a bifactor model where the subfactors are determined by substantive theory and estimate the explained common variance (ECV) of the general factor. The ECV says to what extent the explained variance is dominated by the general factor over the specific factors, and has been used, together with other methods and statistics, to determine if a single factor model is sufficient for analyzing a scale or test (Rodriguez et al., 2016). In addition, the individual item-ECV (I-ECV) has been used to assess approximate unidimensionality of individual items (Carnovale et al., 2021; Stucky et al., 2013). However, the ECVand I-ECVare subject to random estimation error which previous studies have not considered. Not accounting for the error in estimation can lead to conclusions regarding the dimensionality of a scale or item that are inaccurate, especially when an estimate of ECVor I-ECV is compared to a pre-specified cut-off value to evaluate unidimensionality. The objective of the present study is to derive standard errors of the estimators of ECV and I-ECV with linear confirmatory factor analysis (CFA) models to enable the assessment of random estimation error and the computation of confidence intervals for the parameters. We use Monte-Carlo simulation to assess the accuracy of the derived standard errors and evaluate the impact of sampling variability on the estimation of the ECV and I-ECV. In a bifactor model for J items, denote Xj, j 1⁄4 1, ..., J , as the observed variable and let G denote the general factor. We define the S subfactors Fs, s2f1,..., Sg, and Js as the set of indicators for each subfactor. Each observed indicator Xj is then defined by the multiple factor model (McDonald, 2013)

where λ Gj and λ sj are the factor loadings, μ j denotes the item mean, and ϵ j is a random error term with variance ψ 2 j . In a bifactor model, G and each F s are independent of each other and the error terms are independent of each other and with G and each F s .
The explained common variance for CFA models is equal to (Rodriguez et al., 2016) ECV ¼ In the literature, it has been suggested that ECV values higher than 0.7 to 0.8 indicate sufficient unidimensionality of the scales to adopt a unidimensional model (Rodriguez et al., 2016). To further evaluate the unidimensionality of a specific item score, another statistic, the explained common variance of an item (I À ECV j ), was suggested and is defined as (Stucky et al., 2013) The ECV and I À ECV j are functions of the factor loadings and subject to sampling variability when estimating the unknown parameters. However, this has not been considered in prior studies and the impact of random estimation error on the ECV and I À ECV j is unknown. Here, we provide a method for estimating the sampling variability of these statistics based on standard asymptotic theory. Assuming that the estimator b θ of the CFA model parameters θ fulfills ffiffi ffi n p ð b θ À θÞ → D N ð0, P Þ as the sample size n tends to infinity, we can approximate the variance of the estimators of ECV and I À ECV j by applying the delta method (Ferguson, 1996) where g is either ECV or I À ECV j , continuously differentiable functions of b θ. Define λ as the vector of all factor loadings and define λ G as the vector of all factor loadings for the general factor. We require the derivatives of the ECV and I À ECV j with respect to the unknown parameters and obtain, for any factor loading λ 2 λ and To assess the accuracy of the standard errors and illustrate the impact of estimation error for ECV and I À ECV j , we simulated data in accordance with a 35-item scale with factor loadings given in Table 3 in Stucky et al. (2013). The bifactor model had one general factor and five independent specific factors that were each measured by 10, 6, 7, 6 and 6 items. All factors were independently generated from standard normal distributions and the residuals were randomly drawn from a normal distribution with mean zero and variances for each item randomly drawn from the U ð0:2,0:6Þ distribution. The true ECV for these data was .615 and the I-ECVs ranged from .171 to .982. We used sample sizes 200, 400, 800 and 1600, and 1000 replications, where the CFA models were estimated using maximum likelihood with the R package lavaan (Rosseel, 2012) and where the R code for computing the ECV and I-ECVs with associated standard errors are available as online Supplementary material.
In Table 1, we present the simulation results for the ECV, showing that the estimators are accurate for all sample sizes. The Monte-Carlo standard errors range from .031 at sample size 200 to 0.011 at sample size 1600 and the asymptotic standard errors based on the delta method with the derivatives from equation (1) are accurate for all sample sizes.
We present the simulation results for the I-ECVs pertaining to the 10 items that measure the first subfactor in Table 2, for sample sizes 200 and 1600. The simulation study included 35 items but we omitted the remaining results since they were highly similar. In Table 2, it can be seen that the sampling variability for I-ECV is generally larger than for the ECV since these statistics are based on the estimated factor loadings of only a single item. Meanwhile, even at sample size 200, the asymptotic standard errors from the delta method using the derivatives from equation (2) are accurate. The results in Table 2 imply that the assessment of item-level unidimensionality with the I-ECV carries with it substantial random error. The largest impact for the first subfactor exists for item 9, where a 95% confidence interval had average length 2 × z 0:975 × 0:069 ≈ 0:270 with sample size 200, which is quite large for a statistic that ranges between 0 and 1. The average length of a 95% confidence interval for the I-ECV of the same item however reduces to 2 × z 0:975 × 0:021 ≈ 0:082 with sample size 1600.
Assessing approximate unidimensionality is commonly done as part of scale development in education and psychology and useful statistics like the ECV help in evaluating unidimensionality. However, this process should be complemented with an assessment of the random error associated with the statistics used. Just like the reporting of reliability coefficients should include standard errors or confidence intervals (Fan & Thompson, 2001), we argue that measures like the ECV and I-ECV should be reported together with an indication of the amount of random error. In this study, we presented a simple solution to assess estimation error for linear factor models and implemented the approach in R for use by interested researchers. Future studies can include the specific results for other statistics commonly used with bifactor models, such as the omega-total and omegahierarchical coefficients.

Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.

Supplemental Material
Supplemental material for this article is available online.