Introduction
Alcohol is a potent teratogen that readily crosses the placenta to disrupt prenatal development (
Mattson et al., 2019). Fetal Alcohol Spectrum Disorder (FASD) is a diagnostic term used to encapsulate the various behavioural, physiological, and neurodevelopmental challenges arising from prenatal alcohol exposure (
Shelton et al., 2018). FASD is a significant public health concern and a leading cause of developmental disability within the western world (
Astley, 2004), affecting approximately 7.7 per 1000 children in the general population (
Lange et al., 2017). Although children with prenatal alcohol exposure may present with a range of physical, neurological, and psychological abnormalities, cognitive deficits are the most consistent feature of FASD, with impairment most frequently observed in working memory, attentional processes, executive functions, visuospatial reasoning, language, and general intelligence (
Coriale et al., 2013;
Shelton et al., 2018). Despite this, the occurrence and severity of cognitive impairment varies significantly between cases, providing an obstacle to the clinical assessment of FASD (
Benz et al., 2009).
FASD is underrecognised globally, with inconsistent assessment approaches and limited access to diagnostic services providing barriers to disease identification (
Hayes et al., 2023). As a consequence, many children with prenatal alcohol exposure may be at risk of receiving insufficient support for the neurodevelopmental challenges associated with FASD. Establishing sophisticated diagnostic considerations for the clinical assessment of FASD is therefore imperative to better equip clinicians with the skillset to disentangle the complex phenotypes associated with prenatal alcohol exposure. International guidelines currently inform clinical practice (
Astley, 2004;
Bower et al., 2016;
Cook et al., 2016), though they are not without criticism (
Hayes et al., 2022;
McLennan & Braunberger, 2017). For instance, stakeholders have identified a disconnect between conceptualisations of cognitive functions described by international diagnostic guidelines and neuropsychological theory underpinning the assessment tools recommended to assess latent cognitive constructs in children with prenatal alcohol exposure (
Hayes et al., 2022;
McLennan & Braunberger, 2017). The result of this divide is the underutilisation of rich psychometric research that could better inform the assessment of FASD-related cognitive deficits. Bridging the gap between psychometric theory and the current clinical guidelines is critical to improving the consistency and accuracy of FASD diagnosis. Such changes may reduce reliance on clinical intuition, allowing for a standardised and systematic approach to patient-centred care.
The Cattel-Horn-Carrol (CHC) theory of human cognition is a psychometric taxonomy that emphasises the covariance between aspects of intelligence and cognitive functionality (
Caemmerer et al., 2020). The CHC model is organised hierarchically, with stratum III representing the general factor of intelligence or ‘
g’, stratum II representing 16 broad cognitive abilities, and stratum I representing over 80 narrow abilities (
Jewsbury et al., 2017;
McGill & Dombrowski, 2019). The CHC model is largely congruent with many existing intelligence tests (
Flanagan & Alfonso, 2017;
Flanagan et al., 2013), including the Wechsler Intelligence Scales for Children (WISC), commonly recommended as a cognitive assessment tool for FASD diagnostic purposes (
Bower et al., 2016;
Cook et al., 2016).
While the fourth edition of the WISC (WISC-IV) is typically interpreted through four primary indices (
Wechsler, 2003), CHC-derived models propose a five-factor structure, with the primary subtests purported to measure fluid reasoning (Gf), visuospatial processing (Gv), crystalised intelligence (Gc), short-term memory (Gsm), and processing speed (Gs) (
Flanagan et al., 2013;
Keith et al., 2006). Despite deviating from the standard WISC-IV interpretation, the invariance of the CHC-derived five-factor structure is supported cross-contextually (
Chen et al., 2009;
Golay et al., 2013;
McGill & Canivez, 2018) and is suggested to retain validity irrespective of neurodevelopmental and intellectual disability (
Weiss et al., 2013). Meanwhile, the fifth edition of the WISC (WISC-V), organises the test into five primary indices representing visual processing (Gv), fluid reasoning (Gf), crystalised intelligence (Gc), short-term memory (Gsm), and processing speed (Gs) CHC abilities (
Chen et al., 2015;
Wechsler, 2014). The invariance of this structure in neurodevelopmentally disordered populations is supported by the WISC-V administration manual, which details the test’s utility in a normative sample that includes a proportion of children with various special education classifications (i.e. Intellectual Disability, Specific Learning Disorder, Autism Spectrum Disorder, Attention-Deficit/ Hyperactivity Disorder and children considered intellectually gifted:
Wechsler, 2014).
Table 1 outlines the proposed congruency between the CHC model and WISC primary indices/subtests.
Since both WISC-IV and WISC-V CHC-derived models assert construct validity among neurodevelopmentally disordered populations, the purported structure of these tests should be retained when examining children with prenatal alcohol exposure, where diverse and complex impairments across cognitive functions are common. To assess this, a robust investigation into the CHC-derived factor structure of the WISC-IV and WISC-V models within children with prenatal alcohol exposure is warranted. Such research will help to better understand whether the current diagnostic paradigm with a CHC theoretical framework is valid, reliable, and useful to clinical practice.
Multidimensional Scaling and the WISC Models
While factor analytic (FA) techniques are typically employed to understand the relationship between the factor structure of hierarchical intelligence tests, they require large, stratified samples that are not generally available in discrete clinical conditions. Multidimensional scaling (MDS) has been proposed as a complementary methodology to explore interrelationships between latent variables in psychometric assessments (Frisby & Kim, 2008;
Joshanloo & Weijers, 2019). MDS techniques are particularly useful within small samples that are atypical in normality and variance. MDS converts multivariate correlational data into Cartesian coordinates that graphically represent the relative proximities between pairs of correlated variables in geometric space (
Davison & Sireci, 2000;
Jaworska & Chupetlovska-Anastasova, 2009). Variable pairs with stronger correlations are graphically represented in closer proximity while weakly correlated variables are more distal in geometric space (
Groenen & Borg, 2013;
Joshanloo & Weijers, 2019).
Guttman’s Structural Model of Intelligence provides an interpretive framework for understanding MDS coordinate configurations derived from intelligence tests (
Adler & Guttman, 1982;
Cohen et al., 2006; L.
Guttman & Levy, 1991). This model graphically partitions the MDS visual output into a circle (radex) or cylinder, depending on whether the solution is two- or multidimensional, respectively (
Cohen et al., 2006;
Guttman & Levy, 1991). Guttman initially proposed that the radex, irrespective of dimensionality, was interpretable along two covarying components, the simplex and the circumplex (
Guttman & Levy, 1991;
Marshalek et al., 1983).
The simplex portrays a linear distribution of variables ordered according to inherent task complexity; the closeness of a task to the centre of the simplex implies increased recruitment of differing task-related cognitive abilities. Abstract tasks requiring a higher level of inferential ability employ a higher number of cognitive skills, and as a result, display a stronger correlation to other cognitive abilities, placing them closer to the centre of the simplex (
Marshalek et al., 1983;
Meyer & Reynolds, 2018). In hierarchical models of intelligence, test components sharing greater common variance with psychometric g are considered higher in complexity and exhibit higher centrality along the simplex, while tasks that exhibit more unique variance are considered less complex and are located toward the periphery of the radex (
Marshalek et al., 1983;
Meyer & Reynolds, 2018). In both the WISC-IV and WISC-V, subtests that measure Gf, Gv, and Gc domains generally exhibit higher correlations to ‘g’ and would be expected to appear closest to the centre of the radex, while subtests that measure Gsm and Gs domains generally present with a lower association to ‘g’, suggesting that these points would appear toward the periphery of the radex (
Chen et al., 2015;
Keith et al., 2006;
Weiss et al., 2013).
The circumplex on the other hand, portrays the clustering of variables around the geometric centre of the radex according to shared characteristics or content (
Davison & Sireci, 2000;
Meyer, 2021). In the context of intelligence tests, subtests that are highly correlated will cluster into a similar region of geometric space. When using statistical analyses such as smallest space analysis, WISC subtest clusters have historically conformed to surface content features such as test administration (numerical, geometric/ pictorial or verbal) or response modality (oral, manual, or pencil) (
Cohen et al., 2006; L.
Guttman & Levy, 1991; R.
Guttman & Greenbaum, 1998). However, a recent MDS investigation using the WISC-V standardisation sample demonstrated a robust two-dimensional MDS solution with subtest clustering around the circumplex consistent with a CHC-factor structure (
Meyer & Reynolds, 2018). This study demonstrated that MDS in conjunction with the radex model could provide a meaningful interpretive framework for replicating FA findings. Such a framework holds particular importance for clinical research where samples are small and may present with abnormal characteristics that violate parametric FA assumptions.
The present study replicated the methodology outlined in
Meyer and Reynolds (2018), applying MDS to examine subtest complexity and clustering patterns of the WISC-IV and WISC-V in a cohort of children with prenatal alcohol exposure. MDS outputs were examined within the context of the radex model and prior literature to provide a preliminary overview of WISC-IV and WISC-V test structure in children with prenatal alcohol exposure. Given the purported validity of the WISC models among neurodevelopmentally impaired populations, we hypothesise that the linear distribution of primary subtests along the simplex will generally correspond to ‘g’ loadings outlined within the existing psychometric literature. We also hypothesise that the clustering of primary subtests around the circumplex will conform to the clinically invariant CHC structure of the WISC-IV and WISC-V (outlined in
Table 1).
Methodology
The present study utilised archival data collected by a tertiary FASD clinic located in Queensland, Australia, between 2014 and 2018. Data were extracted from a de-identified research database containing information recorded in medical records, parallel files, and computer scoring programs as part of the standard clinical care model. The sample included children referred to the clinic due to behavioural problems and/or suspected neurodevelopmental problems, and a background of confirmed prenatal alcohol exposure. Demographic and background information were collected during the detailed clinical intake interview. A standardised assessment of cognition, utilising either the WISC-IV or the WISC-V A&NZ, was administered by a clinical neuropsychologist or clinical psychologist, consistent with recommendations in the Australian guide to Diagnosis of FASD (
Bower et al., 2016). The ten core subtests of the WISC-IV and WISC-V models were administered following the standardised administration procedures and scored using standard scoring procedures.
When required, WISC administration was adjusted in-line with standardised procedures specified in the administration manuals to accommodate attention, language, motor, and other delays. A range of common accommodations were also used when necessary, including regular breaks or compliance-reward activities between subtests (i.e. playing a quick card game). Less common adjustments included the presence of support persons for emotional regulation, adjustable furniture for musculoskeletal difficulties, and adjustments to lighting and ambient sound for sensory issues. The full WISC was administered to all children. However, in accordance with standardised administration and scoring procedures, subtests were awarded a raw score of zero in cases where children engaged with a subtest but could not complete items and no score was recorded when a child could not validly engage with a subtest. Supplementary subtests were administered only when clinically necessary. These data were utilised to inform clinical decision-making, but not included in the current study.
Materials
The WISC-IV and WISC-V A&NZ were standardised on a normative sample of Australian children between the ages of 6:0 and 16:11. The normative sample included a proportion of children from various special education classifications, including those with Intellectual Disability, Specific Learning Disorder (Reading and/or Written Expression), Autism Spectrum Disorder with Language Disorder, Attention-Deficit/ Hyperactivity Disorder and Gifted or Talented children (
Wechsler, 2003,
2014). The WISC-IV is composed of 10 primary subtests, which disperse into four primary indices. This structure is reported to have robust internal reliability and construct validity (
Wechsler, 2003). The WISC-V is comprised of 10 primary subtests, which disperse into five primary indices. This structure is reported within the WISC-V technical and interpretive manual to have robust internal reliability and construct validity (
Wechsler, 2014).
Statistical Analysis
The PROXSCAL function in SPSS version 28.0.1.0 was used for the MDS analysis to examine the primary subtests for each battery. Initial PROXSCAL specifications were configured to replicate
Meyer and Reynolds (2018) process and solution. A Torgerson start was used to generate the initial MDS configuration. Squared Euclidean distances were selected as the proximity measure to generate the dissimilarity matrix for coordinate positioning in multidimensional geometric space (
Meyer & Reynolds, 2018). An iterative approach to data rescaling and dimensionality using both metric (interval) and non-metric (ordinal) proximity transformations was utilised (
Borg & Mair, 2017). To ensure the most robust solution was found, scree plots illustrating dimensionality by raw stress were used to determine the optimal n-dimensional solution, with the maximum number of n-dimensions plotted being the total number of variables used in the analysis minus one (
n-1) (
Jaworska & Chupetlovska-Anastasova, 2009).
Model fit and interpretability are two key considerations for determining dimensional suitability (
Davison & Sireci, 2000). Increasing dimensions in the model typically improves model fit, but sacrifices interpretative parsimony (
Davison & Sireci, 2000;
Groenen & Borg, 2013). Hence, the simplest statistically robust dimensional solution is optimal for interpretation (
Joshanloo & Weijers, 2019). To determine model-fit, ‘goodness of fit’ and ‘badness of fit’ metrics were utilised (
Davison & Sireci, 2000;
Jaworska & Chupetlovska-Anastasova, 2009). Kruskal’s Stress 1, provided by PROXSCAL, was used as the loss function to determine the ‘badness of fit’, with stress 1 ≤.10 indicating optimal model fit, ≤.15 indicating acceptable model fit, and ≤.20 indicating poor model fit (
Kruskal & Wish, 1978). To maintain consistency with
Meyer and Reynolds (2018), stress 1 ≤.15 was selected as the criterion for model-fit. Dispersion Accounted For (DAF) was used to assess ‘goodness of fit’. DAF >.60 was selected as the criterion for acceptable model-fit (
Borg & Mair, 2017).
Results
The WISC-IV sample (
n = 87) consisted of 56 males (64.4%) and 31 females (35.6%), with ages at the time of assessment ranging from 6.92 to 13.17 years (
M = 9.05,
SD = 1.64). The WISC-V sample (
n = 21) consisted of 16 males (76.2%) and 5 females (23.8%) with ages at the time of assessment ranging from 7.25 to 13.80 years (
M = 9.07,
SD = 1.66). A missing value analysis identified approximately 5% missing data across all WISC-IV/V subtests, indicating that the dataset was suitable for analysis. PROXSCAL employs listwise exclusion of cases for missing data. Little’s Missing completely at random (MCAR) assessment was utilised to validate any exclusion. MCAR was not significant (
p > .05) for both WISC-IV and WISC-V samples, indicating that missing data was completely random.
Tables 2 and
3 outline descriptive and demographic statistics for the WISC IV and WISC-V samples.
Multidimensional Scaling Analysis
Scree plots confirmed suitable two- and three-dimensional solutions for both WISC-IV and WISC-V metric and non-metric analyses. All solutions and configurations met the a priori specified model-fit statistics (Stress 1 ≤ .15; DAF >.60). In this instance, three-dimensional non-metric solutions provided better model-fit statistics; however, two-dimensional non-metric solutions were selected for interpretation as they represented a more parsimonious explanation.
WISC-IV: Interpretation of Subtest Positioning in Relation to CHC-Derived Domains
Model fit for the WISC-IV two-dimensional non-metric MDS solution was deemed acceptable according to the a priori criteria (Stress 1 = .09), with 99% dispersion accounted for (DAF = .99). The WISC-IV MDS output revealed a simplex representation of subtests that was inconsistent with expectations of CHC-derived WISC-IV subtest complexity. Subtests representing Gc (VOC, COM), Gsm (LNS), Gv (PC), Gf (BD), and Gs (SS) appeared in the centre of the distribution, indicating greater cognitive complexity of these tasks within the sample. Meanwhile, SIM (Gc), DS (Gsm), MR (Gv), and CD (Gs) appeared toward the periphery of the distribution, indicating comparatively lower complexity of these tasks within the sample.
Subtest clustering around the circumplex was also inconsistent with expectations of CHC-derived subtest interrelations. The sample demonstrated unique clustering for Gf and Gv subtests with BD (Gv subtest) having a close proximity to the Gf subtests (MR and PC). Clustering of Gc (VOC and COM) and Gsm (LNS) subtests was also observed. However, this was not consistent, with SIM (Gc) and DS (Gsm) subtests demonstrating some dissociation from this cluster. Gs subtests (CD and SS) did not cluster together, with CD appearing distant from all other subtests.
Figure 1 below illustrates the two-dimensional non-metric MDS output of WISC-IV subtests.
WISC-V: Interpretation of Subtest Positioning in Relation to CHC-derived Domains
Model fit for the WISC-V two-dimensional non-metric MDS solution was deemed acceptable according to the a priori criteria (Stress 1 = .15), with 98% dispersion accounted for (DAF = .98). The WISC-V MDS output revealed a simplex representation of subtests that was inconsistent with expectations of WISC-V subtest complexity. Subtests representing Gsm (PS, DS), Gf (MR), and Gs (SS) appeared at the centre of the distribution, indicating greater cognitive complexity of these tasks within this sample. Meanwhile, subtests representing Gc (VOC, SIM), Gf (FW), Gs (CD), and Gv (VP, BD) appeared toward the periphery of the distribution, indicating comparatively lower complexity of these tasks within this sample.
Subtest clustering around the circumplex was also inconsistent with expectations of CHC-derived subtest interrelations. The sample demonstrated clustering of subtests related to Gsm (PS, DS), Gf (MR. FW), Gv (BD), and Gc (SIM) domains, indicating relatively poor differentiation between domains. Subtests within each of these domains were also dispersed, indicating relatively weak within-domain subtest relationships. An exception to this were Gs subtests (SS, CD), which demonstrated a close within-domain subtest relationship in a unique sector of the distribution. Gv (VP) and Gc (VOC) subtests exhibited distant relations to all other subtests, suggesting a dissociation of these tasks from all domain clusters.
Figure 2 below illustrates the two-dimensional non-metric MDS output of WISC-V subtests.
Alternative Interpretation: Subtest Positioning in Relation to Task Response Modality
Figure 3 demonstrates the MDS output of WISC-IV subtests partitioned by item response modality. WISC-IV subtests with oral response modality cluster within close proximity in one sector of the distribution (particularly COM, VOC, and LNS), implying a close relationship between verbal expression tasks in children with prenatal alcohol exposure. Meanwhile, subtests with manual (PC, MR, and BD) and pencil/paper (CD and SS) modalities appear in separate sectors of the distribution.
Figure 4 demonstrates the MDS output of WISC-V subtests partitioned by response modality. WISC-V subtests also generally clustered according to oral (DS, VOC and SIM), manual (MR, FW, BD and VP), and pencil/paper (CD and SS) modalities. Interestingly, picture-span (PS), which appeared as the most central and cognitively complex task on the WISC-V output, possesses a dual oral/ manual response modality. Such a finding may reflect alignment with Guttman’s initial explanation of cognitive complexity, where recruitment of more output processes results in heightened task difficulty (L.
Guttman, 1954).
Discussion
The present study offers a novel perspective on the standardised assessment of intellectual ability in a cohort of children with prenatal alcohol exposure. While preliminary, due to sample size and methodology, our results indicate that a CHC-oriented interpretation of the WISC-IV and WISC-V may be unsuitable within this clinical context, with the relative positioning of subtests deviating from expectations of test complexity and structure reported in prior literature.
CHC-derived WISC-IV short-term memory (Gsm) and crystalised intelligence (Gc) subtests clustered in close proximity to each other and were positioned towards the centre of the radex, indicating shared content features and greater cognitive complexity of these tasks within the sample. Clustering between CHC-derived visuospatial processing (Gv) and fluid reasoning (Gf) subtests was also observed in the WISC-IV model, suggesting poor discrimination between these domains. Interestingly, the standard four-factor WISC-IV index framework is structured so that Gf and Gv subtests represent a single perceptual reasoning domain. Our findings indicated that the standard interpretation of the WISC-IV perceptual reasoning index may be more suitable for this clinical cohort than a CHC-derived assessment.
WISC-V subtest clustering indicated general associations between fluid reasoning (Gf) and working memory (Gsm) subtests, crystalised intelligence (Gc) and working memory (Gsm) subtests, and fluid reasoning (Gf) and visuospatial reasoning (Gv) subtests, suggesting interdependencies between these domains within the sample. The within-domain dispersion of subtests within this sample also indicated poor relationships between tasks purported to measure the same domain. The exception to this were processing speed (Gs) tasks, which appeared to have a close within-domain relationship and a general dissocation from other tasks administered to the sample.
Working memory tasks appeared more centrally than typically expected across both WISC-IV and WISC-V configurations (
Chen et al., 2015;
Keith et al., 2006;
Weiss et al., 2013). Contextually, this implies elevated task complexity of Gsm subtests within this sample and may represent a deficit in working memory commonly reported for children exposed to alcohol in utero (
Coriale et al., 2013). Such findings, however, should be interpreted with caution, as collectively, outcomes suggested that the WISC models may be limited in discriminating between CHC domains within this cohort.
While sample-specific characteristics may underpin the observed discrepancies with prior findings and reflect the variable cognitive profile often observed in children with prenatal alcohol exposure (
Akison et al., 2024), considerable debate persists regarding the overall utility of index scores beyond higher-order ‘g’ for understanding intelligence profiles in both normative and clinical populations. Higher-order ‘g’ is often noted as the predominant source of variance within WISC models (
Canivez et al., 2017;
Canivez & Kush, 2013;
Dombrowski et al., 2015;
Watkins, 2006,
2010). On this basis, it has been claimed that WISC assessments psychometrically detect overall IQ more accurately than domain-specific cognitive abilities.
Dombrowski et al. (2018) describe index-level interpretation of the WISC as heavily dependent on the gradual differentiation of cognitive constructs that occurs as children age (
Dombrowski et al., 2018). As neurodevelopmental disorders often limit the maturation and diversification of cognitive functions, it is conceivable that such conditions could impinge upon the efficacy of the WISC in accurately detecting latent cognitive abilities among these populations. However, solely interpretting overall IQ scores, while psychometrically robust, severely diminishes the clinical utility of these tools in determining the extent of domain-specific functional impairment among disordered groups (
Courville et al., 2016;
Fiorello et al., 2007;
Hayes et al., 2022;
McGill, 2016). Furthermore, some evidence supports the use of index-level interpretation above overall IQ in clinical groups, emphasising the importance of individualised assessment in neurodevelopmentally delayed children (
Fiorello et al., 2007). As such, it is crucial to consider alternative interpretation frameworks that may complement and enhance standard psychometric approaches to facilitate a stronger understanding of functional deficits.
The ‘Boston Process Approach’ is a philosophical ideology within clinical neuropsychology that emphasises the importance of the process taken in responding to intelligence tests, particularly within clinical settings, where the extent of an individual’s functional impairment fundamentally defines the interaction between input and output features of cognitive tasks (
Bruno-Golden et al., 2013;
White & Rose, 1997). Hence, understanding the patient’s ability (process-success) or inability (process-failure) to complete a task may provide insight into the practical implications of neurocognitive impairment, offering a richer understanding of how cognitive deficits may manifest in everyday functioning (
Bruno-Golden et al., 2013). In this context, the administration modality of a task (or what Guttman initially referred to as ‘The Facet of Format of Communication’) becomes particularly important, as the ease or difficulty in which an individual responds to each modality may be informative for understanding functional abilities.
Our results (illustrated in
Figures 3 and
4) demonstrated the utility of such an alternative interpretation framework, with WISC-IV and WISC-V MDS outputs illustrating a pattern of subtest clustering that adhered to response modality. Upon interpretation, WISC-IV subtest clustering within this sample may imply an elevated complexity of tasks with an oral modality, suggesting that this clinical group may have difficulties with verbal communication. Meanwhile, a dual-response task was implied to be the most complex within the WISC-V output, adhering to Guttman’s explanation of cognitive complexity, which suggests that the recruitment of more output processes results in heightened task difficulty (L.
Guttman, 1954).
Limitations, Future Directions and Conclusions
Although comparable as methodological approaches for examining the structure of test batteries, MDS and FA compute results differently, where FA techniques statistically derive distinct but correlated categories from intelligence test data, while MDS techniques spatially arrange correlated tasks to geometrically represent inter-item relationships (
Tucker-Drob & Salthouse, 2009). Differences between these techniques can lead to discrepant outcomes, which, in turn, can be further accentuated through the use of largely confirmatory FA approaches, that aim to detect expected structural frameworks rather than representing organically occurring item relationships (
Tucker-Drob & Salthouse, 2009). While MDS circumvents this issue, unveiling variable relationships that are potentially obscured by confirmatory approaches, the subjective nature of output interpretation may limit the consistency and generalisability of such findings. Considering this, MDS and FA techniques may be most effectively applied in tandem to explore subtest relations in intelligence batteries, where a combined approach may better illuminate data structures that would typically go unseen by either technique alone.
To our knowledge, this study is the first attempt to explore the structure of the WISC models in children exposed to alcohol in utero. Our findings provide an opportunity to reconsider the interpretation of standardised psychometric outcomes in the diagnostic assessment of FASD. Through the Boston Process Approach, essential clinical information is obtained by examining how difficulties with behaviour, attention, mood, language, motor control, and executive function influence test engagement. This wealth of understanding is often lost in standardised assessment procedures that focus solely on psychometric outcomes. As such, assessments of neurodevelopmental impairment that account for process-success and process-failure may address conventional barriers to psychometric testing by providing insight into how severe functional deficits limit a child’s ability to interact with standardised assessment tools. This insight could provide a basis for individualised assessment procedures that improve upon diagnostic accuracy and better inform clinical decision-making surrounding outpatient care.
The current findings, while providing unique, clinically meaningful insight, must at present be considered preliminary, given constraints in sample size and methodology. Future endeavours may benefit from combining both FA and MDS techniques to examine structural variance and invariance, particularly within clinical cohorts that differ significantly in developmental trajectories from normative samples used to develop intelligence tests. Regardless, our findings may complement and enrich established interpretive frameworks for FASD assessment and provide an impetus to redirect the current diagnostic protocol towards a paradigm informed by the ‘process approach’, which allows clinicians to define functional cognitive deficits through the observable impact of impairment in everyday behaviour. In combination with discrete outcomes in neurocognitive domains, such a framework may encourage a holistic approach to the diagnosis of FASD that better supports interventional and rehabilitative goals, establishing a patient-centred approach to clinical care.