Encoding
The encoding stage of the RMIE simply involved judging whether a presented picture was real or novel. We included an analysis of these data to indicate children’s ability to do the task, as failure to process stimuli at this level might impact their performance in the retrieval stage. For 60 two-choice items, the binomial theorem indicates that a score of 36 or less correct is no better than chance (50% correct). Accordingly, we excluded four children (2 TD and 2 SLI) who scored at this level, who appeared either not to understand the task or to be non-compliant. Performance was generally high for the remaining children, with mean of 92% (SD = 4.53) correct for the TD group and 87.85% (SD = 4.59) for the SLI group. However, removing cases who performed at chance did not remove the skew. Therefore, we used log error scores (log after subtracting correct score from 101) to remove the skew. The group difference on encoding was tested using a one-way ANCOVA with centralized age and non-verbal ability as covariates. This gave a significant difference, F (1, 52) = 5.64, p = .021, = .09, with the TD children having lower error scores (M = 1.92, SD = 1.01) than the SLI group (M = 2.50, SD = 0.64). Although in absolute terms there was only a small difference between groups in error rates, it seemed advisable to adjust for this in analysis of subsequent recognition scores by using the log error encoding score as a covariate.
Recognition
We excluded the four children (2 each in TD and SLI) who had 36 or fewer items correct at the encoding stage in the analysis of recognition memory. We ran a 2 (groups, TD and SLI) × 2 (intervals, 10 and 60 min delay) × 2 (object type, real and novel items) ANCOVAs (one each for accuracy and RT) with centred covariates of age, non-verbal ability and log errors on encoding.
For accuracy, there was no main effect of group, F (1, 51) = 3.30, p = .07, = .03 (SLI: M = 70.25, SD = 16.33, TD: M = 75.44, SD = 14.14). The main effect of interval was significant, F (1, 51) = 7.39, p = .009, = . 03, with score at 10 min (M = 74.86, SD = 14.37) better than score at 60 min (M = 70.83, SD = 16.29). The main effect of object type was also significant, F (1, 51) = 138.39, p < .001, = .36, with better recognition for real objects (M = 81.33, SD = 13.47) than novel objects (M = 64.36, SD = 12.39). The interval × group interaction, F (1, 51) = 3.70, p = .06, = .01, object type × group interaction, F (1, 51) = .05, p = .82, = .0002 and the group × object type × interval interaction, F (1, 51) = 3.41, p = .07, = .009 were not significant. However, the interval × object type interaction was significant, F (1, 51) = 13.44, p < .001, = .03. A follow-up using paired t-tests with Bonferroni correction (critical p-value of .05/4 = .012) showed that the recognition for real objects did not change between intervals, t (55) = −0.09, p = .929, whereas novel object recognition declined at the 60 min interval compared to 10 min interval, t(55) = 4.52, p < .001. At both intervals real objects were better recognized compared to novel objects: at 10 min, t (55) = 6.73, p < .001 and at 60 min, t (55) = 11.89, p < .001.
Findings on log RT showed a main effect of group, F (1, 51) = 34.03, p < .001,
= .25 with TD performing faster (M = 3.01, SD = .01) than SLI (M = 3.14, SD = .13). Other main effects [interval: F (1,51) = 1.15, p = .29,
= .006 and object type: F(1,51) = 0.12, p = .73,
= .0003] and the interaction effects [group × interval: F(1,51) = 3.29, p = .08,
= .02; object type x group: F(1,51) = 1.22, p = .27,
= .003; interval × object type × group: F(1,51) = 1.14, p = .29,
= .002] were not significant. The object type × interval interaction however was significant, F (1, 51) = 7.09, p = .01,
= .01. We did not analyse this interaction further, because the RT findings contributed only minimally for testing of our proposed hypothesis, and the likelihood of spurious findings is high in exploratory multiway ANOVA (
Cramer et al., 2015).
Discussion
The present findings showing affected incidental encoding is in line with Lukács et al. (under review) who, using a similar task, showed that children with SLI were significantly poorer than TD children on incidentally encoding verbal (word versus non-word) as well as non-verbal (real versus novel) information.
Children with SLI performed similarly to TD children on RMIE on both the recognition sessions (with the discrepancies between groups at encoding controlled for). Further, the absence of a group × interval interaction suggests that children with SLI were just as good as TD (but not superior as predicted) across sessions on recognition memory. The present finding showing intact declarative memory in SLI on an incidental declarative task is in agreement with results of Lukács et al. However, one of the key differences between these studies is that Lukács study used a 24 h delay between recognition (included a period of sleep) which showed enhancement in SLI. We found that both TD and SLI performed better at recognizing real objects than novel objects at short and long intervals, and real objects were retained well between intervals during which the novel objects degraded. This effect could be attributed to the advantage that real objects have over novel objects with regards to labelling and representation. The lack of any interactions with group suggests that children with SLI used similar processing mechanisms to TD children; we did not find the predicted advantage for novel item recognition by this group. We also found that children with SLI responded more slowly than the TD group, which is consistent with general processing slowness reported in SLI (
Leonard et al., 2007).
VPA task
There were no outliers for this task and all children were included in the analysis. A 2 (group: between subject factor) × 2 (interval, 10 and 60 min: within-subject factor) ANOVA was run for accuracy and RT measures separately with centred covariates of age and non-verbal ability. On accuracy, the main effect of group was significant, F (1, 56) = 4.87, p = .03, = .07, with TD (M = 71.94, SD = 24.83) performing better than SLI (M = 56.38, SD = 24.94). The main effect of interval was also significant, F (1, 56) = 32.02, p < .001, = .07, with immediate recall (M = 70.83, SD = 23.69) better than delayed recall (M = 57.50, SD = 26.64). The interaction between group and interval did not reach significance, F (1, 56) = .51, p = .48, = .001.
On RT, the main effect of group was significant, F (1, 56) = 7.49, p = .008, = .08, with the SLI group faster (M = 3623.40, SD = 1125) than the TD group (M = 4081.96, SD = 1255.72). The main effect of interval was also significant, F (1, 56) = 26, p < .001, = .14, with faster responses at longer (M = 3444.59, SD = 1068.81) than shorter delays (M = 4260.77, SD = 1211.74). The interaction between group and interval was significant, F (1, 56) = 7.02, p = .01, = .04. This interaction was followed up using t-tests. At the 10 min recall interval both TD and SLI performed similarly (p = .885), but at the 60 min interval the SLI group was significantly faster than TD (p = .001). Paired t-tests showed that TD children performed at similar speed at long and short intervals (p = .127), whereas those with SLI performed recall after 60 min significantly faster than 10 min interval (p < .001). A correlation close to zero, r (30) = .023, p = .905, between speed and accuracy at 60 min recall interval for SLI data gave no evidence of speed accuracy trade-off.
Discussion
The lower accuracy of children with SLI on the VPA (an intentional declarative task) is similar to findings of
Collisson et al. (2014), who, using a similar task, showed impaired declarative memory in SLI. We suggest the following explanatory perspective for this result. As we had predicted, the children with SLI might have encoded the information (i.e. abstract shape and colour association) less accurately, because the task was intentional. Intentional tasks are bound by capacity limits for information processing (for evidence on processing capacity limits in SLI, see
Bishop, 1994;
Vissers, Koolen, Hermans, Scheper, & Knoors, 2015). That is, on the intentional task, the whole capacity is divided between attending to the stimuli, allotting working mental space to register the association and selecting the appropriate strategy (e.g. rehearsal) for registering (e.g.
Marois & Ivanoff, 2005), therefore, it is possible that the constraints on total capacity in SLI children might have affected their recall in VPA. The exposure duration to register a stimulus was only 3 s in the VPA task, which could have overloaded the processing capacity. Further, note that, unlike RMIE, encoding accuracy was not measured on VPA, so it was not possible to estimate the impact of poor initial encoding on recall. The current task differs from the errorless learning procedure adopted by
Bishop and Hsu (2015), where children could not proceed until the correct response had been selected, with cueing being used if necessary. This richer encoding context may explain why Bishop and Hsu found a typical learning rate in their SLI participants on an intentional declarative task (i.e. novel picture and sound association learning).
In contrast to the impairment in accuracy, children with SLI were faster than their TD peers on VPA. This is a surprising result, given that, as noted above, processing speed is usually reduced in SLI compared to controls, and we found slower responses on other tasks in our study. We considered whether this might reflect impulsive responding that would lead to a speed-error trade-off, but there was no evidence of this. We are aware of one other study with a similar result:
Bavin et al. (2005) reported significantly faster RT for children with SLI compared to TD on pattern recognition and spatial recognition tasks where the participants had to remember and recognize previously seen spatial pattern on the screen. Note that in the same study, children with SLI were slower than TD on motor latency. We can only speculate that children with SLI may have used a different recall strategy that required less time to execute, perhaps relying more on visual imagery than verbal coding.
Type of retrieval effect
Results
In a further analysis, we directly compared retrieval accuracy in the two declarative tasks, RMIE and VPA. The accuracy scores were converted in to percentages so that the scores were on a common scale. Note, however, that the probability of getting an item correct by chance does differ between tasks, so main effects of tasks are not of interest. Our focus, rather, was whether there were interactions with group, which would indicate that the type of declarative task might be differentially difficult for those with SLI. Normality was checked and a 2 (group, TD versus SLI) × 2 (interval, 10 and 60 min) × 2 (retrieval type, recognition and recall) ANOVA was run. The main effect of group, F (1, 52) = 7.45, p = .009, = .06, with TD (M = 74.70, SD = 18.82) performing better than SLI (M = 64.10, SD = 21.04), the main effect of interval, F (1, 52) = 44.13, P = < .001, = .05, with retrieval at 10 min (M = 73.77, SD = 18.60) better than retrieval at 60 min (M = 65.04, SD = 21.66) and main effect of retrieval type, F (1, 52) = 14.23, P = < .001, = .09, with recognition (M = 74.65, SD = 11.09) better than recall (M = 64.16, SD = 25.98) were all significant. Even though, the within-subject factors did not interact significantly with the group: retrieval type × group [F (1, 52) = 2.35, p = .13, = .02], interval × group [F (1, 52) = .73, p = .40, = .0008] and interval × group × retrieval type [F (1, 52) = .04, p = .84, < .0001], the interaction between retrieval type and interval was significant, F (1, 52) = 7.66, p = .008, = .01. Since the prediction was to check if there was a group interaction with any of within-subject factors, we did not report the interaction further.
Discussion
Children with SLI were generally poorer than TD children on declarative tasks when compared across retrieval types and intervals. We were particularly interested in the three-way (group × retrieval type × interval) interaction which was absent, showing that both the groups performed better on recognition type of retrieval compared to recall type of retrieval. This could be attributed to differences in representational requirements between these tasks (Postman et al., 1948). The absence of a retrieval type × interval interaction fails to support
Brown et al.’s (2012) finding that declarative memory examined through recognition memory (but not recall) undergoes off-line consolidation. That is, the information was lost alike on both the retrieval types. However, caution must be applied in comparing the present finding with Brown’s study, since those authors used a 3–4 h retention period whereas our study used 60 min retention.
Concluding remarks
We examined predictions made by the PDH and its compensatory hypothesis and confirmed that children with SLI have impaired procedural memory on a non-verbal SRT task. As predicted children with SLI showed intact (but not enhanced) declarative performance on a recognition memory task that employed incidental encoding of items (i.e. RMIE task), but not on a paired-associate task that employed intentional encoding of items and recall type of retrieval (i.e. VPA task). Furthermore, across retrieval types of declarative tasks, the pattern of performance was comparable in SLI and TD (see
Bishop and Hsu (2015) for similar findings). Finally, the correlation between memory systems did not support a trade-off between memory systems as predicted by the PDH.
The present study has limitations that could be addressed in future studies examining compensation. First, we did not examine consolidation in procedural memory, but only in declarative memory (for studies reporting procedural consolidation deficits in children with SLI, see
Desmottes et al., 2016;
Hedenius et al., 2011). Second, encoding on the VPA task was not measured in the present study. Having a measure of intentional encoding would have resulted in a more comprehensive picture of where in the process the deficit lies in children with SLI. Finally, the findings on type of retrieval effects have to be interpreted with caution, since the tasks differed with respect to encoding and also stimuli use. Nevertheless, the present findings showing intact recognition but affected recall in SLI suggests that future experiments investigating declarative memory in SLI should use recognition paradigms to demonstrate declarative potential.
One limitation of any study looking at correlations between memory tasks concerns test reliability. An index derived from an experimental measure, such as the SRT task or a declarative learning task, is unlikely to be as reliable as a psychometric test. Variation in test results will result from uncontrolled factors, such as effects of guessing in multiple choice tests or fatigue in tasks relying on RT. Inter-correlations between test measures cannot be higher than the test–retest correlation of the test itself, and this is generally unknown for commonly used declarative and procedural measures. The lack of significant correlations between measures from two declarative tests cautions against assuming that different tests are all indexing a unitary system. On the other hand, there will also be influences that may drive spurious correlations between measures. No task gives a pure measure of the procedural or declarative system. Motivation and attentional variations are common confounding factors that may lead to measures being correlated. In our study non-verbal ability scores and age were used as covariates in partial correlation, but there could be other factors such as working memory (
Oberauer, 2010) or attention (assuming SRT and RMIE partly involved attentional processing, see
Jiang & Chun, 2001;
Jiménez, 2003;
Nissen & Bullemer, 1987;
Turk-Browne, Jungé, & Scholl, 2005) that could influence the correlation findings. One conclusion from the current study is that there is a need for more basic research establishing measures of procedural and declarative learning that are reliable and valid enough for assessment of stable individual differences.
Clinical implications
There has been debate about whether explicit or implicit (recruiting declarative and procedural systems respectively) instructions yield better results in grammar intervention of children with SLI. Some language therapy techniques use predominantly explicit approaches (e.g. using shapes and colours to teach grammatical relations), which could capitalize on strengths in their declarative system (for review see
Ebbels, 2013). Our findings, however, cast outside doubts on whether explicit grammar teaching is the best strategy for the following two reasons. First, children with SLI’s declarative memory is at its best only when explored more incidentally, that is, when children with SLI intentionally attempt to process information (i.e. associations in the present study) they are not as effective. Second, grammar relations in natural language are probabilistic, which makes them well suited for learning by the implicit memory system. There is certainly a need for more exploration in this direction. For example, it may be preferable to recruit the system that is readily made for learning grammar, i.e. the procedural system. Clinical experiments that contrast the effectiveness of grammar training in children with SLI using implicit and explicit methods would help us address this important issue.
Open questions for future research
At least two open questions are ripe for future investigation. First, if examining procedural and declarative systems separately does not reveal any trade-off, would it be more informative to engage the systems simultaneously in a single task (such as artificial language learning), where they can compete for learning? If we could find a way to quantify the contribution of each system, we may find more evidence of a trade-off.
Poll et al. (2015) attempted this approach with some success by designing an experiment that engaged both the systems in natural language.
Second, we need studies that explore the nature of procedural deficits in SLI. Lum et al. (2014) reviewed work on studies that examined procedural memory using SRT and showed that sequence learning in SLI got better as a function of age and number of trials in the SRT task. To explore their nature of the procedural deficit further, at least two types of deficit within the procedural sequence learning mechanism could be worth examining. One possibility is an ‘efficiency deficit’ which assumes that the full range of the procedural learning is functional but inefficient. In which case, repeated exposures to procedural specific information (such as less predictable but linked elements in grammar, ‘a’ predicts ‘b’ only if the interelements are ‘x’ and ‘y’) would result in improved learning. Another, a ‘capacity deficit’ where the mechanism is limited in its range. In which case, despite repeated exposures to procedural specific information the learning saturates well before average. Such studies will closely tie the procedural deficits to nature of language being learned because procedural complexity varies both within and across language structures.