Data-driven adjustments for combined use of NGA-East hard-rock ground motion and site amplification models

Model development in the Next Generation Attenuation-East (NGA-East) project included two components developed concurrently and independently: (1) earthquake ground-motion models (GMMs) that predict the median and aleatory variability of various intensity measures conditioned on magnitude and distance, derived for a reference hard-rock site condition with an average shear-wave velocity in the upper 30 m (VS30) = 3000 m/s; and (2) a site amplification model that modifies intensity measures for softer site conditions. We investigate whether these models, when used in tandem, are compatible with ground-motion recordings in central and eastern North America (CENA) using an expanded version of the NGA-East database that includes new events from November 2011 (end date of NGA-East data curation) to April 2022. Following this expansion, the data set has 187 events, 2096 sites, and 16,272 three-component recordings, although the magnitude range remains limited (∼4 to 5.8). We compute residuals using 17 NGA-East GMMs and three data selection criteria that reflect within-CENA regional variations in ground-motion attributes. Mixed-effects regression of the residuals reveals a persistent pattern in which ground motions are overpredicted at short periods (0.01–0.6 s, including peak ground acceleration (PGA)) and underpredicted at longer periods. These misfits are regionally variable, with the Texas–Oklahoma–Kansas region having larger absolute misfits than other parts of CENA. Two factors potentially influencing these misfits are (1) differences in the site amplification models used to adjust the data to the reference condition during NGA-East GMM development relative to CENA amplification models applied since the 2018 National Seismic Hazard Model (NSHM), and (2) potential bias in simulation-based factors used to adjust ground motions from the hard-rock reference condition to a VS30 = 760 m/s condition. We provide adjustment factors and their epistemic uncertainties and discuss implications for applications.


Introduction
In the 2018 and 2023 U.S. Geological Survey (USGS) National Seismic Hazard Model (NSHM; Petersen et al., 2020Petersen et al., , 2023)), ground-motion intensity measures for central and eastern North America (CENA) were evaluated using ground-motion models (GMMs) and site amplification models developed as a part of the Next Generation Attenuation-East (NGA-East) project (Goulet et al., 2021a;Youngs et al., 2021).These GMMs and site amplification models were developed by different teams of investigators and under different organizational frameworks.In the case of GMMs, 17 models and a weighted median (referred to as ''central branch'' below) were recommended by Goulet et al. (2021a) with the aim of capturing epistemic uncertainties related to the overall ground-motion space, including magnitude scaling, distance scaling, and other attributes.These recommended GMMs do not include individually developed ''seed'' GMMs by independent modelers (Pacific Earthquake Engineering Research Center (PEER), 2015; hereafter PEER, 2015), although some of those seed models are considered in the NSHM (Moschetti et al., 2024;Rezaeian et al., 2021), as those GMMs were argued to represent physical features that were not present in the 17 NGA-East models.The GMMs apply for a hard-rock reference site condition defined as having average shear-wave velocity in the upper 30 m (V S30 ) = 3000 m/s and site decay parameter (k 0 ) = 0.006 s (Hashash et al., 2014), which is often used as the reference site condition for applications in which sitespecific site response is applied.The model development was conducted as a Senior Seismic Hazard Analysis Committee (SSHAC) Level 3 project (Budnitz et al., 1997;U.S. Nuclear Regulatory Commission, 2012), which is a formal process involving extensive review and documentation.
Because of the hard-rock reference site condition, development of the seed GMMs required adjustments to recorded ground motions in the NGA-East database (Goulet et al., 2021b), all of which were from softer-than-reference sites (average shear-wave velocities in the upper 30 m, V S30 ;150-2000 m/s).The adjustments occurred relatively early in the project (PEER, 2015), with each seed GMM developer team using their preferred site adjustment models (and initial site V S30 values from Goulet et al., 2014 that were later updated by Parker et al., 2017).The adjusted ground motions were used in GMM development as a constraint on scaling relations (with distance and magnitude), but also to set constant terms in the models that control the overall model amplitudes.Subsequently, the NGA-East Technical Integrators for GMM development applied four main amplification models to adjust data for residual analyses.The purpose of those analyses was to screen GMMs and to thereby limit misfit in the recommended GMMs (Goulet et al., 2018).The amplification factors applied in those data adjustments, as with some of the seed models, employed models for active tectonic regions (e.g.Seyhan and Stewart, 2014).It was later demonstrated by Hassani and Atkinson (2017), Parker et al. (2019), Zalachoris and Rathje (2019), and Boore (2020) that CENA has weaker V S30 scaling (i.e.smaller absolute value slopes), which means that V S30 has less predictive power in CENA than in active regions.
The site amplification models used in the 2018 and 2023 NSHMs for CENA were later developed by an expert panel based on a synthesis of available research (Hashash et al., 2020;Stewart et al., 2020).This synthesis drew heavily upon research products from the NGA-East Geotechnical Working Group (GWG) (Harmon et al., 2019a(Harmon et al., , 2019b;;Parker et al., 2019).The GWG site amplification model development was reviewed extensively but this occurred outside of the NGA-East SSHAC process.The GWG site amplification (F S ) models are intended to represent amplification relative to V S30 = 3000 m/s and k 0 = 0.006 s.The reference condition was not defined relative to the NGA-East hard-rock GMMs; hence, if the GMMs are not centered with respect to the assumed reference condition, there is a potential for bias to propagate through the GWG site amplification model when used with the NGA-East models to predict ground motions at other V S30 values.Because the aim of the GWG was to derive amplification relative to the reference condition, the issue of potential bias was not addressed.
The site amplification model has linear (F lin ) and nonlinear (F nl ) components: The linear component of the model has two components: in which F V describes the amplification relative to a V S30 = 760 m/s reference condition and F 760 describes the amplification for 760 m/s sites relative to 3000 m/s sites.F 760 carries significant parametric uncertainty in k 0 due to the lack of empirical data at the reference site condition (e.g.Atkinson, 2012;Boore and Campbell, 2017) and the assumption that k 0 reflects only material damping (Al Atik et al., 2022).Two terms are used in Equation 2 because they were derived using different procedures.F V is empirically constrained from NGA-East data (Parker et al., 2019), while F 760 is derived from ground-response simulations (Boore and Campbell, 2017;Frankel et al., 1996;Harmon et al., 2019aHarmon et al., , 2019b;;Silva et al., 2015).This two-tier approach was required because it was not possible to empirically derive site amplification relative to 3000 m/s conditions due to the lack of ground-motion recordings at this reference condition.
Because the site adjustments applied during GMM development used a modeling approach different from how the NGA-East GMMs are now applied, this study was undertaken to assess whether the combined use of NGA-East GMMs and site amplification models is compatible with ground-motion data from CENA.We summarize an expanded (relative to NGA-East; Goulet et al., 2021b) CENA data set that is presented in greater detail by Ramos-Sepu´lveda et al. (2023a).We then use residual analyses to assess model performance with respect to independent variables (magnitude, distance, and V S30 ) and overall mean model misfits.The residual analyses consider different NGA-East GMMs (i.e. the 17 models presented by Goulet et al., 2021a) and alternative data selection criteria in consideration of observed regional features.Although the database has grown appreciably, the moment magnitude (M) range remains limited (M \ ;5.8).For this reason, our aim is not to validate scaling relationships over the parameter ranges that control hazard (e.g.close distances and large magnitudes).Rather, we seek to evaluate the need for model adjustments to fit the available data, while leaving source and path scaling as-is.These analyses indicate a consistent pattern of mean misfit (bias) with respect to period, which can be considered in forward applications through simple adjustments of GMM constant terms.We also investigate likely causes for the observed misfits.

CENA ground-motion database
The database used in this research is an expanded version of the NGA-East groundmotion database (Goulet et al., 2021b).The NGA-East portion of the database consists of ground-motion intensity measures and metadata (event locations, magnitude, distance, V S30 ) provided in the electronic supplements to Goulet et al. (2021b).The NGA-East data were merged into a relational database developed for ground-motion studies (see Data Resources and Ramos-Sepu´lveda et al., 2023a for details) and expanded to include events in CENA since November 2011 (date of the latest event in NGA-East).Additional ground motions in the Texas-Oklahoma-Kansas (TOK) region were also considered in a parallel, collaborative component of this project (Li et al., 2023;Zalachoris et al., 2020), but as of this writing those additional data have not been added to the relational database.
The database expansion considered all events with M ø 4 in CENA from November 2011 to April 2022, based on event hypocenter locations east of the boundary between the active tectonic and stable continental regions as provided by Dreiling et al. (2014).This boundary has been recently updated (Moschetti et al., 2024) as shown in Figure 1.All of the events we processed remain within the updated region.This comprised 187 earthquakes at the locations shown in Figure 1.Unprocessed ground motions from these events were downloaded as miniSEED files from the Incorporated Research Institutions for Seismology (IRIS) data center (see ''Data Resources'' section).The number of records was over 73,111, which was considered too large for manual processing procedures as applied in past NGA projects (Goulet et al., 2014;Kishida et al., 2020).Accordingly, we sought an automated or semi-automated alternative, and ultimately decided to use the USGS open-source software gmprocess (Hearne et al., 2019).We introduced options in gmprocess to improve the high-pass corner frequency selection, to facilitate manual review of waveforms, and to resolve other differences between gmprocess and past NGA project processing protocols such as the choice to filter/differentiate/integrate in the time versus frequency domains (Ramos-Sepu´lveda et al., 2023b).
The recent events typically have a greater density of recordings relative to the NGA-East events, which is a consequence of the growth of seismic instrumentation in CENA.The more recently installed instruments, including at prior locations of Temporary Array (TA) stations, have broader usable frequency ranges.Ground-motion intensity measures (peak ground acceleration, PGA; peak ground velocity, PGV; and spectral acceleration, Sa from 0.01 to 10 s) from these events and information on their component-specific usable frequency ranges were uploaded to the relational database.For each of these intensity measures, as-recorded components are provided along with median-, minimum-, and maximum-component horizontal motions (RotD50, RotD00, and RotD100, respectively; Boore, 2010).
Metadata describing the seismic sources, path, and site conditions were assembled.General information such as the name and location of the instrument, hypocenter location, and event origin date and time can be obtained from gmprocess' output.Newly added events include moment magnitudes from moment tensor solutions (Dziewonski et al., 1981;Ekstro¨m et al., 2012;Guy et al., 2015) for 76 out of the 100 events, and estimates with the uncertainty of converted magnitudes (i.e. from events that did not have moment tensor solutions) were provided for the remaining 24 events (U.S. Nuclear Regulatory Commission, 2012; NUREG-2117).Procedures described in Contreras et al. (2022) were used to generate approximate rupture dimensions based on magnitude, event type, hypocenter, and orientation of one or two nodal planes.Contreras et al. (2022) modified a fault surface simulation routine previously presented by Chiou and Youngs (2008).The modifications incorporate a magnitude-area scaling relationship for stable continental regions (Leonard, 2014).Site parameters were derived from shear-wave velocity (V S ) profiles where available and otherwise from the Li et al. (2022) geology-slope proxy for TOK sites and the Parker et al. (2017) geology-slope proxy for other CENA sites.
Figure 2 shows data distribution (stacked histograms) as functions of rupture distance (R rup ), V S30 , and M. The added data are mostly applicable for M \ 5.8, R rup .10 km, and V S30 = 200 to 2000 m/s.The new data significantly increase the number of recordings per event and per station relative to what was available from the original NGA-East database, although it does not extend the magnitude range.Tables presenting source and Figure 3 shows the numbers of available recordings and events before and after the expansion of the NGA-East database as a function of oscillator period.There are more records at all periods, but the increase is particularly pronounced at short periods because many ground-motion networks now have instrument sample rates that are adequate to capture these short-period responses, which had not been the case previously (Goulet et al., 2021b).

Residuals calculations
GMM performance is assessed through residual analyses.We define the residual as the difference between an observation (natural-log intensity measure from a recording) and a GMM estimate of the mean ground motion: where Y ij is the observed intensity measure for event i and station j, m ln,k (M i , R rup,ij ) is the natural-log (ln) mean estimated intensity measure at the reference site with V s = 3000 m/s for magnitude M i and rupture distance R rup,ij from GMM k, and F lin is the CENA-specific linear site amplification model (Stewart et al., 2020; Equation 2) in ln units.For Gulf Coast sites, the path component of the GMM was adjusted for additional anelastic attenuation as given in Equation 10of Goulet et al. (2021a).The residuals are partitioned into the following terms using mixed-effect regression analysis (Bates et al., 2015;R Core Team, 2019): where c k is the overall mean misfit for GMM k, h E,i is the event term for event i, and dW ij is the within-event residual.Note that subscript k is not used with the event term and within-event residual for brevity, although these terms are specific to a GMM.
Residual analyses were performed using the NGA-East central branch GMM and the CENA data set (as described in the previous section) with minimal screening in which M ø 4 events were considered with recordings at distances R rup < 600 km (with some exceptions for the coastal plain (CP) region, as described subsequently).The NGA-East GMMs can be used to estimate intensity measures for distances up to 1500 km, but the 600 km threshold was applied to avoid problems related to biased ground-motion sampling at larger distances.Ground-motion data were not considered beyond their maximum usable period (taken as 80% of the inverse of the high-pass corner frequency).No lowestusable period was applied if the low-pass corner frequency (f cLP ) is 40 Hz or greater since Sa are usually controlled by lower frequencies (Boore and Goulet, 2014;Douglas and Boore, 2011); otherwise the lowest usable period was taken as 1.25/f cLP .The purpose of these analyses was to examine trends in residuals with magnitude or distance, which if present would influence the need for (and magnitude of) adjustment factors for CENA.The regions considered are TOK, CP (Gulf Coast, Mississippi Embayment, and Atlantic Coastal Plain), the remainder of CENA, and combinations thereof (Figure 1).The NGA-East numbers are for M ø 4 events, to be compatible with the adopted data selection criteria.

Regional variations of magnitude and distance scaling
As shown in Figure 1, the CENA database includes a large number of events (146) in the TOK region and a small number in the CP region (8), with the balance being 33 events outside of those regions.The NGA-East project screened out potentially induced events (PIE) (Goulet et al., 2021a), which largely originated in the TOK region.This is consistent with other research suggesting that TOK may have distinct ground-motion magnitude-and distancescaling characteristics (e.g.Moschetti et al., 2019;Zalachoris and Rathje, 2019).Regarding the CP region, the NGA-East GMM development team found that the Gulf Coast portion of CP had higher rates of anelastic attenuation (Goulet et al., 2021a; also later observed by Pezeshk et al., 2021), while the Atlantic Coastal Plain did not.Goulet et al. (2021a) provided adjustment factors for the Gulf Coast to account for this effect.Our CP region groups together both Atlantic and Gulf regions (Figure 1), for which we anticipate potentially distinct path characteristics and differences in site response relative to the rest of CENA due to the presence of relatively deep sediments (Boyd et al., 2023;Chapman and Guo, 2021;Guo and Chapman, 2019;Pratt and Schleicher, 2021;Schleicher and Pratt, 2021).
In consideration of these factors, we examined magnitude-and distance-scaling effects beginning with only the non-TOK/non-CP region to maximize consistency with data selection criterion used during NGA-East GMM development, and then we examine differences for CP and TOK.While events in these regions are strictly incompatible with the originally applied criteria, they are nonetheless important to consider in this study because the combined NGA-East GMM and site factors are applied across CENA in the NSHM.We performed residuals analysis followed by mixed-effects partitioning of the non-TOK/non-CP data set (33 events, 1169 recordings) for the central branch GMM. Figure 4 shows the resulting trends of event terms (h E, i ) with magnitude (gray symbols; the blue symbols are for CP sites and are discussed subsequently).The binned means are positive for some bins and negative for others, but we observe no consistent trend with M for any of the intensity measures.This suggests that the magnitude scaling in the NGA-East GMM is consistent with the expanded CENA database.The short-period event terms are mostly negative for CP events but they fall within the range of the non-TOK/non-CP data.Differences are essentially imperceptible at long periods.
Figure 5 shows the trend of dW ij with respect to R rup (as before, gray symbols are for non-TOK/non-CP sites, blue are for CP sites).The non-TOK/non-CP trend is flat to 600 km, which suggests that the distance scaling of the NGA-East GMM is consistent with the expanded CENA database.For CP sites, at short periods we observe negative binned means at large distances (R rup ø 300 km), which indicates a higher level of anelastic attenuation.For subsequent analyses, we only consider CP data to maximum distances of 300 km to avoid tradeoffs with distance-scaling misfits.
We next examine regional variations between TOK (Figure 1) and the remaining CENA data (including CP) by computing residuals for the full data set and then distinguishing the resulting outcomes by region.Figure 6 shows the resulting trends of event terms with magnitude.For TOK, the binned means of event terms are near zero for M = 4-5.For larger M, the event terms are negative, which has been observed previously (Li, 2022;Zalachoris and Rathje, 2019).Non-TOK event terms, which include both non-TOK/non-CP and CP, are generally positive at short periods, whereas TOK event terms mostly average nearly zero.This indicates that regional variations in ground-motion levels are present.This can be understood by recalling that a single misfit term is computed across all data; accordingly, deviations in mean event term for a particular region indicate different ground-motion levels in that region relative to the overall average.The near-zero mean of event terms for TOK is a consequence of that region dominating the data set (146 of 187 events), whereas the positive mean of short-period event terms for non-TOK events indicates stronger average ground motions than the overall average for the CENA database.
Figure 7 shows the trend of dW ij with respect to R rup .At short periods, we observe a significant upward trend in binned means for TOK events as R rup increases from approximately 10 to 150 km, whereas the non-TOK trends are flat, as before.Neither region has trends with distance for long-period intensity measures (Sa at 1.0 or 5.0 s).These results indicate that the distance attenuation component of the NGA-East central branch GMM is biased in the TOK region, which is not surprising because it was not developed for application to induced earthquakes, which dominate the TOK data set.
In this article, we do not attempt to model the scaling trends and mean model misfits for the TOK region, which is the subject of a parallel effort that has recommended a modified GMM for that region (chapter 5 of Centella et al., 2023).

Validation of V S30 -scaling model
To evaluate the performance of the V S30 -dependent site amplification model (F lin in Equation 1), we partition the dW ij using mixed effects analysis to evaluate site terms (h S ) and remaining residuals (e ij ): The site terms represent the approximate misfits of the model used in the original residuals calculation (Equation 1) for sites in the full data set, after model misfits and event-terms have been subtracted.Figure 8 shows the trend of site terms with V S30 .The results in Figure 8 show no appreciable trend, except for sites with V S30 .1000 m/s, where downward trends are evident for some periods.The trends are similar for TOK sites as shown in chapter 5 of Centella et al. (2023).These results indicate that the V S30 -scaling component of the site amplification model is consistent with the data to a maximum V S30 of 1000 m/s.

Analysis of mean misfits
The data analysis in the prior subsections suggests that the TOK region has distinct features that affect distance attenuation and overall ground-motion levels.Moreover, as shown in Figure 1, TOK has a substantial event concentration (events per area) compared to the rest of CENA, which to some extent produces results that largely reflect TOK attributes.Although less compelling, CP sites also have some different ground-motion features, mainly in relation to large-distance anelastic attenuation.Accordingly, we developed different subsets of the CENA data for analysis of mean misfits: 1. Non-TOK / Non-CP: Events and sites within TOK and the CP are excluded.2. Non-TOK: All non-TOK events are considered, including CP.No TOK events are considered.3. Partial TOK: All non-TOK events are considered.Within TOK, nine events from the 146 are selected (randomly) so that the event density (number of events per area) is consistent with other parts of CENA.
Mixed-effects analyses (Equation 4) were repeated for each of these subsets of data using the central branch NGA-East GMM.This produces three sets of misfits (c k ); as shown in Figure 9, the mean of these sets 61 standard deviation are shown.For reference purposes, the c k term for the complete CENA data set is also shown.The results show a consistent trend of negative misfits at short periods (i.e.models are overpredicting these components of ground motion) and positive misfits at long periods (models are underpredicting).There are modest differences between the three data subsets, with partial TOK producing the largest misfits in terms of absolute value, non-TOK/non-CP the smallest, and non-TOK being an intermediate case.Nonetheless, all of the data subsets show less misfit than the complete CENA data set, indicating that the concentration of data from TOK is contributing the large misfits at short and long periods.
Figure 10 shows c k for the non-TOK data set using all 17 NGA-East GMMs, along with the population weighted mean and mean 6 one weighted standard deviation.The weights applied in these calculations were taken from Goulet et al. (2021a) for the 17 GMMs and were equal for the three data selection criteria.We did not evaluate potential data misfits in relation to magnitude and distance scaling for each of these 17 GMMs as was done for the central branch.It is possible that individual GMMs with large misfits in Figure 10 are influenced by scaling problems.Comparing Figures 9 and 10, it is clear that the uncertainty introduced by alternate GMMs substantially exceeds that from alternative data selection protocols.Although not shown here for brevity, the misfits found when only the NGA-East data are considered are similar to that shown for the non-TOK version of the expanded data set in Figure 9.Moreover, Boore (2020) observed qualitatively similar misfit trends to those reported here, using the NGA-East data set and the Boore (2018) GMM.

Model adjustment factors
We consider the 17 alternative NGA-East GMMs (weighted as given by Goulet et al., 2021a) and three data selection criteria (as given in Analysis of Mean Misfits) to compute   51 misfit terms.Various weighting combinations were considered for data selection, including equal weighting and weighting that gives preference to the non-TOK/non-CP data set.Ultimately, the recommended weights were 0.2 for the non-TOK set, 0.2 for the partial TOK set, and 0.6 for the non-TOK/non-CP set.All sites are considered in the development of the model adjustment factors presented in this section, and modifications for stiff sites with V S30 .1000 m/s are presented subsequently.
Figure 11a shows the resulting weighted mean of the 51 misfit terms (overall m) 6 one weighted standard deviation (s e ).A smoothed version of the misfits is also shown for use in forward applications as a model adjustment factor.The weighted mean misfits from the 51 values was found to be equivalent to the weighted mean misfits obtained using only the single central branch GMM and the three alternative data sets, which confirms that the central branch model is the weighted mean of the 17 alternative NGA-East GMMs. Figure 11b shows the period dependence of the overall standard deviation (across the 51 misfit terms) and the standard deviation from alternative data selection criteria only (s e, data ).The latter standard deviation (s e, data ) is computed using mean misfits from the central branch GMM with the three data sets (between-GMM uncertainties are not included).This standard deviation was found to be nearly identical to those obtained with other single GMMs from the group of 17.
For forward analysis in which the 17 NGA-East GMMs are considered in the logic tree, which is the preferred approach, there is no need to consider the between-GMM uncertainty in the logic tree (to do so would double-count this uncertainty).In this case, the applicable epistemic uncertainty is that labeled as ''alternate data selection'' in Figure 11b (s e, data ).If only the central branch GMM is considered, the central branch model adjustment factors and the larger epistemic uncertainty should be considered.Table 1 provides values of the recommended model adjustment factors after smoothing (same values as in Figure 11a) and standard deviations representing epistemic uncertainties.
To apply the model adjustment factors in forward ground-motion analyses, the factors are simply added to the natural-log mean ground motions as computed from the combined

Stiff site modifications
The site term plots in Figure 8 indicate a downward trend for stiff sites with V S30 .1000 m/s, whereas the mean of site terms is nearly zero with no trend for softer sites.To address this, we fit the V S30 trend of the site terms as follows: where V S30 is in m/s units and b is a coefficient that represents the slope of the site terms between 1000 and 2000 m/s, with the constraint of zero ordinate at 1000 m/s. Figure 12 shows the resulting values of b and the smoothed version recommended for application.
Table 1 provides the smoothed values.The effect of the modification is to reduce, but not eliminate, the recommended model adjustments for stiff sites.

Causes of misfits
The GMM used for residuals analysis in the previous section has three components: a hard-rock GMM (i.e.m ln, k in Equation 3), site amplification from hard-rock to V S30 = 760 m/s (F 760 ), and site amplification from 760 m/s to alternate V S30 values in the range of the data (F V ).Given the lack of trend of site terms with V S30 over the V S30 range containing most of the data (Figure 8), the F V model is unlikely to be the cause of the misfits, which instead are likely associated with some combination of the hard-rock GMM and F 760 .In this section, we assess the likely causes of the misfits.

Differences in NGA-West and NGA-East site amplification models
In this subsection, we investigate the degree to which differences between the site amplification models employed during NGA-East GMM development under the SSHAC process, relative to those now used in forward application, may explain some of the observed misfits.The application of site factors in the development of NGA-East GMMs occurred in two phases: (1) development of seed models (PEER, 2015), where in some cases data were adjusted to the 3000 m/s reference condition for model calibration; and (2) integration of seed models within the process of developing the final recommended GMMs (Goulet et al., 2018).As summarized by Parker et al. (2019) (their Table 1), the 10 NGA-East seed models were developed using a variety of approaches, including simulations to directly estimate ground motions for the reference site condition, ground response simulations to compute site responses for different site conditions (which in turn were used for data adjustments), and two-step data adjustments as reflected by Equation 2, in which the F V term is based on a western US ergodic site amplification model (Seyhan and Stewart, 2014; hereafter SS14) and F 760 is based on CENA-specific simulations.Further information on the model development procedures are given in PEER (2015) and are summarized by Parker et al. (2019) and section 4.4 of Centella et al. (2023).
For the present comparison, we focus on the second (integration) phase of this process, which used SS14 to adjust CENA ground motions to a site condition of V S30 = 760 m/s, followed by adjustments from the 760 m/s condition to the reference (3000 m/s) condition using F 760 .Four F 760 models derived from one-dimensional ground-response simulations for CENA profiles were considered (Atkinson, 2012;Boore, 2015 (two models);and Graizer, 2015).As noted by Goulet et al. (2018), these four F 760 models are similar, particularly in the 1-10 Hz frequency range.
To investigate the potential impact of the different site amplification models, we consider differences between SS14 combined with Atkinson (2012) for site corrections applied during model development and the Stewart et al. (2020) CENA site response model used in forward applications (e.g.Moschetti et al., 2024;Petersen et al., 2020Petersen et al., , 2023)).The assumption made during NGA-East GMM development can be viewed as taking the natural-log mean motion for a given V S30 (m dev ln, k V S30 ð Þ) as follows: where m ln, k 3000 ð Þ is the reference site GMM and F SS14 V and F Ã 760 are the two components of the site amplification model (the * superscript for F 760 indicates that multiple alternative models could be used, although Atkinson (2012) will be used here for demonstration purposes).In contrast, the mean model as applied in the 2018 and 2023 NSHMs is where F Sea20 V and F Sea20 760 are the Stewart et al. (2020) (Sea20) site amplifications.The difference in predicted mean ground motions produced by the different site amplification models can be taken by subtracting Equation 8 from Equation 7as shown in Equation 9: To understand the linkage between the difference in Equation 9with the residuals in Equation 3, it is useful to recognize that (1) the central tendency of the residuals, by definition, is c k and (2) the central tendency of the NGA-East data is Þ is fit to the data using the F SS14 V and F Ã 760 models.Accordingly, Equation 3 can be re-written as where the overbars represent means across the data population.
By substituting 10, we obtain Accordingly, a potentially reasonable hypothesis is that the mean misfits evaluated from residuals in this study may be influenced by the differences between the site amplification models.
Figure 13a and b show mean values of site amplification for the non-TOK data set as derived from the two models (TOK is not included for this analysis due to the relatively strong biases in that region); the F V and F 760 values shown were obtained by exercising the models for each site and then averaging across sites.Considering first F V , the amplification applied during model development ( SS14) is stronger at all periods, but the differences are most pronounced at long period (about 0.15 natural log units).This difference should cause positive misfits (Equation 11), as observed.
Multiple models for F Ã 760 are shown in Figure 13b, which are Atkinson (2012), Boore (2015) (two models), and Grazier (2015).The Boore and Grazier models are shown over the period range considered during the integration phase of NGA-East (Goulet et al., 2018), which are 0.1-1 s and 0.2-5 s, respectively, whereas the Atkinson model used for illustration purposes is shown over the full period range.The mean of F Ã 760 values over the period range 0.2-1 s is closest to Atkinson among the four models (generally within 0.1 ln units).
The differences in the Sea20 and Atkinson (2012) F 760 models in Figure 13a and b are small for T .0.4 s, but at short periods the Atkinson (2012) factors are much lower (about 0.4 natural log units between periods of 0.05 and 0.1 s).These short-period differences are caused by distinctly different shapes of the Sea20 and Atkinson (2012) F 760 models at short periods, likely indicating different implied k 0 values.The relatively low F 760 factors from Atkinson (also observed for Boore, 2015) produce negative misfits, as observed.
Figure 13c compares the differenced site corrections (Equation 9) to the model adjustment factors from Figure 116s e, data .These results demonstrate that the differences in site amplification and the adjustment factors have similar features; for example, the longperiod underprediction (positive factors) appears to be influenced by the much stronger long-period amplification for active crustal regions than for stable continental regions (as contained in the F V terms).Similarly, the short-period overprediction (negative factors) appears to align with differences in F 760 models, in particular the strong peak in F Sea20 760 that is absent in the Atkinson (2012) model.However, the positive adjustment factors at long periods are larger than suggested by the model differences, while the negative factors at short periods are smaller than suggested by these model differences.

Modifications to F 760 for CENA
Since the publication of the Sea20 F 760 factors, additional simulations of site response for sites with V S30 = 760 m/s have been performed by Ilhan et al. (2024) and chapter 7 of Centella et al. (2023).That work considered additional V S profiles, additional material damping formulations, and explicit consideration of the range of k 0 captured in the profiles.Most of the profiles apply for an impedance condition, as defined by Sea20. Figure 14 shows how F 760 factors derived from that work compare to those in Sea20, separated according to whether the profiles represent impedance or gradient conditions.For impedance conditions, the newer results are larger at long periods and smaller at short periods (T \ ;0.015-0.03s) than the mean factors in Sea20.For gradient conditions, the newer results are generally larger at short periods (T \ 0.25 s) and lower at long periods.Since the impedance condition is more typical for firm-ground sites (V S30 = 760 m/s), the differences between impedance models are more important.If a new F 760 model were to be developed that reflected these differences for impedance conditions, it would likely reduce the misfits at both long and short periods.
The F 760 reduction for impedance conditions at short periods is qualitatively consistent with Ktenidou and Abrahamson (2016), who anticipated the potential for overprediction of short-period site response, which they attributed to the NGA-East hard-rock k 0 = 0.006 s being too small.However, the amount of short-period misfits is smaller than anticipated by Ktenidou and Abrahamson (2016).

Misfit attribution
The differences between site amplifications used in NGA-East GMM development relative to those used in application produce ground-motion changes that are generally consistent with the period-dependent pattern of the proposed model adjustments (Figure 13c).While it is difficult to know how much of the overall misfit can be attributed to this effect, it directly impacts multiple seed models and influenced the model integration process that led to the 17 NGA-East GMMs.
As noted previously, we do not anticipate F Sea20 V as appreciably influencing the observed misfits.Some misfit may be from the F Sea20 760 model; at long periods, the newer V S profiles used for 760 m/s sites produce larger amplifications than were found previously for impedance conditions (Figure 14), which if adopted for applications would reduce but not eliminate the misfits.At short periods, the misfits are small and could easily be accounted for with adjustments to k 0 .The portions of the misfits that cannot be attributed to the F Sea20 760 model are likely associated with the hard-rock GMMs.

Conclusions
The USGS NSHM uses hard-rock reference site GMMs from the NGA-East project (Goulet et al., 2021a) and site amplification models recommended by an expert panel (Hashash et al., 2020;Stewart et al., 2020) to estimate ground motions in CENA.Due to asynchronicity in the development of these models, they are not fully compatible with each other.The adjustment factors presented herein allow for compatibility in the joint application of these models.These factors, in effect, adjust the constant terms in the GMMs; the scaling relations (i.e.changes in ground motions with source, path, and site parameters) are unaffected.
Using an expanded CENA data set (relative to that used in the NGA-East project), we examined residuals of the recommended GMMs.While expanded, the range of magnitudes in the database remains limited (M ; 4.0-5.8);hence, our focus was mainly to assess mean model misfit within the range of the data, rather than scaling relationships that are also important for hazard applications.Using the central branch GMM, these residual analyses indicate that for data outside of the TOK region, there is no evidence for bias in the magnitude-and distance-scaling components of the GMM for M ø 4 events and R rup < 600 km, with the exception of faster attenuation in CP regions that manifests at distances .300km.However, persistent period-dependent mean misfits were observed from the data-to-model comparisons for a wide range of alternate NGA-East GMMs and alternate data selection criteria (i.e.excluding data from particular regions).These misfits are toward overprediction at short periods and underprediction at long periods.Different levels of mean misfits were found for the TOK region, the CP region, and the remainder of CENA, with the TOK region having the most distinct characteristics.Model adjustment factors were derived in a manner that accounts for these regional differences by sampling data from the three regions in different ways, which contributes to epistemic uncertainty.
We anticipate that the misfits, which form the basis for proposed model adjustment factors, are associated with both the hard-rock GMMs and the F Sea20 760 factors, although the breakdown of relative contributions among these models is uncertain.For forward applications for commonly encountered site conditions in the range of the F Sea20 V model (200-2000 m/s), whether the misfit arises from the hard-rock GMMs or from the F Sea20 760 factors is inconsequential.For these sites, we recommend applying the model adjustment factors and their uncertainties (Table 1), including the stiff site modification factors (Equation 6) as applicable, to the sum of the hard-rock GMM and site response.The levels of uncertainty to be used depend on how the Probabilistic Seismic Hazard Analysis (PSHA) is conducted, as follows: 1.For PSHA in which all 17 NGA-East GMMs are used (such as in the 2018 and 2023 NSHM), the smoothed model adjustment factors should be used with the epistemic uncertainty given in Table 1 as s e,data .This is the preferred approach because the epistemic uncertainties contained within the 17 GMMs are preserved.
2. For PSHA in which only the central branch GMM is used, the smoothed model adjustment factors should be used with the uncertainty given in Table 1 as s e .
For applications where nonlinear site response is expected (i.e. the F nl term in Equation 1is non-zero), the PGA term that drives the nonlinearity should be modified by the PGA adjustment factor.
For applications in which only the hard-rock GMMs are to be applied, contributions to the bias from the site factors (F S ) should be removed, which we anticipate to be solely related to F Sea20 760 .The amount of this adjustment is uncertain and will depend on how knowledge of F 760 evolves in future work.Increases in F 760 at long periods, and decreases at short periods, which is possible based on recent results shown in Figure 14, would reduce misfits.For these hard-rock applications, we suggest the use of a logic tree in which different fractions of the smoothed model adjustment factors (m in Table 1), including the stiff site modification, are attributed to the hard-rock GMM.Logic-tree branches in which the full adjustments, various percentages of the full adjustments, and no adjustments are applied are recommended.Weights given to these branches would be guided by the degree to which the misfits are believed to be attributed to the F Sea20 760 model, particularly for impedance conditions.

Data resources
Raw ground-motion recordings were retrieved from International Federation of Digital Seismograph Networks (FDSN) data centers of IRIS Data Services using gmprocess (Hearne et al., 2019).IRIS Data Services are funded through the Seismological Facilities for the Advancement of Geoscience (SAGE) Award of the National Science Foundation under Cooperative Support Agreement EAR-1851048.The time series and metadata used in this study are archived in a publicly web-serviced ground-motion relational database (Buckreis et al., 2023).The ground-motion database uses MySQL (by Oracle Corporation, http://www.mysql.com/)as the management system, and an application programming interface (API) was written to facilitate queries using URLs (https://uclageo.com/gm_database/api/index.php, last visited 23 May 2023).Ground-motion records from NGA-East and newly added records are stored in the database under the ''collection_id''=3 and ''user_id''=2, respectively.The specific data resources applied in this project are provided as source, site, and ground-motion flatfile tables by Ramos-Sepu´lveda et al. (2023a).

Figure 1 .
Figure 1.(a) Locations of CENA earthquakes and ground-motion recording stations considered in the present study (Ramos-Sepu ´lveda et al., 2023a).Boundaries of Texas-Oklahoma-Kansas (TOK) and coastal plain (CP) regions (the CP boundary is defined using a minimum sediment depth, from Boyd et al., 2023, of 100 m).(b) Detailed view of Oklahoma where a high event density occurred.

Figure 2 .
Figure 2. Distributions of CENA data set with respect to rupture distance, V S30 , and magnitude, showing differences between NGA-East and added data.

Figure 3 .
Figure3.Numbers of available RotD50 spectral accelerations and events as a function of oscillator period.The NGA-East numbers are for M ø 4 events, to be compatible with the adopted data selection criteria.

Figure 4 .
Figure 4. Trends of event terms with magnitude for non-TOK/non-CP and CP regions.Binned means are for non-TOK/non-CP data and vertical bars through binned means indicate 6 one standard error of the mean.No binned means are shown for the CP region due to the limited number of events.

Figure 5 .
Figure 5. Trends of within-event residuals with distance (R rup ) for non-TOK/non-CP and CP regions.Vertical bars through binned means indicate 6one standard error of the mean.

Figure 6 .
Figure 6.Magnitude-dependence of event terms for TOK and non-TOK regions.Vertical bars through binned means indicate 6one standard error of the mean.

Figure 7 .
Figure 7. Distance-dependence of within-event residuals for full data set.Vertical bars through binned means indicate 6one standard error of the mean.

Figure 8 .
Figure 8. Site terms trends with V S30 for full data set.Vertical bars through binned means indicate 6one standard error of the mean.

Figure 9 .
Figure 9. Period dependence of misfit term c k for NGA-East central branch GMM for alternate data sets.The shaded regions enclose 6one standard error.

Figure 10 .
Figure 10.Period dependence of misfit term c k for 17 NGA-East GMMs for Non-TOK data and weighted mean misfit.

Figure 11 .
Figure 11.(a) Period dependence of model adjustment factor c k and its uncertainty, as derived from 51 misfit terms and (b) overall standard deviations for all 51 misfit terms (three data sets, 17 alternative NGA-East GMMs) and standard deviations solely related to alternate data set selections.

Figure 12 .
Figure 12.Slope of site terms with V S30 between 1000 and 2000 m/s as a function of period.

Figure 13 .
Figure 13.(a) Mean site amplification from F V and F 760 components of the Stewart et al. (2020) model across all CENA sites in the non-TOK data set; (b) Mean site amplification from the F V (SS14) and F 760 (A12, B15, Fea96, G15) models across all CENA sites in the non-TOK data set; and (c) comparison of site response differences (Equations 9 and 11) to the recommended model adjustment factors.A12 = Atkinson, 2012; B15 and Fea96 from Boore, 2015; G15 = Graizer, 2015; SS14 = Seyhan and Stewart (2014).

Figure 14 .
Figure14.Average F 760 values for CENA sites as derived from a weighted combination of the impedance and gradient models as given by Sea20 compared to F 760 derived from recent simulations reported inCentella et al. (2023).

Table 1 .
Recommended natural-log model adjustment factors (smoothed weighted mean of c k values), epistemic uncertainties (expressed in the form of a natural-log standard deviation), and scaling coefficients for modifying model adjustment factors for high V S30 sites (Equation6) PGV: peak ground velocity; PGA: peak ground acceleration.