Abstract
Aim
To evaluate whether adding base excess (BE), alone or in combination with lactate, to the modified Rapid Emergency Medicine score (mREMS) improves 30-day mortality prediction in emergency department (ED) red-zone patients.
Materials and Methods
This observational cohort study included eligible ED red-zone patients presenting between May 2018 and December 2019 (n=1798). Vital signs, Glasgow Coma scale (GCS) scores, and blood gas parameters were collected. Because the relationship between BE and 30-day mortality was non-linear, BE was categorized using a data-driven, mortality-associated classification approach. The resulting categories were weighted using effect sizes from multivariable logistic regression and converted into an ordinal variable (0-3 points), which was integrated into the mREMS framework. Three models were developed: mREMS-Be, mREMS-L, and mREMS-LBe. Model discrimination was evaluated using the area under the curve (AUC) and comparisons were performed using the DeLong test.
Results
Thirty-day mortality was associated with older age, lower systolic blood pressure, lower SpO2, and lower GCS (p<0.05). BE was not predictive when modeled as a continuous variable; however, it demonstrated an independent association with mortality when modeled as an ordinal variable (p=0.022). The AUC of the baseline mREMS was 0.753; it increased to 0.767 with the addition of BE and to 0.782 when BE was combined with lactate. Integration of BE into mREMS resulted in a significant improvement in model discrimination in both the overall and the non-traumatic red-zone patient groups.
Conclusion
Incorporating BE into mREMS provides complementary prognostic value by improving 30-day mortality discrimination among high-acuity ED patients.
Introduction
Early recognition of clinical deterioration in emergency departments (EDs) is critically important for ensuring patient safety and for directing advanced therapeutic resources efficiently (1). Accordingly, Early Warning scores (EWS), which rely on physiological parameters, are widely used as complementary tools to support the rapid and systematic identification of high-risk patients (2). However, it is well established that systems based solely on vital signs may be limited, particularly during the early phases of metabolic derangement when clinical manifestations have not yet emerged (3). For this reason, efforts to integrate metabolic biomarkers into physiologic scoring systems have gained increasing importance with the aim of improving diagnostic accuracy (4).
In this context, blood gas lactate is a powerful metabolic biomarker reflecting tissue hypoperfusion and anaerobic metabolism, and it has been shown to be closely associated with mortality in sepsis, trauma, and various critical illnesses (4, 5). Base excess (BE) in blood gas analysis, on the other hand, reflects the severity of metabolic acidosis and serves as a comprehensive indicator of metabolic stress, particularly in patients with comorbidities or active compensatory mechanisms (6-8).
Among EWS models, the modified Rapid Emergency Medicine score (mREMS) incorporates practical parameters such as age, vital signs and neurological status and is frequently used to predict mortality in the ED (9). Although the current literature does not definitively establish mREMS as the optimal score for ED populations, its component variables and predictive potential make it a useful model (9, 10). While it was initially validated in trauma patients, efforts have been made to broaden its use to general ED cohorts (9-11). Recently, Donoso Calero et al. (12) demonstrated that integrating lactate into mREMS improved its predictive performance in a general ED patient population. Considering the prognostic value of BE, its representation of a metabolic dimension distinct from that of lactate, and the theoretical advantage of combining these biomarkers to achieve a more comprehensive risk assessment, BE emerges as an additional biomarker with potential for integration into EWS models such as mREMS (13-15).
This study evaluates BE-augmented versions of mREMS in a large cohort of red-zone patients from a tertiary ED using a three-level triage system (red-yellow-green) (16). Building on established evidence for lactate integration, the primary objective is to assess the incremental prognostic value of BE, alone and in combination with lactate, for 30-day mortality. Secondary objectives include comparison with the original mREMS and between BE-augmented models. The findings are expected to support earlier identification of high-risk ED patients, more effective use of limited healthcare resources, and inform the development of metabolically augmented mREMS variants that may be suitable for future integration into digital and automated clinical decision-support systems.
Materials and Methods
Study Design and Data Source
This observational cohort study evaluated the 30-day mortality risk among adult ED patients classified as red-zone cases. The study population consisted of eligible red-zone patients who presented to a tertiary training and research hospital in Turkey between 10 May 2018 and 12 December 2019 and for whom all predefined inclusion criteria were met and prospectively recorded study parameters were available. A previous publication from our group utilized only a subset of this dataset comprising 868 patients who met ESI 1-2 criteria; (17), however, the present study includes all eligible red-zone patients who fulfilled the predefined inclusion criteria and had complete data available for analysis (n=1798), and differs from the earlier work in terms of sampling frame, analytical strategy, and study objectives. Accordingly, the current study represents a retrospective secondary analysis of prospectively recorded data.
The study complied with the Declaration of Helsinki. Written informed consent had been obtained during the initial prospective data collection. For the current retrospective analysis of the entire cohort, additional approval was obtained from the Scientific Research Ethics Committee of the University of Health Sciences Türkiye, Trabzon Faculty of Medicine (decision no: 2024/11, date: 05.11.2024), and renewed consent was not required. All data were anonymized prior to analysis and handled in accordance with institutional data confidentiality policies.
Study Population
Patients aged ≥18 years who were classified as red-zone cases in the ED, who had complete vital signs recorded at initial presentation, who had available arterial or venous blood gas analyses were included. Completeness was defined as the availability of all variables required to calculate mREMS at initial presentation. Exclusion criteria were: presentation with cardiac arrest, pregnancy, missing mandatory parameters for mREMS calculation, and duplicate visits. The patient inclusion and exclusion steps, with corresponding numbers, are shown in Figure 1.
Vital parameters were measured by triage nurses, and Glasgow Coma scale (GCS) was assessed by an emergency physician. Blood gas analyses were performed using a Radiometer ABL90 FLEX device integrated within the ED. Thirty-day mortality was verified through the national Death Notification System. Etiological classification (infectious or traumatic) was independently assessed by three emergency medicine specialists, and final categorization was established based on agreement between at least two specialists.
BE Classification and Scoring Method
Given that the association between BE and mortality has been shown in the literature and in logistic regression analyses of the present dataset to be non-linear, the inclusion of BE in the model as a continuous variable was considered inappropriate (8, 18, 19). Therefore, a data-driven approach was adopted to examine how the relationship between BE distribution and mortality risk varied across BE ranges.
Accordingly, BE values in the overall cohort were stratified into quintiles based on their distribution, and 30-day mortality rates were calculated for each quintile. This exploratory assessment indicated a non-linear risk distribution, with higher mortality observed at both markedly negative and markedly positive BE values. Adjacent quintiles exhibiting similar mortality rates were subsequently merged, resulting in four BE categories representing progressively increasing levels of risk.
In weighting the derived BE categories, reliance on crude mortality rates alone was avoided, and an approach accounting for the effects of potential confounders was employed. To this end, BE categories were entered into a multivariable logistic regression model together with established clinical variables, including age, vital signs, and GCS score. The adjusted odds coefficients obtained from this model were used to guide the assignment of ordinal risk points to the BE categories. Based on the observed risk gradient, BE categories were scored from 0 to 3 as follows:
• BE 2.2 to 4.6 mmol/L: 0 point
• BE ≥0.1 to <2.2 mmol/L or >4.6 mmol/L: 1 point
• BE ≥-3.2 to <0.1 mmol/L: 2 points
• BE <-3.2 mmol/L: 3 points
This approach ensured that BE classification and weighting were grounded in the risk structure observed within the study cohort and adjusted for relevant covariates, rather than relying on externally predefined thresholds. However, acknowledging that the prognostic signal of BE may be sensitive to the modeling approach, the maximum BE score was capped at 3 points to avoid disproportionate weighting that could obscure the prognostic contribution of lactate within the composite score.
Following score assignment, BE was treated as an ordinal variable and re-entered into multivariable logistic regression analysis. In this analysis, ordinal BE demonstrated a statistically significant association with 30-day mortality. This finding suggests that the derived BE categories exhibit internal consistency with respect to risk direction. Consequently, BE was integrated into mREMS and mREMS-based models as a complementary metabolic parameter reflecting a non-linear risk pattern.
Statistical Analysis
All primary statistical analyses were conducted using JAMOVI (v. 2.6.44). Distributional characteristics of continuous variables were assessed using the Shapiro-Wilk test in conjunction with histogram and Q-Q plot inspection. Normally distributed variables were summarized as mean ± standard deviation, whereas non-normally distributed variables were summarized as median (interquartile range). Categorical variables were summarized as counts and percentages. Between-group comparisons were performed using the Mann-Whitney U test for continuous variables and χ² test for categorical variables.
To identify independent predictors of 30-day mortality, two multivariable binomial logistic regression models were constructed. In the first model, BE was included as a categorical variable (BE0-3) to estimate category-specific adjusted odds ratios (ORs) relative to the lowest-risk reference group. In the second model, BE was incorporated as an ordinal variable (0-3 points), reflecting a progressively increasing risk of mortality based on adjusted effect sizes observed in the categorical model. This sequential modeling strategy enabled the evaluation of both category-specific effects and monotonic risk behavior for BE, while minimizing assumptions about linearity.
For multivariable models, regression coefficients (β), OR, 95% confidence intervals (CIs), and p values were reported. ORs and corresponding 95% CIs were derived directly from regression coefficients [OR = exp(β), 95% CI = exp(β±1.96 × standard error)]. Model fit was assessed using the Akaike information criterion and McFadden’s R².
Model discrimination was assessed using ROC curves and area under the curve (AUC) values. Optimal cut-off points for each score were determined using the Youden index to provide an objective balance between sensitivity and specificity. Differences in AUC among the four evaluated scores (mREMS, mREMSBe, mREMS-L, and mREMS-LBe) were analyzed using pairwise DeLong tests. A p value<0.05 was considered statistically significant for all analyses.
To address potential overfitting associated with data-driven BE categorization, internal validation of all mREMS-based models was performed using bootstrap resampling with 5.000 iterations. Calibration performance, particularly for BE-containing models, was further evaluated using decile-based calibration plots in the overall study population.
Results
A total of 1,798 red-zone ED patients were included in the study (Figure 1). The most frequent diagnoses in the cohort were pneumonia (23.0%), cerebrovascular disease (14.5%), and multiple trauma (13.7%) (Table 1). The overall population was predominantly elderly, and non-survivors were significantly older than survivors (76 vs. 67 years, p<0.001; Table 2). Although the sex distribution was comparable between groups, respiratory rate and heart rate were higher in the mortality group, whereas systolic blood pressure (SBP), mean arterial pressure (MAP), and SpO2 were lower (all p<0.001). GCS was also significantly lower among non-survivors (Table 2).
Evaluation of blood gas parameters revealed pronounced metabolic impairment in patients who died. BE values were more negative, and although median lactate levels were identical between survivors and non-survivors, non-survivors demonstrated a wider interquartile range and higher upper lactate values (both p<0.001; Table 2).
Distribution of mREMS and its metabolically enhanced variants showed that all scores were significantly higher among non-survivors (all p<0.001; Table 2). Greater score separation between survivors and non-survivors was observed for mREMS-L and mREMS-LBe compared with the baseline mREMS.
In multivariable logistic regression analysis, age, SBP, SpO2, and GCS were identified as independent predictors of 30-day mortality [all p<0.05; Table 3 (a-b)]. BE when entered as a continuous variable was not independently associated with 30-day mortality and therefore did not demonstrate a significant linear effect. However, when modeled as an ordinal variable, BE showed an independent association with mortality (OR per category increase: 1.200; p=0.022; Table 3b). Lactate remained a strong independent predictor across all models (p<0.001).
In the ROC analysis, the baseline mREMS model demonstrated discriminative performance, yielding an AUC of 0.753. Augmentation of mREMS with BE resulted in a modest improvement in discrimination (AUC=0.767), while integration of lactate led to a further increase in AUC to 0.778. The combined mREMS-LBe achieved an AUC of 0.782. Although mREMS-LBe yielded the numerically highest AUC, its performance was not statistically superior to that of mREMS-L (Table 4). Pairwise DeLong tests demonstrated that mREMS-Be, mREMS-L, and mREMS-LBe all performed significantly better than mREMS (p=0.006, 0.002, and 0.002, respectively). While mREMS-LBe also outperformed mREMS-Be (p=0.014), it did not show a statistically significant advantage over mREMS-L (p=0.228). ROC curves for the overall cohort are shown in Figure 2a.
Internal validation using bootstrap resampling demonstrated minimal optimism across all models. Optimism-corrected AUCs were 0.7663 for mREMS-Be and 0.7816 for mREMS-LBe, closely approximating apparent performance. Model accuracy assessed by the Brier score was 0.0976 for mREMS-Be and 0.0966 for mREMS-LBe. Decile-based calibration analyses demonstrated good agreement between predicted and observed 30-day mortality across risk strata in the overall cohort (Figure 3a-b; Table 5). In subgroup analyses, all models demonstrated high discriminative ability in trauma patients (AUCs: 0.873 for mREMS, 0.865 for mREMS-Be, 0.846 for mREMS-L, and 0.837 for mREMS-LBe), however, pairwise DeLong test comparisons revealed no statistically significant differences between the models (all p>0.05; Table 4). In the non-traumatic subgroup, integration of both BE and lactate yielded higher AUC values compared with baseline mREMS (p<0.05 for all relevant DeLong comparisons). Although mREMS-LBe showed the numerically highest AUC in this subgroup as well, it did not show a significant advantage over mREMS-L. ROC curves for the traumatic and non-traumatic subgroups are presented in Figures 2b and 2c, respectively.
Overall, metabolically augmented mREMS models demonstrated improved discrimination in predicting 30-day mortality compared with the baseline score.
Discussion
This study demonstrates that integrating mREMS-Be significantly improves 30-day mortality prediction among critically ill ED patients. The mREMS model enriched with lactate and BE provides superior discrimination compared with mREMS-Be and demonstrates predictive performance comparable to that of the previously validated mREMS-L model in critically ill ED patients not limited to trauma.
Integration of mREMS-L is not a novel concept. A recent multicenter study by Donoso Calero et al. (12) in a mixed ED population showed that mREMS-L significantly improved the performance of the original mREMS. Our results were consistent with those reported in this study. These findings suggest that mREMS, which has demonstrated strong prognostic capability primarily in trauma populations, may also be applicable to non-trauma ED populations when combined with lactate (9, 10, 12).
In non-traumatic etiologies, clinical deterioration may progress more insidiously and vital signs may lag behind early metabolic impairment; consequently, classical EWS models such as mREMS may be less sensitive in the early recognition of clinical deterioration (20-22). In this context, lactate, as a marker of metabolic stress and early hypoperfusion (4, 23) may help mitigate this limitation. As another clinically informative component of blood gas analysis reflecting metabolic reserve, BE has clear theoretical relevance; however, to our knowledge, its integration into an EWS has not been widely evaluated (7, 8, 18, 19, 24). Although mREMS does not represent all EWSs, incorporating BE into mREMS allowed us to explore this question within a pragmatic and commonly used framework. Given lactate’s established prognostic capacity and BE’s ability to reflect metabolic reserve, the combined use of these biomarkers may offer complementary clinical information.
On the other hand, the question of whether BE or lactate better predicts clinical or metabolic deterioration in critically ill patients remains unresolved (25-28). Caputo et al. (26) reported that lactate and BE were equivalent in predicting shock among trauma patients with normal vital signs, whereas other studies have shown context-dependent differences, with lactate or BE demonstrating relative advantages in specific populations (28, 29). Several investigations further suggest that combining lactate and BE may improve the prediction of clinical deterioration compared with either biomarker alone (8, 23). Accordingly, lactate and BE were incorporated into the mREMS framework both standalone and in combination.
Our results show that BE, consistent with the literature, did not exhibit a linear association with mortality, but became prognostically informative when modeled as an ordinal variable (4, 19, 24). This pattern may reflect threshold-dependent depletion of metabolic reserve rather than a uniform linear effect. Accordingly, ordinally scored BE, both alone and combined with lactate, improved prognostic accuracy when integrated into mREMS in ED patients, not limited to trauma. These findings are also consistent with evidence from previous systematic reviews and meta-analyses demonstrating that the integration of metabolic biomarkers into physiology-based scoring systems enhances the prediction of clinical deterioration in critically ill patients, not limited to trauma (20, 21, 30). In the overall cohort and the non-traumatic subgroup, mREMS-LBe demonstrated superior discriminative performance compared with those of the original mREMS and mREMS-Be. The lack of a significant performance advantage over mREMS-L suggests that the observed improvement is largely attributable to the dominant prognostic contribution of lactate (18, 23). In this context, BE appears to function as a complementary metabolic biomarker rather than as an independent determinant of prognostic discrimination, and integration of BE and lactate may represent an option for enhanced risk stratification in general ED populations, pending external validation.
In the trauma cohort, incorporation of BE, lactate, or both biomarkers did not yield a statistically significant improvement in the discriminative performance of mREMS. This finding may reflect limited statistical power due to the relatively small trauma sample size. Nonetheless, prior studies have demonstrated that BE performs comparably to lactate and may be superior in specific clinical contexts for predicting mortality and adverse outcomes in trauma populations (29, 31, 32). Given the established prognostic utility of mREMS in trauma patients, further evaluation of BE integration in larger, adequately powered trauma cohorts is warranted as part of future model refinement efforts.
In non-trauma populations, relatively high mortality rates and the limited ability of conventional vital sign–based parameters to reliably detect early physiological deterioration highlight the increasing importance of integrating metabolically sensitive biomarkers, such as lactate and BE, into physiology-based EWS (20-22,33). However, in the context of the present study, translating this physiological rationale into a statistically robust and clinically applicable modeling strategy required several methodological assumptions, including data-driven categorization of BE. Although this approach was intended to account for the non-linear association between BE and 30-day mortality, the derivation of cohort-specific thresholds may introduce optimism bias and limit model transportability to other ED populations. While bootstrap-based internal validation and calibration analyses demonstrated minimal optimism and acceptable internal stability, these methods do not ensure external validity. Therefore, the performance of BE-augmented mREMS models should be interpreted with caution, and multicenter external validation is warranted before broader clinical application.
Study Limitations
Limitations of this study include the relatively small trauma subgroup, which prevented adequate statistical power; the possibility that arterial-venous differences in blood gas sampling introduced additional variability in BE; and the single-center retrospective design, which may limit generalizability. Thus, validation of the integration of metabolic biomarkers into mREMS requires multicenter, prospective studies.
Conclusion
This study demonstrates that integration of BE into mREMS, whether alone or combined with lactate, is associated with improved discrimination in predicting 30-day mortality among high-acuity ED patients, with BE serving as a complementary metabolic parameter rather than supplanting the established prognostic role of lactate. Following external validation, BE-augmented mREMS models may be considered for integration into automated clinical decision-support systems in acute care settings.


