Abstract
Objective
The objective of this paper is to evaluate the performance of the Systemic Lupus Erythematosus Disease Activity Index (SLEDAI-2K) in detecting clinically meaningful changes in SLE disease activity.
Methods
A longitudinal cohort study was conducted of 334 SLE patients during a 36-month follow-up. At each outpatient visit, disease activity was scored using the Physician Global Assessment (PGA) and SLEDAI-2K. Correlations between PGA and SLEDAI-2K were assessed. A clinically meaningful change in SLE disease activity was defined as a ΔPGA ≥ 0.3 points from baseline. Performance of SLEDAI-2K in detecting a clinically meaningful worsening or improvement was tested using receiver operating characteristic (ROC) analysis.
Results
Adjusted mean PGA and SLEDAI-2K scores presented a high correlation (rho = 0.824, p < 0.0005). In ROC analysis, a SLEDAI-2K variation presented an area under the curve (AUC) of 0.697 (95% confidence interval (CI) (0.628–0.766), p < 0.0005) to detect a clinically meaningful improvement, with a sensitivity of 28.8% for a SLEDAI-2K ≥ 4 reduction. The AUC to detect a clinically meaningful worsening was 0.877 (95% CI (0.822–0.932), p < 0.0005), with a sensitivity of 35.3%.
Conclusions
SLEDAI-2K has a limited ability to detect clinically meaningful changes in SLE disease activity, failing to identify almost two-thirds of cases judged as having a clinically meaningful improvement or worsening. There is a need for more sensitive SLE disease activity measures in clinical practice and research.
Introduction
The assessment of disease activity has a central role in clinical daily practice, research and clinical trials involving patients with systemic lupus erythematosus (SLE). Multiple disease monitoring instruments have been developed, but all present important limitations.
The SLE Disease Activity Index (SLEDAI), a global score index that includes the sum of 24 weighted clinical and laboratory variables, with its SLEDAI-2K and Safety of Estrogens in Lupus Erythematosus National Assessment (SELENA)-SLEDAI versions, is the most widely used SLE disease activity measure.1–3 SLEDAI is used worldwide in daily clinical practice. Moreover, it is the core determinant of the SLE Responder Index (SRI4) used in clinical trials and also to define flares, remission and low disease activity states.4–7
However, the inability to demonstrate clinically meaningful improvements in recent trials, despite a positive clinical impression of efficacy, has raised the debate about the performance of SLEDAI in detecting clinically meaningful changes in SLE disease activity. 8 Each item of the SLEDAI is scored dichotomically, resulting in the attribution of the same numerical weight, irrespective of the severity of change observed. Additionally, potentially severe lupus manifestations, such as hemolytic anemia, pneumonitis or gastrointestinal activity are not scored in SLEDAI. These aspects can be expected to hinder its ability to identify partial, albeit clinically meaningful, improvement or worsening. Such a limitation, if proven, would question the application of this instrument not only in target-driven management of SLE but also in the clinical evaluation of new medications.
The aim of this study is to evaluate the performance of SLEDAI-2K in detecting a clinically meaningful change in SLE disease activity in a clinical practice setting.
Materials and methods
Study design
A longitudinal cohort study was conducted from January 2014 to December 2016, at a tertiary care lupus clinic. Consecutive patients fulfilling the American College of Rheumatology 1997 and/or the Systemic Lupus Erythematosus International Collaborating Clinics 2012 classification criteria were included.9,10 This project adheres to the principles of Declaration of Helsinki, was approved by the local ethics committee, and informed consent was obtained from all patients.
Data collection
At each outpatient visit, disease activity during the previous 30 days was scored according to the Physician Global Assessment (PGA) (0–3 points, on a 10-cm scale) and SLEDAI-2K (0–105), by the same senior rheumatologist (L.S.I.), with extensive experience in SLE management. PGA and SLEDAI-2K were scored blinded for the previous assessments. A difference in PGA ≥ 0.3 points at follow-up visits compared with study baseline was considered a clinically meaningful change in SLE disease activity, both for improvement and worsening. 4
Statistical analysis
Descriptive analysis was performed for demographic and disease activity data at baseline. Correlation between PGA and SLEDAI-2K values was assessed through Spearman's test. SLEDAI-2K– and PGA-adjusted means were determined as the area under the curve (AUC) of its serial values at each visit, divided by the total time length. 11 We also assessed the Spearman's correlation between changes from baseline, observed in SLEDAI-2K and in PGA, at the visits with a clinically meaningful worsening or improvement.
Performance of SLEDAI-2K in predicting clinically meaningful changes in SLE disease activity was assessed through receiver operating characteristic (ROC) curve analysis. For patients with a clinically meaningful change, differences in SLEDAI-2K were calculated from baseline to the first visit at which this change occurred. For patients with clinically stable disease according to PGA over the entire follow-up, the SLEDAI-2K variation between baseline and last visit was used. Sensitivity, specificity, positive and negative predictive values (PPV, NPV) of cut-off variations in SLEDAI-2K ≥ 1 and ≥4 points in identifying a clinically meaningful change in SLE disease activity were calculated. Statistical significance was set at p < 0.05. Analyses were performed with IBM SPSS Statistics, V.24, and with R, V.3.1.2. 12
Results
Demographic and clinical data at baseline
Baseline characteristics of the patients included
SLEDAI-2K: Systemic Lupus Erythematosus Disease Activity Index 2000; PGA: Physician Global Assessment.
Correlation between PGA and SLEDAI-2K
At baseline, PGA and SLEDAI-2K scores were strongly correlated (rho = 0.766, p < 0.0005). The adjusted means of PGA and SLEDAI-2K over the 36-month follow-up were also strongly correlated (rho = 0.824, p < 0.0005). During follow-up, 80 and 51 patients had a clinically meaningful improvement and worsening, respectively, based on a difference in PGA ≥ 0.3 points in follow-up visits compared to study baseline. The clinically meaningful improvement and worsening occurred at a mean time of follow-up of 6.9 ± 7.3 and 14.3 ± 10.7 months, respectively. The disease activity manifestations of the patients with a clinically meaningful change, at baseline and at the clinically meaningful change visit, can be found in Supplementary Tables 1 and 2. The correlation of the variation in SLEDAI-2K and PGA between the baseline and the visit with a clinically meaningful improvement and worsening was 0.398 (p < 0.0005) and 0.463 (p < 0.0001), respectively.
Performance of change in SLEDAI-2K
The ROC curve analysis of the performance of SLEDAI-2K in identifying clinically meaningful changes in disease activity revealed an AUC of 0.697 (95% confidence interval (CI) = 0.628–0.766, p < 0.0005) for improvement, and of 0.877 (95% CI = 0.822–0.932, p < 0.0005) for worsening (Figure 1). Estimated sensitivities, specificities, PPV and NPV are presented in Table 2 and Supplementary Tables 3 and 4, using two different cut-offs of change in SLEDAI-2K: ≥1 and ≥4 points.
ROC Curve analysis of the performance of the SLEDAI-2K to detect a clinically meaningful improvement (a) and worsening (b). Performance of SLEDAI-2K to detect a clinically meaningful change in PGA Sens: Sensitivity (%); Spec: Specificity (%); PPV: Positive predictive value (%); NPV: Negative predictive value (%).
Discussion
We found that SLEDAI-2K scores and their changes over time are generally in good agreement with the clinical estimation of disease activity, as described by PGA. This is in agreement with previous reports.13,14 However, the performance of SLEDAI-2K in identifying cases of significant improvement or aggravation is inadequate for use in clinical practice and research. In fact, a variation in SLEDAI-2K ≥ 4 presents low sensitivity in identifying clinically meaningful changes in disease activity, failing to capture most events of either improvement or worsening. As expected, sensitivity is increased by using a threshold of one point difference, but with an unacceptable decrease in specificity and, especially, PPV.
Previous studies on the performance of SLEDAI have yielded heterogeneous results, depending on the analytical approach, the cut-offs of change and the comparator used for defining clinically meaningful change.13,15–19 To define meaningful change in SLEDAI and PGA, we used the cut-offs adopted in SRI4 (4 and 0.3 points, respectively), an outcome measure that has been extensively used as a primary endpoint in SLE clinical trials. 4 Although a variation in PGA ≥ 0.3 is not a validated cut-off, it is considered to be a clinically meaningful change in SLE disease activity, not only for worsening (as it is incorporated in the SRI), but also for improvement.4,20,21 Importantly, a detailed analysis of the lupus disease activity items in the patients that in our study were identified by this PGA-based approach as presenting an improvement or worsening showed that in all cases there was a corresponding change in objective activity items (Supplementary Tables 1 and 2). Moreover, recently our group published a new SLE-Disease Activity Score (SLE-DAS) and used a PGA variation ≥0.3 points to identify the optimal discriminative variation of SLE-DAS to detect a clinically meaningful change. 21 That SLE-DAS cut-off presented high performance in identifying clinically significant changes in disease activity in an external validation cohort. An alternative approach to identify a clinically meaningful change could be based on treatment change (stop/decrease and start/increase) compared with a previous visit. However, this approach is not without limitations, because changes in treatment also reflect the treating physician’s disease activity judgement, similarly to the PGA. Furthermore, recommended treatment changes do not always reflect an actual disease activity variation as it happens, in the lupus nephritis induction regimen, with the corticosteroid standard tapering protocol, usually started before improvement. 22
The PGA is an inherently subjective measure, but not to the point of being discretionary. Our results support its validity by demonstrating a high correlation between SLEDAI-2K and PGA at each visit, as well as between the mean time-adjusted SLEDAI-2K and PGA. The fact that both were scored always by the same experienced rheumatologist may have added to this consistency. The use of the PGA to assess lupus disease activity in research has received renewed interest and support, not only with its inclusion in SRI4, but also with the development of the Lupus Foundation of America-Rapid Evaluation of Activity, an index based on the assessment of a PGA for each organ system. 23 The SRI4 has been applied in some clinical trials using the SELENA-SLEDAI version, while in others the SLEDAI-2K was included; the major difference is that SLEDAI-2K always scores proteinuria >500 mg per day because of active lupus nephritis, while SELENA-SLEDAI does not score persistent proteinuria, resulting in an inappropriate four-point reduction in cases with persistently active lupus nephritis.
The limited sensitivity of SLEDAI to change seems to be an unavoidable consequence of scoring each disease activity item dichotomically, thus ignoring the severity of abnormalities. For example, thrombocytopenia of 90,000 or 5000/mm 3 is equally scored (one point); lupus arthritis with 2 or 20 active joints is equally scored (four points); and the reduction of proteinuria due to lupus nephritis from 2500 mg per day to 600 mg per day is not reflected in SLEDAI-2K (four points). This obviously hinders the use of SLEDAI to monitor and guide treatment in the individual patient, as well as its validity as a tool to evaluate the efficacy of novel treatment agents or strategies.
The results of our study have major implications in the interpretation of clinical trials applying SRI4 as the primary endpoint. SRI4 defines efficacy as an improvement of ≥4 points in SLEDAI from baseline. The other components (British Isles Lupus Assessment Group and PGA) are used only to validate a patient as a responder, as they are required only to not worsen significantly, in an attempt to increase specificity. Given the low sensitivity of a change in SLEDAI-2K ≥4 to detect clinically meaningful change, it is likely that treatment effects are largely underestimated by SRI4.
The inability of SLEDAI-2K to capture partial responses in disease activity led to the development of the SLEDAI-2K Responder Index 50 (SRI-50), which can capture a ≥50% improvement in disease among lupus patients. 24 Its construct validity, reliability and ability to identify responders have been shown in several studies.24–28 However, the SRI-50 cannot identify worsening of disease activity.
Limitations of our study include its monocentric setting. The use of the PGA as the gold standard for clinically meaningful change in disease activity may be exposed to criticism, but the absence of a more objective alternative, its strong correlation with SLEDAI and its wide acceptance by the clinical and research communities support its use.
The prospective assessment of data at each clinical visit with consistently scored SLEDAI-2K and PGA, the large cohort with long duration of follow-up, and the real-life setting are important strengths of our study. To the best of our knowledge, this study is the first evaluating the performance of SLEDAI in a real-life clinical setting.
In conclusion, we demonstrated that SLEDAI-2K has a low sensitivity in identifying clinically meaningful changes in SLE disease activity. This is a major limitation both in clinical trials and daily practice. Research is needed to develop SLE disease activity measures with improved sensitivity to change.
Supplemental Material
Supplemental material for Performance of SLEDAI-2K to detect a clinically meaningful change in SLE disease activity: a 36–month prospective cohort study of 334 patients
Supplemental Material for Performance of SLEDAI-2K to detect a clinically meaningful change in SLE disease activity: a 36–month prospective cohort study of 334 patients by D Jesus, M Rodrigues, A Matos, C Henriques, J A Pereira da Silva and L S Inês in Lupus
Footnotes
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
