Abstract
Background:
The mechanism(s) of cognitive impairment remains complex, making it difficult to confirm the factors influencing poststroke cognitive impairment (PSCI).
Objective:
This study quantitatively investigated the degree of influence and interactions of clinical indicators of PSCI.
Methods:
Information from 270 patients with PSCI and their Wechsler Adult Intelligence Scale (WAIS-RC) scores, totaling 18 indicators, were retrospectively collected. Correlations between the indicators and WAIS scores were calculated. Multiple linear regression model(MLR), genetic algorithm modified Back-Propagation neural network(GA-BP), logistic regression model (LR), XGBoost model (XGB), and structural equation model were used to analyze the degree of influence of factors on the WAIS and their mediating effects.
Results:
Seven indicators were significantly correlated with the WAIS scores: education, lesion side, aphasia, frontal lobe, temporal lobe, diffuse lesions, and disease course. The MLR showed significant effect of education, lesion side, aphasia, diffuse lesions, and frontal lobe on the WAIS. The GA-BP included five factors: education, aphasia, frontal lobe, temporal lobe, and diffuse lesions. LR predicted that the lesion side contributed more to mild cognitive impairment, while education, lesion side, aphasia, and course of the disease contributed more to severe cognitive impairment. XGB showed that education, side of the lesion, aphasia, and diffuse lesions contributed the most to PSCI. Aphasia plays a significant mediating role in patients with severe PSCI.
Conclusions:
Education, lesion side, aphasia, frontal lobe, and diffuse lesions significantly affected PSCI. Aphasia is a mediating variable between clinical information and the WAIS in patients with severe PSCI.
INTRODUCTION
Post-stroke cognitive impairment (PSCI) refers to the cognitive impairment that occurs within a period after a stroke event and is a subtype of vascular cognitive impairment that includes the subgroups of post-stroke no dementia and post-stroke dementia [1, 2]. Huang et al. summarized previous studies and found that the incidence of PSCI was 38%, with an 18.4% incidence of post-stroke dementia one year after stroke [3]. Additionally, a meta-analysis suggested that one-third of survivors are affected by PSCI [4] and that it is possibly underestimated [5]. The incidence of PSCI, especially post-stroke dementia, increases significantly with time [1, 6]. At present, it is believed that a definite PSCI diagnosis should be made six months after a stroke, before establishing a rehabilitation scheme [1]. With increasing stroke incidence, PSCI has gradually become a serious public health concern. Compared with the age- and sex-matched population, severe and mild stroke patients have advanced impairment of cognition by 25 years and at least 4 years, respectively [7], thus enduring a seriously reduced quality of life.
Cognitive impairment influences cognitive function as well as motor function and stroke recurrence. Lin et al. found that PSCI may directly affect limb training owing to poor cognition [8]. Aphasia is a PSCI dysfunction, usually associated with dominant hemisphere lesions and limb apraxia [9]. Kwon et al. showed that PSCI is a high-risk factor for ischemic stroke recurrence [10], and Oksalaet al. found that PSCI is an important influencing factor for long-term low survival rates after ischemic stroke [11].
Two research groups performed lesion-symptom mapping studies, and Weaver et al. found a strong association of the left frontotemporal lobe, left thalamus, and right parietal lobe with PSCI [12], and Zhao et al. [13] identified the left angular gyrus, left basal ganglia, and white matter around the left basal ganglia as strategic structures for global cognitive impairment after stroke. Additionally, age [14–16], female sex [4, 17], educational level [4, 16], temporal lobe atrophy [4, 7], family history of dementia [4], previous stroke or TIA [18], multiple infarcts, diabetes [15], a trial fibrillation [4, 18], hypertension/blood pressure variability [19], social support [16], and global cerebral atrophy [7] were highly correlated with PSCI. The risk of PSCI could increase in the decades following a stroke [20, 21]. The PSCI risk factors are not sustained during the rehabilitation process, and it is imperative to refine the understanding of PSCI risk factors and translate these findings to improved outcomes [21].
Few studies have focused on the degree of influence of risk factors or used the Wechsler Adult Intelligence Score-Revised for China (WAIS-RC), which maybe an adequate neuropsychological measurement of cognition after stroke [22] and is considered more of a proxy measure of brain reserve than the Montreal Cognitive Assessment [23].
The degree of influence differs for PSCI factors, and the varying degrees to which each factor relates to PSCI should be confirmed for further research on controlling variables and matching demographic information. However, only a few studies have addressed this issue. This study adopted the WAIS-RC score as the dependent variable and used four models based on two data types, including a continuous regression model and a classification model, to explore and quantify the factors affecting cognitive function.
MATERIALS AND METHODS
Data source
This study retrospectively collected the data of post-stroke patients at the China Rehabilitation Research Center from November 2018 to September 2022. This study was approved by the Medical Ethics Committee of China Rehabilitation Research Center (2021-146-1).
Participants
According to the International Statistical Classification of Diseases and Related Health Problems-Tenth Edition (ICD-10), the diagnoses of patients in this study included Sequelae of subarachnoid hemorrhage (I69.0), Sequelae of intracerebral hemorrhage (I69.1), Sequelae of other nontraumatic intracranial hemorrhage (I69.2), Sequelae of cerebral infarction (I69.3), Sequelae of stroke, not specified as hemorrhage or infarction (I69.4), and Sequelae of other and unspecified cerebrovascular diseases (I69.8).
In this study, the inclusion criteria were as follows: 1) aged 20–75 years and a disease course of at least 2 weeks; 2) a definitive diagnosis of stroke; 3) complete availability of required data; and 4) completion of the WAIS-RC within 10 days of hospitalization. The exclusion criteria were as follows: 1) those with disorders of consciousness and 2) those who did not cooperate for other reasons. We identified 270 patients based on the inclusion and exclusion criteria.
Verbal informed consent was obtained before the interview.
Predictors and outcome
The indicators related to cognitive function after stroke included three sections (overall 18 indicators): demographic information, underlying disease, and stroke-related indicators (Supplementary File 1, Part 1).
The outcome indicator was the aggregate score (also called intelligence quotient, IQ) assessed using the WAIS-RC, comprising two dimensions: verbal intelligence quotient and performance intelligence quotient (Supplementary File 1, Part 2). A therapist with experience of >3 years assessed all patients according to the handbook using the WAIS-RC in a fixed office.
Statistical analysis
Data scoring
Age, disease course, and education were transferred to categorical data. Age ranged from 1–3 for 18–44 years, 45–60 years, and >60 years. Education was categorized into three groups, ranging from 1–3 (<12 years, 12–16 years, and >16 years); similarly, disease course ranged from 1–5 for <1 month, 1–3 months, 3–6 months, 6–12 months, and >12 months.
For logistic regression (LR) and XGBoost models (XGB), WAIS score transferred to three categories of > =90, 70–90, and <70, respectively defined as normal, lower than normal, and cognitive disorder levels. The data transformation rules are listed in Supplementary File 2.
Correlation analysis
A correlation analysis between every factor and the WAIS score was first performed for better modeling. All independent variables were categorical, and indicators with a significant correlation (p < 0.05) with the WAIS scores were included in the model using Spearman correlation analysis.
Model construction
In this study, four models were created to determine the degree of contribution of each factor to cognitive function, including a multiple linear regression model (MLR), genetic algorithm modified Back-Propagation neural network (GA-BP), LR, and XGB models. The first two models require a continuous dependent variable, while the other two require categorical variables. The modeling methods are described in Supplementary File 2.
To further explore the interaction of factors, we analyzed the mediating effect, in which we set dysfunctions as mediating variables, other factors as independent variables, and the WAIS IQ as the dependent variable.
SPSS (version 26.0) was used to construct the MLR and LR, MATLAB 2021b to build the GA-BP neural network, Python 3.11 to construct the XGB model, and Amos 26.0 to perform the mediating effect analysis.
RESULTS
Participants
Of the 496 stroke participants, 270 participants who met the inclusion and exclusion criteria were enrolled (Fig. 1). Demographic information, underlying diseases, and complications of 270 patients are listed in Table 1.

Research data source and population. We collected data from 496 patients who underwent the WAIS, of whom 294 were post-stroke patients. Demographic or other required information was missing in 17 patients, and seven patients were out of the age range. A total of 270 patients were enrolled in this study.
Information of 270 patients and correlation coefficients of all factors to WAIS score
aPosterior circulation: brain stem, cerebellum, and occipital lobe. bDiffuse lesions: including subarachnoid hemorrhage and other diffuse lesions. *p < 0.05.
Correlation analysis
After correlation analysis, seven factors (education, lesion side, aphasia, frontal lobe, temporal lobe, diffuse lesions, and disease course) significantly correlated with the IQ score (Table 1) and were included in the models.
MLR
After correlation analysis, the independent variable in the MLR included seven items, and the dependent variable was the WAIS score. The homogeneity of variance between independent and dependent variables was verified. The observations were independent of each other (Durbin-Watson = 1.813). The regression tolerances were all greater than 0.1 with no multicollinearity. The regression model was statistically significant (F = 12.053, p < 0.01, adjusted R2 = 0.223). Of the seven independent variables included in the model, the effects of education, lesion side, aphasia, diffuse lesions, and frontal lobe on the WAIS score were statistically significant (p < 0.05) (Table 2).
Correlation coefficient of variables to WAIS score in MLR model
**p < 0.05, *p < 0.1.
GA-BP neural network
Data were included in the model for training with 15 neurons and fixed seed values to obtain the influence degree values of the seven independent variables on cognitive function (Table 3). Education, aphasia, frontal lobe, temporal lobe, and diffuse lesions had the greatest impact on the WAIS score and the weights of each part (Supplementary File 1, Part 3) (Training Model R2 = 0.404).
Importance of seven variables to WAIS score in GA-BP neural network and XGBoost
*p < 0.05.
LR
Two binary LRs in this study were used to assess the effects of seven independent variables on cognitive function. Both models used the normal WAIS score (level 1) as a reference for comparing the contribution of each indicator at levels 2 and 3. In the model comparing levels 1 and 2 (χ2 = 47.168, p < 0.001), education and lesions substantially influenced WAIS scores. In the model comparing levels 1 and 3 (χ2 = 82.155, p < 0.001), education, lesion side, aphasia, and disease course influenced the WAIS scores primarily (Table 4).
OR values of variables
Model 1 correctly classified 79.5% of participants. The sensitivity and specificity were respectively 84.7% and 59.7%, the positive predictive value was 74.2% and the negative predictive values was 74.0%. Model 2 correctly classified 79.5% of the study participants. The sensitivity and specificity were respectively 85.4% and 67.7%, the positive predictive value was 84.0% and the negative predictive value was 70.0%.
XGB
With all the data included in the training model, the importance of the respective variables was obtained from the model, as shown in Table 3. Education, lesion side, aphasia, and diffuse lesions were the variables that most influenced WAIS scores.
Mediating effect
A structural equation was built to analyze the mediating effect, in which aphasia was set as the mediating variable, the WAIS score as the dependent variable, and the other factors as the independent variables (Fig. 2). There are significant indirect and total effects in this equation. To further explore the mediating effects of factors on WAIS, all patients were divided into two groups: mild to moderate cognitive impairment (WAIS> = 70) group (G1) and severe cognitive impairment (WAIS < 70) group (G2). In G1, only a significant total effect was observed in the equation, and the mediating effect of aphasia was unclear. In G2, the mediating effect of aphasia was significant; however, the indirect and total effects were unclear (Table 5 and Supplementary File 1, Part 4).

Structure equation to analyze mediating effect. Mediating variable: aphasia; independent variables: education, lesion side, frontal lobe, temporal lobe, diffuse lesions, and disease course ‘a,’ ‘b,’ and ‘c’ in the equation represent the effects from factors on aphasia or WAIS score. Indirect effect = a * b; total effect = a * b + c; rate = indirect / total effect.
Indirect and total effects of factors on WAIS score
DISCUSSION
Based on the WAIS, this study used four common prediction models to analyze the prognostic factors of PSCI. The factors significantly impacting WAIS scores in the MLR and LR models, those with more than 0.2 impacting points in the GA-BP neural network and more than 0.1 impacting points in the XGB model, were defined as the influencing factors. Finally, the following four factors considerably influenced the outcome: education, stroke side, aphasia, and diffuse lesions. Furthermore, the frontal lobe and disease course significantly influenced the WAIS score. A high education level, lesions in the right hemisphere, short disease course, no aphasia, lesions not in the frontal or temporal lobes, or diffuse lesions are protective factors against cognitive impairment.
Education [24–29], stroke side [13, 31], aphasia [4, 33], and diffuse brain injury [34, 35] were all indicators that significantly affected the outcomes. In addition, frontal and temporal lobe injuries have a greater impact on outcomes [12, 36]. Cognitive reserve has a significant impact on cognitive impairment after stroke, and the cognitive reserve of patients with high educational levels is relatively good [24, 28]. Weaver et al. showed that left frontal-temporal lobe infarction is a significant risk factor for PSCI, demonstrating the influence of laterality and location on cognitive function [12]. Except for the factors above, lesions in the thalamus, hypertension, diabetes, previous stroke, and other relevant medical histories are also risk factors for PSCI [37]. According to this study, some factors may contribute to PSCI; however, no significant differences were found, and further exploration is needed.
In this study, we used multinomial logistic regression models with 3-categorical variables to analyze the predictive factors, but the p-value of the Test of Parallel Lines was 0.002, indicating that the 3-category regression models were not parallel. The influencing factors of different models may differ substantially. Therefore, the authors divided the model into two groups for analysis: normal mild cognitive impairment (level 1 versus level 2) and normal severe cognitive impairment (level 1 versus level 3). Aphasia, closely related to the lesion location and laterality, is a prominent factor in patients with severe cognitive impairment [32, 33]. Aphasia is a functional impairment after stroke, which significantly impacts cognitive function and assessment [4, 33] and thus may have a masking effect. In clinical practice, differentiating between sensory aphasia and cognitive impairment poses a problem for rehabilitation doctors [4, 21]. Therefore, it is suggested that aphasia should be assessed first in patients with obvious aphasia, followed by a cognitive function assessment and evaluation of their interactions.
The mechanisms of occurrence and rehabilitation of PSCI are complex. The PSCI includes attention, memory, visual space, language, and executive control [1, 9]. Each function can realize a cognitive process by closely cooperating with the others, but the mechanism is unclear [9, 21], which is different from other functional disorders, such as motor disorders, with relatively distinct neural pathways. Analyzing the prognosis of cognitive impairment and establishing a predictive model requires further work.
Limitation
This study had several limitations: 1) Although the WAIS-RC scale has a high degree of discrimination and accuracy in assessing overall cognitive function with overall cognitive function as the outcome of this study, it is not perfect in each subtest. For example, the total score has age correction, but each sub-test does not, and the influencing factors of various dimensions of cognition are not well considered, so there may be deviations; it is suggested to select a specific scale for evaluating a cognitive dimension for influencing factor analysis; 2) the sample size of this study is not enough, and subsequent research should be based on an in-depth analysis of large sample size; 3) approximately 10% of patients had cognitive impairment before the first stroke [4], necessitating the inclusion of a healthy control group in the clinical study; 4) Although prognostic factors can be obtained, the contribution of these factors to the WAIS score is low (R2 = 0.223 in the MLR model). We believe more influential factors for cognitive function prognosis exist, such as the disease severity. Pendlebury et al. confirmed that the incidence of dementia in patients with severe stroke (NIHSS score > 10) was significantly higher than that in patients with mild stroke (NIHSS score < 3) and TIA one year after onset [7]. In the next step, we need a larger sample size, including evaluation indicators of various dimensions and more detailed patient data to clarify the influencing factors of cognitive impairment.
AUTHOR CONTRIBUTIONS
Wenlong Su (Conceptualization; Investigation; Formal analysis; Writing – Original Draft, Review & Editing; Funding acquisition); Hui Li (Conceptualization; Investigation; Formal analysis; Writing – Review & Editing); Hui Dang (Conceptualization; Investigation; Writing – Review & Editing); Kaiyue Han (Conceptualization; Investigation; Writing –Review & Editing); Jiajie Liu (Conceptualization; Formal analysis; Writing – Review & Editing); Tianhao Liu (Conceptualization; Writing – Original Draft, Review & Editing); Ying Liu (Conceptualization; Supervision; Writing – Review & Editing); Zhiqing Tang (Conceptualization; Writing – Review & Editing); Haitao Lu (Conceptualization; Investigation; Funding acquisition; Writing – Review & Editing); Hao Zhang (Conceptualization; Funding acquisition; Supervision).
Footnotes
ACKNOWLEDGMENTS
We would like to thank the Research Program of the China Rehabilitation Research Center, Joint Doctoral Program of the University of Health and Rehabilitation Sciences, Capital Medical University, and Capital’s Funds for Health Improvement and Research. We thank the clinicians and other staff in the Department of Neurological Rehabilitation and Department of Psychology at the China Rehabilitation Research Center for their support with data collection.
FUNDING
This study was funded by the Joint Doctoral Program of the University of Health and Rehabilitation Sciences and Capital Medical University (2020kfdx-010), Research Program of China Rehabilitation Research Center (Grant No. 2021zx-19), and Capital’s Funds for Health Improvement and Research (Grant No. 2020-1-6011).
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
