Abstract
Standardized testing is an integral part of the English and American education systems. However, the use of high-stakes testing has unintended consequences, one of which is test anxiety. Over the last 50 years, increased attention has been directed to developing tools to identify students experiencing test anxiety. However, many test anxiety instruments have been created for research purposes rather than use within school decision-making and lack evidence for interpretation. The purpose of the current study was to support the use of the Multidimensional Test Anxiety Scale (MTAS) in applied settings by using a latent profile analysis to identify respondent groups to support score interpretation. Participants included 918 secondary students in England.
Introduction
Academic anxiety is a comprehensive term for the various types of anxieties that students may experience in the school environment (Cassady, 2010). Test anxiety is one such academic anxiety and is defined as the changes in emotion and physiology resulting from an individual’s perception of the consequences of a test or exam (Zeidner, 1998). Test anxiety occurs when an individual appraises an evaluative situation (i.e., test) as threatening and is often accompanied by worry and heightened physiological reactions. Students who experience test anxiety may perform poorly in exams due in part to the disruptive nature of test anxiety, such as devoting cognitive resources to non–test-related tasks (Angelidis et al., 2019). Test anxiety has been associated with poor academic performance since research into the subject began in the 1950s (Sarason & Mandler, 1952). A meta-analysis conducted by von der Embse et al. (2018) found a negative relationship between test anxiety and exam performance, grade point average (GPA), and standardized tests. Increases in test anxiety were associated with decreased scores across all three (von der Embse et al., 2018).
In America and the United Kingdom, standardized tests, like the National Curriculum Tests and the Florida State Assessments, are used to evaluate students, teachers, schools, and school districts; thus, students are assessed at several points throughout their education. The scores on the standardized tests are used to determine whether students attain a high school diploma and can access post-secondary education (Segool et al., 2014), and they are often used to support or make educational decisions, like retention. A student may experience greater test anxiety from these standardized assessments than from typical classroom exams because of the higher stakes associated with them (Segool et al., 2013).
Given the prominent role of standardized assessments in evaluating student academic performance and facilitating further educational attainment (i.e., university admission), it becomes essential for schools to identify students who may need additional support to reduce the likelihood of high test anxiety. Several test anxiety assessment tools have been developed over the last two decades, including the most widely used assessment, the Test Anxiety Index (TAI: Spielberger, 1980). However, there are several limitations to these tools. First, many test anxiety instruments were created for research purposes without the necessary evidence for use in applied decision-making situations. More specifically, evidence for interpretation (Kane, 2013) is an important consideration in developing an assessment such that assessment scores can meaningfully differentiate between groups. A second limitation is that several frequently used test anxiety scales are not reflective of modern advancements in theoretical conceptualizations of test anxiety.
The Multidimensional Test Anxiety Scale (MTAS; Putwain et al., 2020) was created to address the aforementioned limitations of several of the most frequently used test anxiety scales. The MTAS components included the cognitive aspects of worrying, thoughts around failure, and cognitive interference, and the autonomic aspects of physiological indicators and tension (Lowe et al., 2008; Zeidner & Matthews, 2005). To date, MTAS research has examined the factor structure and relation to distal academic performance and similar student report assessments (Putwain et al., 2020). Additionally, MTAS research has used a new sample to confirm the factor structure, evaluate measurement invariance, determine the internal consistency of the factors, and identify cut scores (von der Embse, Kim, et al., 2021). The present study sought to add evidence for the applied use of the MTAS by determining respondent groups via a latent profile analysis.
There are various methods used to determine cut scores or standards for assessments. The two main types that these methods fall under are person-centered analyses or variable-centered analyses. In a previous study, cut scores of 58 and 60 in MTAS Total score were created for the MTAS compared to a panic and anxiety scale using ROC (von der Embse, Putwain, Francis, et al., 2021). The main advantage of ROC is the fact that it reports the changes in specificity and sensitivity across different cut of scores. However, the freedom to determine thresholds and cut scores is also a weakness. The level of specificity and sensitivity is dependent on the researcher and is thus subjective. Additionally, as a variable-centered analysis, it assumes that the sample is homogenous and focuses on the variables.
This study examined classification thresholds through a latent profile analysis, which is a person-centered analysis based on the principle that responses to items in the assessment form distinct and mutually exclusive subgroups called latent profiles. This method can be used to determine both the number of classification groups formed through the assessment as well as the standards for inclusion in each of these classifications. Thus, it will expand previous research to increase the usability of the MTAS.
Methods
There were 918 students included from an existing dataset, including 217 who self-identified as male and 694 who self-identified as female, while seven declined to disclose their gender. All participants were secondary school students from eight schools in the United Kingdom, two of which were girls’ schools, which may account for the relatively higher number of females in the sample. The mean age of this sample was 15.76 years old. The grades in this sample were between Year 10 and Year 13. The racial/ethnic breakdown of this sample was 3% Asian, 5% Black, 87% White, and 2% multiracial. Additionally, 15% of the sample were eligible for free school meals. There were no missing data in the dataset.
Measures
MTAS
The Multidimensional Test Anxiety Scale (MTAS: Putwain et al., 2020) was developed to measure the multiple components of test anxiety. Factors include cognitive interference, worry, physiological indicators, and tension. The MTAS consists of 16 items and uses a Likert scale from 1 “Strongly Disagree” to 5 “Strongly Agree” with 4 items for each of the four constructs listed above. The MTAS has supportive evidence for internal consistency, factorial validity, predictive validity, measurement invariance, and the creation of cut scores (Putwain et al., 2020; von der Embse et al., 2020). The MTAS has positive relationships with student mental health (rs = .13 to .46) and negative relationships with academic performance (rs = .01 to .41) and well-being (rs = .01 to .41; Putwain et al., 2020).
GAD
Generalized Anxiety Disorder (GAD) was measured using the six-item Generalized Anxiety subscale from the Revised Children’s Anxiety and Depression Scale (RCADS: Chorpita et al., 2005). The subscale consisted of a 4-point scale from 0 (Never) to 3 (Always) whereby a higher score indicated greater anxiety. Subscales from the RCAD have good psychometric properties including convergent validity (rs = .52 to .74 for Generalized Anxiety) and internal consistency (αs = .84 to .88 for GAD) (Donnelly et al., 2018).
Data Collection Procedure
Data were collected from eight secondary schools. Once IRB approval was obtained, the eight schools were enrolled in the study, and consent was solicited from the principal, students, and parents of students under 18 and considered minors. The MTAS was then administered during a “free” period in the students’ timetables when instruction was not being provided. The teachers who administered the MTAS followed a script whereby they informed the students that they were not being tested, but to fill out the questionnaire honestly.
Data Analysis
Reliability
The internal consistency of the four factors of the scale was examined. Cronbach’s alpha and McDonald’s omega were calculated for the different factors as well as the overall scale, with values between .7 and .9, indicating acceptable to exceptional internal consistency (Tavakol & Dennick, 2011).
Latent Profile Analysis
LPA was used to identify patterns of risk among the four factors, cognitive interference (CI), worry (W), physiological indicators (PI) and tension (T), and overall test anxiety (OTA). Based on the literature, the authors predicted that the classifications would be low, moderate, and high (von der Embse et al., 2014). However, to ensure that the ideal number of profiles was selected for the model, a two-profile model was run first, followed by a three-profile, a four-profile, and a five-profile model. The analysis stopped at a five-profile model, given the potential challenges in the interpretation and usability of six or greater profiles. Once a model was chosen, we used a multinomial logistic regression procedure to examine the association between the profile membership and measures of General Anxiety Disorder (GAD), using the R3STEP in Mplus 8.1 (Muthén & Muthén, 2017). Examination of the association between the profile membership to GAD was completed to provide validity evidence to the cluster solutions (3-step approach).
Results
Descriptive Statistics for MTAS Items.
Descriptive Fit Statistics for LPA Models.
The interpretation of the LPA data included fit statistics along with the guiding test anxiety theory and the ultimate goal of identifying profiles that will aid in the interpretation of MTAS scores. The four-profile model exhibited better fit (AIC = 17,925.013, BIC = 18,035.924, sample-adjusted BIC = 17,962.879, LMR-LRT = 186.953, p = .0429, adjusted LMR-LRT = 181.628, p = .0458, bootstrap LRT = 186.953, p = .000). While the two-profile model had the best entropy, a review of the totality of fit statistics suggested the four-profile model to be selected.
Mean Subscale Scores and Sample Percentages for Latent Profiles. Each of the Subscale Scores was out of a Total Score of 20 and each of the Total Scores were out of a Total Score of 80.
The multinomial logistic regression was run with profile 3 (high test anxiety) as the reference group. The unstandardized regression coefficients for the GAD indicated that the students with lower GAD symptoms were more likely to belong to the low test anxiety group (b = −0.578, SE = .09, p < .001) and the average test anxiety group (b = −0.251, SE = .07, p < .001) than the high test anxiety group.
Discussion
Test anxiety has been linked with a number of negative outcomes, including low student performance in high-stakes tests (von der Embse et al., 2018), with significant downstream consequences such as grade retention or denial of university admission. There are a number of standardized tests that students take throughout their education, like the National Curriculum Tests, and these test scores are used to evaluate students’ academic performance as well as the effectiveness of teachers and schools (Segool et al., 2014). Test anxiety impacts how well students do on these exams, so identifying and addressing test anxiety should be a high priority in schools. However, test anxiety must first be reliably identified to facilitate early intervention support. Test anxiety assessment tools are one of the most systematic ways to identify students experiencing test anxiety. The MTAS was developed to address limitations of existing tools; however, additional research was needed to support use in school settings.
Interpretation and use include all the steps between administering an assessment and using its results to make decisions (Kane, 2013). The primary aim of the present study was to increase the usability of the MTAS in schools and other practical settings and provide guidelines for the interpretation of scores. Previous MTAS research identified cut scores using a variable-centered analytical approach with an external criterion (Putwain et al., 2020). This study took a different approach via LPA, which is a person-centered approach. An LPA offers a unique benefit by identifying clusters of responses with common attributes and based on theory (Laursen & Hoff, 2006).
Results indicated that a four-profile model was consistent with the fit statistics and test anxiety theory. Upon examination of the four-profile model, the following descriptions were used including low test anxiety (average total score = 34/80), average test anxiety (average total score = 49/80), above average test anxiety (average total score = 59/80), and high test anxiety (average total score = 71/80). In the 3-step analysis, higher GAD symptoms were associated with a higher likelihood of belonging to the high test anxiety group, compared to the low test anxiety and average test anxiety groups. These four categories may be useful for schools and other stakeholders using the MTAS. The four-profile model may allow school personnel to prioritize service for those students experiencing the high and above average levels of test anxiety. Approximately 20% of the sample endorsed high levels of anxiety which may indicate a substantial number of students in need of intervention support. These students experiencing high levels of test anxiety would likely benefit from individualized support. Additionally, many students (43%) endorsed above average levels of test anxiety and could potentially benefit from brief, small group interventions.
While there are strengths to the current study, there are some limitations. First, the present study utilized an existing dataset. This prevented modifications to how these data were collected, the diversity of the sample, and what types of data were collected. A second limitation included the representativeness of the sample, which may limit the generalization of results. Additional research is needed with a more diverse population of students from across different cultures and countries to examine the measurement invariance of the MTAS. Such a study would increase the population of students for whom the MTAS can be employed. Further research could also examine the number of latent profiles that emerge with different samples of students. It will be important to evaluate the membership within these profiles as potentially and differentially predictive of important distal outcomes such as academic achievement.
Test anxiety is often comorbid with learning disorders and ADHD. Further research is needed to understand the relationship between test anxiety and these disorders and how they may interact with each other. Students with learning disorders and ADHD may already receive support in the school setting, but if they are also experiencing test anxiety, additional intervention may be warranted. Future research should address treatment types and intensity to support the direction of limited resources. The four-profile classification presented in this study separated scores into low, average, above average, and high test anxiety. More research needs to be completed to investigate treatment for test anxiety across these different profiles. Lastly, research could be conducted to evaluate the stability of the profiles and how students may change over time through latent transition analysis. These further analyses would add support to the identified profiles and give additional information to users of the MTAS on the needs of students identified with higher levels of test anxiety. In an educational landscape that is increasingly reliant upon standardized testing for decision-making purposes, understanding test anxiety and its impacts on test performance is crucial. The further development and validation of test anxiety assessment tools will be important to increase usability and, ultimately, decision-making.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
