Abstract
Early diagnosis of children with reading disorders is essential for intervention and academic success. Many children with reading difficulties have a deficit in phonemic awareness. A web-based, group-administered screening vehicle, the Risk Factor Screen for Reading (RiFS-Reading), was developed to quickly identify students who are “At Risk” for reading difficulties by assessing knowledge of phonemic awareness. The goal was to establish the validity of the RiFS-Reading measure. Results supported construct validity through factor analysis and criterion validity by comparisons with performance on well-established measures of phonemic awareness and reading. Phonemic awareness on the RiFS-Reading was independent of other language skills.
Reading and literacy among school-age children is a serious concern in the US, especially since the abrupt school closures in spring 2020 (Domingue et al., 2022). Reading disorders may occur in as many as 1/5 to 2/5 children, many of whom are not diagnosed until second or third grade, wasting valuable time for remediation and instruction (Lyon, 1996, 1998; Siegelman et al., 2022). Successful remediation is much more difficult and costly if intervention is delayed. Therefore, it seems reasonable that early diagnosis of reading disorders could significantly improve children’s lives.
Francis et al. (1996) showed that most school-age children with reading deficits fail to catch up with their peers. A child’s third-grade reading ability can be used to predict overall long-term academic success, as 75% of children with reading disabilities who are not identified before third grade continue to have reading disabilities by ninth grade, and fewer than 2% go on to participate in four-year programs after high school (Hamilton & Glascoe, 2006). Because these children rarely catch up, the consequence of a slow start becomes monumental as the deficits accumulate over time. The effects of failure to acquire early word reading skills range from negative attitudes toward reading to missed opportunities for vocabulary growth and the development of reading comprehension strategies to less practice in reading. The best solution therefore is to allocate resources for early identification of students with deficits in the areas that contribute to reading skills (Torgesen, 1999; Torgesen et al., 2010) and then provide remediation (Ehri et al., 2001).
The most common cause of early reading difficulties has been found to be weaknesses in knowledge of letters and letter sounds and phonological processing, including phonemic awareness (Catts et al., 2002; Milankov et al., 2021; Torgesen, 1999; Torgesen, et al., 2010). Torgeson recommended measuring phonemic awareness based on performance in three categories: sound comparison, phoneme segmentation, and phoneme blending (Torgesen, 1999; Torgesen, et al., 2010). He and his colleagues helped develop a comprehensive measure to assess these skills, namely, the Comprehensive Test of Phonological Processing, Second Edition (CTOPP-2; Wagner et al., 2013). There are also existing screening tests of phonemic awareness, such as the Dynamic Indicators of Basic Early Literacy Skills (DIBELS) (Good et al., 2001), i-Ready (Curriculum Associates, 2019), Measure of Academic Progress (MAP) Assessment (Northwest Evaluation Association, 2016), the Phonological Awareness Skills Test (Kilpatrick, 2016), and the EarlyBird Screener (Gaab & Petscher, 2021); however, they either need to be individually administered, do not fully capture several important constructs, take a long time to administer, and/or are costly.
A novel screening measure has therefore been developed: the Risk Factor Screen for Reading (RiFS-Reading). It differs from other screening tools in that it targets students in kindergarten and first grade and is completely web-based, engaging the students with graphics on a tablet or touch-screen computer. It also can be administered in a small group format, which is significantly more efficient, as teachers and schools often do not have the time or resources to test students individually. Furthermore, results are available almost immediately and provide a cost-effective way to identify children at risk for early reading difficulties. The goal of this research was to determine whether RiFS-Reading is a valid tool to identify students who are at risk for developing reading problems so that they can receive early intervention.
Methods
Objective
The purpose of this study was to examine the reliability and validity of the Risk Factor Screen for Reading (RiFS-Reading) Scale. It was hypothesized that the scale will have (a) construct validity (as measured by item evaluation during exploratory factor analysis), (b) good reliability (as measured by Cronbach’s alpha), (c) convergent validity (as compared to Comprehensive Test of Phonological Processing, Second Edition and Woodcock-Johnson Test of Achievement, Fourth Edition (WJA-IV) Letter-Word Identification test), and (d) discriminant validity (as compared to the Wechsler Individual Achievement Test, Fourth Edition (WIAT-IV) Expressive Vocabulary and Receptive Vocabulary tests).
Procedure
The RiFS-Reading test was administered first followed by the administration of the four tests used to measure validity (i.e., CTOPP-2, WJA Letter-Word Identification, WIAT-IV Receptive Vocabulary, and WIAT-IV Expressive Vocabulary). These measures were administered within four months of the RiFS-Reading, three months of which were during the summer in which no classroom instruction was provided. However, most students received the measures within a month interval of the previous test (M = .94 months, SD = 1.59 months). Parents provided informed consent for their children to participate in the study. This study was overseen by WCG Western IRB. Analyses were conducted using Microsoft Excel and IBM SPSS Statistics, version 29 (IBM Corp, Armonk, NY, USA).
A subset of students (N = 13) were re-administered the RiFS-Reading measure to obtain valid profiles. Four students were re-administered the RiFS-Reading to decrease the time between the RiFS-Reading and the other measures used in this study (i.e., CTOPP-2, WJA-IV Letter-Word Identification, WIAT-IV Receptive Vocabulary, and WIAT-IV Expressive Vocabulary), four students had sections on the RiFS-Reading that “timed out” either due to inattention or internet instability, and five students exhibited behaviors that suggested that they were not attending to the task.
Measures
Risk Factor Screen for Reading Scale
The RiFS-Reading test is conducted on a handheld tablet that is given to each student so they can work independently. The measure was developed for each student to complete individually but can be administered simultaneously to multiple students using touch-screen devices. A small group of students can sit at their desks at the same time with separate tablets. A proctor can walk around to ensure that students are engaged and do not experience technical difficulties. The test was designed to take 10–15 minutes to complete and evaluates students’ understanding of phonics and phonemic awareness. Phonics was measured using two subtests: Letter Recognition, which assesses the student’s knowledge of the letters of the alphabet, and Letter-Sound Association, which assesses the student’s knowledge of the sounds the letters make. The Letter Recognition subtest included 14 items, where the student is prompted to select a specific letter from three options. The Letter-Sound Association subtest included nine items, where the student is prompted to select the letter associated with the sound provided. The letters utilized in the Letter Recognition and Letter-Sound Association subtests were selected based on an analysis of the most frequently utilized letters in words listed in the Concise Oxford Dictionary (Baldick, 1990).
Phonemic awareness can be grouped into three categories: Sound Comparison, Phoneme Segmentation, and Phoneme Blending. Utilizing the deductive method for item development (Boateng et al., 2018), items were generated to correspond to the relevant categories, with items utilizing one-syllable words of no more than three-to-four letters to ensure appropriateness for students as young as five years old. Phonemic awareness was measured using three subtests: (1) Sound Comparison, which assesses the student’s ability to recognize the beginning sound of a word, (2) Phoneme Segmentation, which assesses the ability of the student to break apart a word into its individual sounds, and (3) Phoneme Blending, which assesses the ability of the student to combine sounds into a word. The Sound Comparison subtest included four items, where a picture of an object was presented, and the student was prompted to choose another one-to-two-syllable word that begins with the same sound (e.g., “Find the word that begins with the same sound as pen”). Pictures and sounds were provided for each choice (e.g., van, pot, horse). The Phoneme Segmentation subtest included six items, where the student was presented with the name and picture of an object that consisted of three-to-four letters and one-syllable (e.g., “The word is pot”). The student was then asked to select from three options the group of sounds that represent the word broken down into its component phonemic sounds (e.g., /p/ /ŏ/ /t/, /p/ /ă/ /t/, /p/ /ŭ/ /p/). The Phoneme Blending subtest included six items, where the student was presented with the phonemes that comprise a three-phoneme, one-syllable word (e.g., “What word do these sounds make? /b/ /e/ /d/”). The student was asked to select from three options which word was formed when the phonemes were blended together (e.g., bet, bad, bed).
Two (of the six) items from the Phoneme Segmentation subtest and two (of the six) items from the Phoneme Blending subtest were eliminated prior to conducting the analysis. The last items from these subtests were eliminated due to observations of fatigue as the test progressed as well as occasional technical errors that occurred due to internet and/or programming issues, which occasionally made the test stop or skip those items at random times. The additional items from the Phoneme Segmentation and Phoneme Blending tests were eliminated because some of the choices sounded very similar due to the articulation of the narrator on the test (e.g., pat and pet) and therefore appeared to cause confusion in the students.
On the RiFS-Reading, a composite score was created for Phonics (Letter Recognition and Letter-Sound Association) and for Phonemic Awareness (Sound Comparison, Phoneme Segmentation, and Phoneme Blending). The total number of correctly answered items was summed across each of the relevant subtests for the corresponding composite. A student was identified to be “At Risk” in a given area if their score was less than 75% and “Not At Risk” if their score was 75% or higher. The threshold of 75% was chosen as the cutoff between “At Risk” and “Not At Risk” students to denote below average performance on the test and allow for the possibility of careless mistakes due to the young age of the students.
Comprehensive Test of Phonological Processing Second Edition.
The Phonological Awareness subtests of the Comprehensive Test of Phonological Processing, Second Edition (CTOPP-2; Wagner et al., 2013) were individually administered. This well-validated measure of phonological processing consists of three subtests: Elison, Blending Words, and Sound Matching. The Elison subtest measures the ability to remove phonological segments from spoken words to form other words, the Blending Words subtest measures the ability to synthesize sounds to form words, and the Sound Matching subtest measures the ability to select words with the same initial and final sounds. For students seven years old and older, the Phoneme Isolation test, which measures the child’s ability to isolate individual sounds within words, was used in place of the Sound Matching test, as the test changes at this age to mirror the increased developmental expectations but measures the same underlying skill. The total time to administer all three subtests was approximately 20 minutes.
On the CTOPP-2, scaled scores for each of the subtests (Elison, Blending Words, Sound Matching/Phoneme Isolation) have a mean score of 10 and a standard deviation of 3. The Phonological Awareness Composite score has a mean score of 100 and a standard deviation of 15. The scale authors reported that these subtests’ internal consistency coefficients are .86 or higher, and the Phonological Awareness Composite internal consistency coefficient is .92. Validity of the CTOPP-2 subtests and composites was demonstrated by correlations with other validated measures of phonological awareness. Coefficients range from .47 to .85 for the subtests and from .64 to .78 for the composite (Wagner et al., 2013).
Woodcock-Johnson Tests of Academic Achievement, Fourth Edition Letter-Word Identification Subtest
The Letter-Word Identification subtest of the Woodcock-Johnson Tests of Academic Achievement, Fourth Edition (WJA-IV; Schrank et al., 2014), which measures word identification skills, was individually administered to each participant and took approximately 5 minutes. Initial items require a student to recognize and identify individual letters. Later items require the student to recognize a word from an array and then read words of increasing difficulty in isolation. The WJA-IV Letter-Word Identification was scored based on age and translated into standard scores (M = 100, SD = 15). Letter-Word Identification has a median reliability of .92. Validity of the WJA-IV reading cluster was demonstrated by correlations with other validated measures of reading, including a coefficient of .83 with the Kaufman Test of Academic Achievement, Second Edition Reading Composite and a coefficient of .92 with the Wechsler Individual Achievement Test, Third Edition Basic Reading Composite (Schrank et al., 2014).
Wechsler Individual Achievement Test, Fourth Edition Receptive Vocabulary and Expressive Vocabulary Tests
The Wechsler Individual Achievement Test, Fourth Edition (WIAT-IV; Breaux, 2020) Receptive Vocabulary test is an assessment used to measure comprehension of vocabulary. Students are asked to select the picture from an array of four possibilities that best illustrates the meaning of each word. The Expressive Vocabulary test is a measure used to determine a student’s ability to retrieve words and demonstrate knowledge of vocabulary. Students are presented with a picture, are provided with a definition, and are then asked to say the word that best describes the picture and definition. Assessments were conducted one-on-one and took approximately 5 minutes to administer both subtests. The scale authors reported that internal consistency reliabilities range from .69 to .77 for the Expressive Vocabulary test and .79 to .84 for the Receptive Vocabulary test for the ages tested. Test-retest reliabilities are .84 and .66, respectively (Breaux, 2020).
Data Preparation
Prior to conducting analyses, the data were cleaned per standard procedures. Data were examined using descriptive statistics and histograms to assess for skewness, kurtosis, and outliers. All values were within normal range. Scales did not show evidence of abnormal skewness or kurtosis. Five students had missing values on the CTOPP-2 and/or WIAT-IV Expressive and Receptive Vocabulary scores, so the values were deleted pairwise in the analyses.
Data Analysis Plan
Frequencies of gender, ethnicity, and languages spoken at home were calculated in addition to the mean and standard deviation age in months of the participants in the study. An exploratory factor analysis was then conducted to determine how many factors described the skills measured in the RiFS-Reading screening tool and which items within each factor were similar to each other. The reliability of the measure was assessed using Cronbach’s alpha. Convergent validity was examined by conducting one-way ANOVAs between the “At Risk” and “Not At Risk” groups on the RiFS-Reading and scores on corresponding subtests of a measure of phonemic awareness (i.e., CTOPP-2 Phonological Processing) and a measure of reading (i.e., WJA-IV Letter-Word Identification). Discriminant validity was then examined by comparing scores on the RiFS-Reading to scores on measures of expressive and receptive vocabulary. (See pages 9 through 14 for analyses of reliability and construct, convergent, and discriminant validity). Use of the threshold of p ≤ .05 was employed to determine significance of results. Partial eta-squared values were considered to be small at the .01 level, medium at the .06 level, and large at the .14 level.
Results
Participants
Demographic characteristics.
Scale Validation
Construct Validity
Exploratory Factor Analysis
Initially, the factorability of the correlation matrix of the 35 items of the RiFS-Reading was examined (14 Letter Identification, 9 Letter-Sound Association, 4 Sound Comparison, 4 Phoneme Segmentation, and 4 Phoneme Blending). The exploratory factor analysis revealed that the matrix was non-positive definite such that not all of the eigenvalues were positive. A closer examination revealed that very few of the students were categorized as “At Risk” on the Letter Identification and Letter-Sound Association subscales (two and three, respectively). There was therefore very little variability in the students’ performance, which may have contributed to this characteristic of the matrix. Therefore, the Letter Identification and Letter-Sound Association items were excluded from further analysis.
Pattern Matrix for Exploratory Factor Analysis using Promax Method of Rotation.
Note. Words in parentheses indicate the stimulus items.
Note. h2 refers to the communality of each item.
***p < .001.

Scree plot for exploratory factor analysis.
Intercorrelations between RiFS-Reading Subscales
Intercorrelations between the performance on each of the RiFS-Reading subscales.
Note. N = 50.
*p < .05.
**p < .001.
Reliability
Internal Consistency Indices
The internal consistency of the RiFS-Reading subscales and its subscales were calculated with Cronbach’s alpha coefficients. The internal consistency values were alpha = .82 for the total scale, alpha = .69 for the Sound Comparison subscale, alpha = .76 for the Phoneme Segmenting subscale, and alpha = .80 for the Phoneme Blending subscale, indicating adequate internal consistencies.
Convergent Validity
Comparison to CTOPP-2 Phonological Processing Skills
Mean CTOPP-2 Composite and subtest scores (Mean for Composite = 100, SD = 15; Mean for Subscale = 10, SD = 3) for students “Not At Risk” and “At Risk” on the RiFS-Reading Composite and subscales.
Note. N = 46; CTOPP-2 = Comprehensive Test of Phonological Processing, Second Edition.
**p < .01.
***p ≤ .001.
Simple effects to examine the subscale components of these composite scores were then conducted to determine whether the subscales were measuring similar constructs based on the theoretical underpinnings of the scales. A series of three one-way ANOVAs were conducted to determine if students who were identified as being “At Risk” and “Not At Risk” on the individual subscales of the RiFS-Reading (Phoneme Segmentation, Sound Comparison, and Phoneme Blending) had significantly different scores on their corresponding CTOPP-2 subscales (Elison, Sound Matching/Phoneme Isolation, Blending Words).
The first of these one-way ANOVA tests was conducted to determine if there was a difference in CTOPP-2 Elison scores between students who were identified as being “At Risk” and “Not At Risk” for the Phoneme Segmentation subtest on the RiFS-Reading. Results revealed that there was a statistically significant difference in mean exam scores between the “At Risk” and “Not At Risk” groups. Students who were “At Risk” on the RiFS-Reading (M = 8.27, SD = 1.68) had significantly lower CTOPP-2 Elison scores than students who were “Not At Risk” (M = 10.71, SD = 2.44) (F(1, 45) = 9.49, p = .004). The partial eta-squared was .18 and represents a large effect size.
A second one-way ANOVA test was conducted to determine if there was a difference in CTOPP-2 Sound Matching/Phoneme Isolation scores between students who were identified as being “At Risk” and “Not At Risk” for Sound Comparison on the RiFS-Reading. Results revealed that there was a statistically significant difference in mean exam scores between the “At Risk” and “Not At Risk” groups, whereby students who were “At Risk” on the RiFS-Reading (M = 8.71, SD = 1.38) had significantly lower CTOPP-2 Sound Matching/Phoneme Isolation scores than students who were “Not At Risk” (M = 11.10, SD = 2.46) (F(1, 45) = 6.13, p = .017). The partial eta-squared was .12 and represents a medium effect size.
A third one-way ANOVA test was conducted to determine if there was a difference in CTOPP-2 Blending Words scores between students who were identified as being “At Risk” and “Not At Risk” for Phoneme Blending on the RiFS-Reading. Results revealed that there was not a statistically significant difference in mean exam scores between the “At Risk” (M = 9.45, SD = 3.67) and “Not At Risk” (M = 11.20, SD = 2.64) groups (F(1, 45) = 3.01, p = .064). The partial eta squared was .06 and represents a medium effect size.
Comparison to Reading Skills
A one-way ANOVA test was conducted to determine if performance on the Letter-Word Identification test of the WJA-IV differed between the RiFS-Reading Phonemic Awareness Composite “At Risk” and “Not At Risk” groups. Results revealed a statistically significant difference in mean reading scores between students who were identified as “At Risk” (M = 83.81, SD = 11.67) and “Not At Risk” (M = 97.62, SD = 17.21) using the RiFS-Reading Phonological Awareness Composite score (F(1, 44) = 6.11, p = .017). The partial eta-squared was .12 and represents a medium effect size.
Discriminant Validity
Comparison to Vocabulary Skills
One-way ANOVA tests were conducted to determine if performance on the WIAT-IV Receptive and Expressive Vocabulary tests differed between the RiFS-Reading Phonemic Awareness Composite “At Risk” and “Not At Risk” groups. Results revealed that there was not a significant difference between the receptive vocabulary scores of the RiFS-Reading “At Risk” group (M = 98.46, SD = 9.80) and “Not At Risk” group (M = 99.38, SD = 12.63) (F(1, 43) = .054, p = .82) nor between the expressive vocabulary scores of the RiFS-Reading “At Risk” group (M = 101.07, SD = 7.93) and “Not At Risk” group (M = 103.34, SD = 10.43) (F(1, 43) = .494, p = .49).
Discussion
Research has shown the importance of phonemic awareness in the development of early reading skills (Milankov et al., 2021). Children who possess weak phonemic awareness have trouble accurately sounding out words and have been shown to have trouble catching up if not addressed by third grade (Hernandez, 2011). The purpose of this study was to develop and validate the Risk Factor Screen for Reading (RiFS-Reading) as a tool to quickly identify students who are at risk for early reading difficulties. This differs from other assessments of phonemic awareness because it is web-based, can be group-administered in a short period of time, requires minimal training for the assessors, and provides electronically scored results immediately after completion.
To measure construct validity, exploratory factor analysis was conducted. This revealed a three-factor structure. This corresponds with the three factors considered to be the most important components of phonemic awareness as described by Torgeson (1999). Results revealed that this model accounts for over 60% of the variance, with items loading onto the factors corresponding with their respective subtests at levels of .69–.88, with the exception of two, one of which was .54 and the other which was .41. Although there were two items that cross-loaded onto other factors, their primary loadings were on their intended factors. Furthermore, results showed the RiFS-Reading had adequate reliability as demonstrated by tests of internal consistency of each of the subtests and the total RiFS-Reading Phonemic Awareness Composite scores that revealed alphas between .69 and .82. It is important to note that although many studies have found phonemic awareness tasks to be part of one unified construct, many others (e.g., David et al., 2022; Høien et al., 1995) suggest that the skill involves slightly different abilities, an idea supported by this study.
In order to assess the convergent validity of the RiFS-Reading, students’ scores on the RiFS-Reading were compared to their performance on the CTOPP-2, a well-validated, reliable measure of phonemic awareness. Students who were identified as “At Risk” on the RiFS-Reading were found to score significantly lower on the Phonological Processing Composite and subtests of the CTOPP-2 with the exception of one (i.e., Blending Words), which trended towards statistical significance (p = .06). The CTOPP-2 Composite scores of students who were identified as “At Risk” on the RiFS-Reading fell around the 30th percentile, which is suggestive of weak skills for their age group. In addition, students who were identified as “At Risk” on the RiFS-Reading scored significantly lower than students who were “Not At Risk” and approximately at the 14th percentile on the WJA-IV Letter-Word Identification test, a well-validated measure of reading. This indicates that the RiFS-Reading is predictive of phonemic awareness abilities as well as reading skills. This is consistent with previous research, which has shown that phonemic awareness is highly related to early reading abilities (Wagner & Torgesen, 1987). The combination of these results supports the convergent validity of the RiFS-Reading.
To ensure that the RiFS-Reading is a measure of phonemic awareness and not other language skills, scores on the RiFS-Reading were compared to scores on the WIAT-IV tests of expressive and receptive vocabulary. Results showed that the vocabulary scores of the “At Risk” and “Not At Risk” groups on the RiFS-Reading were not significantly different, suggesting that phonemic awareness as measured by the RiFS-Reading is a distinct factor in the development of reading skills.
Limitations of the Study
While the RiFS-Reading proved successful in quickly predicting the phonemic awareness of students, it is a screening tool that has some limitations. As RiFS-Reading is meant to be administered in 10–15 minutes, the phonemic awareness sections only include four questions, with two incorrect answers yielding an “At Risk” score. Also, as the participants were almost all from upper-middle class areas, nearly all were able to identify the letters and letter sounds, making it difficult to assess the efficacy of the Letter Recognition and Letter-Sound subtests due to limited variability in the students’ performance. It would therefore be beneficial to examine the validity of the Phonics subtests of the RiFS-Reading among students towards the beginning or middle of kindergarten and among students from a variety of backgrounds and educational experiences, as there is likely to be more variability in these groups to evaluate these factors as predictors of early reading success. Future research should also determine the utility of using the RiFS-Reading with students from more diverse and lower socioeconomic backgrounds, as they may have less experience with tablets, potentially making the navigation of the RiFS-Reading more difficult.
Other limitations include the sometimes unreliable WiFi access at schools. Also, while the RiFS-Reading allows for limited proctor training, some training is necessary for administrators to ensure that they are able to address any technical malfunctions. The length of the current test may also cause fatigue in students. This was evident in the Phoneme Blending subtest of the RiFS-Reading where the “At Risk” and “Not At Risk” groups did not have significantly different scores on the CTOPP-2 Blending Words subtest (although the result trended towards significance). As it is the last section, the Phoneme Blending subtest may be less reliable than the other subtests, as young students may be fatigued by this time, and without one-on-one administration, it can be difficult to monitor students’ attention to ensure they perform their best.
Conclusion
In summary, these results show that the RiFS-Reading is a reliable and valid assessment that can be used to identify students who are at risk for early reading problems. Students who exhibit reading problems in kindergarten and first grade run a high risk of never catching up to their peers. Educators acknowledge that deficits in phonics and phonemic awareness are important contributors to students’ reading performance; unfortunately, many schools do not screen for those skills in a meaningful way due to training requirements, the need for one-on-one administration, administration time, long wait times for results, and cost. Those considerations were central in the development of RiFS-Reading. Educators will be able to capitalize on the RiFS-Reading screener results to target students who need more practice in phonemic awareness (even 10–15 minutes three times a week for the entire school year), making the screening effort highly valuable in the context of early education.
Supplemental Material
Supplemental Material - Validation of the Risk Factor Screen for Reading (RiFS-Reading) Screening Tool for the Early Identification of Reading Problems
Supplemental Material for Validation of the Risk Factor Screen for Reading Screening Tool for the Early Identification of Reading Problems by Bennett Kuttler, and Elliot G. Levy in Journal of Education
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
