Abstract
BACKGROUND:
As older adults prefer to live in their homes, it is of paramount importance to examine the adaptation of the environment to the older adults’ capabilities.
OBJECTIVE:
The aim of this study was to develop and evaluate psychometric properties of a scale to measure physical environment problems and barriers in older adults’ homes using an ergonomic approach in Iran.
METHODS:
This mixed-method investigation was conducted in two stages in Yazd, Iran. The primary 71-item version of the questionnaire was developed according to qualitative findings and a thorough review of the literature. Then the psychometric characteristics, including face, content, construct validity, were assessed. Content validity was also assessed using CVI and CVR. Finally, its reliability and construct validity were confirmed by composite reliability (CR), Fornell-Larker matrix, and confirmatory factor analysis (CFA) using Smart PLS software version 3.
RESULTS:
The face validity of the developed scale was acceptable, and the mean scores of CVI and CVR were 0.78 and 0.84, respectively. The preliminary draft of the scale was categorized into seven dimensions. Factor validity and reliability were confirmed by acceptable factor loadings, and desirable realms of composite reliability (>0.7) average variance extracted (>0.5). The cross-loading method and the Fornell-Larker matrix were used to evaluate the divergent validity of the scale, and the results confirmed its acceptable fit.
CONCLUSIONS:
The findings indicated that reliability reached acceptable values, and different aspects of validity were almost confirmed. Accordingly, the questionnaire was to measure physical environment problems and barriers in older adults’ homes; however, it requires further validation for future use in other contexts.
Introduction
Older adults prefer to spend their time at home [1]. In this regard, their tendency to live in their homes with their families and communities is called “aging in place” [2–4]. The home environment plays a vital role in their autonomy, health status, and independence [5]; however, the main problem is that their homes are not designed based on their needs and abilities, and older adult population cannot immediately meet their needs due to their disabilities and limitations [6]. In the aging process, various organs of the body degrade [7]. This degradation can affect the performance and work activities of the older adults [8–10]. Studies have also revealed that more than half of older adults have difficulty doing daily routine activities such as bathing and dressing, and that the rate of disability increases in older age groups [11, 12]. Given that the existing houses are often not suitable for older adults and the disabled, some improvements may reduce their dependence on daily activities, reduce the risk of accidents, and enable them to spend more hours safely and independently at home [13].
Environment adaptation significantly affects a person’s quality of life (QOL), spatial attachment, and a sense of well-being and belonging [14]. Regarding older adults, adaptation is the structural modification of the physical environment to remove barriers so that older adults can access the facilities of their living environment without others’ assistance [15]. One of the main goals of ergonomics is environment adaptation for humans by designing the environment properly and paying attention to human capabilities [16, 17].
Appropriate scales can be used to investigate the problems and barriers of the physical environment for older adults. As such, the scales play a critical role in identifying barriers and planning to eliminate them. Various measurement scales have been developed to study the physical environment of houses, including Evaluation of Older People’s Living Environments (EVOLVE) by Lewis et al. in the United Kingdom [17, 18]. This scale examines the characteristics and physical design of older adults’ living environment, the suitability of the living environment, and its relationship with older adults’ QOL [19]. It addresses all living spaces, including entrance, hallway, hall, living room, kitchen, bathroom, and bedroom, as well as assistive technologies used by their residents. This scale can be used to design, evaluate, and relocate older adults’ living environment [17]. One of the disadvantages of this scale is that it contains a large number of items (about 1900 items in 60 pages), and older adults themselves do not complete it. In other words, it is an object-oriented method that includes a detailed checklist. Physical Fitness Questionnaire for Nursing Homes is another scale evaluating, which contains 12 subscales and 70 items [20]. This scale is also an expert-oriented checklist, in which the adaptation is examined without evaluating the disability of the elderly. In another study by Hrovatin et al. in Slovenia, a questionnaire was developed to assess the environment of older adults’ homes ergonomically [21], which disregarded the relationship between older adults’ limitations and living environment.
The existing scales for assessing the living environment of older adults are primarily checklists addressing cultural and environmental conditions. The way older adults interact with the barriers and constraints in the physical home environment is not usually explored; however, the issue is of paramount significance. Accordingly, in this study, a developed scale was used as a basis for identifying the problems and barriers of the physical environment of older adults’ homes regarding the type of limitations and disabilities of older adults. In other words, when the needs of an environment exceed an individual’s capabilities, an unadaptable situation arises. Therefore, the physical, mental, and cognitive limitations may challenge older adults to meet the environmental needs [22]. The problems of older adults in the home environment are rooted in their decreased abilities; hence, classifying barriers and problems of the physical home environment according to the limitations of older adults can be a new approach in scales to delve into the problems and barriers of the physical environment in older adults’ homes.
Given the significance of identifying barriers and problems in the physical environment of older adults’ homes and the lack of appropriate and valid scales in this field, a suitable and reliable scale is necessary. On the other hand, the adjustment and design of measurement scales should be in accordance with the native culture of the community under study. To this end, the present study aimed to develop and evaluate the psychometric properties of a scale to measure physical environment problems and barriers in older adults’ homes by adopting an ergonomic approach.
Materials and methods
Type of study
In the present study, an instrument was developed based on qualitative results and the literature review and was then employed to collect quantitative data. This study was conducted to develop and evaluate the psychometric properties of a scale to measure physical environment problems and barriers in older adults’ homes by adopting an ergonomic approach. The research setting of this study was the homes of older adults residing in Yazd, Iran, in 2020.
Inclusion and exclusion criteria
Inclusion criteria consisted of age above 60 and residence in Yazd. On the other hand, older adults living in nursing homes, including full- and part-time centers, and older adults suffering from a remarkable disability that made them fully dependent on others, were excluded from the present study.
Scale development
The present study was a mixed-methods study using both qualitative and quantitative phases. In the qualitative phases, semi-structured interviews were conducted to ask the older adults residing in their homes in Yazd about the barriers and problems of the physical environment of homes. This qualitative study adopted the conventional content analysis and encompassed 53 participants, including 36 61–94-year-old older adults and 17 24–58-year-old caregivers. Then, the questionnaire items were developed using the data extracted from interviews and the literature review. In the quantitative phase, the psychometric properties of the questionnaire were assessed. Then some items were merged, or some others were modified. The items formed the first draft of the questionnaire. In the next step, the psychometric evaluation of the scale was performed. Face validity, content validity, and construct validity were used to determine the scale’s validity. In this regard, face validity was first examined using both qualitative and quantitative methods.
Face validity
Face validity was investigated both qualitatively and quantitatively. From the qualitative perspective, face validity addresses whether the appearance of the instrument is associated with what it measures. To investigate qualitative face validity, 10 participants (older adults) meeting the inclusion criteria were asked to comment on the items’ level of difficulty, appropriateness, and ambiguity [23, 24].
To determine face validity quantitatively, the item impact was applied. The scale was submitted to 10 older adults, who were asked to select one of the options from a Likert scale to show the significance of each item. The item impact scores > 1.5 were considered appropriate [25]. The impact score of each item was calculated by the following equation [26].
Content validity addresses the relevance of an instrument’s content to the construct it measures [27]. Both quantitative and qualitative methods were used to assess content validity [28]. Qualitative content validity was assessed by experts on qualitative and instrument development, ergonomists and gerontologists [29]. In the qualitative phase, they evaluated wording, grammar, item allocation, and scaling of the questionnaire. In the quantitative phase, the content validity index (CVI) and the content validity ratio (CVR) were calculated. The clarity, simplicity, and relevancy of each item were assessed by CVI. To determine the content validity ratio, 10 experts were asked to review each phrase based on a 3-point Likert scale (namely necessary, useful but not necessary, and not necessary). In this regard, according to the Lawshe Table, phrases whose content validity ratio was≥0.62 were retained [30].
To determine the content validity index, the method proposed by Waltz and Basel and the opinions of 10 experts on a 4-point Likert scale were used to assess the relevance, simplicity, and clarity criteria for each phrase in the questionnaire. According to Waltz and Basel’s method, items with a score > 0.79, 0.70–0.79, and < 0.70 are appropriate, need to be modified and reviewed, and are unacceptable, respectively. After modifying and reviewing the items with CVI between 0.7 and 0.79, they were re-submitted to the experts, and their content validity index was evaluated once more. The items with scores < 0.7 were deleted. In the next step, the questionnaire was calculated based on the mean, ratio, and content validity index [31].
Construct validity
To perform factor analysis by using the PLS (partial least squares regression) method, sampling was performed on 200 eligible older adults. According to some sources, this number is sufficient to perform confirmatory factor analysis (CFA) [32, 33]. The sampling method was multi-stage, including selecting the municipal areas and older adults using a systematic random method. PLS test is based on variance and can be used to evaluate formative (real) items. The test needs to check the reliability and validity of research constructs and scales. The reliability of the test depends on the accuracy of the measurement and its stability, so it implies two different definitions, including the reliability (stability) and consistency of test scores over time. This means that if a test is performed several times on a respondent, the same score will be obtained in all cases. The second meaning of reliability refers to the similarity of items, indicating to what extent test questions are interrelated. To evaluate the reliability of the constructs, Fornell and Larcker [33, 34] suggested three criteria, including (1) reliability of each item, (2) composite reliability (CR) of each construct, and (3) average variance extracted (AVE).
Regarding the reliability of each item, the factor loading of≥0.6 for each item is defined in the CFA as a good construct indicator. Moreover, the factor loading of the items should be significant, at least at p = 0.01 [31, 32]. Dillon-Goldstein coefficient (pc) is used to evaluate the CR of each construct. Given that PLS uses the subjects’ factor scores for analysis, in contrast to multiple regression, it is necessary to consider the factor loading of each item in calculating the reliability index. However, Cronbach’s alpha coefficient assigns equal weights to the items and shows less reliability; hence, the CR coefficient was used. The acceptable values of CR should be≥0.7 [35, 36].
The third indicator of reliability is AVE. Fornell and Larcker recommended AVE≥0.5, indicating that the construct explains about≥50% of the variance of its indicators [34, 36]. Validity refers to whether the items measure the same concept as desired. To examine the validity or divergent validity of constructs, Chin [37] recommended two criteria. First, the construct items should have the maximum factor loading on their construct, indicating that they should have a small cross-sectional load on other constructs. Gefen and Straub suggested that the factor loading of each item on its construct should be at least 0.1 more than the factor loading of the same item on other constructs [38]. The second criterion is that the AVE of a construct must be greater than the correlation of that construct with other constructs. This indicates that the correlation of that construct with its markers is greater than its correlation with other constructs. According to Tannen et al., the goodness of fit (GOF) index in the PLS model acts as the overall fit index of the model and can be used to check the validity or quality of the PLS in general [39]. The GOF index consists of three absolute types to compare and contrast different groups. The value of this index can be obtained by dividing the absolute GOF by the maximum GOF of the tested model. The GOF of the external model measures the overall quality of the measurement model, and the GOF of the internal model measures the quality of the structural model. This index also acts as the fit index of the LISREL model and ranges between zero and one; values closer to one indicate the appropriate quality of the model [40].
Finally, the stability of composite reliability was used to determine the reliability of the scale. To determine the scale’s reliability, the retest was reviewed over a two-week interval, and the scores obtained in these two stages were compared using the intra-cluster correlation index. If this value is > 0.80, the stability rate is desirable. Sampling was performed on 200 qualified older adults; according to some references, this number is acceptable to perform CFA.
Ethical consideration
The ethics committee at Tabriz University of Medical Sciences, Iran, approved the study protocol (IR.TBZMED.REC.1398.517). In this study, the research objectives and the confidentiality of the participants’ information were observed. Moreover, the participants’ written consent was obtained before they participated in this study. The participants were informed based on the Declaration of Helsinki.
Results
The participants’ mean age (standard deviation) in the qualitative phase of the study was 71 years (1.8), and the mean age (standard deviation) of older adult caregivers was 48 years (2.3). Moreover, 63% of the subjects were male, and 37% were female, and 18% of older adult caregivers were male, and 82% were female. In the quantitative phase of the study (psychometric phase), 200 persons participated in this study, 184 of whom were male, and 216 were female, and the mean age (standard deviation) of older adults participating in this study phase was 74 years (1.4). The participants’ characteristics for this phase are shown in Table 1.
Participants’ characteristics in the quantitative phase of the study
Participants’ characteristics in the quantitative phase of the study
In the qualitative phase of the study, the data obtained from the structured interviews with the older adults and the literature review led to the development of scale. The scale consisted of two parts. The first part included the general demographic information of each individual and the second part includes physical environment problems and barriers in the older adults’ homes. This part consists of 71 Likert-type questions, anchored in three points ranging from ‘0-no difficulty’ to ‘3-very’, which were classified into seven main dimensions (Table 2).
The domain and item in the scale
In evaluating the qualitative face validity, after collecting information about the level of difficulty, the degree of appropriateness and ambiguity was determined by interviewing the participants, and the necessary modifications were made in the questionnaire items.
Quantitative face validity
In this phase, two items (D3, D8) did not get the minimum impact score (1.5) and were removed from the questionnaire.
Qualitative content validity
To evaluate the qualitative content validity, the questionnaire was provided to 10 experts in ergonomics and older adults, and finally, some modifications were made to the questionnaire.
Quantitative content validity
The results of CVR showed that six items (D1, D2, D4, D6, D7 and D8) of the scale scored < 0.62, which were removed from the scale. Using the Waltz and Basel method, the results of CVI showed that 51 items scored > 0.79, which were retained from the scale, and 12 items scored between 0.7 and 0.79, which were provided to experts after modifications and with a score > 0.79, were retained in the scale. In the next step, the CVR and CVI of the scale were calculated based on the mean value. Finally, 63 questions with a mean CVI of 0.78 and a mean CVR of 0.84 were obtained (Table 3).
Content validity ratios and content validity index of the scale
Content validity ratios and content validity index of the scale
Factor loading coefficients and CR were used to evaluate the reliability of the constructs. The evaluation of the preliminary 63-item scale showed that 25 items were removed from the model due to factor loadings < 0.6, and other items (44 items) with factor loadings > 0.6 were included in the scale. After removing the 25 items, factor loadings for the retained 44 items were recalculated, as shown in Table 4. Figure 1 shows the research model with standardized path coefficients.
Construct Validity and Reliability
Construct Validity and Reliability

Research model with standardized path coefficients (evaluation of the measurement model after modification).
The results of the second round of factor analysis showed that all the items had factor loadings > 0.6, and thus these 44 items were approved, and the model had a proper fit (p < 0.01). The results in Table 1 are significant at the 99% confidence level (p < 0.01). CR was used to investigate the scale’s internal correlation. The CR coefficients for the studied constructs were > 0.7, confirming their acceptable reliability. Moreover, the AVE index value for all the constructs was > 0.5 and within an acceptable level, indicating the convergent validity of the model. In general, the measurement model explains the convergent reliability and validity of the questionnaire (Table 5).
Summary of reliability analysis
To evaluate divergent validity, a matrix should be created based on Fornell and Larker so that the values of the original diameter are the square root of the AVE coefficients of each construct, and the smaller values of the original diameter are the coefficients of correlation between each construct with other constructs. Table 6 reveals that the square root of the AVE coefficients is greater than the correlation coefficients between the constructs. These results indicate that the measurement model has acceptable divergent validity.
Factor correlations and the square root of the AVE (divergent validity of constructs)
The results of the divergent validity test were also calculated by using the cross-loading method, as represented in Table 7. It should be noted that the factor loading of each item on its construct in this test should be at least 0.1 above the factor loading of the same item on other constructs.
Results of divergent validity study through cross-factor loading method
Table 7 indicates that the divergent validity was confirmed by the cross-loading method for 44 items for each construct. The results in Table 7 are significant at the 99% confidence level (p < 0.01).
The goodness of fit (GOF) considers both measurement and structural models and is used as a criterion in measuring the model’s overall performance. The index is calculated according to the following formula. Wetzels et al. (2009) proposed three values of 0.01, 0.25, and 0.36 for weak, medium, and strong GOF, respectively. According to Table 8 and the following formula, its mean communality was 0.579, and its mean R2 was 0.388. According to the following formula, the GOF value for the study model was 0.473, indicating a strong and acceptable GOF [41].
Structural model fit criteria and GOF
One of the ways to evaluate the degree of adaptation between the home environment and the characteristics of older adults is to examine the barriers and problems reported by older adults in the home environment, which reflect a lack of adaptation. In this regard, the measurement scales for barriers and problems in older adults’ homes can play a crucial role, and the present study addressed the gap in the literature, i.e. the lack of proper assessment scales in this field.
One of the significant challenges of measurement scales is determining their psychometric properties. Studies have revealed that some of the developed scales are used without checking their validity and reliability. However, the quality of a scale depends on its validity and reliability [39]. Accordingly, the present study aimed to evaluate the psychometric properties of a scale to measure physical environment problems and barriers in older adults’ homes using an ergonomic approach.
In this study, decreased abilities in older adults and their vital role in posing problems in the physical environment of older adults’ homes [17] were considered in the development of the scale. A comprehensive scale was also developed to measure the barriers and problems in the physical environment of older adults’ homes. In this regard, the scale validity was examined by assessing face validity, content validity, and construct validity.
In the present study, the face validity of the questionnaire was determined, and the impact score of each question was calculated according to the opinions of the target group. Questions with an impact score of < 1.5 were eliminated or modified [42]. Lawshe’s method was used to evaluate the content validity of the questionnaire quantitatively. To this end, CVR and CVI coefficients were determined. The results showed that the developed scale had acceptable validity to measure the physical environment barriers and problems in older adults’ homes.
The evaluation of the preliminary 63-item scale showed that the scale had a proper fit by removing one item from the construct of barriers and problems related to vision, two items from the cooling and heating construct, four items from the movement construct, 9 items from the access construct, one item from the hand-grip construct, and two items from the accident construct, as their factor loading values were < 0.6 [32]. Given that the scale items in this study were selected by both the interviews with older adults and a review of literature, the opinions of older adults as end users can be more important [43].
To evaluate the scale, internal Consistency Reliability (CR) was used, the value of which was > 0.7 for the studied constructs, confirming the acceptable reliability of the studied model. Moreover, the AVE index value for all the constructs was > 0.5, indicating the convergent validity of the model. In general, the measurement model explains the convergent reliability and validity of the questionnaire.
To evaluate the divergent validity of the questionnaire, cross-loading and Fornell-Larker methods were used, and the results showed an acceptable fit, confirming the necessary coordination between items and constructs. The GOF of the model was also 0.473, representing that the questionnaire structure generally fits well with the data.
According to the findings of this study, the scale is efficient in measuring the physical environment problems and barriers in older adults’ homes by using an ergonomic approach. It can be used for ergonomic assessments of older adults’ homes with appropriate psychometric properties.
Although the reliability and validity of the measurement scale for older adults living in Yazd were confirmed, and the multidimensional measurement scale of physical barriers and problems was approved by an ergonomic approach, other researchers should address the limitations of this study. The study limitations were as follows: small sample size, inclusion of older adults’ homes in Yazd and not the other cities, cross-sectional nature of the data, older adults’ tendency to underestimate the problems [44], and older adults forgetting some problems.
Accordingly, it is recommended to conduct studies on a larger statistical population from other geographical areas with different cultures and lifestyles to determine the role of culture, lifestyle and the type of architecture in older adults’ problems. On the other hand, given that the present study focused on urban houses, it is recommended to examine rural homes in future studies.
Given the tendency of older adults to underestimate access problems and environmental risk factors in their homes [44] and their lack of understanding of how environmental barriers affect their daily activities [45], it is recommended to solve this problem to some extent by expanding the statistical community or by including their family and caregivers in the evaluation. It should be noted that individuals with disabilities report more home adaptation needs than older adults [43, 44]. Given the limitations of this study and the need to address such limitations in future studies, new items should be added to the developed scale so that its results can be used to develop interventions and improve the physical environment in older adults’ homes.
Home designers usually ignore a decrease in human abilities caused by aging, and this issue, in most cases, causes a lack of physical environment adaptation for older adults. By focusing on home design based on older adults’ abilities, it is possible to make an environment adaptable to older adults’ conditions and help them maximize their capabilities [11]. On the other hand, it is recommended to consider inclusive design principles in home designs from the very beginning to make them suitable for older adults. This is because previous studies have indicated that older adults are usually reluctant or even opposed to making recommendations and taking corrective action in their homes and usually postpone repairs at least until their limitations are aggravated [46].
Conclusion
The present study examined the validity and reliability of the scale to measure physical environment barriers and problems in older adults’ homes using an ergonomic approach, and the findings revealed the acceptable psychometric properties of the scale. Accordingly, this scale can be used as a reliable and valid tool in future studies. Since measuring and evaluating the validity of a scale is a continuous process, and the desired validity is achieved by conducting several studies [47], future studies should examine the validity and the factor structure of this scale.
Footnotes
Acknowledgments
This study was extracted from a doctoral dissertation on ergonomics approved at the Tabriz University of Medical Sciences and financially supported by the Vice-Chancellor for Research. The authors thank the university authorities and older adults and their caregivers for their cooperation in this study.
