Abstract
The objective of this study was to estimate the association between changes in health care expenditures relative to changes in health risk status for employers of all sizes. Repeat health risk assessments (HRAs) were obtained from 50,005 employees and spouses with 2 years of health plan enrollment, and from 37,559 employees and spouses with 3 years of enrollment in employer-sponsored medical coverage. Changes in health care expenditures were measured from the year before completion of the first HRA to the years before and after the completion of the second HRA. Propensity score weighting was used to adjust for those who did not repeat the HRA so results could be extrapolated to the larger population. Propensity score weighted multiple regression analyses were used to estimate the relationship between changes in health care expenditures with changes in risk status for 9 risk categories. Significantly higher health care expenditures were associated with those who moved from low risk to medium or high risk, compared to those who remained low risk. Expenditure reductions estimated for those who improved their health status from high risk to medium or low risk were not statistically significant. This study is unique because of its large sample size, its use of data from a wide range of employer sizes, and its efforts to extend generalizability to those who did not complete both HRAs. These results demonstrate that the potential for short-term health care savings may be greater for programs that help maintain low risk than for programs focused on risk reduction. (Population Health Management 2014;17:297–305)
Introduction
M
With the continued expansion of health management programming, there is an ongoing interest in documenting population health improvement and economic returns from these programmatic investments. Despite several decades of research on these topics, shortcomings in the research methods, small sample sizes, the use of single employer evaluations, a wide range of program designs, and a lack of consistency across studies have hampered conclusions about the effectiveness and/or cost-benefit of these programs. 5 Numerous studies, most being cross-sectional in design, have documented that health risks are associated with higher health care expenditures in employee and spouse populations. 6 –13 Studies also show that health management programs of various designs may result in health improvements for selected health risks (eg, physical activity, nutrition, weight) and/or overall health risk status over time. 14 –19
Evaluations of the economic benefits resulting from health management programs are generally based on the hypothesis that high rates of participation are necessary to generate savings. Generally, it takes 3 to 5 years for such savings to materialize. 3,5,17,20,21 Fewer longitudinal studies have focused on the mechanism and/or timing of cost savings related to reductions of measured health risks and/or in the maintenance of those currently at low risk.
It is well documented in the scientific literature that costs positively correlate with changes in risks—as risks improve, health care costs often decrease and as risks get worse, costs tend to increase. 22 –26 More recently, studies have included larger populations, combined data from several large employers, and have increased the precision of the timing for cost changes following risk changes. 18,19 To date, however, the authors of the present study could find no studies including small and medium-sized employers. In addition, methods have been lacking that would address selection bias related to having highly motivated repeat health risk assessment (HRA) participants. The purpose of this study is to estimate the association between changes in health care expenditures relative to changes in health risk status for employers of all sizes, generalizing to all employees and spouses regardless of whether an HRA was completed.
Methods
Study design and sample selection
This study was a retrospective cohort analysis using individual-level, self-reported data obtained from 2 consecutive annual HRAs. These HRAs were completed by insured employees, spouses, and significant others from across Optum's employer client book of business. Inclusion criteria limited the study sample to those 18 to 63 years of age at their first HRA and to those who had completed their second HRAs between May 2009 and May 2011. The elapsed time between the first and second HRA must have been between 270 and 450 days. Most sample members had medical and pharmacy covered by UnitedHealthcare Insurance Company (84%); another 16% were covered by other health plans. For the first analysis, HRA repeaters were continuously enrolled in their health plan for 2 years, with no monthly gaps from the year prior to the first HRA to the year prior to the second HRA. For the second analysis, HRA repeaters were continuously enrolled for 3 years, from the year before the first HRA was taken to the year after the second HRA was completed. Members with evidence of maternity services were excluded from this analysis. About 80% of employers offered additional health coaching options. To maintain a consistent program intensity across employee/spouse participant populations, those with documented health coaching participation also were excluded (about 20%).
To minimize selection bias that might occur if HRA repeaters were significantly different than those who did not complete 2 HRAs, a cohort of insured employees and spouses who completed 0 or 1 HRA (ie, those who were eligible for but did not complete 2 HRAs) also was selected. Their data were used to generate a propensity score case weight that was applied to those who completed 2 HRAs to make the data from those who completed 2 HRAs better resemble all employees regardless of whether HRAs were completed. Methods for generating the case weight are described in a following section. HRA non-completers/non-repeaters must have otherwise met the same criteria as those with repeat HRAs to be included in the study.
Definition of health risks
The University of Michigan Health Management Research Center HRA was administered online and included questions about lifestyle and behavioral health risks, biometric factors, and health status. 27 The following 9 health risks were included in this study: alcohol, blood pressure (BP), body weight, cholesterol, medication/drug use for relaxation, nutrition, physical activity, stress, and smoking. All survey responses were self-reported; however, values from biometric screenings were utilized if measured within 30 days of the HRA completion date, as occurred in about 10% of respondents. The 9 health risks were dichotomized to denote either higher risk and lower risk groups as follows: alcohol:>14 drinks/wk (men);>7 drinks/wk (women); BP: systolic BP>119 or diastolic BP>79 mm Hg; cholesterol:>199 mg/dL; drug use for relaxation: sometimes/almost every day; physical activity:<1 time/wk; smoking: current; nutrition:<3 servings/day fruit/vegetable/fiber; stress: some/a lot; and weight: body mass index<18.5 or>24.9. Of note, the research team used at-risk categories that included those at moderate and high severity risk levels—an approach commonly used in population health management strategies. Based on the number of risk factors where the respondent was at higher health risk, individuals were subsequently grouped into the following overall risk status levels for the analysis: high risk indicates the individual was at higher risk for 4–9 risks, medium risk indicates he or she was at higher risk for 3 risk factors, and low risk indicates the respondent was at higher risk for 0–2 risk factors. The research team then observed whether sample members moved between these low-, medium-, or high-risk categories over time. These movements were then associated with increases or decreases in health care expenditures by undertaking propensity score weighted statistical analyses that are described in a following section.
Determinants of health care costs
Total allowable health care expenditures were calculated in US dollars for each sample member, including all inpatient, outpatient, professional, and pharmacy costs, for the year prior to the first HRA, the year prior to the second HRA, and the year after the second HRA was completed. Allowed health care costs (including plan payment, member payment, and coordination of benefits) were utilized because allowed health care costs are impacted to a lesser degree by differences in employer health care paid benefit structures, or by changes in benefit structures that may occur over time. All health care costs were adjusted to May 2011 dollars using the medical care services component in the Consumer Price Index.
Covariates
Covariates were included in the weighted regression analyses to estimate relationships between changes in health care expenditures and changes in risk so that the research team could differentiate between the impact of changing risks versus other factors that also may influence health care expenditures. These covariates included measures of demographics, health status, and other characteristics self-reported on the HRA, or were taken from health plan eligibility and claims files. Demographic variables included measures of the respondent's age, sex, race, education level, employee status, and employer size. Age and education were each stratified into 4 groups (age: 18–34, 35–44, 45–54, and 55–63 years; education: high school graduate or less, some college, college graduate, and graduate school). Race was stratified into 5 groups (white, black, Hispanic, Asian, and other). Employer size was stratified into 3 groups by number of employees (small<1000 employees; medium 1000–9999 employees; and large ≥10,000 employees) and employee status was stratified into 2 groups (employee and spouse/significant others).
Covariates measured from health plan membership or claims data included the individual's health plan type, the ratio of paid-to-allowed health care costs covered by the respondents' insurance, an indicator of the existence of any diagnosis or procedure associated with complex medical conditions (see the following description) and the calculated Charlson comorbidity index (CCI). 28 The CCI is a measure of the risk of 1-year all-cause mortality attributable to selected comorbidities that also has been shown to be highly predictive of morbidity and health care expenditures.
Information about health plan type was stratified into 4 groups: point of service (POS), preferred provider organization (PPO), exclusive provider organization (EPO), and other. The ratio of paid-to-allowed health care costs represented the “richness,” or actuarial value, of the subjects' health plan benefit design. The actuarial value was calculated for each employer and assigned to each member. The complex medical condition flag was created to designate whether a member had at least one of the following complex conditions as determined by claims-based diagnoses or procedures during the HRA pre period: cancer, end-stage renal disease, HIV/AIDS, organ transplant, trauma, or mental disorders.
Propensity score weighting for selection bias adjustment
Propensity score weighting was used to adjust for potential selection bias to enhance the generalizability of these findings to the broader population. The propensity score adjustment process uses information about the demographic, socioeconomic, and health status variables already described that could potentially drive HRA completion. This information was used to estimate the underlying probability of completing 2 HRAs for each individual, and then that estimated probability was used to create a weighting variable to be applied to the data from those who completed 2 HRAs to make them better resemble all employees regardless of the number of HRAs that were completed. The value of the weighting variable equals 1/predicted probability of completing 2 HRAs. These weighted analyses reduced threats to external validity that are problematic in other studies that do not adjust for response bias. The utility of propensity score models to adjust for external validity threats are described elsewhere. 29 A practical application of these methods was demonstrated in a recent publication evaluating the associations between health risks and health care expenditures. 13
One can make inferences about the utility of the propensity score approach by assessing the balance of measured member characteristics between HRA repeaters and non-completers/non-repeaters. This can be done by comparing the standardized difference value for each covariate used in the propensity score weighting and observing how balance improves after weighting. Standardized difference values less than 0.10 indicate successful balancing of each variable between those who completed 2 HRAs and those who did not. If most variables have standardized difference values less than 0.10 this helps ensure that the study sample more closely resembles the overall population of HRA completers/non-completers/non-repeaters, thereby increasing generalizability of the study findings when weighted statistical analyses are used. 30
Finally, as a sensitivity analysis for the effectiveness of the propensity score weighting method, the research team also applied a 2-stage residual inclusion method suggested by Terza et al 31 to adjust for selection bias. This method uses a 2-step regression model that adjusts for the impact of both measurable and nonmeasurable variables that may influence the likelihood of completing 2 HRAs. The Terza et al method is similar to the propensity score method in that both begin by using regression analyses to estimate relationships between completing 2 HRAs and the demographic and other variables already mentioned. Rather than creating a case weight for the subsequent health risk/health expenditure regression, however, the Terza et al approach uses the residuals from the HRA completion regression as another independent variable in the risk/cost regression.
Outcome health care expenditure variable
Changes in health care expenditures were calculated by subtracting the individual's baseline expenditures prior to the first HRA from (1) the annual expenditures prior to the second HRA, and (2) from the annual costs after the second HRA for each of the 9 risk status change categories. The resulting cost dfference variables were used as the dependent variables for the weighted risk/cost regressions described next.
Regression modeling
Propensity score weighted multiple regression analyses were performed to measure the independent association between the health care cost differences and each of the 9 risk status change categories, controlling for the impact of all of the covariates mentioned. Generalized linear regression models with a gamma distribution and a log link function were used to assess the association between risk status changes and changes in health care expenditures. 32 Individuals whose total annual expenditures exceeded the 97.5th percentile were excluded from the analyses to avoid outlier values from heavily influencing the results.
Results
Sample characteristics
The study included 50,005 sample members when the focus was on health care expenditure changes between the year before the first HRA was taken and the year before the second HRA was taken. Sample sizes were lower (n=37,559) when focusing on expenditure changes between the year before the first HRA was taken and the year after the second HRA. The unweighted demographics, health status, and other characteristics of sample members for these 2 sets of analyses were similar and are shown in Table 1. The samples used in both analyses were predominantly employees, females, white, and well educated. Their average age was about 44 years. Most members chose POS health plans. Health risk characteristics of sample members are shown in Table 2. For most risk factors there were small changes over time in percentages of sample members at high risk.
CCI, Charlson comorbidity index; EPO, exclusive provider organization; HRA, health risk assessment; POS, point of service; PPO, preferred provider organization.
BMI, body mass index; DBP, diastolic blood pressure; HRA, health risk assessment; SBP, systolic blood pressure.
The results of propensity score weighting for response bias are shown in Table 3. Prior to propensity score weighting, repeat HRA completers were significantly older, more likely to be an employee, to have been offered an HRA incentive, and to be in a small or medium-sized organization. HRA repeaters also were less likely to have a medical plan that was a POS, PPO, or EPO. After propensity score weighting, differences between repeat HRA completers and non-completers/non-repeaters were minimized. Most variables had standardized differences less than 0.10, indicating successful weighting. The many differences between the repeat HRA completers and non-completers/non-repeaters prior to weighting underscore the importance of this adjustment in order to make these results generalizable.
EE, employee; EPO, exclusive provider organization; HRA, health risk assessment POS, point of service; PPO, preferred provider organization.
Tables 4 and 5 show the propensity score weighted changes in health care expenditures associated with each of the 9 risk status change categories, after controlling for demographics, medical plan type, health status, and baseline health risks. Differences in health care costs from the year before the first HRA to the year before the second HRA are shown in Table 4. Differences in health care costs from baseline to the year after the second HRA are shown in Table 5.
Adjusted for age, sex, race, education, all baseline risks, medical plan type, client-based paid/allowed ratio, elapsed time between HRAs, baseline Charlson comorbidity index, complex medical conditions (during T1 and T2), client size, and employee status.
HRA, health risk assessment.
Adjusted for age, sex, race, education, all baseline risks, medical plan type, client-based paid/allowed ratio, elapsed time between HRAs, baseline Charlson comorbidity index, complex medical conditions (during T1 and T2), client size, and employee status.
HRA, health risk assessment.
In both tables, health care expenditures are presented as unadjusted differences (ie, prior to any regression analyses); regression adjusted differences without any weighting; and propensity score weighted regression adjusted differences. Generally, the values are consistent regardless of the use of these approaches. The Terza et al model to adjust for selection bias provided nearly identical results when compared to the propensity score weighted results and thus are not shown but are available from the authors upon request. 31
As shown in Table 4, significantly higher health care expenditures were associated with transitions to higher risk status (ie, moving from low to medium, low to high, or medium to high risk). Significant cost increases also were observed when comparing the year before the first HRA to the year after the second HRA was taken, when considering those who moved from low to medium risk and those who moved from low to high risk (Table 5). Conversely, those who improved their risk status by moving to lower risk categories (ie, from high to medium or high to low risk) demonstrated smaller cost increases compared to those who remained high risk. However, the changes in costs that were associated with reductions in risk were not statistically significant in either analysis. Finally, about 30% of the sample included those who were always in the lowest risk group; members of this group had the lowest increases in costs over time. Overall, 19% improved their risk status, another 18% moved to a higher risk status through the addition of risk factors, 11% remained medium risk, and about 20% remained high risk over the time period (Tables 4 and 5).
Discussion
Sample members in this study of repeat HRA completers were from small, medium, and large employers. The shorter term economic impact of risk status changes indicated significant increases in health care expenditures associated with those who transitioned from low risk to either medium or high risk. These cost increases persisted and increased in the year after the second HRA was completed. Although smaller cost increases were directionally associated with risk status improvement compared to those who remained high risk (ie, moving from high risk to medium or low risk), these cost changes were not significant. Regression adjustment and adjustments for selection bias did not change these conclusions. Thus, these results would indicate that the cost savings potential associated with health management programs may be enhanced with an increased focus on the avoidance of gaining more risks.
This analysis utilized 9 health risks commonly used in the health promotion industry to estimate health status (eg, low, medium, and high risk) in the population studied. 10 –12,15,18,27 In implementing a population health management strategy, however, the research team included both moderate and high severity criteria in defining each of the individual health risks. This approach better stratifies the population for low- and high-level interventions. The study population had access to annual HRAs and a health communications program and demonstrated a small but nonsignificant improvement in the percentage (+0.3 percentage points) of individuals at low risk over time. This net change in health status for the population reflects an overall 19% of the population that improved their risk status, 18% who got worse (ie, added health risks), and 63% who stayed at the same risk level. Edington's natural flow model detailed in Loeppke et al 15 indicates an expectation of a loss of 5 percentage points for those at low risk, with 16% expected to improve their health status, 23% to add health risks, and 61% to stay at the same risk levels. This would indicate that a low intensity health management program such as described in the present study can improve on the natural flow by assisting individuals to maintain their current health status and minimize transitions to higher risk status.
Expanding health risk criteria to include moderate severity better stratifies the population for interventions but lessens the impact on medical expenditures at baseline and in observed changes in medical expenditures with changes in those health risks over time. This may partially explain the results that risk reduction from high-risk status to medium or low risk was associated with smaller cost increases compared to those who remained high risk, but not actual cost decreases. Slowing medical cost increases in the current health economic environment could be considered a win—even without actual cost decreases.
In addition, the impact of gaining and losing health risks over time on health care expenditures has been demonstrated elsewhere. For example, the impact of weight gain on members of employer health plans has been associated with significantly increased health care expenditures, 19,25 whereas weight loss (even in successful weight loss programs) was not associated with health care cost reductions within the next 2 years. 16 In a large study by Carls et al, 19 individual risk changes were tracked for 9 health risks over 3 years. Seven of the individual health risks that were reduced were not associated with lower health care expenditures while alcohol reduction and smoking cessation were associated with significantly increased costs. Haynes et al 26 tracked 5 health risk transitions, confirming that individual risk reductions and overall risk status improvement were not associated with expected cost decreases over the time period. In fact, several of the risks that were reduced were associated with significant cost increases, as health issues may have motivated the changes in lifestyle (eg, smoking cessation, BP reduction).
The timing of health care cost changes relative to risk status changes has less often been the focus of scientific research studies. This study design increased the precision of these calculations by using a baseline year prior to the completion of the first HRA and then considered cost changes from the baseline to the year prior to the second HRA, or from the baseline to the year after the second HRA was completed. The precision of the design confirmed the results that significant increases in health care expenditures were associated with increased risks documented in the year before the second HRA and, furthermore, that these cost increases were significantly sustained in the year after the second HRA. In contrast, although health care expenditure increases were minimized over time with risk reduction documented in the year before the second HRA, these smaller cost increases were not statistically significant either in the year before or after the second HRA.
One of the issues often raised about analyses of relationships between changes in health risks and associated changes in health expenditures is whether the results are influenced by selection bias. If so, the results obtained may not generalize to those people who complete fewer than 2 HRAs. In the present study, only about 2% of the employees and spouses of interest completed 2 HRAs. To address this concern, the research team tested 2 adjustment approaches. One was based on propensity score weighting 29 ; the other was a more recently developed residual inclusion method 31 that is suitable when nonlinear relationships may exist, such as those between risks and costs. Both sets of results showed a similar pattern of findings. The research team urges some caution when interpreting these results because such a large proportion of people had fewer than 2 HRAs, so the adjustments that were applied may not have worked as well as would be the case if many more had completed 2 HRAs.
One of the strengths of this study is the large, diverse sample, which came from multiple employers of all sizes. Smaller employers often depend on the availability of health programs from their health insurance providers but represent a significant portion of the market and are responsible for much of the recent growth in wellness opportunities being presented to employees. 1 To date, most research studies evaluating the impact of health management programs have focused on one or only a few large employers. This study provides an important addition to the health promotion literature in that employees from smaller companies (1–1000) are duly represented; they accounted for 28% of the study sample.
Limitations to consider include that the study results were based primarily on self-reported HRA data. Previous research has established the reliability and validity of HRA data, certainly for use in categorizing individuals into low-risk or at-risk subgroups. 33,34 A second limitation is that the adjustments for selection bias may be imperfect. With only a small proportion of people completing 2 HRAs, it is still possible that some unmeasured variables may influence the results and the ability to generalize to the entire population still may be somewhat limited. A third limitation relates to the relatively high level of education of this study cohort (54% had at least a college degree), which may limit the generalizability of these results to populations from other employers.
A strength of this study is its large sample size of members with repeat HRAs that included employees, spouses, and significant others, associated with a wide range of employer sizes. A second strength is the use of the multivariate modeling, which controlled for several relevant confounding variables. A third strength is the adjustment for selection bias, though, while not likely to be perfect, was at least attempted and seems to be the first of its kind in this body of literature.
The research team found that health risks significantly affected health care expenditures across employers small to large. Furthermore, as health risks increased, health care expenditures significantly increased over time. These results underscore the potential opportunities for cost savings among employers of all sizes by focusing not only on reducing behavioral risks in susceptible individuals but perhaps more importantly by assisting those currently low-risk individuals to maintain their low-risk status over time.
Most insurance providers offer HRAs and other (eg, disease management) program features, thus program opportunities should be available across most employers either as employer-funded health management programs or as a benefit for those with insurance coverage. These results would indicate that efforts to avoid future health risks, along with health improvement opportunities, would be beneficial to employees in improved quality of life and well-being, and to their employers in maximizing the potential of health management programs as one strategy in the containment of health care costs.
Footnotes
Acknowledgments
The authors thank Frank G. Bottone, Jr, PhD for his critical review of this manuscript and editorial assistance.
Author Disclosure Statement
Drs. Musich, Bhattarai, Hawkins, and Ozminkowski, Mr. White, and Mr. Hartley declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: All authors are employees of Optum. Drs. Ozminkowski and Hawkins disclose that their positions with the company afford them stock options with UnitedHealth Group. This study was sponsored by Optum.
