Abstract
Introduction
The COVID-19 pandemic advanced the use of telehealth-facilitated care. However, little is known about how to measure safety of clinical diagnosis made through telehealth-facilitated primary care.
Methods
We used the seven-step Safer Dx Trigger Tool framework to develop an electronic trigger (e-trigger) tool to identify potential missed opportunities for more timely diagnosis during primary care telehealth visits at a large Department of Veterans Affairs facility. We then applied the e-trigger algorithm to electronic health record data related to primary care visits during a 1-year period (1 April 2020–31 March 2021). The algorithm identified patients with unexpected visits within 10 days of an index telemedicine visit and classified such records as e-trigger positive. We then validated the e-trigger's ability to detect missed opportunities in diagnosis using chart reviews based on a structured data collection instrument (the Revised Safer Dx instrument).
Results
We identified 128,761 telehealth visits (32,459 unique patients), of which 434 visits led to subsequent unplanned emergency department (ED), hospital, or primary care visits within 10 days of the index visit. Of these, 116 were excluded for clinical reasons (trauma, injury, or childbirth), leaving 318 visits (240 unique patients) needing further evaluation. From these, 100 records were randomly selected for review, of which four were falsely flagged due to invalid data (visits by non-providers or those incorrectly flagged as completed telehealth visits). Eleven patients had a missed opportunity in diagnosis, yielding a positive predictive value of 11%.
Discussion
Electronic triggers that identify missed opportunities for additional evaluation could help advance the understanding of safety of clinical diagnosis made in telehealth-enabled care. Better measurement can help determine which patients can safely be cared for via telemedicine versus traditional in-person visits.
Introduction
While the COVID-19 pandemic brought significant disruptions in clinical care across the world, it helped to usher in a wave of telemedicine-facilitated care.1–3 Motivated by the need to meet patient care demands, relaxation of telemedicine regulatory requirements, 4 and adjustments in telemedicine reimbursement, many clinics converted part or all their office visits into telemedicine visits within a matter of weeks of the pandemic onset.5–7 Telemedicine via telephone or video visits offered many benefits during a pandemic involving a contagious disease, 8 including physical distancing and elimination of the need for transportation. 9
Despite benefits, anecdotal drawbacks to telemedicine have been noted. Most notably, physical exams are limited to only those components that can be seen or heard during videoconferencing or telephone calls, and subtle symptoms may be missed. While certain provider-directed patient self-exams are possible, 10 these are useful only in certain aspects of the exam and in patients physically and mentally able to follow the instructions provided. Furthermore, technical quality of the video and audio can potentially be suboptimal and patient–provider interactions may be negatively impacted, resulting in poor or interrupted communication or incomplete assessments. 11 Better understanding of the impact of telemedicine on diagnostic accuracy is needed to validate or refute these anecdotes.
Even when not facing a global pandemic, diagnostic errors are common and underreported, 12 and little is known about how this pandemic or use of telemedicine will impact diagnostic errors.13,14 Measurement of diagnostic safety of telemedicine is thus essential. However, diagnostic errors are harder to identify than other types of errors, and multiple mechanisms have been proposed, including the use of electronic triggers, or e-triggers.15–17 E-triggers could offer a mechanism to identify diagnostic errors related to telemedicine use to advance the measurement of diagnostic safety. E-triggers use algorithms to search vast amounts of EHR-based clinical data to identify patients at high risk of experiencing or having experienced a diagnostic error, and they can account for complex clinical exclusion criteria to enhance detection accuracy. 15 However, they have not yet been applied to telemedicine. This application could help understand vulnerabilities related to telediagnosis and help distinguish which patients can be safely cared for in a telemedicine visit. In this pilot study, we evaluated the feasibility of developing and implementing a telemedicine-focused e-trigger to identify instances of telemedicine-related diagnostic errors. This work could lend itself to future efforts to understand contributory causes for telemedicine-related diagnostic errors and whether they are equal to or inferior to in-person visits and in what circumstances. This would allow better guidance on triaging which patients can receive high quality diagnostic care via telemedicine and whether certain patients should be evaluated only in person.
Methods
Setting
We developed an e-trigger that retrospectively identified patients with potential diagnostic errors after telemedicine visits. The study was conducted at a large, multispecialty academic medical campus. Because a large proportion of initial telehealth visits at the facility were performed by primary care, we chose to focus on this specialty. Approval was received from the Baylor College of Medicine Institutional Review Board.
E-trigger development
We developed a telemedicine-based e-trigger aimed at identifying patients experiencing a diagnostic error or delay related to a telemedicine visit. E-trigger criteria were constructed based on discussion with primary care leadership and leaned heavily on prior work to detect diagnostic errors in the primary care setting. 16
E-trigger development followed the seven steps outlined in the Safer Dx Trigger Tools Framework:
15
Results
E-trigger development
After e-trigger criteria were determined conceptually, we applied each criterion individually to the data repository and performed reviews to ensure that expected data were appropriately captured. To validate the accuracy of each criterion, we performed 50 reviews (20 where the subsequent visit was an ED visit, 20 where it was a hospital visit, and 10 where it was an unplanned PCP visit) on e-trigger-flagged records. Throughout this development process, we iteratively refined the criteria to improve the accuracy of the e-trigger. For example, we excluded trauma and injury on the subsequent visits, as these represented new events and were unlikely to be related to the initial visit. While unlikely in our Department of Veterans Affairs data set, we additionally excluded childbirth diagnoses on subsequent visits for similar reasons. We made these adjustments prior to validation. The finalized e-trigger is displayed in Table 1.
Final e-trigger criteria.
E-trigger validation
We validated the full e-trigger algorithm on the cohort of patients seen via a telehealth visit during the study time-period. We found 128,761 total telehealth visits (from 32,459 unique patients) during this timeframe, of which the e-trigger identified 434 visits (307 patients) with an unplanned ED, hospital, or primary care visit. Of these, 116 visits (67 patients) were excluded by diagnostic codes listed in Table 1, leaving the 318 visits (240 patients) flagged for review. From these, we reviewed charts from 100 randomly selected patients to confirm the presence of a delay. Table 2 contains demographics for both the 240 unique patients flagged by the e-trigger and the 100-patient subset reviewed.
Demographics of e-trigger-flagged and reviewed charts.
Of the 100 charts reviewed, 5 had an audio-only telephone visit, and 93 had a video visit. Four records were inappropriately captured by the e-trigger, including two patients with social worker telehealth visits rather than with a PCP and two patients incorrectly identified by the e-trigger as having a telehealth visit despite no actual visit occurring (one where the patient did not connect and another where a face-to-face visit was incorrectly coded as a telehealth visit). Of the 96 remaining patients with telehealth visits with a PCP, 88 were subsequently seen in the emergency department, 6 had subsequent unplanned visits with a PCP, and 2 were directly admitted to the hospital.
Using the Revised Safer Dx Instrument, the chart reviewer identified 11 patients with a missed opportunity for diagnosis (rated six or seven on a seven-point Likert scale on the final item; Table 3). All other records were rated as a low likelihood of a missed opportunity (one of seven). Thus, the e-trigger achieved a PPV of 11% (95% CI: 5.6–18.8%). Scaling this up to the entire 240 patients identified, we would expect to find approximately 26 missed opportunities at the study site during the 1-year period.
Description of missed opportunities identified among records flagged by e-trigger.
Among the 11 missed opportunities identified, 7 involved documented signs and symptoms consistent with an alternative diagnosis than what was made by the treating physician, leading the patient to seek subsequent in-person care at the ED or in another clinic visit where the alternative and more serious diagnosis was made. For example, a patient diagnosed with allergic rhinitis despite documented shortness of breath and fever received a diagnosis of pneumonia at the subsequent visit. Three missed opportunities involved failure to order workup or follow-up on test results in a timely manner. Only one missed opportunity occurred during the subsequent visit where a patient was discharged from the ED despite positive troponins that were reviewed by the PCP. The PCP sent the patient back to the ED where he was subsequently diagnosed with a non-ST elevation myocardial infarction.
The time between the first and second visits ranged between 1 and 10 days, though all but one missed opportunity occurred within 6 days of the initial visit. Reducing the red flag criteria from 10 days for a subsequent visit to 6 days could have increased the positive predictive value from 11% (11 missed opportunities in 100 reviewed records) to 14.5% (10 missed opportunities in 69 records reviewed) at the expense of excluding one missed opportunity.
Discussion
We developed and tested an e-trigger algorithm to detect missed opportunities in diagnoses related to telemedicine visits. The e-trigger algorithm achieved a positive predictive value of 11%, making it capable of detecting telemedicine-related missed opportunities for further study.
Our findings suggest that electronic triggers could improve detection of telehealth-related missed opportunities in diagnosis. We estimated that our e-trigger could identify approximately 26 telehealth-related missed opportunities in diagnosis at the study site during the one-year study timeframe. Because such instances are notoriously difficulty to identify with non-selective chart reviews, malpractice claims, and incident reporting-based techniques, our e-trigger's ability to identify at least one missed opportunity by performing only 10 reviews suggests that finding such instances for learning and improvement purposes is feasible. Finding common patterns from such reviews could lead to solutions to improve diagnosis through telemedicine.
Additional research is needed to compare diagnostic safety to similar in-person visits and understand factors that increase or decrease the risk of telehealth-related diagnostic errors. Efforts are also needed to compare telehealth visits to in-person visits in different clinical situations (e.g. for certain presenting symptoms) to identify whether visits offer similar or different diagnostic accuracy. Application of e-triggers, such as the one developed during this study, can additionally enable tracking of telemedicine-related diagnostic error frequency over time and inform interventions to improve safety of telehealth-based care. Nevertheless, we plan to pursue future work to advance the PPV of telemedicine e-triggers and evaluate their implementation and use. Additional e-triggers that detect diagnostic errors without relying on a subsequent visit also need to be developed.
Several limitations warrant mention. First, our study was performed at a single Veterans Affairs site, and findings may not be generalized to other sites. However, prior work suggests that e-triggers can be applied to multiple types of care settings, often with simple localized customizations, to achieve similar results. 21 Second, our e-trigger evaluated care during a time of transition for health care and may be affected by the novelty of telehealth to most providers. Nevertheless, such e-triggers can evaluate whether rates of missed opportunities improve over time as telehealth familiarity increases. Finally, we did not evaluate the negative predictive value; however, this pilot study was aimed at evaluating the feasibility of developing a telehealth-based e-trigger. Future work will need to evaluate additional aspects of trigger performance and characteristics of patients who experienced a missed opportunity, as well as compare diagnostic performance of telehealth visits to that of in-person visits.
Conclusion
We developed and tested a novel electronic trigger to identify instances of missed opportunities in diagnosis related to telehealth visits. While further testing and implementation are needed, such e-triggers could help advance the understanding of diagnostic safety of telehealth-related care by identifying instances of diagnostic error for further study.
Footnotes
Data availability
The United States Department of Veterans Affairs (VA) places legal restrictions on access to veteran's health care data, which include both identifying data and sensitive patient information. The VA data sets used for this study are not permitted to leave the VA firewall without a Data Use Agreement. However, VA data are made freely available to researchers behind the VA firewall with an approved VA study protocol. All summary data obtained for analysis during this study were included in the manuscript.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: This project was funded by the Baylor College of Medicine Department of Medicine Vice Chair Group for Quality Improvement and Innovations and partially funded by the Houston VA Health Services Research & Development Center for Innovations in Quality, Effectiveness and Safety (CIN 13–413). Dr Singh is additionally supported by the VA National Center for Patient Safety and the Agency for Healthcare Research and Quality (R18HS029347 and R01HS028595).
