Evaluating Multimodal Interfaces for Visually Impaired Users in Autonomous Ridesharing: A Usability Study

Abstract

Transportation accessibility remains a critical challenge for visually impaired individuals, constraining their autonomy and societal participation. Although autonomous vehicles (AVs) hold transformative potential for enhancing mobility, prevailing human-machine interfaces (HMIs) frequently neglect the unique interaction requirements of this population. This study investigates the efficacy of a multimodal HMI explicitly designed to facilitate autonomous ridesharing interactions for visually impaired users. Employing a between-subjects experimental design, we evaluated user trust and satisfaction across six core ridesharing functions under three distinct conditions: (1) visually impaired participants with multimodal (audio-visual) feedback, (2) non-visually impaired participants, and (3) visually impaired participants without audio feedback (N = 24). Our findings demonstrate that audio-enhanced multimodal interfaces bridge the accessibility gap, enabling visually impaired users to attain trust and satisfaction levels statistically comparable to those of non-visually impaired users. Furthermore, the absence of audio feedback significantly degraded navigational confidence, vehicle identification accuracy, and overall user experience (p < .05). These results theoretically validate the significance of auditory cues in AV HMIs, while empirically confirming design principles for universal accessibility. By providing actionable guidelines for inclusive interface design, this work advances equitable mobility solutions and underscores the imperative of user-centered autonomy in next-generation transportation systems.

Keywords

autonomous vehicle rideshare visually impaired inclusive design human machine interaction

Introduction

Transportation represents a fundamental challenge for visually impaired (VI) individuals, with over 25 million Americans experiencing transportation-limiting disabilities due to sensory, cognitive, or motor impairments (Fink et al., 2023). This lack of accessible transportation significantly impacts their independence, workforce participation, and overall quality of life (Flaxman et al., 2021). Fully autonomous vehicles (FAVs) hold immense potential to transform mobility by offering safe, independent travel options for VI individuals. Yet, current human-machine interfaces (HMIs) in these vehicles depend heavily on visual cues, rendering them inaccessible to this population.

Compounding the issue, current policies and interface designs often fail to address the specific accessibility needs articulated by blind and visually impaired people, preventing the realization of this technology’s life-changing mobility benefits (Fink et al., 2021). Despite these challenges, recent advances in inclusive AV interface design demonstrate that effective user interfaces can be developed through targeted evaluation methodologies tailored specifically for persons with visual impairments (Angeleska et al., 2022).

Key research indicates that multimodal feedback systems—integrating auditory, tactile, and visual cues—significantly enhance usability for VI users, yet most existing navigation systems lack necessary flexibility, forcing users to adapt to the system rather than accommodating individual needs (Dourado & Pedrino, 2023). This gap is particularly critical as emerging evidence shows that different explanatory techniques and modality combinations substantially impact cognitive load, situation awareness, trust, and overall user experience in automated systems (Kaufman et al., 2024).

In summary, this study applies human factors methodologies to develop and evaluate a multimodal HMI inclusive interface for autonomous vehicles aimed at enhancing independence and overall experience of VI users. The research focuses on identifying essential features and modalities preferred by VI individuals, developing a prototype that integrates auditory and visual feedback, and conducting comprehensive usability testing to assess the system’s effectiveness. By advancing accessible HMI solutions, this work contributes to the broader goal of making autonomous transportation more inclusive for VI users.

Methods

This study employs a structured three-phase methodology to systematically analyze and optimize travel-related tasks. The initial phase involves a functional decomposition of the complete trip cycle into six critical operational components: (1) identity verification, (2) trip confirmation, (3) standard driving operations, (4) unexpected event resolution, (5) destination arrival protocols, and (6) termination procedures. Building upon this framework, a comprehensive hierarchical task analysis was implemented to delineate fundamental elements and decision points within each functional domain. Subsequently, these analytical insights informed the development of an optimized human-machine interface grounded in user-centered design principles. The final validation phase employed a controlled experimental paradigm to quantitatively assess the efficacy and usability of the proposed interface prototype. The study was approved by the University of Michigan Institutional Review Board (HUM00263286).

Task Analysis

Task1: Pre-drive preparation

Identity Verification

Upon boarding, the autonomous vehicle’s voice module prompts the passenger to fasten their seatbelt for safety. Then, audio and visual cues guide them to confirm their identity by speaking their four-digit ride code (shown in Figure 1).

Figure 1.

Confirm ID process.

Trip Confirmation

The system presents destination and prompts passenger for verbal confirmation. Passenger responds with “yes” or provides correct address. If recognition fails, system offers customer service connection.

Task2: During the Ride

During Routine Driving Tasks

The autonomous driving system uses audio-visual cues to inform passengers about upcoming events and decisions during routine driving. For example, as the vehicle nears an intersection, it announces, “Approaching intersection. Stopping for red light in 3 s” Once stopped, it confirms, “Stopped at red light,” and the HMI shows the estimated red light duration. The system briefly describes intersection details, like traffic and pedestrian activity. Before the light turns green, it states, “Light turning green soon. Checking intersection safety. Preparing to move.” After crossing, it confirms, “Intersection cleared,” and updates the estimated arrival time audibly and visually (shown in Figure 2).

Figure 2.

Traffic lights procedure.

Handling Unexpected Events

In the event of potential traffic issues—such as sudden obstructions, accidents, or hazards—the HMI switches to the special mode designed to handle these situations by informing passengers of real-time dangers and evasive actions (shown in Figure 3).

Figure 3.

Emergency avoidance procedures.

Task3: Reaching Destination

Destination Notification

The system will provide an audible alert 3 min prior to reaching the destination.

Exit Interaction

Upon arriving at the destination, the system will provide disembarking guidance, detailing a safe exit procedure.

Prototype Design

Based on the task analysis results and human-centered design principles, we developed a multimodal in-vehicle HMI prototype. The design incorporates key human factor principles including:

Attention Management (e.g., salience compatibility, multimodal approach).

Perception Optimization (e.g., minimizing information access costs, gain redundancy, proactive updates).

Memory Support (e.g., predictive aids, consistency, knowledge in the world).

The prototype provides real-time feedback on the vehicle’s intentions, communicates the surrounding environment, anticipates driving decisions, and facilitates smooth mode transitions during unexpected events.

Technical Approach

A laptop connects to an external monitor mounted on the back of the front seat, serving as the visual interface to display route status, pre-arrival alerts, and environmental descriptions. Surrounding the screen are physical buttons in yellow (#F4B300) and purple (#B87AFF), each tied to predefined functions. An external speaker on the rear seat enables voice interaction. On the software side, MATLAB is used to develop codes controlling audio playback and touchscreen content, ensuring synchronized or asynchronous real-time display and audio feedback for dynamic multimodal presentation. Lastly, Audio content for the voice module is pre-recorded using OpenAI’s web-based advanced voice generation (Voice: Maple).

Experiment Design

A between-subjects study with 24 participants (12 males, 12 females; Mean age = 27.58 years, SD = 3.07) was conducted to evaluate the prototype’s usability. Post-hoc power analysis revealed adequate power (0.50–0.81) for detecting medium to large effects (ε² = 0.192–0.322).

Independent Variable

Participants were randomly assigned to three interface accessibility conditions (N = 8 participants per condition).

None Visually Impaired (NVI): Participants with normal vision using the multimodal interface.

Visually Impaired (VI): Participants with visual impairment using the multimodal interface.

Visually Impaired without audio (VIX): Participants with visual impairment using the interface without audio feedback.

Visual Impairment Simulation

To simulate visual impairment conditions, we utilized the Cambridge Visual Impairment Simulator Software (Inclusive Design Toolkit, University of Cambridge) to adjust screen displays for normal-sighted participants. The simulated condition applied a moderate blur setting, reducing visual acuity to 0.1 to 0.3 (Snellen chart), consistent with WHO-defined moderate low vision. To ensure experimental consistency, all participants in the visual impairment condition experienced the same moderate blur setting. This approach was chosen because blur effectively replicates common visual challenges while allowing standardized implementation. The selected level represents visual impairments that significantly impact interface usability while still enabling interaction with properly designed elements (shown in Figure 4).

Figure 4.

Visual impairment simulation (Normal vs. Visually Impaired).

Experiment Process

Step 1. Participants arrive at the designated waiting area outside the laboratory. The experimenter explains the task requirements of the experiment and the participants' roles. All participants sign informed consent forms and complete demographic questionnaires. Step 2. Participants are then randomly assigned to an experimental condition and given 3 minutes to familiarize themselves with the visual impairment state through the human-machine interface (HMI). Step3. The formal experiment begins when participants receive an “autonomous vehicle approaching” phone notification with a verification code outside the laboratory and are guided to their seats. Participants then experience simulated autonomous driving, sequentially experiencing the preparation process before the journey, the driving experience during the journey, and the arrival process at the destination (During the driving state, vehicle environmental sounds are played through external speakers, while an experimenter controls the rotation and vibration of the seat using rope traction to simulate vehicle movement). Upon reaching the destination, participants complete questionnaires and participate in interviews conducted by the experimenter. The experiment concludes after the interview is completed (shown in Figure 5).

Figure 5.

Experiment procedure.

Data Analysis

Data were collected through an online questionnaire including demographic information, 1 to 5 Likert-scale questions on trust and satisfaction with the HMI prototype, and open-ended interviews. The data were processed by software SPSS (ver. 26) and MATLAB (ver. 2024a).

For the quantitative data, due to the non-normal distribution of the Likert-scale questions, a nonparametric Independent-Samples Kruskal-Wallis Test was employed to compare group differences across eight dimensions: correct rideshare identification , correct destination confirmation , route and navigation accuracy , location access functionality , audio-visual feedback , call for help functionality , direction upon exiting and overall satisfaction with feedback (statistical significance for each test was determined at α = .05). Post-hoc analyses with Bonferroni adjustments clarified significant pairwise contrasts. For all Kruskal-Wallis tests, effect size was quantified using epsilon-squared (ε²).

Qualitative data from open-ended interviews, focusing on audio-visual feedback preferences, perceived effectiveness, and improvement suggestions were thematically coded and analyzed to complement the quantitative findings and provide deeper insights into user experience with our HMI prototype.

Results

Trust and Satisfaction

The Kruskal-Wallis test revealed significant differences across the three groups in three aspects (shown in Figure 6):

Correct Rideshare (H(2) = 8.755, p = .013, ε² = 0.322): The mean rating was significantly higher in the NVI group (Mean = 4.88) compared to the VIX group (Mean = 3.50, p = .013), while the VI group showed similar ratings to NVI (Mean = 4.25).

Route and Navigation (H(2) = 6.022, p = .049, ε² = 0.192): The NVI group reported higher scores (Mean = 4.75) compared to the VIX group (Mean = 3.63, p = .043), with the VI group (Mean = 4.13) showing no significant difference from either group.

Overall Satisfaction (H(2) = 6.972, p = .031, ε² = 0.237): The NVI group showed the highest satisfaction (Mean = 4.75), significantly different from the VIX group (Mean = 3.88, p = .032), while the VI group’s ratings (Mean = 4.13) did not differ significantly from either group.

No significant differences were found between VI and NVI groups across all measures (8 aspects), suggesting that the multimodal interface effectively supported visually impaired users when audio feedback was available.

Figure 6.

Ratings on key dimensions.

Figure 7 highlights the effectiveness of our multimodal interface for visually impaired users in autonomous ridesharing. With audio feedback, the VI group achieved performance levels comparable to the NVI group across all eight aspects, demonstrating that the interface successfully bridges accessibility gaps. However, the VIX group showed notable declines, particularly in navigation and rideshare identification, underscoring audio feedback’s critical role.

Figure 7.

Evaluation across eight dimensions.

Subjective User Feedback

The open-ended interview revealed several key findings:

94 % participants in the NVI and VI groups (15 out of 16) reported positive experiences, highlighting “clear communication,” “specific directions,” and the effective integration of audio-visual feedback.

Seventy-five percentage participants in the VIX group (6 out of 8) expressed concerns about the interface limitations, primarily focusing on difficulty receiving timely information without audio feedback.

Conclusion

Audio feedback played a critical role in enhancing the experience of visually impaired users. Participants who received multimodal feedback demonstrated higher confidence in key tasks, including correct rideshare, route navigation, and overall satisfaction. In contrast, those without audio cues exhibited decreased confidence and increased frustration, particularly during complex interaction tasks such as location positioning and handling unexpected incidents.

The findings further highlight the effectiveness of multimodal coordinated feedback based on human factors design principles in developing accessible interfaces. By reducing cognitive load and incorporating redundant audio-visual information, the multimodal system successfully addressed the specific needs of visually impaired users, enabling a seamless and inclusive ride-hailing experience. However, the controlled experimental environment differs from real-world AV experiences, and using simulated visual impairment with normal-sighted participants limits generalizability to diverse VI populations. Future research should test with actual VI users in real road conditions. This pilot study was designed as preliminary research to establish proof-of-concept for the multimodal interface design before conducting larger-scale studies with diverse visual impairment populations.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Gerui Xu

References

Angeleska

Aleksovska

Avramov

Sidorenko

Rizov

Jankovic

(2022). Design and evaluation of an inclusive autonomous vehicle user interface developed for persons with visual acuity Loss. Proceedings of the Design Society, 2, 2035–2044. https://doi.org/10.1017/pds.2022.206

Dourado

A. M. B.

Pedrino

E. C.

(2023). Towards interactive customization of multimodal embedded navigation systems for visually impaired people. International Journal of Human-Computer Studies, 176. https://doi.org/10.1016/j.ijhcs.2023.103046

Fink

P. D. S.

Doore

S. A.

Lin

Maring

Zhao

Nygaard

Beals

Corey

R. R.

Perry

R. J.

Freund

Dimitrov

Giudice

N. A.

(2023). The Autonomous Vehicle Assistant (AVA): Emerging technology design supporting blind and visually impaired travelers in autonomous transportation. International Journal of Human-Computer Studies, 179, 103125. https://doi.org/10.1016/j.ijhcs.2023.103125

Fink

P. D. S.

Holz

J. A.

Giudice

N. A.

(2021). Fully autonomous vehicles for people with visual impairment: Policy, accessibility, and future directions. ACM Trans. ACM Transactions on Accessible Computing, 14(3), Article 15. https://doi.org/10.1145/3471934

Flaxman

A. D.

Wittenborn

J. S.

Robalik

Gulia

Gerzoff

R. B.

Lundeen

E. A.

Saaddine

Rein

D. B.

, Vision, & Eye Health Surveillance System Study Group. (2021). Prevalence of visual acuity loss or blindness in the US: A Bayesian Meta-analysis. JAMA Ophthalmol, 139(7), 717–723. https://doi.org/10.1001/jamaophthalmol.2021.0527

Kaufman

Costa

Kimani

(2024). Effects of multimodal explanations for autonomous driving on driving performance, cognitive load, expertise, confidence, and trust. Scientific Reports, 14(1), 13061. https://doi.org/10.1038/s41598-024-62052-9