Abstract
Background
Subjective visual vertical (SVV) test is a key functional assessment tool that provides insights into vestibular imbalance. Mobile virtual reality SVV measurement system (MVR-SVV) has the potential to facilitate simple, low-cost, and reliable measurements.
Objective
This study aimed to verify the reliability and validity of MVR-SVV by comparing its data with the previously established bucket test (bucket-SVV).
Methods
Thirty-eight healthy adults completed both bucket-SVV and MVR-SVV tests. The reliability and validity of MVR-SVV were examined using intraclass correlation coefficients (ICCs), Pearson’s correlation, Bland-Altman plots (BAP), and minimum detectable change (MDC).
Results
BAP results indicated that the limits of agreement for the SVV angles were 1.61 to −1.24°. No fixed errors were identified (p = 0.13), although a small proportional error was detected (y = −0.59x + 0.157, p < 0.001). Pearson’s correlation coefficient between bucket-SVV and MVR-SVV was 0.716 (p < 0.001). Within-day reliability was poor for bucket-SVV, with ICC = 0.33–0.38, but moderate for MVR-SVV, with ICC = 0.70–0.71. Between-day reliability was poor for both methods, with ICC = 0.38 for MVR-SVV and ICC = 0.28 for bucket-SVV. MDC was 1.78° for bucket-SVV and 2.67° for MVR-SVV.
Conclusions
Our findings suggest that MVR-SVV can be used for assessing SVV. Its portability, availability, and reliability make it a valuable tool for clinicians in clinical settings.
Keywords
Introduction
The subjective visual vertical (SVV) test is a key functional assessment tool that provides insights into vestibular imbalance by quantifying a patient’s perception of verticality. 1 SVV tilt has been identified in various diseases and can indicate signs of abnormal tilt, leading to dizziness and impaired postural control.2,3,4 The standard methods for evaluating SVV include the hemispheric dome method 5 and the light bar-in-the-dark method, 6 both of which rely on specialized equipment. Although these standard methods yield valuable diagnostic information, they are often expensive, difficult to set up, and cumbersome to transport. Consequently, this form of SVV measurement is often not feasible in the clinical setting. Zwergal et al. introduced the bucket test to measure SVV. 5 The bucket test can be used as a screening tool for utricular lesions due to its simplicity and reliability. Some studies developed a “virtual SVV system” employing virtual reality (VR) technology to obtain more compact and reliable measurements.7,8,9,10 Although these virtual SVV systems are effective, they cannot completely address the issues of simplicity and high costs because they require specialized VR equipment and computers. Therefore, SVV measurements using smartphones have been reported to be less expensive and more convenient. 11 However, this method has room for improvement, as the smartphone screen frame can serve as a reference for vertical positioning. To overcome these limitations, we previously proposed a simple and affordable measurement method that combines commercially available VR goggles and a smartphone.12,13 This method is characterized by its simplicity, affordability, and accessibility, as it eliminates the need for specialized equipment and enables assessments in various settings. However, despite its demonstrated potential in evaluating older adults and patients with vestibular disorders, its validity relative to conventional SVV tests has yet to be rigorously investigated. Additionally, healthcare providers who use the conventional bucket test as an outpatient screening tool are seeking information on whether MVR-SVV measurements using technology provide comparable results. This study aimed to evaluate the reliability and validity of the mobile virtual reality SVV measurement system (MVR-SVV), which combines VR goggles and smartphones, by comparing its data with those from the previously established bucket test (bucket-SVV). Future directions for the use of MVR-SVV will also be discussed.
Materials and methods
Participants
Thirty-eight young individuals (age = 20.6 ± 0.6 years), free from injury, participated in the study (sex = 21 male, 17 female). None of the participants reported any cognitive conditions, history of neurological disease, or use of medications that interfered with their activities of daily activities. All study procedures adhered to the principles of the Declaration of Helsinki and its subsequent amendments and were approved by the ethics committee of Nagano University of Health and Medicine, Japan (approval number: 2021-1). Written informed consent was obtained from all participants.
Procedures
All participant measurements were completed for both the bucket-SVV and MVR-SVV, with each test consisting of 10 trials. This number was based on previous research showing that averaging approximately 10 repetitions in the bucket method yields a stable estimate of subjective visual vertical. 5 While that study focused on the bucket method, we applied the same number of trials to the MVR-SVV to ensure reliability and enable direct comparison. Each test lasted approximately 5 min, followed by a 10–20 minute break between assessments to reduce potential fatigue effects. The entire testing procedure was completed within 30 min. To evaluate between-day reliability, retests of both the bucket-SVV and MVR-SVV were conducted between 2 and 4 weeks after the initial assessment. The order of all tests was randomized using a computer.
The bucket-SVV setup is depicted in Figure 1. The device was assembled using commercially available opaque buckets.
5
A line of diameter was affixed at the bottom of the bucket using phosphorescent tape (Figure 2(a)). The vertical line of the inner diameter coincides with the 0° point of the protractor affixed to the bottom of the bucket. A weighted string, attached to the 0° mark on the outer bottom of the bucket, swung freely and intersected the protractor, quantifying bucket deviation from the vertical direction in degrees. Participants sat upright with their heads secured using a chin rest to maintain a neutral head position. To ensure reliable SVV measurements, head tilt was minimized throughout the test. They looked into a non-translucent plastic bucket, ensuring their entire field of vision was within the bucket. The bucket was rotated by the researcher at various angles. Participants were then verbally instructed by the researcher to adjust the bucket until they perceived the internal line as straight vertically. Following this adjustment, SVV measurements, expressed in degrees, were recorded from outside the bucket based on the position of the string on the protractor. Configuration for the bucket test of subjective visual vertical. View of the subjective visual vertical test. (a) Diametric line in glow-in-the-dark tape on the inside bottom of the bucket. (b) Measurement screen of mobile virtual reality subjective visual vertical measurement system.

The MVR-SVV comprised a commercially available iPhone, smartphone application “SVV” (Developer: T.K., H.F.), VR goggles for smartphones, and a Bluetooth remote control. After operating the smartphone and entering the SVV app measurement screen, it was placed in the VR goggles. Similar to the bucket-SVV setup, participants sat upright, and their heads were secured by a chin rest (Figure 3). When the participant wore the VR goggles, a luminous bar was displayed, as shown in Figure 2(b), and the participant could rotate and vertically adjust the issuing bar using the left and right buttons on the remote control. The center button on the remote control allowed the participant to proceed to the next trial and repeat the same process to measure 10 SVV values. In this study, the iPhone screen was mirrored by Wi-Fi and monitored on a sub monitor to ensure that the participants operated the device properly. If the goggles or the smartphone built into the goggles are tilted, there may be a difference between the visually perceived vertical position and the measured value. The SVV application recognized device tilt from the built-in sensor of the smartphone and corrected the SVV value by subtracting the deviation. The results displayed the raw data of the angle, tilt angle of the sensor, and the corrected SVV value. In this study, the corrected SVV values were adopted after setting and measuring the head and smartphone as vertically as possible. Configuration for the mobile virtual reality subjective visual vertical measurement system.
Statistical analysis
The initial step involved assessing agreement between the two devices by generating a Bland–Altman plot for the SVV angle of each testing protocol. Differences in SVV measurements between the two methods were plotted against the mean results. A one-sample t-test was employed to compare differences between the two methods. Linear regression was utilized to test the significance of potential proportional bias between the mean difference in SVV angles for bucket-SVV and MVR-SVV and the mean of these two measurements. 14 Pearson’s correlation was employed to test the strength of agreement for validity between bucket-SVV and MVR-SVV. The intraclass correlation coefficient (ICC) was calculated to determine within-day reliability (ICC across 10 trials on the same day) and between-day reliability (ICC between mean values of Day 1 and Day 2). A model with two-way random effects and single-measure intraclass correlation coefficients (ICC6,14) was employed to evaluate reliability. Point estimates of the ICCs were interpreted as follows: excellent (>0.9), good (0.75–0.9), moderate (0.5–0.75), and poor (<0.5). 15 The standard error of measurement (SEM) and minimum detectable change (MDC) were calculated alongside the ICC values to assess measurement reliability and detect meaningful changes in SVV values. Specifically, SEM agreement was computed using the between-day variance obtained from ICC estimation, incorporating both systematic errors and residual variance to reflect measurement reliability (SEM agreement = √σ between measurement + σ residual).16,17 Variance components were obtained using a multilevel mixed-effects model (restricted maximum likelihood estimates).16,17 This calculation follows the methodology described by de Vet et al. 16 The MDC, indicative of the minimum change in individuals detectable beyond measurement error with a confidence of 95%, was determined as 1.96√2SEM. 18 Statistical significance was set at p < 0.05. All statistical analyses were performed using SPSS software (version 25.0; IBM SPSS Inc., Armonk, NY, USA).
Results
Reliability and concurrent validity analysis of subjective visual vertical angle in bucket test and mobile virtual reality SVV measurement system.
SVV: subjective visual vertical; Bucket: bucket test; MVR: mobile virtual reality; SD: standard deviation; Mean Diff: mean difference; CI: confidence interval; ICC: intraclass correlation coefficient; SEM: standard error of the measurement; MDC: minimum detectable change.

Bland–Altman plots for the subjective visual vertical tests.
Discussion
This study compared two clinically available methods for assessing the SVV: bucket-SVV and MVR-SVV. The Bland-Altman analysis revealed proportional errors between the two methods but no fixed errors, with a correlation coefficient of 0.716. MVR-SVV showed higher within-day reliability compared with bucket-SVV, although between-day reliability was low for both methods. However, all the reliability results were higher for MVR-SVV than for bucket-SVV. Importantly, the MVR-SVV demonstrated concurrent validity with the bucket-SVV, suggesting its potential as a portable, cost-effective, and valid tool for assessing SVV.
One of the key strengths of this study is its focus on developing a cost-effective and widely accessible assessment tool. Previous studies have utilized VR systems for SVV measurement, highlighting their potential for accurate assessments of vestibular function.7–10 However, these systems often require costly VR equipment and computers, limiting their practical application in clinical settings. In contrast, the MVR-SVV represents a significant advancement in terms of affordability and portability. Smartphones are ubiquitous, and VR goggles are increasingly affordable, making this system highly practical for routine clinical use. By leveraging commonly available technology, this approach democratizes access to vestibular assessment tools, enabling their use in a broader range of clinical and community settings. Although studies on smartphone and VR goggle combinations for SVV measurement exist, including two by our team, none have rigorously validated this method against traditional SVV tests.12,13,19 This study addresses this gap, establishing the criterion-related validity of MVR-SVV against the bucket-SVV.
The MVR-SVV demonstrates criterion-related validity with bucket-SVV, establishing its value as a tool for assessing SVV. Bucket-SVV, a clinically useful method, has been shown to be comparable to the hemispheric dome method, the standard for SVV assessment. 5 Our findings revealed a significant correlation between MVR-SVV and bucket-SVV, with no fixed errors in the Bland-Altman plot. Proportional errors were identified, indicating that MVR-SVV values tend to overestimate as they diverge from zero. This overestimation may correspond to an underestimation by bucket-SVV due to cognitive bias, as assessors may unconsciously align results closer to zero when evaluating healthy individuals. Despite these proportional errors, they remained within the 95% limits of agreement, supporting the equivalence of MVR-SVV and bucket-SVV.
In addition to its simplicity and affordability, the MVR-SVV benefits from its ability to calculate tilt angles using the iPhone accelerometer. This feature enhances its applicability for assessing head tilt SVV 20 and for monitoring patients with postural control abnormalities, making it particularly valuable in clinical populations with vestibular dysfunction or stroke. By addressing these additional clinical needs, the MVR-SVV extends beyond the capabilities of traditional SVV assessment tools.
MVR-SVV showed higher within-day reliability than bucket-SVV, but both exhibited low between-day reliability. These findings are consistent with those of previous studies. Wengier et al. found that the reliability of SVV between five trials was higher for smartphone tests than for bucket tests. 11 In our study, among 10 trials, MVR-SVV exhibited an ICC of >0.7, whereas bucket-SVV showed an ICC of <0.4. The results of the within-day reliability study support the findings of previous studies. Regarding between-day reliability, Michelson et al. reported that the bucket-SVV had an ICC of 0.15. 8 In our study, bucket-SVV had an ICC of 0.28, and MVR-SVV had an ICC of 0.38. The previous study was retested on the same day with a 30-minute time gap, 8 whereas our study was retested over a period of 2 weeks. The SVV in healthy individuals may fluctuate in both positive and negative directions around 0°, even over short periods, indicating sensitivity to the effect of time. However, the higher reliability of MVR-SVV compared with bucket-SVV, despite the influence of the time period, may suggest the superior accuracy of MVR-SVV. Additionally, this study included only healthy participants, which may have contributed to lower ICC values. As ICC is influenced by variance among individuals, the low variability in SVV measurements within this group may have resulted in artificially lower reliability estimates, despite consistent measurements. Previous studies have reported greater variability in SVV measurements among patients with vestibular disorders or stroke,21,22 which may improve reliability estimates. Considering this, the between-day reliability values of 0.28 for bucket-SVV and 0.38 for MVR-SVV should not be interpreted as inherently poor but rather as a reflection of the limited variability among healthy participants.
The MDC results also revealed that changes of ≤1.78° for bucket-SVV and ≤2.67° for MVR-SVV fall within measurement error, making small SVV variations in healthy individuals statistically insignificant. However, both methods are effective for identifying larger SVV deviations in patients with vestibular disorders or stroke. Gómez et al. reported SVVs of 8.4° at initial testing, decreasing to 3.2° after 2 weeks and 1.4° after 4 weeks in patients with acute vestibular disorders. 22 Similarly, studies on patients with stroke or dizziness reported significantly larger SVV deviations, far exceeding MDC thresholds.4,21,23 These findings highlight the clinical utility of both methods for populations with abnormal SVV.
This study has several limitations. First, the use of the bucket test as a reference, although validated and clinically utilized, is not considered the gold standard. Secondly, individual differences exist in terms of determining the visual vertical, with some participants meticulously adjusting their verticals before deciding, whereas others make quick decisions. Despite this variability, MVR-SVV demonstrated higher reliability, emphasizing its potential as a robust and accessible tool for SVV assessment.
Conclusion
The MVR-SVV demonstrated higher within-day and between-day reliability compared with bucket-SVV. Unlike bucket-SVV, MVR-SVV does not require manufacturing and can be measured using readily available VR goggles, remote controllers, and smartphones. It offers cost-effectiveness and space efficiency compared with the gold standard dome test. Further research should investigate its diagnostic and prognostic accuracy across various populations, including older adults, vestibularly impaired, and patients with stroke. As smartphone-based medical applications become more prevalent, particularly in telemedicine initiatives, 24 the adoption of this system among patient populations could contribute to the advancement of evidence-based medicine and telemedicine.
Footnotes
Acknowledgements
The authors would like to thank the study participants for their time and dedication.
ORCID iDs
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a Grant-in Aid for Early-Career Scientists from JSPS KAKENHI in Japan [grant numbers JP22K17641].
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
