Abstract
In recent years, an increasing number of Patient Reported Experience Measures (PREMs) have been enabled to explore and guide improvements. The aim is to establish a set of questionnaires for assessing hospitals care, share results, and perform a transparent formal cognitive and psychometric validation process for some of these. A set of 23 questionnaires was developed through literature review and discussion with professionals from one Spanish tertiary hospital, and supplemented with Net Promoter Scores (NPS). The hospital piloted the questionnaires, receiving 400,719 responses (November-2022-November-2025). These were described for reference benchmarking. Questionnaire dashboards were enabled to all hospital professionals Intranet. Five of the PREMs underwent content validation:focus groups/interviews in the aforementioned hospital(July-2024-September-2025) and psychometric validation(June–2025-January-2026) involving 11 hospitals from 4 Spanish regions, including an item-bank proposal and scoring. A cultural adaptation process was performed into Catalan and English, involving additional hospitals. The hospital received 400,719 responses from 23 questionnaires (24% overall response-rate). The main average NPS values (range across services) were: adult patients hospitalization/adults referring to children’s hospitalization (77ad/70adch;55–84), emergencies without hospitalization (71/52), external face-to-face consultations (83ad/75ch;38–83). Through five questionnaires tested with focus groups, items were nunaced, deleted or added. The questionnaires proved psychometrically robust. The process per questionnaire involved between 106-1,353 patients per validation wave. The cultural adaptation process was formally completed with minor changes. Fully-validated questionnaires are generated and detailed with data from several patient profiles and without specific units biases enabling wide patient experience analyses. The robust validated questionnaires can inspire other hospitals future value-based healthcare.
Introduction
Electronic Patient Reported Experience Measures are of increasing interest in the participation strategies for hospitals quality improvement. They complement sources like claims, suggestions or complaints documents. Many questionnaires have been developed to generically cover the corresponding assessment in different settings within the hospital care and recently many are appearing to approach the specific aspects from different “care problems” (also named, in value based healthcare, integrated practice units 1 ). Furthermore, measurable involvement projects play an increasing role, with the need for full person-centered tools like ePREMs to work with.
The most widely used questionnaires in hospital settings such as Picker, 2 HCAHPS 3 or coordinated care P3CEQ, 4 as well as many relevant specific and local ones,5,6 play a wide role in achieving to understand patient needs and cover the aforementioned challenges. They’re increasingly used even if, for example, only a small number of them are proposed among ICHOM standard sets. 7 Accompanying shorter versions of these with final Net Promoter Scores (NPS) 8 also enables to work with complementary easy to understand overall scores.
Nevertheless, many of these existing questionnaires have problems to overcome. Some are only available for hospitalization assessment leaving an absence of a robust set covering the wide range of hospital fields with a common philosophy. Many do not have a formal published transparent detailed cognitive or psychometric validation process available or where designed as satisfaction and not as experience questionnaires. Some could be considered too large (>15 items) and some miss some dimensions that might be relevant to the patient like pain control or spirituality need.
PREMs should be used in the context of hospital citizen involvement strategies such as ICE-VH, 9 and dashboards should be freely available to facilitate the daily practiceand integrated decision-making . Achieving this scale-up requires a formally multicentric, randomized, co-created set of questionnaires sharing a common philosophy and available in multiple language – a gap that remains unaddressed in countries like Spain.
The main aim of this study was to share the learnings from developing a set of questionnaires and the formal validation of a subset for their use in value-based-health-care driven tertiary hospitals and show the formal design in clinical and citizen participation environments.
Methods
Development Phase. The First Set of Questionnaires
Set of Questionnaires Answers Distribution in One Tertiary Hospital; Net Promoter Score Values (November 2022- November 2025)
AD: Adult, CH: Children, ICU: Intensive Care Unit; REA: Reanimation Unit, EM: Emergencies, H: Hospitalization.
*with N>50 patients.
**includes hospital pharmacy attended patients.
Patient fatigue burden was dealt compacting a reanimation unit, emergencies before hospitalization, and intensive care unit multi-response conditional questionnaires proposing to be sent together to assess the hospitalization process.
A limitation of the current work is that these extensions will still need to be validated. Also each PREM was programmed limited to be sent a maximum of once every 3 months to each patient impacting response rates. Accessibility and trust to care are a relevant limitation in some of the questionnaires considering the Catalan Framework for the patient experience dimensions. 10 That said, in regards to accessibility to care (general waiting list, not the waiting in the hospital hallway), this was claimed in one of the paediatric interviews, but authors think that it is an outstanding topic that should have its own questionnaire. In addition, patients did not request trust questions and considered the number of items fair, in some way delegating the trust in covering adequately the rest of the experience dimensions. Thus, an option could be working on optional additional items on accessibility in the PREM bank, not fatiguing the patient with questions if they can’t be solved or if they are out of scope.
Questions were deemed unsuitable for neonatology and obstetrics purposes and so specific questionnaires were built. Extractions (blood…) and radio-diagnostics settings questionnaires used were established based on already existing historical paper versions. Transition to primary care questionnaires 11 and end-of-life questionnaires are in development and were not included in the current manuscript. The questionnaire result dashboards were made available to all the professionals on the hospital’s Intranet. We called this process the PREMs democratization to raise more awareness and data-based decisions and actions.
Five of the aforementioned questionnaires entered into a process of formal validation from July 2024 to December 2025 (highlighted at Table 1).
Cognitive Testing Phase. Focus Groups and Interviews
Five patient focus groups were performed in cooperation with the Patient Experience Institute. The invitation process included patients being diversity inclusive (non-native, medical/surgery diseases, sex, age). This qualitative design was proposed as a must to the quantitative psychometric approach. Interviews were not the first choice because of the burden of work and because we trust that the topic was not sensitive enough allowing the value-added of hearing and getting inspired by others. Patients agreed with the items ordered by care pathway even if this meant to combine Likert and dichotomic outcomes. The whole process re-inforced the objective of developing an experience and not a satisfaction Spanish validated questionnaire wording, though, “assess”, “sufficient”, “correct”, “punctuate”, “satisfactory” or “adequate” were words that had to be avoided. The sentence to introduce the questionnaires was related with reporting regarding the experience, trying to even avoid the word “correct”, given that reporting the experience was the only introduction perspective minimizing methodological critiques.
The focus groups on-line were developed from July-2024 to September-2025. The patients included had been treated in each of the settings in the hospital in the last 2 years. Patients were selected by asking for support from the citizen attention unit to the heads of service and direct key known motivated professionals from the hospital. Patients were presented with each of the questions and were requested to assess qualitatively the clarity, pertinence, relevance and wording suggestions per item as per part of an experience questionnaire. After the items were presented, patients were asked if any important topic was not included and if any overall suggestions were to be included. A Parents focus group (for the pediatric scope) was too short in participation, and so was an e-Delphi to try to gather the cognitive debriefing, so additional 8 interviews in August-2025 were performed to achieve enough minimal robust patient feedback.
The Validation Pyschometrics Phase
The following sections detail the work performed: - Setting/sites involved in validation waves: the five questionnaires were administered in two consecutive waves separated by 2 weeks in 11 Spanish hospitals summoned via SMS or mails (centre-dependant) between June-July 2025 (adults) and October-2025-January-2026 (paediatrics); - Sampling/recruitment and sample sizes: each participating centre was instructed to administer questionnaires on a randomly selected day, stratified by care type (hospitalization, emergencies…) encouraging to reach enough patients for a distribution analysis. If additional instructions were required, maximum efforts for a thirty patients minimum per type, and adding days to accomplish it was given as a theoretical value. Centres exceeding 2,000 selected patients had their sample capped at 350, through random selection, to preserve comparability across 11 hospitals of different sizes and regions. - Analysis performed: (1) Exploratory/Confirmatory Factor analyses were performed (EFA-CFA) validating the model questionnaire: This was carried out to re-inforce the coherence of an underlying questionnaire structure and group the items that measured the experience construct, (2) reliability (internal consistency and test-retest) and validity (internal structure, discriminant validity and convergent validity) considering McDonald values because the data include categorical data, (3) Scoring establishment. - Missing-data handling: For the EFA, only patients with complete data were included. Given the high rate of 'not applicable’ responses to spirituality and telemedicine items, these were handled separately and proposed as part of the additional item bank rather than the main psychometric model. - Additional analytic decisions: Non-included “out-of-structure” items are proposed as an additional item bank. The main rationale is that factor analysis should not cause a loss of replicable items on topics of patient interest. Moreover, additional items could be needed for very specific purpose intervention assessments. For example, diversity or trust items can be added. An additional supplementary appendix 9 was added to clarify the minimals required in the framework by April 2026.
The Validation Cultural Adaptation Phase
The five questionnaires in Spanish were culturally adapted as final experience measures into Catalan and English following ISPOR guidelines steps 12 (October-2025 -March-2026). Bilingual healthcare sector professionals (who lived and worked several years in countries where both languages per comparison are official) translated and reached a consensus on discrepancies. The process was repeated with a back-translation and re-consensus. Finally cognitive debriefing interviews (template in the Supplementary Index) with native patients with the cooperation of two totally additional Spanish hospitals as well as from a centre in the United States and from Ireland.
Results
The Development Phase
The initial search identified several Spanish-language adult PREMs covering various hospital settings. Some examples were: emergencies, 13 day-hospital, 14 hospitalization, 15 major ambulatory surgery, 16 face-to-face visits, 17 telematics visits, 18 home hospital 19 or questionnaire to assess children experience. 20 Almost all validations were made in some specific units, unicentric, based on only some profiles of patients, or missed types of validation analyses.
An initial set of 23 questionnaires were available in the hospital before the 5 questionnaires validation process. The average NPS was 64.4 (8.5% of detractors) (Range by setting: 22-99) enabling a full Table 1 The average NPS shown slightly lower consistent values in women as well as higher across adults referring children care. The response rate was 24%
Cognitive Testing Phase
Main Questionnaire Improvement Changes From the Focus Groups and Interviews on the Patient Reported Experience Questionnaires (July 2024 -September 2025)
AD: Adults.
*Both invitations and participation were subject to the hospital’s professionals and patients availability and cooperation . Patients decided for each item if they wanted a dichotomic or a 5-level answer to explain their answer.
The Psychometric Validation Phase
Final Patients Completing PREM in the Eleven Centres Psychometric Validation
Acronyms. CEX: Face-to-Face External consultations; MAS: Major Ambulatory Surgery; C1 to C11 are from 4 regions in Spain: Andalucia, Asturias, Catalonia and Madrid.
np: Did not Participate.
*There was confusion in the coding and call for participation between 2 centers with similar names.and more than 2,000 patients were involved so a randomization was involved to avoid an overweight This table presents the participation once missings were managed (patients included in the factorial analyses).
Psychometric Results of the 5 Patient Reported Experience Mesures Validation With Patients in the Eleven Participating Centres (1/2)
CEX: Face-to-Face External consultations; F1: Factor 1, F2: Factor 2; F3: Factor 3; FG: Focus Group; MAS: Major Ambulatory Surgery; NPS: Net Promoter Score.
*Number of items as per significance given Geomin Rotated Loadings **only performed at one of the hospitals.***logically, no dimension was priviliged in the analysis.
**The correlation was significant and the absolute value was moderate, in a context where both questionnaires are conceptually different (Picker was designed as a continuous psychometric scale).
Psychometric results of the 5 Patient Reported Experience Mesures validation with patients in the eleven participating centres (2/2)
The Validation Cultural Adaptations Phase
The main changes both to Catalan and English were word changes (not sentences) due to linguistic nuances. We could group them in conceptual and semantic equivalence to the clinical context (“body handling” vs “patient handling and transfers”), syntactic and stylistic refinement (“dispuso” vs “pudo disponer”), lexical standardization (“schedule” vs timetable”) or grammatical precision (“Alta Alta” vs “En el momento del alta”). This was similar in English where formality instead of jargon, and ease of comprehension provoked debates of one word against another in the consensus phases between bilingual care professionals as detailed in the Supplementary Index table 6. The feedback from the patients was more scarce even finding final minor ortographic errors, typographical inconsistencies, literal errors or similar but always little inconsistencies. Even if small, all the experience -not satisfaction items had minimal changes respecting the learnings from all the steps of the overall validation process.
In summary, five questionnaires were formally validated and are ready for implementation: F2FCEXAD, HOSPAD, CMAD, EmwoAD, F2FCEXPED. Eighteen additional questionnaires sharing the same wording philoshophy are available upon request. Any language use, and adaptations into new languages can but should be done requesting the process and with free centralised copyright given by the authors.
Discussion
The overall NPS of the first 400,079 hospital answers from the total set of questionnaires initially developed was 64.4 which can serve as a basis when aggregating all type of hospital services and even to establish yearly goal management objectives. Nevertheless, the range of values among settings (types of care) is high, proving that, assessing a whole center with a unique value, is quite simplistic. Values ranged from above 80s in children care, or 70s in hospitalization care, to 60s in emergency settings and 20s in virtual visits, which remains an expectable challenge. Given the response rate above 20% - higher than others reported with SMS systems 21 - and number of available answers, the NPS values could serve as a reference for many tertiary public hospitals. The high response rate can be related with the inclusion of “dealing with too many questions” strategies. The high participation on the whole set of questionnaires endorses the number of items, principles and wording policy of these questionnaires. The wording of items are similar in the full set of questionnaires, thus, given the transparent explained cognitive validation process, it re-inforces the plausible validity of the rest of questionnaires that were not validated by the time of this manuscript and that are available upon request in Spanish and Catalan. Finally, as items are presented following the patient pathway, this should facilitate to process to remember and expose experience logics.
The transparent process of the focus groups is one of the strengths of the cognitive testing part of the validation process. Qualitative approaches to listen directly to patients needs aren’t so frequent in PREMs questionnaire validation processes. The whole set of items after this exercise cover coherently most of the main PREMs dimensions established by different frameworks 22 (treat, information, accessibility, environment…). The items include variability of cross-cultural items from deeper information need analysis to shared-decision-making. Formally, PREMs used in hospitals should have a transparent validation of such a process.
The transparent and psychometric multi-sized-region-centric experience-oriented validation is quite another study strength. Even if factor analysis is questioned in the literature as a tool for validating PREMs,23,24 authors trust that it has a value not to be deleted as an additional analysis for covering the building including content consistency. The addition of a proposal for a validated additional item bank also both compensates the dangers of a lost of items and ensures, keeping scores calculation available, that any assessment need can be added in terms of benchmarking making easy to empower assessable improvement actions.
As mentioned, some settings had already separated options of questionnaires available in Spain. To tour knowledge most existing tools lack validation for online use or high participation rates. Furthermore, few have undergone transparent validation with randomized patients across diverse units, including items on facts that can’t be changed by the hospital or understood by patients, or not proven adaptive to patient fatigue, amongst others. Furthemore, as the main limitation the specific test-retest in our validation was not totally paired (same invited but could not be exactly same respondents).
As explained, discussions occurred during the psychometric and cognitive validation phase to avoid proposing satisfaction instead of experience questions. We decided to leave numerical 1 to 5 with “-” and “+” on non-dichotomic items, instead of providing category labels reducing ceiling effects. This could be seen as following satisfaction-style questionnaires; after working wordings and focus groups, it was seen as the easier comprehension. Neutral intermediate answer was not over-reported and it makes dashboarding-colors easier. Exceptionally, a satisfaction-oriented question was left, for example, because of its’ power for experience decision making: the item asking if you would have liked a virtual visit instead of a face-to-face. Finally, the use of a Likert format and an overall introducing statement inviting to report regarding the “patient experience” was proposed after big debates considering that is is not fully reprehensible as explictely asking for an “agreement” but also because patients participation phases re-approved they felt comfortable with not using specific words (of course each item results can’t be analyzed by quantitative averages but as frequencies analyses).
The current set of generic PREMs can be used in usual practice with frameworks such like MATRICS 25 and as mant specific disease PREMs items overlap with that of generics, centralized strategies such as having dashboards with a common philosophy set of questionnaires enabling selection by ICD-10 diagnoses, seem initially more profitable. Finally, having complete response PREM Likert matrixes will reduce missing answers against questionnaires that split answers depending on the patient consecutive answers or Computer-Assisted Questionnaires.
The set of valid PREMs strategy covers and is aligned with that of value-based healthcare and participation principles given. 26 The main future lines are validating other questionnaires from the basal set, expanding the cultural adaptations, or setting additional questionnaires like an <<end-of-life PREM>> or <<PREMs answered directly by adolescents/children>>.
Conclusion
Hospital care quality must be improved every day as well as the tools to assess this improvement including the patient experience perspective. The results demonstrate the feasibility to develop and share a set of several languages robust and transparent based framework generic validated PREMs across a big spectrum of hospital settings including paediatrics and full-patient involvement, and not limited to a number of services. This shouldinspire integrated Value-Based-Healthcare strategies and taking managerial decision-making based on patients opinions.
Supplemental Material
Supplemental Material - Multicentric Design and Validation of a Set of Democratized Hospital Patient Reported Experience Measures in Spain
Supplemental Material for Multicentric Design and Validation of a Set of Democratized Hospital Patient Reported Experience Measures in Spain by Emmanuel Gimenez, Marta Aguayo, Laura Muñoz, Maria José Rodriguez, Carlos Bezos, Marina Martínez, Montserrat Martínez
Footnotes
Acknowledgements
To all the bilingual people having worked/lived/been attended in the health sector during years/native in both Spain (Catalonia) and English speaking countries including Canada, United States, Ireland and United Kingdom. The authors thank the Institut Català de la Salut as an umbrella institution and all the participating hospitals for the unpaid collaboration. Hospital Universitario Virgen del Rocío de Sevilla (Antonio Cervera), Hospital Universitario Central de Asturias (Bernabe Fernandez), Hospital Universitario Príncipe de Asturias (Madrid) (Marta Macías), Institut Català de la Salut (Margarita Garcia), Hospital Universitari Germans Trias (Irene Jimenez), Hospital Universitari dr.Josep Trueta de Girona (Elisabet Jordà, Pere Rimbau), Hospital Verge de la Cinta de Tortosa (Soledad Lucas), Hospital de Viladecans (Jose Luis Moreno), Hospital Universitari de Bellvitge (Irene Feliu, Silvia Millat), Hospital Universitari Arnau de Vilanova de Lleida (Maria Bonjorn), Hospital Joan XXIII de Tarragona (Neus Camañes), Hospital Universitari Vall d’Hebron Hospital (Melissa Bradbury, Emma Pastó, Anna Oliver-Seguí and Laia Humbert), Benard Health & Science Consulting (Angèle Benard), Marcos Seneca Garcia (Parc Taulí de Sabadell), Institut Gutman (Elena Hernandez i Montserrat Bernabeu); Expertise Valeur en Santé; Coalition priorité cancer au Québec, Canada (Eva Villalba), Trinity St James’s Cancer Institute (Grainne Smith), Juan Diego Gonzalez (West Alton Gloor Medical Clinic, Texas) and Biocat (Nuria Castany). The authors also thank David Romero and Francisco Cidoncha (Information Systems Vall d'Hebron Hospital). Additionally to RateNow for providing the technological platform used to administer the survey waves to patients in the participating hospitals. This system ensured appropriate information to patients and enabled detailed segmentation of responses by hospital, questionnaire and survey wave.
Ethical Considerations
Study presented and approved by the Drug Research Ethics Committee and Medication Projects Committee of Vall d’Hebron University Hospital on November 26, 2021. (PR(AG)584/2023). All methods were carried out in accordance with relevant guidelines and regulations or Declaration of Helsinki.
Author Contributions
The manuscript has been submitted with the consent of the authors who made significant contribution to the concept, design, acquisition, analysis or interpretation of data, as well as provided important intellectual content, approved the final version and agreed to be accountable for the work.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: LM (statistician) works for Databioexp and received a non-conditional funding from RateNow to do the psychometric analysis of the five Patient Reported Experience Measures. RateNow is a company specialised in measuring patient experience through digital surveys delivered to patients’ mobile phones, enabling real-time analysis in a business-intelligence platform powered by artificial intelligence. RateNow technology was used in this study to perform the psychometric validation, sending and building SMS invitations with the questionnaires to patients from all participating hospitals.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Additional Comments
Each care centre questionnaires uses but must be requested as per scientific property, per traceability of use, and possible improvement versions to
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
