Skip to main content

The use of patient-reported outcome measures to improve patient-related outcomes – a systematic review

Abstract

Background

Patient-reported outcome measures (PROMs) provide invaluable information on patients’ health outcomes and can be used to improve patient-related outcomes at the individual, organizational and policy levels. This systematic review aimed to a) identify contemporary applications and synthesize all evidence on the use of PROMs in these contexts and b) to determine characteristics of interventions associated with increased effectiveness.

Methods

Five databases were searched for studies providing quantitative evidence of the impact of PROM interventions. Any study design was permitted. An overall benefit (worsening) in outcome was defined as a statistically significant improvement (deterioration) in either a PROM, patient-reported experience measure or clinical outcome. Study quality was assessed using the Effective Public Healthcare Panacea Project’s Quality Assessment Tool for Quantitative Studies. A narrative synthesis was conducted.

Results

Seventy-six studies of the 11,121 articles identified met the inclusion criteria. At the individual level, 10 (43%) of 23 studies that fed back PROMs to the patient or healthcare provider showed an improvement in outcome. This percentage increased in studies which used PROMs to monitor disease symptoms and linked these to care-pathways: 17 (68%) of 25 studies using this mechanism showed an improvement. Ten (71%) of 14 studies using PROMs to screen for disease found a benefit. The monitoring and screening approach was most effective using PROMs covering cancer-related, depression and gastro-intestinal symptoms. Three studies found that the mere collection of PROMs resulted in improved outcomes. Another three studies used PROMs in decision aids and found improved decision quality.

At the organizational/policy level, none of the 4 studies that used PROMs for benchmarking found a benefit. The three studies that used PROMs for in-depth performance analyses and 1 study in a plan-do-study-act (PDCA) cycle found an improvement in outcome.

Studies employing disease-specific PROMs tended to observe improved outcomes more often. There are concerns regarding the validity of findings, as studies varied from weak to moderate quality.

Conclusions

The use of PROMs at the individual level has matured considerably. Monitoring/screening applications seem promising particularly for diseases for which treatment algorithms rely on the experienced symptom burden by patients. Organizational/policy-level application is in its infancy, and performance evaluation via in-depth analyses and PDCA-cycles may be useful. The findings of this review may aid stakeholders in the development and implementation of PROM-interventions which truly impact patient outcomes.

Background

Patient-reported outcomes measures (PROMs) are considered an invaluable tool to capture information on patients’ health outcomes, including expectations and values. Two types of PROMs exist, namely generic and disease-specific PROMs [1]. Generic PROMs aim to measure a health outcome from an overarching perspective, allowing for comparison between different diseases and a general judgement on the severity. These measures are often multi-dimensional; examples include measures of overall Quality of Life (e.g., EQ-5D) or well-being (e.g., WHO-5) [2, 3]. Disease-specific PROMs aim to measure these concepts, the symptom burden and functional status associated with a disease or a group of diseases [4].

PROMs were introduced to complement clinical outcome measures in studies assessing the (cost-)effectiveness of new clinical interventions. However, their application has broadened, including the role as outcome indicator in clinical practice alongside traditional indicators such as mortality and prevalence/incidence [5]. This movement is adopted by medical science and leading institutions like the Organisation for Economic Co-operation and Development, which conform to the principle that assessing health system performance starts by assessment of patient-related outcomes [6]. It is pragmatic to distinguish three levels of intended use: the individual (micro-), organizational (meso-) and policy (macro-) level [7].

At the micro-level, PROMs are used at the patient-encounter level. Several systematic reviews revealed evidence that using PROMs at the micro-level has a modest beneficial impact on patient-related outcomes [8,9,10,11,12,13,14,15]. The key idea is that a patient fills out a PROM once or multiple times, and the results are fed back to the patient or clinician [15]. Greenhalgh et al. has outlined the underlying theory how PROMs may be useful at this level: the feedback of PROMs may alter the decision-making process, and initiate a change to clinical practice [16]. Several examples exist: firstly, the feedback of PROMs to patient and provider can aid in communicating symptoms which may otherwise remain unnoticed [17, 18]. Another example are novel digital patient-decision systems using PROMs, which develop rapidly parallel to digital technology (e.g., apps, e-portals, and dashboards) [19].

Aggregated PROMs can be used to inform the healthcare system at the organizational (meso-) and health system (macro-) level, respectively. Evidence of the impact of PROMs use at the meso-/macro-level is scarce, and a recent review did not find a clear impact on patient outcomes [8, 20]. The key idea at this level is that aggregated PROMs can guide the (continuous) improvement of healthcare provided by a group of clinicians, hospital or even country [21]. Their role in orthopedic surgery may illustrate their potential. At the meso-level, an orthopedic surgery unit in a hospital may use PROMs to improve local policy on eligibility criteria for surgical treatment, to rationalize pain killing strategies, or to compare performance across surgeons on a monthly basis [22]. At the macro-level, PROMs results according to hospital, region, nation, or otherwise may be presented in a standardized form (both in epidemiological and graphical meaning), inviting for a process of feedback, analysis of drivers, and if possible subsequent improvement [21]. This mechanism is often referred to as benchmarking and is thought to demonstrate performance differences among providers, facilitate more in-depth clinical audits, and inform decision-making, and is a potentially effective method to improve the quality of care [23, 24]. An example which aimed to encourage benchmarking is the NHS-programme in the UK on certain surgical procedures. This program publicly published PROMs for varicose vein, groin hernia, and hip and knee arthroplasty surgery; as of 2017 PROMs are only collected for hip/knee surgery [25]. This program also aimed to incentivize patients to select the assumed best provider, however, available evidence does not support this pathway [21, 26].

We think a contemporary review is warranted because it remains unknown why certain PROMs-interventions are more effective than others [8, 11]. Certain mechanisms underpinning the interventions may contribute to increased effectiveness. For example, a critical step to transform a suboptimal PROM level, i.e. a patient value below a particular threshold, into an improved outcome may be to link this observation to a care pathway. The doctor may receive an alert inviting her/him to check the situation. This approach seems promising in disease areas where symptom monitoring along with treatment tailoring is common practice, e.g., gastroenterology, rheumatology, and oncology [27, 28].

In this systematic review, we aim to identify contemporary evidence of the impact of the use of PROMs at the micro-, meso- and macro-level on patient outcomes. Our second aim is to identify and describe characteristics of the intervention and PROMs used which may contribute to an increased chance for success.

Methods

The present systematic review was registered in PROSPERO under record 2022 CRD42022333400. This review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (2020) when applicable [29].

Data sources and search strategy

The following databases were searched: MEDLINE, Embase, Web of Science Core Collection, Cochrane CENTRAL Register of trials, and Google Scholar from database inception to August 24, 2023 for studies that reported the use of PROMs to improve quality of care. The final search was developed and refined through an iterative process and consisted of 3 blocks, namely: (a) various terms for PROMs, (b) various terms for quality, effectiveness and outcomes, and (c) mechanisms through which PROMs may be used to benefit healthcare (e.g., feedback, monitoring, dashboards and plan-do-check-act (PDCA) cycles) (Supplementary Material 1). A PDCA-cycle is a commonly used framework to guide the continuous improvement of healthcare and services provided [30]. Additional studies were identified by screening the references of included articles.

Study selection

Studies were eligible that (a) provided evidence on the impact of an intervention, (b) using a previously validated PROM, (c) which reported at least one quantitative outcome per the definition described below. Any study design was permitted. Studies were excluded if (a) the full-text could not be retrieved and/or only a conference abstract was available; (b) the study was conducted as a pilot; (c) there was no comparator or pre-intervention comparison; (d) the PROM was used to select patients for another type of intervention; (e) the article was not available in English. Two reviewers (JB and AI) independently screened all titles and abstracts obtained from the search and applied the inclusion criteria to eligible studies. Any disagreements regarding the inclusion of studies was discussed between the two reviewers and were resolved by consensus.

Outcome definition

We defined the potential impact of a PROM-intervention on patient-related outcomes using the Donabedian framework [31]. To evaluate the quality of healthcare or impact of an intervention, contemporary guidelines place emphasis on outcome measures which reflect the impact on the health status of patients [32]. Typically, these outcomes are of quantitative nature and are collected at the patient-level. We discerned three types of outcomes measured based on previous reviews, namely (1) PROMs, (2) patient reported experiences measures (PREMs) and (3) clinical outcomes. Outcome measures were categorized according to the dimensions/items into overarching groups based on the identified studies, e.g., Health-Related Quality of Life (HRQoL), physical functioning, mental functioning, and symptom burden. Similarly, this was done for PREMs (e.g., satisfaction) and clinical outcomes (e.g., readmissions).

A study was judged to have found an overall benefit (or a detriment/harm) if any of the above-mentioned outcomes improved (worsened) up to statistical significance. As patient-related outcomes may be specific to the intended use and medical domain, we did not attribute weight to a specific type of outcome. Studies often contained multiple comparisons through analysis of dimensions or even items separately. This approach inflates testing, increasing the potential of a type I error. Therefore, we required at least 2 subdomain/single-items to reach statistical significance to qualify the impact as a benefit or detriment, unless outcomes were defined as primary outcome a priori.

In accordance with previous reviews, process of care measures (e.g., number of symptoms discussed) were extracted, but were considered to mediate outcomes described above [14].

Data extraction and quality assessment

The following data were extracted from eligible studies by one of the reviewers (JB or AI): authors, country, setting, study design, sample, PROMs used, description of intervention using PROMs, co-interventions, training offered on the intervention and/or interpretation of PROM, all primary and secondary outcome measures and their quantification.

Two reviewers (JB and AH) independently assessed the methodological quality of included studies using the Effective Public Healthcare Panacea Project’s Quality Assessment Tool for Quantitative Studies [33]. The tool was considered the most appropriate for this systematic review as it covers various study designs and public health interventions. Domains assessed using the tool included selection bias, study design, confounders, blinding, data collection methods, and withdrawals and drop-outs. Each domain was rated as 1 (strong), 2 (moderate) or 3 (weak). A global score was calculated, in which strong = no weak ratings, moderate = 1 weak rating, and weak = two or more weak ratings.

Data synthesis

A narrative synthesis was conducted as a formal meta-analysis appeared not possible at an early stage due to the heterogeneity of study designs and outcomes reported. Overall, the synthesis was split up by the micro- and meso-/macro-level. The impact of PROMs interventions was assessed by four possible determinants for increased effectiveness. The applications were categorized into mechanisms applied based on commonalities between PROMs interventions. Subsequently, we captured a broader perspective by determining the impact of PROMs interventions by the medical domain, the type of PROM used in the intervention, and by the separate outcome dimensions used to measure the effect of the intervention. For the latter, we decided to only present those which were measured in at least 3 studies. We discerned studies which used the same PROM outcome as in the intervention from studies which (only) used different outcomes. Finally, for each determinant and outcome dimension, the average quality of studies was calculated.

Results

The PRISMA diagram depicting the selection process is presented in Fig. 1. A total of 18,652 records were identified. After removing duplicates, 11,121 records were screened at title-abstract level, of which 159 were screened at full-text; 57 records were found to be eligible for inclusion [17, 19, 28, 34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88]. Through reference tracking another 21 records were identified [17, 89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108], leading to a total of 78 included studies. Two studies presented outcomes in two separate publications; these were combined resulting in 76 unique studies [17, 74, 75, 87].

Fig. 1
figure 1

PRISMA flowchart of study selection

Study characteristics

An overview of study characteristics, PROMs used, overall study impact and quality is presented in Table 1 (micro-level) and Table 2 (meso-/macro-level). Below we shortly describe the included studies: for a more detailed description of study characteristics refer to Supplementary Material 2, and for extended tables of study characteristics, quality assessment and outcomes extracted refer to Supplementary Material 3.

Table 1 Study characteristics at the micro-level (sorted by medical domain)
Table 2 Study characteristics at the meso-/macro-level (sorted by medical domain)

Micro-level

Sixty-eight out of 76 studies provided evidence on the use of PROMs at the micro-level [17, 19, 28, 34,35,36, 38,39,40,41,42,43,44, 46,47,48, 50,51,52,53,54,55,56, 58, 59, 62,63,64,65,66,67,68,69,70,71, 73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93, 95,96,97,98,99,100,101,102,103,104,105,106,107,108]. Most studies were conducted in the United States (n = 32), and were in the medical domains primary care (n = 17), oncology (n = 19), gastroenterology (n = 5) and orthopedic (trauma) surgery (n = 6). Fifty-five studies used a disease-specific instrument in their intervention, 3 used a generic instrument and 10 a combination. Sixteen studies were of strong quality, 31 were of moderate quality and 21 were of weak quality.

Macro-level

Eight out of 76 studies provided evidence of the use of PROMs at the macro-level [37, 45, 49, 57, 60, 61, 72, 94], and no studies were found at the meso-level. Studies were conducted in various countries. Most studies were conducted in surgical fields (n = 7), of which 3 in both non-surgical and surgical fields; the eighth study was conducted in primary care. Five studies used a disease-specific PROM, 1 used a generic PROM, and 2 used a combination. Four studies were rated as moderate quality, while the other 4 were rated as weak quality.

Impact by determinants and outcome dimensions

Outcome of PROMs interventions by determinants are summarized in Table 3 (micro-level) and Table 4 (meso-/macro-level). Table 5 shows the impact by outcome dimensions. The quality of studies for each determinant generally indicated “moderate” quality, both at the micro- and meso-/macro-level; the exception is highlighted. Six mechanisms were identified at the micro-level, and 3 at the meso-/macro-level.

Table 3 Overall impact by determinants at the micro-level
Table 4 Overall impact by determinants at the meso-/macro-level
Table 5 Impact by outcome dimensions

Impact by mechanism

Micro-level

Feedback of PROMs to patient

One of 5 studies employing feedback of PROMs to patients fed back (raw) scores directly [54], 3 included a graphical display of PROMs scores [55, 78, 85], and 1 combined a narrative report with a graphical display [43]. Studies were conducted in various domains. One (20%) study conducted in head-cancer patients fed back data from a comprehensive inventory of disease-related symptoms and found an improved overall outcome, driven by improved symptoms (pain and activity), mental and physical functioning [54].

Feedback of PROMs to provider

Two of the 18 studies employing feedback of PROMs to providers used (raw) scores in their report [79, 90], 4 included a narrative report [52, 53, 73, 93], 8 included a graphical display [17, 36, 44, 47, 48, 84, 91, 92], and 3 combined a narrative report with a graphical display [34, 41, 89]. Overall, nine (53%) studies found an improvement in outcome [17, 34, 47, 53, 73, 84, 89, 90].

When looking at the information collected, 14 of 18 studies fed back PROMs to patients which covered disease-specific information such as hip functioning, cancer-related, or gastrointestinal symptoms [17, 34, 36, 41, 47, 53, 73, 79, 84, 89,90,91,92,93]. Of these 14 studies, 9 (64%) found an improvement in outcome [17, 34, 47, 53, 73, 79, 84, 89, 90]. Most studies pertained to cancer-related symptoms (n = 8) of which 5 (63%) reported an improvement via various outcome dimensions, including reduced emergency department (ED) visits or readmissions (n = 2), improved physical, mental and social functioning (n = 1), symptoms (depression and cancer-related) (n = 1) or experience with care (n = 1) [17, 47, 79, 84, 89]. The remaining 4 studies fed back PROMs to the provider pertaining to general HRQoL and/or pain, and found no improvement in outcome [44, 48, 52, 55].

Using PROMs to screen for disease or symptoms

Seven studies out of 14 used PROMs to screen for depression [28, 35, 50, 56, 71, 98, 102], and 1 study for oncological symptoms [70], to initiate treatment or a care pathway. Of these, five (63%) studies observed an improved outcome driven by improved symptoms (depression, stress or anxiety) (n = 4), improved mental (n = 2), social (n = 2), and physical functioning (n = 1), and reduced ED visits and readmissions (n = 1) [28, 35, 56, 70, 71]. One study found an outcome deterioration via worsened pain symptoms [50].

Six studies combined the screening for depression with follow-up monitoring to evaluate whether the treatment works, and potentially adjust if treatment was ineffective [38, 59, 74, 83, 88, 105]. Of these, three also incorporated disease-specific information: knee functioning [88], cancer-related [74], and gastro-intestinal symptoms [105]. Five (83%) out of 6 studies found improved outcome particularly via improved symptoms (depression and anxiety) (n = 4) and reduced ED visits (n = 2) [59, 74, 83, 88, 105]. Two of three disease-specific symptoms also improved, except for oncological symptoms [74].

Using PROMs to monitor symptoms

Twelve out of 25 studies used PROMs to identify patients under treatment exceeding predefined thresholds of symptoms and linked these to treatment changes, increased monitoring or care pathways [39, 63, 66, 67, 81, 86, 95, 97, 100, 103, 107, 108]; 10 (83%) found an improved outcome [39, 63, 66, 81, 95, 97, 100, 103, 107, 108]. Seven studies also used PROMs monitor treatment but did not explicitly mention the use of predefined algorithms [40, 42, 69, 82, 99, 101, 104]; 4 (57%) reported an improvement [82, 99, 101, 104]. Six studies incorporated PROMs into the clinical pathway and sent out alerts upon exceeding a threshold without specific guidance to the provider [64, 68, 76, 80, 96, 106], 1 of these also used PROMs to monitor treatment response [106]; three (50%) found an improved outcome [64, 96, 106].

When looking at the information collected, 13 out of 25 studies used PROMs to monitor existing depression symptoms [42, 63, 68, 69, 80, 82, 97, 99,100,101, 106,107,108]. Of these, 10 (77%) found an improved outcome, mostly driven by improved depression symptoms (n = 9) and satisfaction (n = 5) [63, 69, 82, 97, 99,100,101, 106,107,108]. Five studies used PROMs to monitor cancer-related symptoms [64, 67, 76, 103, 104], of which 3 (60%) found various improved outcomes including HRQoL, physical and mental functioning, and satisfaction [64, 103, 104]. Three studies monitored gastro-intestinal symptoms in patients with inflammatory bowel disease and all (100%) found reduced readmissions (n = 2) and improved HRQoL (n = 1) [39, 66, 96]. The remaining 4 studies were conducted in various domains [40, 81, 86, 95], of which two showed improved outcomes. The first monitored surgical recovery in colorectal surgery patients and found improved perception of general health, anxiety and satisfaction. The other used PROMs to guide treatment in children with juvenile idiopathic arthritis and found reduced pain and arthritis activity [81, 95].

No feedback: filling out effect of PROMs

One of 3 studies tested the hypothesis of whether merely filling out alcohol abuse PROMs would reduce alcohol use by a direct measurement effect [51]. Similarly, another study collected PROMs weekly in patients with eczema without any additional interventions [62]. The third study collected PROMs daily after surgery via an app; patients could always contact their provider via the e-portal [46]. All (100%) studies reported improved outcome due to improved symptoms (depression and alcohol dependency) (n = 2) and improved HRQoL (n = 1).

PROMs in decision-aids

In three studies a one-time PROM was used in a decision-aid along an education component to help with treatment choice (surgical vs. conservative) in patients with knee osteoarthritis [19, 58, 77]. All studies (100%) found an improvement in shared-decision making, while 1 of these only found this effect in females [58].

Meso-/macro-level

PROMs in benchmarking

Three benchmarking studies used case-mix adjusted PROM scores [37, 49, 57], while the fourth used unadjusted scores [94]. Three studies presented performance reports to the provider, which included PROM scores and how they compared to peer providers [37, 49, 94]; in 2 studies complication rates were also presented [37, 49]. The other study evaluated a nationwide PROMs collection program, which provided both patients and providers the option to check providers' PROMs outcomes [57]. All studies were of weak quality, and did not find an improvement in outcome; 1 study even reported a potential worsening [49].

PROMs in in-depth analysis of data

Three studies used PROM data in combination with guidelines, teaching and protocols to improve pain management in various surgical and non-surgical departments [45, 60, 72]. One of these studies also used a feedback loop by a department representative to evaluate and provide advice on the implemented initiatives [45]. The two other studies pertained to the same quality initiative aimed to reduce the pain of patients admitted to hospitals but were conducted in different developing countries/departments [60, 72]. All 3 (100%) studies found an improvement in outcome due to reduced pain (n = 3) and nausea (n = 2) symptoms in particular.

PROMs in PDCA-cycles

One study conducted a PDCA-cycle where they introduced an improved total knee implant and changed their surgical technique, guided by and evaluated with PROMs scores [61]: an overall improvement in outcome (HRQoL) was observed.

Impact by medical domain

Micro-level

At the micro-level, the medical domains in which PROM interventions were conducted which seemed to be consistently associated with improved outcome were orthopedic (trauma) surgery (n = 6 studies, 100% effective), gastroenterology (n = 5, 80%), oncology (n = 19, 68%), and primary care (n = 17, 59%). Less effective seemed cardiology (n = 4, 50%) and rheumatology (n = 4, 50%). Limited evidence was available for other domains.

Meso-/macro-level

Interventions conducted in orthopedics, primary care, and urology were not found to be related to improved outcome. Four studies covered various internal and surgical departments, of which 3 (75%) showed improved outcome.

Impact by type of PROM used in intervention

Micro-level

Most studies used a disease-specific PROM, which showed the highest percentage of improved outcomes (n = 55 studies, 71% effective). Generic PROMs or a combination of both showed an overall lower percentage (n = 13, 38%). While disease-specific PROMs were used in all mechanisms, generic PROMs were used in studies employing the “feedback” mechanism (n = 10), “decision-aids” (n = 2), and once (combined with a disease-specific PROM) in “screening”.

Meso-/macro-level

According to the type of PROM (disease-specific vs. generic) no specific pattern was observed.

Impact by outcome dimensions

Micro-level

In this section, we describe the impact of the PROMs-interventions on the outcome dimensions (PROMs, PREMs or clinical outcomes), regardless of the mechanism or other determinants.

Regarding PROMs, studies often showed an improvement in general health perceptions (n = 8 studies, 75% effective), decision-readiness and conflict (n = 4, 75%) and symptoms overall (n = 46, 57%). Particularly depression was evaluated often (n = 25), and improved in 57% of studies. The percentage decreased for HRQoL (n = 29, 38%) and physical and mental functioning domains.

Regarding PREMs, satisfaction was most often studied (n = 23), and improved in less than half of studies (43%). Patient-activation and experience with care tended to improve slightly more often (n = 7, 57%, for both outcomes).

As for clinical outcomes, twelve studies analyzed emergency department visits, of which 58% found an improvement. Fewer studies observed a positive effect on complications (n = 8, 13%) and (re)admissions (n = 17, 29%), and no studies observed an effect on survival (n = 5, 0%).

Studies which used a different outcome than the PROM in the intervention more often had an improved overall outcome (n = 36, 72%), compared to those which did not (n = 32, 56%).

Meso-/macro-level

With regard to PROMs, symptoms showed improved most often, which mostly pertained to pain (n = 5, 60%). HRQoL was also measured in 5 studies, however, improved in less studies (40%). Other domains and outcomes were studied in only a few studies, and showed no improvement.

Discussion

In this systematic review, evidence on the use of PROMs to improve patient-related outcomes at the micro- (68 studies) and meso-/macro- [8] levels was collected and analyzed. Moreover, determinants for increased effectiveness were elucidated.

At the micro-level, 44% of studies employing direct feedback of PROMs to the provider and/or patient resulted in improved patient outcomes, which is in line with previous reviews [8,9,10,11,12,13,14,15]. A contemporary development was to use PROMs to screen for disease or to monitor existing disease. These studies linked the PROMs scores to care pathways or treatment adaptations, and approximately 70% of studies found improved outcomes. This approach was particularly effective for depression, oncological and gastroenterological disease. A novel application was to use PROMs to inform patients considering knee arthroplasty, which generally resulted in improved decision-quality. At the meso-/macro-level, current evidence does not support using PROMs in benchmarking. The scarce evidence available suggests, however, that PROMs might be of value in an in-depth analysis of the performance of departments and hospitals and PDCA-cycles. At both the micro- and meso-/macro-level, studies more often employed disease-specific PROMs, which – in comparison with studies which employed generic PROMs – found improved outcomes more often.

The evidence at all levels was of moderate quality at best, which raises concerns regarding the validity of the findings.

Micro-level

Providing feedback on the PROM scores to patients or providers is generally thought to benefit outcomes via improved patient-healthcare professional communication and identification of problematic symptoms [16]. This application is often used in patients with chronic disease who have multiple visits to their doctor, which in our review included diabetes, gastrointestinal disease, oncology, orthopedics, transplantation care; most evidence was available for oncology [8, 27]. For example, two studies applied a tailored symptom inventory for head-neck cancer patients and found a positive impact on PROMs [47, 54]. The effectiveness may be because this group presumably experiences a number of severe physical symptoms (e.g., problems with swallowing) which, if timely detected, are sensitive to treatment.

The application of PROMs to improve patient outcomes seems particularly effective if a deviation from the acceptable threshold occurs and can be linked to a recognizable action by the clinician, such as referral or treatment adaptation. This mechanism was effective in several studies in the medical domains, including depression, oncology and gastrointestinal care. For example, monitoring patients with diagnosed diseases such as inflammatory bowel disease or screening for disease with an expected high burden in the studied population such as post-partum depression may be beneficial [28, 39]. The purpose and goal of the tool may be clearer for both patient and provider, which could increase its effectiveness.

Various reasons may underlie decreased effectiveness of PROM-interventions. Firstly, a general trend was observed that studies utilizing generic PROMs found less positive effect overall, and these studies mostly did not link a generic PROM to a care pathway (such as “screening” or “monitoring”). Generic PROMs may provide insufficient insight into treatable or modifiable factors related to the studied population. However, it should be noted, one of the identified decision-aids successfully employed only a generic measure in patients considering knee arthroplasty [19]. Combined, we believe this underlines the fact that the choice of PROM in the intervention should be driven by the intended use. Secondly, the measured outcome may play a role: PROM interventions tended to have a more pronounced impact on general health perceptions and symptom burden, but less so on certain outcomes such as HRQoL in general or survival. Other reasons for failure may include patients’ resistance to discussing symptoms, time constraints in clinical practice and lack of provider continuity, and implementation hurdles through lack of knowledge [16].

The evaluation of interventions based on systematic PROM feedback appears to be a challenge. Firstly, the definition of 'control' treatment: about a third of the studies collected PROMs in the control group, unconnected to feedback or another intervention. This may decrease the difference as the collection of PROMs itself may induce beneficial effects as observed in 3 studies [46, 51, 62]. These findings suggest a Hawthorne-like effect through the completion of PROMs alone [51, 109]. The patient’s self-knowledge and awareness are increased, and filling out the questionnaire may increase their empowerment to take a more active role in their healthcare [34]. We expected this effect to be relatively limited, as approximately half of studies used a different outcome measure than the PROM in the intervention and generally found an improvement. Secondly, most studies did not measure intervention compliance making it impossible to know to what extent (and how) patients or providers used the PROM interventions. Thirdly, PROMs are generally part of a more complex intervention with multiple facets (e.g., patient education), and it is impossible to isolate the exact role of the PROM in the intervention. However, we believe this is also one of the key roles of PROMs in contemporary medicine; they can enhance interventions by offering important insight into patient outcomes.

Meso-/macro-level

The 4 studies which evaluated PROM benchmarking did not find a benefit. Multiple reasons for the intervention not being successful have been suggested. Boyce et al. noted that PROMs have not been developed nor validated as performance measures, and the choice of PROM may play a role in the usability of the provided feedback [37]. It is possible that inter-provider comparisons do not inherently motivate professionals to initiate additional audits and research activities or professionals may lack the knowledge to undertake such initiatives. The included studies do not describe how the data was (or wasn’t) used in a feedback process of change. Kumar et al. suggested that further improvement might be prevented when the quality of care is already high [49]. The quality of the benchmarking process is also dependent on adequate case-mix variable selection, which is time-consuming and costly [110, 111]. A lack of educational support could also play a role, and it may be useful to provide examples of successes and failures with using PROMs data [112]. Finally, aggregated PROMs are used extensively in research aimed at improving quality care through, e.g. identifying subgroups at risk for poorer outcomes. These studies presumably have a large impact on national clinical guidelines, however, to our knowledge, the impact is hardly reported in peer-reviewed literature. The same applies to quality benchmarking under the supervision of professional organisations: this information is discussed with hospital groups and individuals but is generally not published.

Some examples, however, were found for in-depth analysis and PDCA-cycles with the aim to initiate quality improvements. A PDCA-cycle provides a structured and iterative approach to test changes aimed at improving the quality of systems [113]. Four studies were found that exploited these types of methods using PROMs data, all finding a benefit on patient outcomes. Zaslansky et al. suggested that the success could be attributable to the relatively low starting performance of partaking departments [60]. A commonality among these studies is the clear definition of the goal, an action plan, and feedback on the intervention along the way; all potential items which might facilitate the success of a quality improvement initiative, also highlighted by a Cochrane review [114].

Strengths and limitations

The major strength of this review is the broad search strategy, including the added value of PROMs at the micro-, meso- and macro-level. Several limitations must be acknowledged. Non-peer-reviewed literature (e.g., registry reports), which may be an important source of information on the use of PROMs as quality improvement tool, was excluded. However, this was not deemed feasible because these documents are often published in non-English languages and generally do not report clear evidence of an impact, such as a before-after comparison. Meta-analysis and estimating the effect sizes were not possible due to the heterogeneity of outcomes. PROM scores were variably reported as total score and/or by dimension, limiting the synthesis on the impact of PROMs-interventions by outcome dimensions.

Conclusion

This systematic review provides a comprehensive overview of novel applications of PROMs which aim improve patient outcomes, and determinants for increased effectiveness. The effectiveness appears to relate to the underlying mechanism, type of PROM used and outcome studied. At the micro-level, for example, PROMs feedback to patient or provider was positively associated with patient outcomes in approximately half of studies. Contemporary studies went a step further and linked PROMs scores to care pathways in for example depression, oncological and gastrointestinal care, which resulted in improved outcomes in a higher percentage of studies. At the meso-/macro-level evidence was limited, and evidence did not suggest a benefit of using PROMs for benchmarking. Promising applications included in-depth analysis and PDCA-cycles using PROMs data. With the increasing use of PROMs in routine clinical care, these findings may help in designing applications which truly impact patient outcomes. As the quality of studies was moderate at best raising concerns regarding the validity of findings, rigorously designed studies should be conducted on testing these applications.

Availability of data and materials

All data generated or analysed during this study are included in this published article.

Abbreviations

PROMs:

Patient-reported outcome measures

PDCA cycle:

Plan-do-study-act cycle

PRISMA:

Preferred Reporting Items for Systematic Reviews and Meta-Analyses

PREMs:

Patient reported experiences measures

HRQoL:

Health-Related Quality of Life

ED:

Emergency department visits

References

  1. Brooks R. EuroQol: the current state of play. Health Policy. 1996;37(1):53–72.

    Article  CAS  PubMed  Google Scholar 

  2. Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167–76.

    Article  PubMed  Google Scholar 

  3. Devlin NJ, Brooks R. EQ-5D and the EuroQol Group: Past, Present and Future. Appl Health Econ Health Policy. 2017;15(2):127–37.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Meadows KA. Patient-reported outcome measures: an overview. Br J Community Nurs. 2011;16(3):146–51.

    Article  PubMed  Google Scholar 

  5. Murray CJ, Frenk J. A framework for assessing the performance of health systems. Bull World Health Organ. 2000;78(6):717–31.

    CAS  PubMed  PubMed Central  Google Scholar 

  6. The Organisation for Economic Co-operation and Development (OECD) Health at a Glance 2019.

  7. Alberta PROMs and EQ-5D Research and Support Unit (APERSU). Enhancing the Use of Patient-reported Outcome Measures (PROMs) in the Healthcare System in Alberta. Link: https://apersu.ca/wp-content/uploads/2020/09/APERSU-PROMs-White-Paper.pdf. 2020.

  8. Boyce MB, Browne JP. Does providing feedback on patient-reported outcomes to healthcare professionals result in better outcomes for patients? A systematic review. Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation. 2013;22(9):2265–78.

    Article  PubMed  Google Scholar 

  9. Espallargues M, Valderas JM, Alonso J. Provision of feedback on perceived health status to health care professionals: a systematic review of its impact. Med Care. 2000;38(2):175–86.

    Article  CAS  PubMed  Google Scholar 

  10. Gilbody SM, House AO, Sheldon T. Routine administration of Health Related Quality of Life (HRQoL) and needs assessment instruments to improve psychological outcome–a systematic review. Psychol Med. 2002;32(8):1345–56.

    Article  CAS  PubMed  Google Scholar 

  11. Gilbody SM, House AO, Sheldon TA. Routinely administered questionnaires for depression and anxiety: systematic review. BMJ. 2001;322(7283):406–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Greenhalgh J, Meadows K. The effectiveness of the use of patient-based measures of health in routine practice in improving the process and outcomes of patient care: a literature review. J Eval Clin Pract. 1999;5(4):401–16.

    Article  CAS  PubMed  Google Scholar 

  13. Ishaque S, Karnon J, Chen G, Nair R, Salter AB. A systematic review of randomised controlled trials evaluating the use of patient-reported outcome measures (PROMs). Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation. 2019;28(3):567–92.

    Article  CAS  PubMed  Google Scholar 

  14. Valderas JM, Kotzeva A, Espallargues M, Guyatt G, Ferrans CE, Halyard MY, et al. The impact of measuring patient-reported outcomes in clinical practice: a systematic review of the literature. Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation. 2008;17(2):179–93.

    Article  CAS  PubMed  Google Scholar 

  15. Gibbons C, Porter I, Gonçalves-Bradley DC, Stoilov S, Ricci-Cabello I, Tsangaris E, et al. Routine provision of feedback from patient-reported outcome measurements to healthcare providers and patients in clinical practice. The Cochrane database of systematic reviews. 2021;10(10):CD011589.

  16. Greenhalgh J, Gooding K, Gibbons E, Dalkin S, Wright J, Valderas J, Black N. How do patient reported outcome measures (PROMs) support clinician-patient communication and patient care? A realist synthesis. Journal of patient-reported outcomes. 2018;2:42.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Velikova G, Booth L, Smith AB, Brown PM, Lynch P, Brown JM, Selby PJ. Measuring quality of life in routine oncology practice improves communication and patient well-being: a randomized controlled trial. J Clin Oncol. 2004;22(4):714–24.

    Article  PubMed  Google Scholar 

  18. Watson L, Link C, Qi S, DeIure A, Chmielewski L, Hildebrand A, et al. Testing a modified electronic version of the Edmonton symptom assessment system-revised for remote online completion with ambulatory cancer patients in Alberta. Canada Digit Health. 2023;9:20552076231191000.

    Article  PubMed  Google Scholar 

  19. Bansback N, Trenaman L, MacDonald KV, Durand D, Hawker G, Johnson JA, et al. An online individualised patient decision aid improves the quality of decisions in patients considering total knee arthroplasty in routine care: A randomized controlled trial. Osteoarthr Cartil Open. 2022;4(3): 100286.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Dorr MC, van Hof KS, Jelsma JGM, Dronkers EAC, Baatenburg de Jong RJ, Offerman MPJ, de Bruijne MC. Quality improvements of healthcare trajectories by learning from aggregated patient-reported outcomes: a mixed-methods systematic literature review. Health Res Policy Syst. 2022;20(1):90.

  21. Greenhalgh J, Dalkin S, Gibbons E, Wright J, Valderas JM, Meads D, Black N. How do aggregated patient-reported outcome measures data stimulate health care improvement? A realist synthesis. J Health Serv Res Policy. 2018;23(1):57–65.

    Article  PubMed  Google Scholar 

  22. Al Sayah F, Lahtinen M, Bonsel GJ, Ohinmaa A, Johnson JA. A multi-level approach for the use of routinely collected patient-reported outcome measures (PROMs) data in healthcare systems. Journal of patient-reported outcomes. 2021;5(Suppl 2):98.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Ellis J. All inclusive benchmarking. J Nurs Manag. 2006;14(5):377–83.

    Article  PubMed  Google Scholar 

  24. Willmington C, Belardi P, Murante AM, Vainieri M. The contribution of benchmarking to quality improvement in healthcare. A systematic literature review. BMC health services research. 2022;22(1):139.

  25. Valderas JM, Fitzpatrick R, Roland M. Using health status to measure NHS performance: another step into the dark for the health reform in England. BMJ Qual Saf. 2012;21(4):352–3.

    Article  CAS  PubMed  Google Scholar 

  26. Totten AM, Wagner J, Tiwari A, O'Haire C, Griffin J, Walker M. Closing the quality gap: revisiting the state of the science (vol. 5: public reporting as a quality improvement strategy). Evid Rep Technol Assess (Full Rep). 2012(208.5):1–645.

  27. Chen J, Ou L, Hollis SJ. A systematic review of the impact of routine collection of patient reported outcome measures on patients, providers and health organisations in an oncologic setting. BMC Health Serv Res. 2013;13:211.

    Article  PubMed  PubMed Central  Google Scholar 

  28. van der Zee-van den Berg AI, Boere-Boonekamp MM, Groothuis-Oudshoorn CGM, MJ IJ, Haasnoot-Smallegange RME, Reijneveld SA. Post-Up Study: Postpartum Depression Screening in Well-Child Care and Maternal Outcomes. Pediatrics. 2017;140(4).

  29. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021;372: n71.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Roberts AYK, editor. Evidence-based practice manual: research and outcome measures in health and human services: New York. NY: Oxford University Press; 2004.

    Google Scholar 

  31. Donabedian A. Evaluating the quality of medical care. 1966. Milbank Q. 2005;83(4):691–729.

  32. Agency for Healthcare Research and Quality R, MD. Six Domains of Healthcare Quality. Content last reviewed December 2022. [Available from: https://www.ahrq.gov/talkingquality/measures/six-domains.html.

  33. Armijo-Olivo S, Stiles CR, Hagen NA, Biondo PD, Cummings GG. Assessment of study quality for systematic reviews: a comparison of the Cochrane Collaboration Risk of Bias Tool and the Effective Public Health Practice Project Quality Assessment Tool: methodological research. J Eval Clin Pract. 2012;18(1):12–8.

    Article  PubMed  Google Scholar 

  34. Ackermans L, Hageman MG, Bos AH, Haverkamp D, Scholtes VAB, Poolman RW. Feedback to Patients About Patient-reported Outcomes Does Not Improve Empowerment or Satisfaction. Clin Orthop Relat Res. 2018;476(4):716–22.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Allen J, Annells M, Nunn R, Petrie E, Clark E, Lang L, Robins A. Evaluation of effectiveness and satisfaction outcomes of a mental health screening and referral clinical pathway for community nursing care. J Psychiatr Ment Health Nurs. 2011;18(5):375–85.

    Article  CAS  PubMed  Google Scholar 

  36. Almario CV, Chey WD, Khanna D, Mosadeghi S, Ahmed S, Afghani E, et al. Impact of National Institutes of Health Gastrointestinal PROMIS Measures in Clinical Practice: Results of a Multicenter Controlled Trial. Am J Gastroenterol. 2016;111(11):1546–56.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Boyce MB, Browne JP. The effectiveness of providing peer benchmarked feedback to hip replacement surgeons based on patient-reported outcome measures–results from the PROFILE (Patient-Reported Outcomes: Feedback Interpretation and Learning Experiment) trial: a cluster randomised controlled study. BMJ Open. 2015;5(7): e008325.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Buxton JA, Chandler-Altendorf A, Puente AE. A novel collaborative practice model for treatment of mental illness in indigent and uninsured patients. Am J Health Syst Pharm. 2012;69(12):1054–62.

    Article  PubMed  Google Scholar 

  39. de Jong MJ, van der Meulen-de Jong AE, Romberg-Camps MJ, Becx MC, Maljaars JP, Cilissen M, et al. Telemedicine for management of inflammatory bowel disease (myIBDcoach): a pragmatic, multicentre, randomised controlled trial. Lancet. 2017;390(10098):959–68.

    Article  PubMed  Google Scholar 

  40. Epstein JN, Rabiner D, Johnson DE, Fitzgerald DP, Chrisman A, Erkanli A, et al. Improving attention-deficit/hyperactivity disorder treatment outcomes through use of a collaborative consultation treatment service by community-based pediatricians: a cluster randomized trial. Arch Pediatr Adolesc Med. 2007;161(9):835–40.

    Article  PubMed  Google Scholar 

  41. Fihn SD, McDonell MB, Diehr P, Anderson SM, Bradley KA, Au DH, et al. Effects of sustained audit/feedback on self-reported health status of primary care patients. Am J Med. 2004;116(4):241–8.

    Article  PubMed  Google Scholar 

  42. Girgis A, Breen S, Stacey F, Lecathelinais C. Impact of two supportive care interventions on anxiety, depression, quality of life, and unmet needs in patients with nonlocalized breast and colorectal cancers. J Clin Oncol. 2009;27(36):6180–90.

    Article  PubMed  Google Scholar 

  43. Gossec L, Cantagrel A, Soubrier M, Berthelot JM, Joubert JM, Combe B, et al. An e-health interactive self-assessment website (Sanoia(®)) in rheumatoid arthritis. A 12-month randomized controlled trial in 320 patients. Joint Bone Spine. 2018;85(6):709–14.

  44. Hadjistavropoulos T, MacNab Y, Lints-Martindale A, Martin R, Hadjistavropoulos H. Does routine pain assessment result in better care? Pain Res Manag. 2009;14(3):211–6.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Haller G, Agoritsas T, Luthy C, Piguet V, Griesser AC, Perneger T. Collaborative quality improvement to manage pain in acute care hospitals. Pain Med. 2011;12(1):138–47.

    Article  PubMed  Google Scholar 

  46. Jaensson M, Dahlberg K, Eriksson M, Nilsson U. Evaluation of postoperative recovery in day surgery patients using a mobile phone application: a multicentre randomized trial. Br J Anaesth. 2017;119(5):1030–8.

    Article  CAS  PubMed  Google Scholar 

  47. Kjaer T, Dalton SO, Andersen E, Karlsen R, Nielsen AL, Hansen MK, et al. A controlled study of use of patient-reported outcomes to improve assessment of late effects after treatment for head-and-neck cancer. Radiother Oncol. 2016;119(2):221–8.

    Article  PubMed  Google Scholar 

  48. Kroenke K, Talib TL, Stump TE, Kean J, Haggstrom DA, DeChant P, et al. Incorporating PROMIS Symptom Measures into Primary Care Practice-a Randomized Clinical Trial. J Gen Intern Med. 2018;33(8):1245–52.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Kumar RM, Fergusson DA, Lavallée LT, Cagiannos I, Morash C, Horrigan M, et al. Performance Feedback May Not Improve Radical Prostatectomy Outcomes: The Surgical Report Card (SuRep) Study. J Urol. 2021;206(2):346–53.

    Article  PubMed  Google Scholar 

  50. Mallen CD, Nicholl BI, Lewis M, Bartlam B, Green D, Jowett S, et al. The effects of implementing a point-of-care electronic template to prompt routine anxiety and depression screening in patients consulting for osteoarthritis (the Primary Care Osteoarthritis Trial): A cluster randomised trial in primary care. PLoS Med. 2017;14(4): e1002273.

    Article  PubMed  PubMed Central  Google Scholar 

  51. McCambridge J, Day M. Randomized controlled trial of the effects of completing the Alcohol Use Disorders Identification Test questionnaire on self-reported hazardous drinking. Addiction. 2008;103(2):241–8.

    Article  PubMed  Google Scholar 

  52. Reiber GE, Au D, McDonell M, Fihn SD. Diabetes quality improvement in Department of Veterans Affairs Ambulatory Care Clinics: a group-randomized clinical trial. Diabetes Care. 2004;27(Suppl 2):B61–8.

    Article  PubMed  Google Scholar 

  53. Richardson J, Chan D, Risdon K, Giles C, Mulveney S, Cripps D. Does monitoring change in function in community-dwelling older adults alter outcome? A randomized controlled trial Clin Rehabil. 2008;22(12):1061–70.

    PubMed  Google Scholar 

  54. Rogers SN, Allmark C, Bekiroglu F, Edwards RT, Fabbroni G, Flavel R, et al. Improving quality of life through the routine use of the patient concerns inventory for head and neck cancer patients: main results of a cluster preference randomised controlled trial. Eur Arch Otorhinolaryngol. 2021;278(9):3435–49.

    Article  PubMed  Google Scholar 

  55. Santana MJ, Feeny D, Johnson JA, McAlister FA, Kim D, Weinkauf J, Lien DC. Assessing the use of health-related quality of life measures in the routine clinical care of lung-transplant patients. Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation. 2010;19(3):371–9.

    Article  PubMed  Google Scholar 

  56. Shyu YI, Liang J, Tseng MY, Li HJ, Wu CC, Cheng HS, et al. Comprehensive and subacute care interventions improve health-related quality of life for older patients after surgery for hip fracture: a randomised controlled trial. Int J Nurs Stud. 2013;50(8):1013–24.

    Article  PubMed  Google Scholar 

  57. Varagunam M, Hutchings A, Neuburger J, Black N. Impact on hospital performance of introducing routine patient reported outcome measures in surgery. J Health Serv Res Policy. 2014;19(2):77–84.

    Article  PubMed  Google Scholar 

  58. Volkmann ER, FitzGerald JD. Reducing gender disparities in post-total knee arthroplasty expectations through a decision aid. BMC Musculoskelet Disord. 2015;16(1):16.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Wu S, Ell K, Jin H, Vidyanti I, Chou CP, Lee PJ, et al. Comparative Effectiveness of a Technology-Facilitated Depression Care Management Model in Safety-Net Primary Care Patients With Type 2 Diabetes: 6-Month Outcomes of a Large Clinical Trial. J Med Internet Res. 2018;20(4): e147.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Zaslansky R, Chapman CR, Baumbach P, Bytyqi A, Castro Lopes JM, Chetty S, et al. Improving perioperative pain management: a preintervention and postintervention study in 7 developing countries. Pain Rep. 2019;4(1): e705.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Partridge T, Carluke I, Emmerson K, Partington P, Reed M. Improving patient reported outcome measures (PROMs) in total knee replacement by changing implant and preserving the infrapatella fatpad: a quality improvement project. BMJ quality improvement reports. 2016;5(1).

  62. Baker A, Mitchell EJ, Partlett C, Thomas KS. Evaluating the effect of weekly patient-reported symptom monitoring on trial outcomes: results of the Eczema Monitoring Online randomized controlled trial. Br J Dermatol. 2023;189(2):180–7.

    Article  PubMed  Google Scholar 

  63. Balestrieri M, Sisti D, Rocchi M, Rucci P, Simon G, Araya R, de Girolamo G. Effectiveness of clinical decision support systems and telemedicine on outcomes of depression: a cluster randomized trial in general practice. Fam Pract. 2020;37(6):731–7.

    Article  PubMed  Google Scholar 

  64. Basch E, Deal AM, Kris MG, Scher HI, Hudis CA, Sabbatini P, et al. Symptom Monitoring With Patient-Reported Outcomes During Routine Cancer Treatment: A Randomized Controlled Trial. J Clin Oncol. 2016;34(6):557–65.

    Article  CAS  PubMed  Google Scholar 

  65. Berdal G, Sand-Svartrud AL, Linge AD, Aasvold AM, Tennebø K, Eppeland SG, et al. Bridging gaps across levels of care in rehabilitation of patients with rheumatic and musculoskeletal diseases: Results from a stepped-wedge cluster randomized controlled trial. Clin Rehabil. 2023;37(9):1153–77.

    Article  PubMed  PubMed Central  Google Scholar 

  66. Berinstein JA, Cohen-Mekelburg SA, Greenberg GM, Wray D, Berry SK, Saini SD, et al. A Care Coordination Intervention Improves Symptoms But Not Charges in High-Risk Patients With Inflammatory Bowel Disease. Clin Gastroenterol Hepatol. 2022;20(5):1029–38 e9.

  67. Cooley ME, Mazzola E, Xiong N, Hong F, Lobach DF, Braun IM, et al. Clinical Decision Support for Symptom Management in Lung Cancer Patients: A Group RCT. J Pain Symptom Manage. 2022;63(4):572–80.

    Article  PubMed  Google Scholar 

  68. Dhingra L, Schiller R, Teets R, Nosal S, Dieckmann NF, Ginzburg R, et al. Pain Management in Primary Care: A Randomized Controlled Trial of a Computerized Decision Support Tool. Am J Med. 2021;134(12):1546–54.

    Article  PubMed  Google Scholar 

  69. Dobscha SK, Corson K, Hickam DH, Perrin NA, Kraemer DF, Gerrity MS. Depression decision support in primary care: a cluster randomized trial. Ann Intern Med. 2006;145(7):477–87.

    Article  PubMed  Google Scholar 

  70. Ferrell B, Chung V, Hughes MT, Koczywas M, Azad NS, Ruel NH, et al. A Palliative Care Intervention for Patients on Phase 1 Studies. J Palliat Med. 2021;24(6):846–56.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Fortmann AL, Walker C, Barger K, Robacker M, Morrisey R, Ortwine K, et al. Care Team Integration in Primary Care Improves One-Year Clinical and Financial Outcomes in Diabetes: A Case for Value-Based Care. Popul Health Manag. 2020;23(6):467–75.

    Article  PubMed  PubMed Central  Google Scholar 

  72. Garduño-López AL, Acosta Nava VM, Castro Garcés L, Rascón-Martínez DM, Cuellar-Guzmán LF, Flores-Villanueva ME, et al. Towards Better Perioperative Pain Management in Mexico: A Study in a Network of Hospitals Using Quality Improvement Methods from PAIN OUT. J Pain Res. 2021;14:415–30.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Holm I, Pripp AH, Risberg MA. The Active with OsteoArthritis (AktivA) Physiotherapy Implementation Model: A Patient Education, Supervised Exercise and Self-Management Program for Patients with Mild to Moderate Osteoarthritis of the Knee or Hip Joint. A National Register Study with a Two-Year Follow-Up. J Clin Med. 2020;9(10).

  74. Howell D, Li M, Sutradhar R, Gu S, Iqbal J, O’Brien MA, et al. Integration of patient-reported outcomes (PROs) for personalized symptom management in “real-world” oncology practices: a population-based cohort comparison study of impact on healthcare utilization. Support Care Cancer. 2020;28(10):4933–42.

    Article  PubMed  Google Scholar 

  75. Howell D, Rosberger Z, Mayer C, Faria R, Hamel M, Snider A, et al. Personalized symptom management: a quality improvement collaborative for implementation of patient reported outcomes (PROs) in “real-world” oncology multisite practices. Journal of patient-reported outcomes. 2020;4(1):47.

    Article  PubMed  PubMed Central  Google Scholar 

  76. Livanainen S, Ravichandra R, Jekunen A, Arokoski R, Mentu S, Lang L, et al. ePRO symptom follow-up of colorectal cancer patients receiving oxaliplatin-based adjuvant chemotherapy is feasible and enhances the quality of patient care: a prospective multicenter study. J Cancer Res Clin Oncol. 2023;149(10):6875–82.

    Article  Google Scholar 

  77. Jayakumar P, Moore MG, Furlough KA, Uhler LM, Andrawis JP, Koenig KM, et al. Comparison of an Artificial Intelligence-Enabled Patient Decision Aid vs Educational Material on Decision Quality, Shared Decision-Making, Patient Experience, and Functional Outcomes in Adults With Knee Osteoarthritis: A Randomized Clinical Trial. JAMA Netw Open. 2021;4(2): e2037107.

    Article  PubMed  PubMed Central  Google Scholar 

  78. Ngo E, Truong MB, Wright D, Nordeng H. Impact of a Mobile Application for Tracking Nausea and Vomiting During Pregnancy (NVP) on NVP Symptoms, Quality of Life, and Decisional Conflict Regarding NVP Treatments: MinSafeStart Randomized Controlled Trial. JMIR Mhealth Uhealth. 2022;10(7): e36226.

    Article  PubMed  PubMed Central  Google Scholar 

  79. Patel MI, Kapphahn K, Dewland M, Aguilar V, Sanchez B, Sisay E, et al. Effect of a Community Health Worker Intervention on Acute Care Use, Advance Care Planning, and Patient-Reported Outcomes Among Adults With Advanced Stages of Cancer: A Randomized Clinical Trial. JAMA Oncol. 2022;8(8):1139–48.

    PubMed  PubMed Central  Google Scholar 

  80. Pérez JC, Fernández O, Cáceres C, Carrasco ÁE, Moessner M, Bauer S, et al. An Adjunctive Internet-Based Intervention to Enhance Treatment for Depression in Adults: Randomized Controlled Trial. JMIR Ment Health. 2021;8(12): e26814.

    Article  PubMed  PubMed Central  Google Scholar 

  81. Pooni A, Brar MS, Anpalagan T, Schmocker S, Rashid S, Goldstein R, et al. Home to Stay: A Randomized Controlled Trial Evaluating the Effect of a Postdischarge Mobile App to Reduce 30-Day Readmission Following Elective Colorectal Surgery. Ann Surg. 2023;277(5):e1056–62.

    Article  PubMed  Google Scholar 

  82. Price S, Hamann HA, Halaby L, Trejo J, Rogers FC, Weihs K. Collaborative depression care sensitive to the needs of underserved patients with cancer: Feasibility, acceptability and outcomes. J Psychosoc Oncol. 2023:1–23.

  83. Rollman BL, Anderson AM, Rothenberger SD, Abebe KZ, Ramani R, Muldoon MF, et al. Efficacy of Blended Collaborative Care for Patients With Heart Failure and Comorbid Depression: A Randomized Clinical Trial. JAMA Intern Med. 2021;181(10):1369–80.

    Article  PubMed  PubMed Central  Google Scholar 

  84. Skovlund PC, Vind Thaysen H, Schmidt H, Alsner J, Hjollund NH, Lomborg K, Nielsen BK. Effect of patient-reported outcomes as a dialogue-based tool in cancer consultations on patient self-management and health-related quality of life: a clinical, controlled trial. Acta Oncol. 2021;60(12):1668–77.

    Article  PubMed  Google Scholar 

  85. Steele Gray C, Chau E, Tahsin F, Harvey S, Loganathan M, McKinstry B, et al. Assessing the Implementation and Effectiveness of the Electronic Patient-Reported Outcome Tool for Older Adults With Complex Care Needs: Mixed Methods Study. J Med Internet Res. 2021;23(12): e29071.

    Article  PubMed  PubMed Central  Google Scholar 

  86. Tirelli F, Xiao R, Brandon TG, Burnham JM, Chang JC, Weiss PF. Determinants of disease activity change over time in Enthesitis related arthritis: effect of structured outcome monitoring and clinical decision support. Pediatr Rheumatol Online J. 2020;18(1):79.

    Article  PubMed  PubMed Central  Google Scholar 

  87. Velikova G, Keding A, Harley C, Cocks K, Booth L, Smith AB, et al. Patients report improvements in continuity of care when quality of life assessments are used routinely in oncology practice: secondary outcomes of a randomised controlled trial. Eur J Cancer. 2010;46(13):2381–8.

    Article  PubMed  Google Scholar 

  88. Wylde V, Bertram W, Sanderson E, Noble S, Howells N, Peters TJ, et al. The STAR care pathway for patients with pain at 3 months after total knee replacement: a multicentre, pragmatic, randomised, controlled trial. Lancet Rheumatol. 2022;4(3):e188–97.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  89. Barbera L, Sutradhar R, Howell D, Sussman J, Seow H, Dudgeon D, et al. Does routine symptom screening with ESAS decrease ED visits in breast cancer patients undergoing adjuvant chemotherapy? Support Care Cancer. 2015;23(10):3025–32.

  90. de Wit M, Delemarre-van de Waal HA, Bokma JA, Haasnoot K, Houdijk MC, Gemke RJ, Snoek FJ. Monitoring and discussing health-related quality of life in adolescents with type 1 diabetes improve psychosocial well-being: a randomized controlled trial. Diabetes Care. 2008;31(8):1521–6.

  91. Detmar SB, Muller MJ, Schornagel JH, Wever LD, Aaronson NK. Health-related quality-of-life assessments and patient-physician communication: a randomized controlled trial. JAMA. 2002;288(23):3027–34.

    Article  PubMed  Google Scholar 

  92. Hilarius DL, Kloeg PH, Gundy CM, Aaronson NK. Use of health-related quality-of-life assessments in daily clinical oncology nursing practice: a community hospital-based intervention study. Cancer. 2008;113(3):628–37.

    Article  PubMed  Google Scholar 

  93. Rosenbloom SK, Victorson DE, Hahn EA, Peterman AH, Cella D. Assessment is not enough: a randomized controlled trial of the effects of HRQL assessment on quality of life and satisfaction in oncology clinical practice. Psychooncology. 2007;16(12):1069–79.

    Article  PubMed  Google Scholar 

  94. Weingarten SR, Kim CS, Stone EG, Kristopaitis RJ, Pelter M, Sandhu M. Can peer-comparison feedback improve patient functional status? Am J Manag Care. 2000;6(1):35–9.

    CAS  PubMed  Google Scholar 

  95. Buckley L, Ware E, Kreher G, Wiater L, Mehta J, Burnham JM. Outcome Monitoring and Clinical Decision Support in Polyarticular Juvenile Idiopathic Arthritis. J Rheumatol. 2020;47(2):273–81.

    Article  PubMed  Google Scholar 

  96. Cross RK, Langenberg P, Regueiro M, Schwartz DA, Tracy JK, Collins JF, et al. A Randomized Controlled Trial of TELEmedicine for Patients with Inflammatory Bowel Disease (TELE-IBD). Am J Gastroenterol. 2019;114(3):472–82.

    Article  PubMed  Google Scholar 

  97. Davidson KW, Rieckmann N, Clemow L, Schwartz JE, Shimbo D, Medina V, et al. Enhanced depression care for patients with acute coronary syndrome and persistent depressive symptoms: coronary psychosocial evaluation studies randomized controlled trial. Arch Intern Med. 2010;170(7):600–8.

    Article  PubMed  PubMed Central  Google Scholar 

  98. Frasure-Smith N, Lespérance F, Prince RH, Verrier P, Garber RA, Juneau M, et al. Randomised trial of home-based psychosocial nursing intervention for patients recovering from myocardial infarction. Lancet. 1997;350(9076):473–9.

    Article  CAS  PubMed  Google Scholar 

  99. Katon W, Robinson P, Von Korff M, Lin E, Bush T, Ludman E, et al. A multifaceted intervention to improve treatment of depression in primary care. Arch Gen Psychiatry. 1996;53(10):924–32.

    Article  CAS  PubMed  Google Scholar 

  100. Katon WJ, Von Korff M, Lin EH, Simon G, Ludman E, Russo J, et al. The Pathways Study: a randomized trial of collaborative care in patients with diabetes and depression. Arch Gen Psychiatry. 2004;61(10):1042–9.

    Article  PubMed  Google Scholar 

  101. Katzelnick DJ, Simon GE, Pearson SD, Manning WG, Helstad CP, Henk HJ, et al. Randomized trial of a depression management program in high utilizers of medical care. Arch Fam Med. 2000;9(4):345–51.

    Article  CAS  PubMed  Google Scholar 

  102. Kronish IM, Moise N, Cheung YK, Clarke GN, Dolor RJ, Duer-Hefele J, et al. Effect of Depression Screening After Acute Coronary Syndromes on Quality of Life: The CODIACS-QoL Randomized Clinical Trial. JAMA Intern Med. 2020;180(1):45–53.

    Article  PubMed  Google Scholar 

  103. Maguire R, McCann L, Kotronoulas G, Kearney N, Ream E, Armes J, et al. Real time remote symptom monitoring during chemotherapy for cancer: European multicentre randomised controlled trial (eSMART). BMJ. 2021;374: n1647.

    Article  PubMed  PubMed Central  Google Scholar 

  104. Patel MI, Ramirez D, Agajanian R, Agajanian H, Bhattacharya J, Bundorf KM. Lay Health Worker-Led Cancer Symptom Screening Intervention and the Effect on Patient-Reported Satisfaction, Health Status, Health Care Use, and Total Costs: Results From a Tri-Part Collaboration. JCO Oncol Pract. 2020;16(1):e19–28.

    Article  PubMed  Google Scholar 

  105. Regueiro M, Click B, Anderson A, Shrank W, Kogan J, McAnallen S, Szigethy E. Reduced Unplanned Care and Disease Activity and Increased Quality of Life After Patient Enrollment in an Inflammatory Bowel Disease Medical Home. Clin Gastroenterol Hepatol. 2018;16(11):1777–85.

    Article  PubMed  PubMed Central  Google Scholar 

  106. Sharpe M, Walker J, Holm Hansen C, Martin P, Symeonides S, Gourley C, et al. Integrated collaborative care for comorbid major depression in patients with cancer (SMaRT Oncology-2): a multicentre randomised controlled effectiveness trial. Lancet. 2014;384(9948):1099–108.

    Article  PubMed  Google Scholar 

  107. Simon GE, Ralston JD, Savarino J, Pabiniak C, Wentzel C, Operskalski BH. Randomized trial of depression follow-up care by online messaging. J Gen Intern Med. 2011;26(7):698–704.

    Article  PubMed  PubMed Central  Google Scholar 

  108. Unützer J, Katon W, Callahan CM, Williams JW Jr, Hunkeler E, Harpole L, et al. Collaborative care management of late-life depression in the primary care setting: a randomized controlled trial. JAMA. 2002;288(22):2836–45.

    Article  PubMed  Google Scholar 

  109. Sandberg T, Conner M. A mere measurement effect for anticipated regret: impacts on cervical screening attendance. Br J Soc Psychol. 2009;48(Pt 2):221–36.

    Article  PubMed  Google Scholar 

  110. van Anne-Margreet D, Hester FL, Johan PM, Ewout WS. Random variation and rankability of hospitals using outcome indicators. BMJ Qual Saf. 2011;20(10):869–74.

    Article  Google Scholar 

  111. Goldstein H, Spiegelhalter DJ. League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance. J R Stat Soc Ser A Stat Soc. 2018;159(3):385–409.

    Article  Google Scholar 

  112. Marshall DA, Jin X, Pittman LB, Smith CJ. The use of patient-reported outcome measures in hip and knee arthroplasty in Alberta. Journal of patient-reported outcomes. 2021;5(2):87.

    Article  PubMed  PubMed Central  Google Scholar 

  113. Taylor MJ, McNicholas C, Nicolay C, Darzi A, Bell D, Reed JE. Systematic review of the application of the plan-do-study-act method to improve quality in healthcare. BMJ Qual Saf. 2014;23(4):290–8.

    Article  PubMed  Google Scholar 

  114. Ivers N, Jamtvedt G, Flottorp S, Young JM, Odgaard-Jensen J, French SD, et al. Audit and feedback: effects on professional practice and healthcare outcomes. The Cochrane database of systematic reviews. 2012(6):Cd000259.

Download references

Acknowledgements

We would like to thank dr. Wichor Bramer (biomedical information specialist at Erasmus Medical Center) for his help in developing the search strategy.

Funding

This work was funded by a PhD grant (PHD-287) provided by the EuroQol Research Foundation. The funder had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Author information

Authors and Affiliations

Authors

Contributions

Concept and design: JB, GB, HP; Literature search: JB, AI; Quality assessment: JB, AH; Interpretation of results: JB, AI, AH; Drafting of the manuscript: JB; Critical revision of the paper for important intellectual content: all authors; Obtaining funding: JB, HP, GB; Supervision: HP, GB.

Corresponding author

Correspondence to Joshua M. Bonsel.

Ethics declarations

Ethics approval and consent to participate

Given the study design (systematic review) ethics approval was not required nor sought. This study was registered prospectively in PROSPERO under record 2022 CRD42022333400.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bonsel, J.M., Itiola, A.J., Huberts, A.S. et al. The use of patient-reported outcome measures to improve patient-related outcomes – a systematic review. Health Qual Life Outcomes 22, 101 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12955-024-02312-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12955-024-02312-4

Keywords