15Patient-Reported Outcomes and Quality of Life
Moving beyond the audiogram to disease-specific questionnaires that capture what hearing restoration means to patients.
FWhy the audiogram is not the whole story
Ossiculoplasty is judged, in most papers and most operating notes, by a single number: the air–bone gapon the postoperative audiogram. That number is objective, reproducible and easy to compare, and it will always have its place. But it answers only one question — how much the reconstructed chain conducts sound — and it is silent on the question the patient actually cares about: has my life got better?A gap closed to an immaculate 15 dB is a technical triumph, yet it may buy almost nothing for a patient whose other ear hears well, whose real complaint was a discharging, smelly ear that kept them out of the swimming pool, or whose disability is dominated by tinnitus the audiogram never records.
This is the gap that patient-reported outcome measures (PROMs) are designed to fill. A PROM is a validated questionnaire completed by the patient, without a clinician interpreting on their behalf, that quantifies how a condition or its treatment affects symptoms, function and well-being. The appeal is obvious: hearing is fundamentally a communicationability embedded in social life, and two patients with identical audiograms can be worlds apart in how handicapped they feel. The instruments fall into two broad families — generic measures that score benefit across any intervention, and disease-specificmeasures built for chronic ear disease — and choosing well matters. The explorer below lays out the main options used around middle-ear surgery.
Notice the distinction the explorer draws. A generic benefit measure asks “how much did this operation change your life?” and can compare an ossiculoplasty against a rhinoplasty or a cochlear implant. A disease-specific measure asks detailed questions about thisdisease — the discharge, the hearing, the worry — and is therefore more sensitive to change in chronic ear disease, but cannot be compared across unrelated operations. Good outcome reporting increasingly pairs an audiometric result with at least one PROM, because the two answer different questions.
FGeneric benefit: the Glasgow Benefit Inventory
The Glasgow Benefit Inventory (GBI), introduced by Robinson, Gatehouse and Browning in 1996, is the workhorse generic PROM of otorhinolaryngology [1996]. It is an 18-item, post-intervention questionnaire: the patient is asked, after surgery, how each aspect of their life has changed as a result. Responses are averaged onto a scale running from −100 (maximum harm) through 0 (no change) to +100 (maximum benefit), with subscales for general health, social support and physical health. Because it measures change attributable to an intervention on a common metric, it lets a department compare the benefit of its ossiculoplasties against any other operation it performs.
Two features are worth understanding. First, the GBI is a retrospective changeinstrument — it is given only once, after surgery, and asks the patient to judge the change themselves, so there is no preoperative baseline to subtract. That makes it quick and avoids the problem of a patient’s internal yardstick shifting over time, but it relies on accurate recall. Second, being generic, it is relatively insensitive to the specific burdens of chronic ear disease; a patient cured of a foul-smelling discharge and one whose hearing improved by 10 dB may both report modest positive GBI scores without the instrument distinguishing why [1996]. For a hearing-restoration operation embedded in chronic suppurative disease, that lack of granularity is the reason disease-specific tools were developed.
TDisease-specific instruments for chronic ear disease
Three disease-specific PROMs dominate the chronic otitis media (COM) literature, and a trainee should recognise all three. The Chronic Otitis Media Outcome Test 15 (COMOT-15), developed by Baumann and colleagues in 2009, distils 15 items into three subscales — ear symptoms, hearing function and mental health — plus single questions on overall impact and on doctor visits. It showed excellent internal consistency (Cronbach’s α 0.89–0.91) and good retest reliability, and it discriminates COM patients from healthy controls [2009]. Its explicit hearing-function subscale makes it attractive when the hearing dimension is the focus of counselling.
The Chronic Otitis Media Questionnaire 12 (COMQ-12), from Phillips, Yung and Haggard in 2014, is shorter still — 12 items, each scored 0–5, giving a total of 0–60 where a lower score means better quality of life. Its items cover symptom severity, work and lifestyle impact, use of health services and general discomfort [2014]. It was designed specifically for active (discharging) COM and has proved highly responsive to surgery, which has made it a popular before-and-after audit tool. The Zurich Chronic Middle Ear Inventory (ZCMEI-21), from Bächinger and colleagues in 2016, is the most comprehensive: 21 items across ear symptoms, hearing function, mental and emotional impact, and use of medical resources, scored 0–84 with a lower total again meaning better quality of life. It too has excellent internal consistency, and its total score correlates strongly with a single direct quality-of-life question [2016].
| Instrument | Items | Scoring | Distinctive feature |
|---|---|---|---|
| GBI | 18 | −100 to +100; higher = benefit | Generic; compares across operations |
| COMOT-15 | 15 | 0–100; lower = better | Explicit hearing-function subscale |
| COMQ-12 | 12 | 0–60; lower = better | Most responsive to surgery; for active COM |
| ZCMEI-21 | 21 | 0–84; lower = better | Most comprehensive; has a published MCID |
A recurring trap is the direction of scoring: on the GBI a higher score is good, whereas on all three disease-specific COM instruments a higher score means worse quality of life, so improvement shows as a fall in the total. Misreading the direction is one of the commonest errors in interpreting these scales.
TDoes surgery move the patient-reported needle?
A PROM is only useful if it actually changes when the patient gets better. Prospective studies that administer these questionnaires before and after surgery for chronic otitis media consistently show that disease-specific quality of life improves markedly— and importantly, that this improvement is driven by more than the audiogram. Eradicating discharge, removing cholesteatoma, restoring a dry and safe ear, and improving hearing together lift the score. Across these series, the COMQ-12 has emerged as the most responsive of the disease-specific tools, meaning it shows the largest and most reliable change for a given clinical improvement [2014]. The chart below shows the typical magnitude of that change.
Two cautions temper this encouraging picture. First, much of the benefit in active COM comes from control of disease— a dry, safe ear — rather than from hearing gain alone, so a falling COMQ-12 after tympanomastoid surgery should not be read as proof that the ossiculoplasty component succeeded; the audiogram still does that job. Second, generic health-related quality of life (for example the EQ-5D) often barely moves after middle-ear surgery even when disease-specific and hearing-specific scores improve substantially, because the burden of a chronic ear sits in domains a generic instrument scarcely samples. The lesson is to match the instrument to the question: use a disease-specific PROM to detect benefit in ear disease, and do not be surprised when a blunt generic measure stays flat.
CStatistical versus clinical significance: the MCID
A change in a PROM score can be statistically significant— unlikely to be due to chance — and yet be far too small for any patient to notice. The bridge between the two is the minimal clinically important difference (MCID): the smallest change in score that patients themselves regard as meaningful. Without it, a study can report a “significant” improvement that is clinically trivial, and a clinician has no principled way to tell a worthwhile result from statistical noise.
For the ZCMEI-21 the MCID has been established empirically. Using an anchor-based method against patients’ own global rating of change in a longitudinal surgical cohort, the MCID was estimated at about 5 points (mean preoperative score 28.6 falling to 21.8 after surgery) [2020]. The practical rule that follows is that a ZCMEI-21 improvement of roughly 5 points or more is likely to be felt by the patient, whereas a smaller change — however tidy the p-value — probably is not. The interpreter below puts that into practice: set a before-and-after score and see whether the change clears the threshold.
Anchoring interpretation to the MCID also disciplines how we read the literature. A paper that reports a statistically significant PROM change but a magnitude below the MCID has, in clinical terms, shown very little; one that reports a change comfortably above it has shown a benefit patients can feel. When you appraise an ossiculoplasty or tympanomastoid series, look for the magnitude of PROM change against its MCID, not merely the p-value [2020].
CUsing PROMs in practice and counselling
PROMs are not only research instruments; they have a place at the bedside. Administering a disease-specific questionnaire beforesurgery captures a baseline of what most troubles this particular patient — discharge, hearing, social withdrawal or worry — and that profile should shape both the operative plan and the consent conversation. A patient whose ZCMEI-21 burden lies almost entirely in the ear-symptom and mental-health domains, with little hearing complaint, is telling you that a dry, safe ear matters more to them than the last few decibels of gap closure, and the consent discussion should be framed accordingly.
This connects directly to the older insight that technical success and patient benefit are not the same thing. Smyth and Patterson showed decades ago that patients and surgeons frequently disagree on whether middle-ear reconstruction was worthwhile, and derived the Belfast rule of thumb — that a unilateral reconstruction is worthwhile when the operated ear reaches 30 dB or comes within 15 dB of the other ear — precisely because monaural gap closure overstates benefit when the contralateral ear hears well [1985]. Modern disease-specific PROMs extend that idea: they let the patient, not the audiogram, define the endpoint, and they capture the dimensions — discharge, water precautions, embarrassment, anxiety — that a pure-tone average can never reach.
In summary, the audiogram measures the operation; the PROM measures the patient. For chronic ear disease, choose a validated disease-specific instrument (COMQ-12, COMOT-15 or ZCMEI-21), administer it before and after, respect the direction of scoring, and judge the change against its MCID rather than its p-value. Pair it with the audiometric result and a binaural sanity check, and the report finally answers both questions that matter: did the chain conduct, and did the patient’s life improve [2014, 1985].
What is the most accurate interpretation of these results?
Why might a patient-reported outcome measure capture benefit from ossiculoplasty that the air-bone gap does not?
The Glasgow Benefit Inventory (GBI) differs from the disease-specific chronic otitis media questionnaires in which key respect?
A trainee records that a patient's COMQ-12 total rose from 18 preoperatively to 40 postoperatively and concludes the operation improved quality of life. What is wrong with this conclusion?
A study reports a statistically significant fall of 2 points in the ZCMEI-21 after a new ossiculoplasty technique. How should this be interpreted given a published MCID of about 5 points?