Ossiculoplasty Atlas
Ossiculoplasty Atlas · Outcomes, Prognosis & Complications · Module 15

15Patient-Reported Outcomes and Quality of Life

Moving beyond the audiogram to disease-specific questionnaires that capture what hearing restoration means to patients.

FWhy the audiogram is not the whole story

Ossiculoplasty is judged, in most papers and most operating notes, by a single number: the air–bone gapon the postoperative audiogram. That number is objective, reproducible and easy to compare, and it will always have its place. But it answers only one question — how much the reconstructed chain conducts sound — and it is silent on the question the patient actually cares about: has my life got better?A gap closed to an immaculate 15 dB is a technical triumph, yet it may buy almost nothing for a patient whose other ear hears well, whose real complaint was a discharging, smelly ear that kept them out of the swimming pool, or whose disability is dominated by tinnitus the audiogram never records.

This is the gap that patient-reported outcome measures (PROMs) are designed to fill. A PROM is a validated questionnaire completed by the patient, without a clinician interpreting on their behalf, that quantifies how a condition or its treatment affects symptoms, function and well-being. The appeal is obvious: hearing is fundamentally a communicationability embedded in social life, and two patients with identical audiograms can be worlds apart in how handicapped they feel. The instruments fall into two broad families — generic measures that score benefit across any intervention, and disease-specificmeasures built for chronic ear disease — and choosing well matters. The explorer below lays out the main options used around middle-ear surgery.

Choosing a patient-reported outcome measure for the middle ear

Instrument

Chronic Otitis Media Questionnaire 12

Items12TypeDisease-specificScore range0 to 60DirectionLower = better quality of life
Symptom severityWork and lifestyle impactHealth-service useGeneral discomfort

Best for: Active (discharging) chronic otitis media. The most responsive disease-specific measure to surgery, so good for before-and-after audit.

Generic benefit measures (GBI) score change after any operation on a −100 to +100 scale and let you compare procedures; disease-specific measures (COMQ-12, COMOT-15, ZCMEI-21) are scored so that a lower total means better quality of life and are completed before and after surgery. Item counts, domains and scoring from the validation papers (Robinson 1996; Phillips 2014; Baumann 2009; Bachinger 2016; Nakas 2020). Verified via PubMed.

Notice the distinction the explorer draws. A generic benefit measure asks “how much did this operation change your life?” and can compare an ossiculoplasty against a rhinoplasty or a cochlear implant. A disease-specific measure asks detailed questions about thisdisease — the discharge, the hearing, the worry — and is therefore more sensitive to change in chronic ear disease, but cannot be compared across unrelated operations. Good outcome reporting increasingly pairs an audiometric result with at least one PROM, because the two answer different questions.

FGeneric benefit: the Glasgow Benefit Inventory

The Glasgow Benefit Inventory (GBI), introduced by Robinson, Gatehouse and Browning in 1996, is the workhorse generic PROM of otorhinolaryngology [1996]. It is an 18-item, post-intervention questionnaire: the patient is asked, after surgery, how each aspect of their life has changed as a result. Responses are averaged onto a scale running from −100 (maximum harm) through 0 (no change) to +100 (maximum benefit), with subscales for general health, social support and physical health. Because it measures change attributable to an intervention on a common metric, it lets a department compare the benefit of its ossiculoplasties against any other operation it performs.

Two features are worth understanding. First, the GBI is a retrospective changeinstrument — it is given only once, after surgery, and asks the patient to judge the change themselves, so there is no preoperative baseline to subtract. That makes it quick and avoids the problem of a patient’s internal yardstick shifting over time, but it relies on accurate recall. Second, being generic, it is relatively insensitive to the specific burdens of chronic ear disease; a patient cured of a foul-smelling discharge and one whose hearing improved by 10 dB may both report modest positive GBI scores without the instrument distinguishing why [1996]. For a hearing-restoration operation embedded in chronic suppurative disease, that lack of granularity is the reason disease-specific tools were developed.

TDisease-specific instruments for chronic ear disease

Three disease-specific PROMs dominate the chronic otitis media (COM) literature, and a trainee should recognise all three. The Chronic Otitis Media Outcome Test 15 (COMOT-15), developed by Baumann and colleagues in 2009, distils 15 items into three subscales — ear symptoms, hearing function and mental health — plus single questions on overall impact and on doctor visits. It showed excellent internal consistency (Cronbach’s α 0.89–0.91) and good retest reliability, and it discriminates COM patients from healthy controls [2009]. Its explicit hearing-function subscale makes it attractive when the hearing dimension is the focus of counselling.

The Chronic Otitis Media Questionnaire 12 (COMQ-12), from Phillips, Yung and Haggard in 2014, is shorter still — 12 items, each scored 0–5, giving a total of 0–60 where a lower score means better quality of life. Its items cover symptom severity, work and lifestyle impact, use of health services and general discomfort [2014]. It was designed specifically for active (discharging) COM and has proved highly responsive to surgery, which has made it a popular before-and-after audit tool. The Zurich Chronic Middle Ear Inventory (ZCMEI-21), from Bächinger and colleagues in 2016, is the most comprehensive: 21 items across ear symptoms, hearing function, mental and emotional impact, and use of medical resources, scored 0–84 with a lower total again meaning better quality of life. It too has excellent internal consistency, and its total score correlates strongly with a single direct quality-of-life question [2016].

InstrumentItemsScoringDistinctive feature
GBI18−100 to +100; higher = benefitGeneric; compares across operations
COMOT-15150–100; lower = betterExplicit hearing-function subscale
COMQ-12120–60; lower = betterMost responsive to surgery; for active COM
ZCMEI-21210–84; lower = betterMost comprehensive; has a published MCID

A recurring trap is the direction of scoring: on the GBI a higher score is good, whereas on all three disease-specific COM instruments a higher score means worse quality of life, so improvement shows as a fall in the total. Misreading the direction is one of the commonest errors in interpreting these scales.

TDoes surgery move the patient-reported needle?

A PROM is only useful if it actually changes when the patient gets better. Prospective studies that administer these questionnaires before and after surgery for chronic otitis media consistently show that disease-specific quality of life improves markedly— and importantly, that this improvement is driven by more than the audiogram. Eradicating discharge, removing cholesteatoma, restoring a dry and safe ear, and improving hearing together lift the score. Across these series, the COMQ-12 has emerged as the most responsive of the disease-specific tools, meaning it shows the largest and most reliable change for a given clinical improvement [2014]. The chart below shows the typical magnitude of that change.

COMQ-12 total score falls after middle-ear surgery (lower is better)

08152330COMQ-12 total (0-60)PreoperativePostoperative
Time pointPostoperativeCOMQ-12 total score7 pts

COMQ-12 scores each of 12 items 0-5 (total 0-60); a lower total means better disease-specific quality of life. Prospective series report the overall average falling from about 25 preoperatively to about 7 after surgery (p<0.001), and the COMQ-12 is the most responsive of the disease-specific instruments to surgical intervention (Phillips 2014; postoperative responsiveness data). Illustrative cohort means following the cited pattern, not one patient. Verified via PubMed.

Two cautions temper this encouraging picture. First, much of the benefit in active COM comes from control of disease— a dry, safe ear — rather than from hearing gain alone, so a falling COMQ-12 after tympanomastoid surgery should not be read as proof that the ossiculoplasty component succeeded; the audiogram still does that job. Second, generic health-related quality of life (for example the EQ-5D) often barely moves after middle-ear surgery even when disease-specific and hearing-specific scores improve substantially, because the burden of a chronic ear sits in domains a generic instrument scarcely samples. The lesson is to match the instrument to the question: use a disease-specific PROM to detect benefit in ear disease, and do not be surprised when a blunt generic measure stays flat.

CStatistical versus clinical significance: the MCID

A change in a PROM score can be statistically significant— unlikely to be due to chance — and yet be far too small for any patient to notice. The bridge between the two is the minimal clinically important difference (MCID): the smallest change in score that patients themselves regard as meaningful. Without it, a study can report a “significant” improvement that is clinically trivial, and a clinician has no principled way to tell a worthwhile result from statistical noise.

For the ZCMEI-21 the MCID has been established empirically. Using an anchor-based method against patients’ own global rating of change in a longitudinal surgical cohort, the MCID was estimated at about 5 points (mean preoperative score 28.6 falling to 21.8 after surgery) [2020]. The practical rule that follows is that a ZCMEI-21 improvement of roughly 5 points or more is likely to be felt by the patient, whereas a smaller change — however tidy the p-value — probably is not. The interpreter below puts that into practice: set a before-and-after score and see whether the change clears the threshold.

Is the change big enough to matter? ZCMEI-21 and the MCID

Change (pre − post)+7 ptsMCID threshold5 ptsVerdictMeets MCID

Clinically meaningful improvement

The ZCMEI-21 ranges 0–84, where a lower total means better quality of life, so an improvement is a fall in score. A change of about 5 points or more is the minimal clinically important difference (MCID): below it, a statistically detectable change is unlikely to be felt by the patient (Nakas 2020; mean preoperative 28.6, postoperative 21.8). Illustrative tool, not a substitute for the full questionnaire. Verified via PubMed.

Anchoring interpretation to the MCID also disciplines how we read the literature. A paper that reports a statistically significant PROM change but a magnitude below the MCID has, in clinical terms, shown very little; one that reports a change comfortably above it has shown a benefit patients can feel. When you appraise an ossiculoplasty or tympanomastoid series, look for the magnitude of PROM change against its MCID, not merely the p-value [2020].

CUsing PROMs in practice and counselling

PROMs are not only research instruments; they have a place at the bedside. Administering a disease-specific questionnaire beforesurgery captures a baseline of what most troubles this particular patient — discharge, hearing, social withdrawal or worry — and that profile should shape both the operative plan and the consent conversation. A patient whose ZCMEI-21 burden lies almost entirely in the ear-symptom and mental-health domains, with little hearing complaint, is telling you that a dry, safe ear matters more to them than the last few decibels of gap closure, and the consent discussion should be framed accordingly.

This connects directly to the older insight that technical success and patient benefit are not the same thing. Smyth and Patterson showed decades ago that patients and surgeons frequently disagree on whether middle-ear reconstruction was worthwhile, and derived the Belfast rule of thumb — that a unilateral reconstruction is worthwhile when the operated ear reaches 30 dB or comes within 15 dB of the other ear — precisely because monaural gap closure overstates benefit when the contralateral ear hears well [1985]. Modern disease-specific PROMs extend that idea: they let the patient, not the audiogram, define the endpoint, and they capture the dimensions — discharge, water precautions, embarrassment, anxiety — that a pure-tone average can never reach.

In summary, the audiogram measures the operation; the PROM measures the patient. For chronic ear disease, choose a validated disease-specific instrument (COMQ-12, COMOT-15 or ZCMEI-21), administer it before and after, respect the direction of scoring, and judge the change against its MCID rather than its p-value. Pair it with the audiometric result and a binaural sanity check, and the report finally answers both questions that matter: did the chain conduct, and did the patient’s life improve [2014, 1985].

Case 8.15
A 42-year-old woman with right-sided active chronic otitis media undergoes a canal-wall-up tympanomastoidectomy with PORP ossiculoplasty. Three months later her ear is dry and safe, but her four-frequency air-bone gap has only improved from 32 dB to 26 dB, which the trainee records as a disappointing audiometric result. Her preoperative ZCMEI-21 score was 38 and her postoperative score is 18; she volunteers that she is delighted because the constant discharge and embarrassment have gone and she can swim again. The trainee asks how to reconcile the modest audiogram with the large fall in the questionnaire score.

What is the most accurate interpretation of these results?

Self-assessment - Patient-Reported Outcomes and Quality of Life4 questions
Question 1 · Foundation

Why might a patient-reported outcome measure capture benefit from ossiculoplasty that the air-bone gap does not?

Question 2 · Foundation

The Glasgow Benefit Inventory (GBI) differs from the disease-specific chronic otitis media questionnaires in which key respect?

Question 3 · Trainee

A trainee records that a patient's COMQ-12 total rose from 18 preoperatively to 40 postoperatively and concludes the operation improved quality of life. What is wrong with this conclusion?

Question 4 · Clinician

A study reports a statistically significant fall of 2 points in the ZCMEI-21 after a new ossiculoplasty technique. How should this be interpreted given a published MCID of about 5 points?

Tracked locally in your browser — see /progress for the dashboard.