Rahul Banerjee, MD, discusses how to navigate the obstacles in interpreting Kaplan-Meier curves.
Compared with other eponymous curves, like Frank-Starling, Kaplan-Meier (K-M) curves are rarely mentioned in medical school. Nevertheless, you’ve no doubt come across them in articles, presentations, drug inserts, and more. As initially described by Edward L. Kaplan, PhD; and Paul Meier, PhD, 63 years ago, these curves summarize time-to-event probabilities in datasets of patients who receive different amounts of follow-up.1
Probably you understand the gist of K-M curves (if not, see references 2 and 3). What follows is a primer that trainees like you and me can use to understand the anatomy of K-M curves. This article does not delve into the assumptions or mathematical intricacies that underlie the curves but focuses instead on essential concepts.
The y-axis shows the probability—0% to 100% or 0.0 to 1.0—of event-free survival with regard to the event of interest. However, the event of interest relative to survival varies (see Table). Overall survival (OS), or length of time to patient death from any cause, is always relevant. When considering progression-free survival (PFS), disease-free survival (DFS), and relapse-free survival (RFS), ask yourself whether the event is meaningful to the patient. It probably is in an aggressive malignancy but not for in indolent chronic condition.
Certain modifications to these measurements warrant discussion. Time to next treatment (TTNT) generally exceeds PFS because not every patient with progressive disease requires or desires a new therapy immediately.4 Conversely, when patients receive additional therapy after suboptimal responses to chemotherapy but don't strictly meet the criteria for progressive disease (eg. as with Hodgkin lymphoma), modified PFS, which combines elements of TTNT and true PFS, has been used.5 In certain cases, unique complications may be added to RFS as events of interest, such as graft-vs-host disease-free/relapse-free survival (GRFS) in studies of allogenic stem cell transplantation.6
Finally, with PFS2(time to second objective disease progression), the emphasis is on PFS after next line of therapy (beyond line being studied) is initiated.
For instance, the KEYNOTE-024 study of pembrolizumab plus chemotherapy versus chemotherapy alone was more accurately a study of “pembrolizumab now” (ie, combination therapy in first line vs “pembrolizumab later” (ie, saving check-point inhibition for relapse). In this case, PFS2 shows whether initial treatment with pembrolizumab offers an advantage over waiting until time of relapse.7
There are 3 questions you should ask while reviewing the x-axis:
Examining the Curve (or Curves)
You are probably familiar with median survival, which is either reported with a confidence interval or summarized as "not reached" if less than 50% of patients had events of interest during the follow-up period. Likewise, you’ve learned to look at the right side of the K-M curve for a plateau showing that beyond a certain point events of interest no longer occur and thus suggesting a cure. But keep in mind 2 other questions.
First, how are patients censored? All longitudinal data-sets have a time cutoff, either a data lock date or a patient’s last clinic follow-up before the present day. Patients who don’t have an event of interest before the cutoff date are censored. But what if they are censored for other reasons, like study discontinuation or premature loss to followup due to toxicity? As others have noted, such censoring can affect the results of studies in oncology.10,11 When you see icons on the K-M curve indicating censoring, make sure you check how the term is defined.
Second, if there are multiple curves on the same plot, when do they intersect and when do they diverge or converge? You often see curves intersecting when a high-risk, high-reward strategy is being compared against a lowrisk, low-reward one, such as aggressive frontline chemotherapy versus milder regimens. If a pair of curves converge or diverge as you move toward the right, you can surmise that the dĭerence in benefit begins to emerge or dissipate with extended follow-up. All of which may be relevant to you and your patients during clinical decisionmaking. Of note, curves that cross make it more difficult to test a hypothesis using proportionalhazards regression.12,13
If you’re interested in learning more, consider taking a biostatistical course. Several professional societies also host workshops for oncology fellows that cover these concepts in more detail, including the American Society of Clinical Oncology/American Association for Clinical Research Vail Workshop and the American Society for Transplantation and Cellular Therapy Clinical Research Training Course.
A final word of warning: A K-M curve is only as good as the data that goes into it. Beware of selection bias and confounding that might affect how you interpret the curve. For example, in the curve above, it would appear that ventilators are associated with worse survival among hospitalized patients with COVID-19, but patient acuity is clearly the underlying confounder driving what you see. K-M curves convey a wealth of information beyond the hazard ratio, but only if you Nnow how to interpret them.