|Articles|December 9, 2016

IBM's Watson Achieves High Concordance in Tumor Board Test

The artificial intelligence computer program Watson for Oncology (WFO) achieved a high degree of concordance with tumor board recommendations in a double-blinded validation study in Bengaluru, India, according to results presented at the 2016 San Antonio Breast Cancer Symposium (SABCS).

SP Somashekhar, MBBS, MS, MCH, FRCS

In the study of cases involving 638 patients with breast cancer treated at Manipal Comprehensive Cancer Center, 90% of WFO’s recommendations for standard treatment (REC) or consideration (FC) were concordant with the recommendations of the tumor board. A group of 12-15 oncologists met weekly to review cases, entered data into the WFO system, and then analyzed the degree of concordance between WFO’s recommendations and those of the tumor board, as well as the time it took the oncologists to generate their recommendations.

The degree of concordance varied according to the type of breast cancer, lead study author SP Somashekhar, MBBS, MS, MCH, FRCS, said in his presentation at SABCS. WFO recommendations were concordant nearly 80% of the time in nonmetastatic disease, but only 45% of the time in metastatic cases. In cases of triple-negative breast cancer, WFO agreed with physicians 68% of the time, but in HER2/neu-negative cases, WFO’s recommendations matched the physicians’ recommendations only 35% of the time.

In cases of discordance between WFO’s recommendations and those of the tumor board, tumor board decisions were changed 63% of the time (n = 100) following review.

The study’s authors concluded that the broader divergence between WFO’s recommendations and those of the tumor board could be attributed to the greater number of treatment options available for patients with HER2/neu-negative breast cancer.

“Including HER2/neu cases opens up many more treatments and variables for consideration,” Somashekhar, chairman of the Manipal Comprehensive Cancer Center, explained. “This increases demands on human thinking capacity. More complicated cases led to more divergent opinions on the recommended treatment.”

Physicians took longer to weigh the available treatment options and come to a recommendation compared to WFO, although the doctors were able to work faster as they gained familiarity with cases. Somashekhar said it took doctors an average of 20 minutes initially. As they improved, the mean time dropped to about 12 minutes. By comparison, WFO achieved a median time of 40 seconds to capture and analyze data and give a treatment recommendation.

The study authors said that although WFO recommendations often led the tumor board to reconsider their decisions, the computer program remains a support tool for physicians and cannot replace the “human touch” needed to act upon the many factors of patient engagement that go beyond data analysis.

“We are dealing with human beings, and the context and preferences of each individual patient, the patient—physician relationship, and the human touch and empathy are very important,” Somashekhar said. “It’s always going to be the decision of the treating oncologist and patient to determine what is truly the best option for the patient.”

One reason why physicians do not have to worry about Watson replacing them is that they can perform better at 1-on-1 assessments. For example, whereas in metastatic disease, Watson tended to recommend conservatively based on best available evidence, physicians were more likely to select an aggressive chemo regimen to achieve a high level of response, Somashekhar said. This explains some of the discordance in recommendations, he added.

In the study, WFO analyzed >100 patient attributes for breast cancer and provided a ranking of treatment options according to REC, FC, and “not recommended” (N-REC). The recommendations were backed by data from recent trials, and oncologists were able to click on options listed by Watson to find out more about the recommendations and the reasons for them. The cases were at most 3 years old.

Somashekhar said the study was not designed to evaluate why differences in recommendations occurred, the inferiority or superiority of recommendations, or the impact of WFO on workflow. He said WFO, developed by IBM, is a promising tool that warrants consideration in a variety of other clinical settings and study designs.

Doctors at Memorial Sloan Kettering Cancer Center (MSK) helped to program WFO to enable it to make recommendations on cancer treatment. The system extracts and assesses large amounts of structured and unstructured data from medical records through natural language processing and machine learning. In addition to breast tumors, it is also capable of making recommendations for lung and colorectal cancers.

WFO’s concordance with MSK oncologists’ opinions has been tested in 2 previous studies, showing agreement 90% of the time in one and 50% of the time in another. Doctors in Thailand have been using the system for more than a year, and IBM announced this past summer that it was expanding the program to China, where it was expected to be of high value to doctors in rural centers who don’t have access to resources available to doctors in centralized clinics.

Watson, which has been in use in the Manipal hospital system for 6 months, has proven valuable in controlling cancer clinic costs, because it helps to eliminate bias and errors. “This is something that would ensure that we arrive at the right decision first,” Somashekhar said.

Somashekhar SP. Double blinded validation study to assess performance of IBM artificial intelligence platform Watson for oncology in comparison with Manipal multidisciplinary tumor board—first study of 638 breast cancer cases. Presented at: San Antonio Breast Cancer Symposium, Friday, Dec. 9, 2016; San Antonio, TX. Abstract S6-07

<<<

Stay up to date on the most recent and practice-changing oncology data

Related to this article

Cancer | Image Credit: © catalin - stock.adobe.com

July 11th, 2026

Revisit Every OncLive On Air Episode From June 2026

In case you missed any, check out our recap of the episodes of OncLive On Air that aired in June 2026.

July 10th, 2026

FDA Flashback: Breast Cancer Decisions and News From June 2026

Read a refresh of breast cancer FDA news from June 2026, including several practice-changing approvals and the granting of priority review to a SERD.

RAS G12D–Mutant Metastatic PDAC © stock.adobe.com.

July 9th, 2026

Zoldonrasib Combinations Show Compelling Antitumor Activity in RAS G12D–Mutant Metastatic PDAC

Phase 1/2 data from 2 trials showed high ORRs with zoldonrasib-based regimens in first-line and previously treated RAS G12D metastatic pancreatic ductal adenocarcinoma.

EHA 2026 | Image Credit: © Curie - stock.adobe.com

July 9th, 2026

Hematologists Detail Practice-Shaping Data That Emerged at EHA 2026 (VIDEO)

Hematologic oncology experts outline the top data and themes to emerge from the 2026 EHA Congress

Embolization-Eligible HCC © stock.adobe.com.

July 8th, 2026

STRIDE Plus TACE With/Without Lenvatinib Improves Responses in Embolization-Eligible HCC

In EMERALD-3, STRIDE with or without lenvatinib plus TACE improved PFS, ORR, and DOR vs TACE alone by RECIST 1.1 and mRECIST criteria in HCC.

MRD in CRC Liver Metastases © stock.adobe.com

July 8th, 2026

ctDNA MRD Status Predicts OS Benefit With Adjuvant Chemotherapy in Resected CRC Liver Metastases

Updated data from the GALAXY study showed an association between postsurgical MRD positivity per Signatera and OS benefit with adjuvant chemotherapy.

Domvanalimab/Zimberelimab Plus Chemotherapy in HER2-Negative Gastroesophageal Adenocarcinoma | Image Credit: © Image by Ashling Wahner & MJH Life Sciences Using AI

July 7th, 2026

Domvanalimab/Zimberelimab Plus Chemotherapy Fails to Improve OS in HER2-Negative Gastroesophageal Adenocarcinoma

First-line domvanalimab plus zimberelimab and chemotherapy did not improve OS vs nivolumab plus chemotherapy in HER2-negative gastroesophageal cancer.

FOLFOX Plus Atezolizumab in Stage III dMMR Colon Cancer: ©Image by Ashling Wahner & MJH Life Sciences Using AI

July 7th, 2026

Extended FOLFOX, Atezolizumab Duration Linked With Improved DFS in Stage III dMMR Colon Cancer

Longer duration of adjuvant FOLFOX and atezolizumab was associated with improved DFS in a retrospective analysis of the ATOMIC trial.

IBM's Watson Achieves High Concordance in Tumor Board Test

Related to this article

Trending on OncLive

FDA Approves Pembrolizumab Plus Enfortumab Vedotin in MIBC

Revisit Every OncLive On Air Episode From June 2026

FDA Flashback: Breast Cancer Decisions and News From June 2026

Risvutatug Rezetecan Shows OS Benefit Over Topotecan in Relapsed SCLC

FDA Approves Subcutaneous Isatuximab Delivered Via On-Body Delivery System for Multiple Myeloma