Scientists at Dana-Farber Cancer Institute have demonstrated that an artificial intelligence tool can perform as well as human reviewers – and much more rapidly – in extracting clinical information regarding changes in tumors from unstructured radiology reports for patients with lung cancer.
The AI tool performed comparably to trained human “curators” in detecting the presence of cancer; and whether it was responding to treatment interventions, stable or worsening.
The goal of the study, said corresponding author Kenneth Kehl, MD, MPH, a medical oncologist and faculty member of the Population Sciences Department at Dana-Faber, was to determine whether artificial intelligence tools can extract the most high-value cancer outcomes from radiology reports, which are a ubiquitous but unstructured data source.
Kehl noted that electronic health records now collect vast amounts of information on thousands of patients seen at a center like Dana-Farber. However, unless the patients are enrolled in clinical trials, information about their outcomes, such as whether their cancers grow or shrink in response to treatment, is recorded only in the text of the medical record. Historically, this unstructured information is not amenable to computational analysis and therefore could not be used for research into the effectiveness of treatment.
Because of studies like the Profile initiative at Dana-Farber/Brigham and Women’s Cancer Center, which analyzes patient tumor samples and creates profiles that reveal genomic variants that may predict responsiveness to treatments, Dana-Farber researchers have accumulated a wealth of molecular information about patients’ cancers. “But it can be difficult to apply this information to understand what molecular patterns predict benefit from treatments without intensive review of patients’ medical records to measure their outcomes. This is a critical barrier to realizing the full potential of precision medicine,” said Kehl.
For the current study, Kehl and colleagues obtained over 14,000 imaging reports for 1,112 patients and manually reviewed records using the “PRISSMM” framework. PRISSMM is a phenomic data standard developed at Dana-Farber that takes unstructured data from text reports in electronic health records and structures them so that they can be readily analyzed. PRISSMM structures data pertaining to a patient’s pathology, radiology/imaging, signs/symptoms, molecular markers, and a medical oncologist’s assessment to create a portrait of the cancer patient journey.
Human reviewers analyzed the imaging text reports and noted whether cancer was present and, if so, whether it was worsening or improving, and if the cancer had spread to specific body sites. These reports were then used to train a computational “deep learning” model to recognize these outcomes from the text reports. “Our hypothesis was that deep learning algorithms could use routinely generated radiology text reports to identify the presence of cancer and changes in its extent over time,” the authors wrote.
The researchers compared human and computer measurements of outcomes such as disease-free survival, progression-free survival, and time to improvement or response, and found that the AI algorithm could replicate human assessment of these outcomes. The deep learning algorithms were then applied to annotate another 15,000 reports for 1,294 patients whose records had not been manually reviewed. The authors found that computer outcome measurements among these patients predicted survival with similar accuracy to human assessments among the manually reviewed patients.
The human curators were able to annotate imaging reports for about three patients per hour, a rate at which one curator would need about six months to annotate all of the nearly 30,000 imaging reports for the patients in the cohort. By contrast, the artificial intelligence model that the researchers developed could annotate the imaging reports for the cohort in about 10 minutes, the researchers said in a report in JAMA Oncology.
“To create a true learning health system for oncology and to facilitate delivery of precision medicine at scale, methods are needed to accelerate curation of cancer-related outcomes from electronic health records,” said the authors of the publication. If applied widely, the investigators said, “this technique could substantially accelerate efforts to use real-world data from all patients with cancer to generate evidence regarding effectiveness of treatment approaches.” Next steps will include testing this approach on EHR data from other cancer centers and using the data to discover which treatments work best for which patients.
The senior author of study is Deborah Schrag, MD, MPH, Chief of Division of Population Sciences at Dana-Farber and a medical oncologist.
The research was supported by grants from the National Cancer Institute (K05CA169384, RO1CA203636, and U01CA209414); the Claude Dauphin Philanthropic Research Fund, and the Simeon J. Fortin Foundation.
Kenneth Kehl reports serving as a consultant to Aetion, Inc. Conflicts reported by Deborah Schrag include serving as a consultant/advisor to Cureus and Pfizer, receiving research funding from the American Association of Cancer Research; serving as an editor and receiving personal fees from JAMA outside the related work; and holding equity in mixed biotech funds including QQQ. She also reports holding a trademark and having financial interest in the PRISSMM phenomic data standard.