New research on 175,000 women - the largest NHS study to date - on the use of AI in breast cancer screening shows that AI detected more cases of invasive cancer, more cases overall, had fewer false positives, and recalled fewer women having their first scan than humans did. For one part of the study, AI reduced the time spent reading scans by almost a third.
The research was conducted by Imperial College London, Google, the universities of Cambridge and Surrey, NHS Trusts at Cambridge University Hospitals, Imperial College Healthcare, the Royal Marsden, the Royal Surrey and St George's University Hospitals, and the AIMS public engagement group.
One case every 10 minutes
Breast cancer is the most common cancer in the UK, with one woman diagnosed every 10 minutes.
Yet there is a 29 per cent shortfall of clinical radiologists – almost 2,000 – and this is predicted to rise to 39 per cent by 2029. This research demonstrates the potential for more women to be diagnosed and treated sooner, while reducing radiologists' workload.
In the UK, breast cancer scans are assessed by two readers, usually specialist radiologists. Each reads the scan separately, with the second reader either knowing or not knowing the first reader's decision.
This research looked at how two human readers compared to one human reader plus one AI reader, using AI software developed by Google. The findings are published in two linked papers out today in Nature Cancer.
This is the closest AI has ever come to helping reduce breast cancer deaths within the NHS, so the potential for the NHS to take this forward is significant, particularly in light of the National Cancer Plan for England's recognition that 'there are few clearer signs of the failure of the status quo than our inadequate cancer outcomes' and its appetite to embrace new technologies to address that. Dr Hutan Ashrafian Institute of Global Health Innovation, author on both papers
There were three parts to the study.
The first was a retrospective study of 125,000 women, aged 50 to 70, from five NHS screening services. They were screened in 2015-16 over a 39-month follow-up period. The final analysis included 115,973 breast cancer scans.
AI as the second reader achieved better results than the first human reader, with the cancer detection rate (CDR) rising from 7.54 (human) to 9.33 (AI) per 1,000 women.
AI also identified more invasive cancers, significantly reduced false positives, and detected 25 per cent of interval cancers (cancers detected between healthy scans).
AI performed particularly well for first screens – with 39.3 per cent fewer recalls and a 8.8 per cent higher CDR.
With AI, the amount of time taken to read a scan reduced by almost a third (32.1 per cent or 195,983 vs 288,616 reads), representing a significant reduction in workload.
The second part of the study looked at 9,266 current cases at two screening services at 12 sites in London.
During the first two weeks, AI had a higher recall rate than the humans did and was above the target recall rate for the study, so the researchers adjusted the criteria. Despite this, AI continued to have a higher recall rate.
Big time savings
However, across both sites, the average time for AI to complete a read was 17.7 mins compared to 2.08 days for the first human reader, a significant time saving.
First use of AI in arbitration
The third part of the study looked at the use of AI in arbitration in 50,000 women.
Arbitration is when the first and second readers don't agree on the diagnosis and a third reader analyses the scan and makes a final decision. This is the first time AI has been used in this scenario. It found that AI fared comparably to the humans.
AI had a higher arbitration rate but, on balance, reduced the overall screening workload. The researchers suggest that further development of the AI tool could potentially lead to the detection of cancers earlier than with two human readers.
AI has the potential to transform how the NHS prevents, detects and treats diseases like cancer. These findings highlight how AI can support clinicians to identify more cancers earlier, reduce errors and deliver higher quality care to patients. Lord Ara Darzi Director, Institute of Global Health Innovation, author on both papers
Dr Susan Thomas, Clinical Director at Google and an author on both papers, added:
"Early detection is our most powerful tool in the fight against breast cancer, and these findings mark a genuine turning point.
"This is the first time that we've been able to rigorously test doctors and AI working alongside each other in a clinical setting.
"These findings have the potential to support the transformation of the NHS, and the experiences of the people on both sides of the scan, bringing us one step closer to a future where this technology strengthens entire healthcare systems and, ultimately, saves lives."
Professor Deborah Cunningham, a consultant radiologist at Imperial College Healthcare NHS Trust and another author, added:
"This study provides good evidence for the potential use of AI in the real world of screening mammography where staffing is particularly difficult. Its introduction could provide support for the successful NHS breast screening programme, reducing breast cancer mortality.
"The time saved will free up radiologists to perform more hands-on tasks such as needle biopsy, an essential part of the cancer diagnostic pathway. This should not be regarded as a threat to radiologists' livelihood, rather an opportunity to allow us to spend more time deploying our skills and working with colleagues and patients to improve cancer diagnosis and outcomes."
Professor Fiona Gilbert an academic radiologist at the University of Cambridge and another author, added:
"The work on arbitration (when the AI has flagged an abnormality and the human review is uncertain whether or not to recall the woman) is very important and will help inform the UK prospective EDITH trial (the largest international study comparing different AI tools in mammography machines across 30 sites) on how best to deal with those cases where a human believes the case to be negative but the AI tool is calling case positive.
The work was supported by the NIHR Imperial Biomedical Research Centre, a translational research partnership between Imperial College Healthcare NHS Trust and Imperial College London, and funded through the NHS AI in Health and Care Award in partnership with the NIHR.