AI-Initiated Recalls After Screening Mammography Demonstrate Higher PPV for Breast Cancer

News
Article

While recalls initiated by one of two reviewing radiologists after screening mammography were nearly 10 percent higher than recalls initiated by an AI software, the AI-initiated recalls had an 85 percent higher positive predictive value for breast cancer, according to a new study.

Artificial intelligence (AI) may significantly bolster the effectiveness of recalls after women have undergone screening mammography exams, according to new research involving nearly 55,000 women.

For the study, recently published in Radiology, researchers compared mammography recalls and the positive predictive value (PPV) of those recalls for two reviewing radiologists and an AI computer-aided detection (CAD) software (Insight MMG AI version 1.1.6, Lunit). There were 1,348 recall cases and 5,489 cases flagged for consensus discussion that were drawn from 54,991 women (median age of 55) who had screening mammography, according to the study.

After consensus discussions, the researchers found that 14.8 percent of cases flagged by one radiologist (263/1,858) proceeded to recall in comparison to 4.6 percent of cases flagged by the AI CAD software (86/1,886). However, the study authors noted that AI CAD recalls had a positive predictive value (PPV) of 22 percent (19/86) for breast cancer in contrast to a 3.4 percent PPV for recalls (9/263) initiated by one radiologist.

AI-Initiated Recalls After Screening Mammography Demonstrate Higher PPV for Breast Cancer

In the above case, a 55-year-old woman was flagged for recall by artificial intelligence computer-aided detection (AI CAD) after assessment of full-field digital screening mammography. She was not recalled by two radiologists. The AI CAD score of 73 preceded subsequent diagnosis of a stage 3 in situ cancer with a 9 mm T1 invasive cancer and lymph node metastasis. (Images courtesy of Radiology.)

Similarly, the researchers pointed out an 18.6 percent higher number of recalls when mammograms were flagged by two radiologists (57.2 percent, 360/629) in comparison to recalls initiated by one radiologist and the AI CAD software (38.6 percent, 244/632). Yet the study authors determined that PPV for the recalls by one radiologist and AI CAD was 90 percent higher than the recalls involving cases flagged by two radiologists (25 percent vs. 2.5 percent).

“The most important finding was that the consensus discussion recalled a smaller proportion of participants if their digital mammograms were initially flagged by AI CAD compared with flagged for suspicion of breast cancer by AI CAD versus by a radiologist … Of note, among the participants whom the consensus discussion decided to recall, the cancer yield was several times higher when the examinations had been initially flagged by AI CAD,” wrote lead study author Karin E. Dembrower, M.D., Ph.D., who is affiliated with the Department of Oncology-Pathology at the Karolinska Institute and the Department of Radiology at the Capio Sankt Gorans Hospital in Stockholm, Sweden, and colleagues.

In a subsequent sensitivity analysis of cases involving invasive breast cancer, the study authors saw similar results with a fourfold higher PPV for recalls from AI CAD alone (13 percent) versus recalls initiated by one radiologist (3.07 percent).

Three Key Takeaways

1. AI-enhanced recall accuracy. AI CAD demonstrated a significantly higher positive predictive value (PPV) for recalls (22 percent) compared to a single radiologist (3.4 percent), suggesting that AI can improve the accuracy of recalls in mammography screening.

2. Improved cancer detection. When AI CAD was involved in recall decisions alongside a radiologist, the PPV was notably higher than when recalls were based on two radiologists alone (25 percent vs. 2.5 percent), indicating AI's potential to enhance breast cancer detection rates.

3. Potential for optimized screening workflows. The study suggests that AI CAD may help reduce unnecessary recalls while improving cancer yield, potentially refining decision-making processes in mammography screening programs.

Comparing recalls for cases flagged by two radiologists in comparison to those initiated by one radiologist and AI CAD software, the study authors noted an eightfold higher PPV when AI CAD was involved in the recall decision (17.9 percent vs. 2.26 percent).

“This suggests a differential reliance on decision support related to whether that originated from AI CAD or from a fellow radiologist. The observed behavior may attenuate and underestimate the potential benefits of AI CAD in screening programs,” wrote lead study author Karin E. Dembrower, M.D., Ph.D., who is affiliated with the Department of Oncology-Pathology at the Karolinska Institute and the Department of Radiology at the Capio Sankt Gorans Hospital in Stockholm, Sweden,

(Editor’s note: For related content, see “Multicenter Mammography Study Shows Greater Than 10 Percent Increase in Breast Cancer Detection with Adjunctive AI,” “New Mammography Studies Assess Image-Based AI Risk Models and Breast Arterial Calcification Detection” and “Study: Mammography AI Leads to 29 Percent Increase in Breast Cancer Detection.”)

Beyond the inherent limitations of a single-center study, the authors acknowledged that the results with their two-radiologist consensus discussion may not be applicable to single-reader settings. The researchers also pointed out that more subtle signs on mammograms may be more of a factor with lower radiologist recalls as opposed to a lack of trust in AI CAD interpretation.

Recent Videos
New Mammography Studies Assess Image-Based AI Risk Models and Breast Arterial Calcification Detection
Can Deep Learning Provide a CT-Less Alternative for Attenuation Compensation with SPECT MPI?
Employing AI in Detecting Subdural Hematomas on Head CTs: An Interview with Jeremy Heit, MD, PhD
Current and Emerging Legislative Priorities for Radiology in 2025
How Will the New FDA Guidance Affect AI Software in Radiology?: An Interview with Nina Kottler, MD, Part 2
How Will the New FDA Guidance Affect AI Software in Radiology?: An Interview with Nina Kottler, MD, Part 1
Teleradiology and Breast Imaging: Keys to Facilitating Personalized Service, Efficiency and Equity
Addressing the Early Impact of National Breast Density Notification for Mammography Reports
Where the USPSTF Breast Cancer Screening Recommendations Fall Short: An Interview with Stacy Smith-Foley, MD
Can Fiber Optic RealShape (FORS) Technology Provide a Viable Alternative to X-Rays for Aortic Procedures?
Related Content
© 2025 MJH Life Sciences

All rights reserved.