What a New Study Reveals About AI, Bias and Mammography Assessment

May 2, 2023

Article

Regardless of experience level, radiologists are likely to be affected by automation bias when utilizing adjunctive artificial intelligence (AI) for mammography interpretation, according to newly published research.

Noting a significant decline in correct Breast Imaging Reporting and Data System (BI-RADS) mammography assessment by radiologists of all experience levels when a purported artificial intelligence (AI) modality provided an incorrect BI-RADS assessment, the authors of a new prospective study suggested that “all radiologists … can be subject to automation bias.”

In the study of 27 radiologists utilizing a purported adjunctive AI system for 50 mammograms, researchers assessed radiologist performance, the degree of bias with BI-RADS scoring and radiologist confidence in their own BI-RADS assessments, according to recently published research in Radiology. The researchers noted that 11 radiologists were deemed inexperienced radiologists with a mean of five months of experience interpreting mammograms, 11 radiologists had a mean moderate experience with 13 months of experience with mammography and five very experienced radiologists had a mean of 129.6 months of experience with mammography assessment.

When the AI system utilized in the study suggested the correct BI-RADS category, the researchers saw “no significant difference” with assessments between inexperienced radiologists (mean of 79 percent correct BI-RADS assessments), moderately experienced radiologists (mean of 81.3 percent) and very experienced radiologists (mean of 82.3 percent).

However, correct radiologist BI-RADS scoring declined significantly when there was an incorrect BI-RADS assessment by the AI system with a nearly 60 percent decrease for inexperienced radiologists (19.8 percent correct), over a 56 percent decrease for moderately experienced radiologists (24.8 percent correct) and over a 36 percent decline for experienced radiologists (45.5 percent correct).

“Inexperienced, moderately experienced, and very experienced radiologists were worse at assigning the correct (BI-RADS) scores for cases in which the purported AI suggested an incorrect BI-RADS category. These results suggest that all radiologists, regardless of expertise, can be subject to automation bias,” wrote lead study author Thomas Dratsch, M.D., who is affiliated with the Institute of Diagnostic and Interventional Radiology at the University of Cologne in Germany, and colleagues.

What a New Study Reveals About AI, Bias, and Mammography Assessment

Images courtesy of Radiology.

The researchers also noted that inexperienced radiologists rated the accuracy of the AI system utilized in the study as a median nine out of 10 on a Likert scale and rated their own BI-RADS assessment skills as a median two out of 10.

“ … Inexperienced radiologists were less confident in their own BI-RADS ratings compared with moderately and very experienced readers, which may potentially make them more vulnerable to following incorrect suggestions by AI,” suggested Dratsch and colleagues.

(Editor’s note: For related content, see “Study: Emerging AI Platform for DBT Shows 23 Percent Increase in Breast Cancer Detection Rate,” “Study: AI Improves Cancer Detection Rate for Digital Mammography and Digital Breast Tomosynthesis” and “Meta-Analysis Finds High Risk of Bias in 83 Percent of AI Neuroimaging Models for Psychiatric Diagnosis.”)

In an accompanying editorial, Pascal A.T. Baltzer, M.D., Ph.D., said the research from Dratsch and colleagues “highlights the need for caution when implementing AI-assisted breast imaging without proper training and knowledge of its reliability.”

In order to mitigate possible automation bias in breast imaging, Dr. Baltzer emphasizes ongoing training, performance benchmarking and continuous feedback for radiologists. Ensuring appropriate validation and transparency with adjunctive AI is also critical, according to Dr. Baltzer, a consultant radiologist in breast imaging with the Department of Biomedical Imaging and Image-guided Therapy at the Medical University of Vienna and executive board member of the European Society of Breast Imaging.

In regard to study limitations, the researchers acknowledged they did not compare adjunctive use of the AI-based system to radiologist performance without AI. Dratsch and colleagues also noted they did not consider other factors that may have contributed to radiologist assessment and only focused on mammograms that incorporated standard BI-RADS ratings.

Related Content

Considering Breast- and Lesion-Level Assessments with Mammography AI: What New Research Reveals

Jeff Hall

June 27th 2025

Article

While there was a decline of AUC for mammography AI software from breast-level assessments to lesion-level evaluation, the authors of a new study, involving 1,200 women, found that AI offered over a seven percent higher AUC for lesion-level interpretation in comparison to unassisted expert readers.

The Reading Room Podcast: Current Insights on Recent Research About Radiation-Induced Cancers with CT Scans, Part 2

Jeff Hall

May 5th 2025

Podcast

In a second part of a new podcast episode on recently published research on projected radiation-induced cancers from computed tomography (CT) scans, Mahadevappa Mahesh, MS, Ph.D., and Joseph Cavallo, M.D., offer current perspectives on cardiac CT dosing, AI advances and the importance of teamwork in ensuring appropriate dosing for CT.

New Study Examines Key Factors with False Negatives on AI Mammography Analysis

Jeff Hall

June 25th 2025

Article

Artificial intelligence (AI) software had a 14 percent false negative rate in a new study involving over 1,082 women with invasive breast cancer.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 3

Jeff Hall

September 1st 2023

Podcast

In the third episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss the challenges of expanded breast cancer screening amid a backdrop of radiologist shortages and ever-increasing volume on radiology worklists.

Mammography Screening Linked to Greater than 14 Percent Higher Five- and 10-Year Survival Rates for Breast Cancer

Jeff Hall

June 19th 2025

Article

In a study involving over 1,000 women with breast cancer, researchers found that patients with screening-detected breast cancer had a five-year survival rate of 94.4 percent in comparison to 79.6 percent for women with clinically detected breast cancer.

Can CT-Based Deep Learning Bolster Prognostic Assessments of Ground-Glass Nodules?

Jeff Hall

June 19th 2025

Article

Emerging research shows that a multiple time-series deep learning model assessment of CT images provides 20 percent higher sensitivity than a delta radiomic model and 56 percent higher sensitivity than a clinical model for prognostic evaluation of ground-glass nodules.