Key Takeaways from Multiple Radiology Societies on AI Assessment and Integration

January 22, 2024

News

Article

In a recently issued statement from multiple radiology societies including the RSNA and ACR, researchers offer practical advice for evaluating artificial intelligence (AI) tools, implementing AI into current workflows and monitoring of the technology to help ensure optimal benefit and effectiveness.

Calling artificial intelligence (AI) “the single most disruptive influence on radiology in many decades,” researchers on behalf of five leading radiology societies, including the American College of Radiology (ACR), the Radiological Society of North America (RSNA) and the European Society of Radiology (ESR), have published a multinational statement of practical considerations in assessing, implementing, and monitoring AI tools in radiology.

Simultaneously published in five different journals, including Radiology: Artificial Intelligence, Insights into Imaging, and the Journal of the American College of Radiology (JACR), the multi-society statement delves into potential biases with AI use, steps for assessing clinical accuracy, goal setting for monitoring of AI software, cost considerations and long-term viability.

Here are a few key takeaways from the statement.

1. The researchers emphasized diligent cost-benefit and return on investment (ROI) analyses in concert with the health-care setting and local circumstances when considering adjunctive AI tools or applications such as AI-enabled opportunistic screening. Tangible benefits of AI in outpatient imaging centers or fee-for-service hospital settings may range from an increased volume of findings that require follow-up exams or management to increased efficiency in emergency departments and shorter length of stays, according to the statement authors.

2. While the potential of AI to alleviate increasing workload burden amid a shortage of radiologists has been discussed, the researchers noted that reduced burnout and improved radiologist recruitment tend to be “additive” benefits that do not have as much measurable impact against the costs of AI implementation.

3. The researchers pointed out that AI implementations that only send AI results to an existing Picture Archiving and Communication System (PACS) are problematic due to the potential for automation bias for radiologists and a lack of knowledge for referring physicians as to the accuracy and other details of the AI model being utilized.

4. The statement authors emphasize the use of a system, such as a cloud-naïve environment, that enables radiologists to interact with and possibly modify AI results and share feedback with AI vendors.

“This type of interaction is facilitated in a cloud-naïve environment where both the PACS and AI models can share radiology data and AI results. Additionally, the ability to accept and store AI results along with radiologist feedback, optimize data security, and continuously monitor AI accuracy are crucial technical aspects that are facilitated in cloud-naïve systems,” wrote lead statement author Adrian Brady, M.D., president of the ESR and a clinical professor of radiology at University College Cork in Cork, Ireland, and colleagues.

For Related Content

• “Study Finds Four Out of Seven AI Algorithms Offer Better Lung Nodule Detection on X-Rays than Radiologists”

• “Hybrid MRI Deep Learning Model Shows Promise in Predicting Tumor Deposits with Rectal Cancer”

• “AI Facilitates Nearly 83 Percent Improvement in Turnaround Time for Fracture X-Rays”

Other Considerations With Reported Error Rates

5. Reported error rates with AI model testing “may differ substantially” from application in one’s practice, according to the statement authors. They emphasized consideration of differences with scanner manufacturers, protocols, disease prevalence and demographics of the local community in which the AI software is being deployed. Beyond error frequency, Brady and colleagues said those evaluating AI models should also detectability and correctability of errors with AI models, as well the potential impact of AI model errors on patients.

Targeted Implementation Of AI Software

6. Focusing implementation of AI models in health-care settings where disease prevalence is more pronounced may facilitate improved acceptance of the model in question, according to the statement authors.

“For example, pneumothorax (PTX) on chest X-ray (CXR) has a higher prevalence in the inpatient rather than the average population,” pointed out Brady and colleagues. “Limiting a PTX AI model to only inpatient CXRs will provide fewer false positive results and will more likely be accepted by the radiologists from an accuracy standpoint.”

Recognizing the Benefits of Continuous Monitoring with AI Models

7. The statement authors emphasized continuous monitoring of AI models and sharing of those assessments across multiple sites and geographic regions via an AI data registry. Doing so would allow registry participants to identify local issues contributing to AI model performance or more systematic problems tied to possible software updates.

“Hypothetically, analysis of the aggregate institutional registry data might show the poor performance to be limited to a single machine,” posited Brady and colleagues. “Further analysis might also show that the performance degradation occurred after a software upgrade to that machine or change in examination protocol.”

Contributing Factors That Can Affect Acceptance of AI Models

8. Identifying “Wow” cases can help facilitate stakeholder buy-in with AI models. The statement authors noted that cases demonstrating a significant impact on patient outcomes or operational efficiencies may show key examples of the potential impact of AI to stakeholders such as referring physicians and facility administrators.

9. Automation bias, algorithmic bias, and user-interface (UI) design may also factor into assessment and acceptance of AI models, according to the statement authors. In one study, the statement authors noted that text-only UI output outperformed radiologisr readers for pulmonary nodule detection while AI image overlays, often preferred by radiologists in this use context, did not enhance the performance of reviewing radiologists.

Related Content

Study Suggests AI Software May Offer Standalone Value for X-Ray Detection of Pediatric Fractures

Jeff Hall

April 9th 2025

Article

Artificial intelligence (AI) software demonstrated a 92 percent sensitivity for detecting fractures in a study involving over 1,600 X-rays from a tertiary pediatric emergency department.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 3

Jeff Hall

September 1st 2023

Podcast

In the third episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss the challenges of expanded breast cancer screening amid a backdrop of radiologist shortages and ever-increasing volume on radiology worklists.

AMA Approves Category III CPT Codes for AI-Enabled Perivascular Fat Analysis from CT Scans

Jeff Hall

April 9th 2025

Article

Going into effect in 2026, the new CPT codes may facilitate increased adoption of the CaRi-Heart software for detecting coronary inflammation from computed tomography scans pending FDA clearance of the technology.

The Reading Room Podcast: Emerging Concepts in Breast Cancer Screening and Health Equity Implications, Part 2

Jeff Hall

August 23rd 2023

Podcast

In the second episode of a three-part podcast, Anand Narayan, M.D., Ph.D., and Amy Patel, M.D., discuss recent studies published by the Journal of the American Medical Association (JAMA) that suggested moving to more of a risk-adapted model for mammography screening.

FDA Clears AI Assessment of Ischemic Core Volume on CT with Brainomix 360 Platform

Jeff Hall

April 8th 2025

Article

For patients with acute ischemic stroke, research has demonstrated that automated assessment of ischemic core volume on brain CT scans via the Brainomix 360 software is equivalent to that derived from CT perfusion.

MRI Study Suggests Shape of White Matter Hyperintensities May Be Predictive of Cognitive Decline

Jeff Hall

April 7th 2025

Article

Emerging research demonstrated that cognitive declines in memory, executive function and processing speed domains were associated with irregular shape of periventricular/confluent white matter hyperintensities.