Image interpretation and accuracy may be effectively measured with a new software program.
The software program RadBench effectively and objectively measures image interpretation performance and image interpretation accuracy, according to a study published in Radiography.
Researchers from the United Kingdom sought to develop an assessment tool, called RadBench, in order to regularly measure and monitor image interpretation performance.
The study used two test banks, Test 1 and Test 2 to investigate the use of the software. Each test contained 20 appendicular musculoskeletal images (17 adult, three child). The images, 10 of which were normal and 10 with fractures, were of the ankle, foot, knee, hand, wrist, and elbow. Forty-two radiology professionals participated: 34 general radiographers, three reporting radiographers, two radiologists, and three medical imaging academics.[[{"type":"media","view_mode":"media_crop","fid":"45798","attributes":{"alt":"","class":"media-image media-image-right","id":"media_crop_7105811328381","media_crop_h":"0","media_crop_image_style":"-1","media_crop_instance":"5263","media_crop_rotate":"0","media_crop_scale_h":"0","media_crop_scale_w":"0","media_crop_w":"0","media_crop_x":"0","media_crop_y":"0","style":"height: 124px; width: 179px; border-width: 0px; border-style: solid; margin: 1px; float: right;","title":"©Bloomua/Shutterstock.com","typeof":"foaf:Image"}}]]
The participants ranked each image on a five-point scale, ranging from one, definitely normal, to five definitely abnormal. These choices were then calculate by RadBench to determine the reader’s sensitivity, specificity, and accuracy.
“The RadBench software generated calculations of sensitivity, specificity, and accuracy in addition to a decision making map for each respondent,” the authors wrote. Among the early findings, the researchers found that there was a 5% mean difference between image banks, confirming that benchmarking must be related to a specific test. “The benchmarking option within the software enabled the user to compare their score with the highest, lowest and mean score of others who had taken the same test.”
The highest scores were among reporting radiographers and radiologists (all scored 95% or above accuracy in both tests), while the general radiographer population scored between 60% and 95%.
The authors concluded that the RadBench tool was capable of providing benchmark measures of image interpretation accuracy, with the potential for comparison across populations.
New Study Examines Short-Term Consistency of Large Language Models in Radiology
November 22nd 2024While GPT-4 demonstrated higher overall accuracy than other large language models in answering ACR Diagnostic in Training Exam multiple-choice questions, researchers noted an eight percent decrease in GPT-4’s accuracy rate from the first month to the third month of the study.
The Reading Room Podcast: Emerging Trends in the Radiology Workforce
February 11th 2022Richard Duszak, MD, and Mina Makary, MD, discuss a number of issues, ranging from demographic trends and NPRPs to physician burnout and medical student recruitment, that figure to impact the radiology workforce now and in the near future.
Can Innovations with AI Help Address the Impact of Staffing Shortages on Radiology Workflow?
October 7th 2024While staffing shortages in radiology continue to persist after the COVID-19 pandemic, current and emerging innovations powered by artificial intelligence (AI) may help facilities navigate these challenges and mitigate rising costs of health care.