An efficient reporting system is a necessity in any PACS environment to provide timely, accurate reports to referring physicians. One high-performance Japanese speech recognition solution was presented in an infoRAD exhibit at the recent RSNA meeting.
An efficient reporting system is a necessity in any PACS environment to provide timely, accurate reports to referring physicians. One high-performance Japanese speech recognition solution was presented in an infoRAD exhibit at the recent RSNA meeting.
AmiVoice from Advanced Media of Tokyo provides continuous speech recognition with a 30,000-word radiological lexicon. The system features speaker independence, meaning users are not required to enroll their voices. Users can dictate reports immediately without first training the system to their voice, the speaker adaptation restriction peculiar to most other speech recognition systems.
Most current voice recognition technologies are based on discrete word recognition, which requires users to remember recognition words, inhibiting their natural speech. AmiVoice allows users to speak naturally, at any speed. According to the exhibit, AmiVoice successfully recognizes any given word at a rate exceeding 95%.
The system allows radiologists to choose typing, transcription, or speech recognition for report generation.
"With speech recognition, we are able to create reports in times equivalent to transcriber and hand operation," said Dr. Hidefumi Fujisawa, of the radiology department of Showa University Northern Yokohama Hospital. "Since the system is available 24 hours a day, seven days a week, we are also able to reduce transcriber costs."
One recent paper (Nippon Acta Radiologica 2002;62:23-36) compared 10 Japanese radiological reports created by two radiologists using conventional typing and the AmiVoice system. Neither had any special training in continuous speech recognition systems.
Total speech input time (56.2 sec) was nearly three times faster than the conventional typing input time (142.8 sec). Word misrecognition occurred in 40 of 1362 words (97.1% rate of accuracy of recognition). The average speech recognition time per report was 31.3 sec, with an additional 25.0 sec required for corrections.
The paper concluded that continuous speech recognition is faster than typing, even considering the additional time required for corrections, and is acceptable in view of the overall reduction in report turnaround time.
AmiVoice was released in 1999 and developed jointly with Pittsburgh-based ISI. It is based on ISI's speech recognition engine designed by Alexander Waibel, Ph.D., director of the Interactive Systems Laboratories at Carnegie Mellon University and a leading expert on speech recognition technology.
Study Reaffirms Low Risk for csPCa with Biopsy Omission After Negative Prostate MRI
December 19th 2024In a new study involving nearly 600 biopsy-naïve men, researchers found that only 4 percent of those with negative prostate MRI had clinically significant prostate cancer after three years of active monitoring.
Study Examines Impact of Deep Learning on Fast MRI Protocols for Knee Pain
December 17th 2024Ten-minute and five-minute knee MRI exams with compressed sequences facilitated by deep learning offered nearly equivalent sensitivity and specificity as an 18-minute conventional MRI knee exam, according to research presented recently at the RSNA conference.
Can Radiomics Bolster Low-Dose CT Prognostic Assessment for High-Risk Lung Adenocarcinoma?
December 16th 2024A CT-based radiomic model offered over 10 percent higher specificity and positive predictive value for high-risk lung adenocarcinoma in comparison to a radiographic model, according to external validation testing in a recent study.