Logo Logo
Hilfe
Hilfe
Switch Language to English

Rudolph, Jan; Hümmer, Christian; Ghesu, Florin-Cristian; Mansoor, Awais; Preuhs, Alexander; Fieselmann, Andreas; Fink, Nicola; Dinkel, Julien; Koliogiannis, Vanessa; Schwarze, Vincent; Goller, Sophia; Fischer, Maximilian; Jörgens, Maximilian; Ben Khaled, Najib; Vishwanath, Reddappagari Suryanarayana; Balachandran, Abishek; Ingrisch, Michael; Ricke, Jens; Sabel, Bastian Oliver und Rueckel, Johannes (2022): Artificial Intelligence in Chest Radiography Reporting Accuracy Added Clinical Value in the Emergency Unit Setting Without 24/7 Radiology Coverage. In: Investigative Radiology, Bd. 57, Nr. 2: S. 90-98

Volltext auf 'Open Access LMU' nicht verfügbar.

Abstract

Objectives: Chest radiographs (CXRs) are commonly performed in emergency units (EUs), but the interpretation requires radiology experience. We developed an artificial intelligence (AI) system (precommercial) that aims to mimic board-certified radiologists' (BCRs') performance and can therefore support non-radiology residents (NRRs) in clinical settings lacking 24/7 radiology coverage. We validated by quantifying the clinical value of our AI system for radiology residents (RRs) and EU-experienced NRRs in a clinically representative EU setting. Materials and Methods: A total of 563 EU CXRs were retrospectively assessed by 3 BCRs, 3 RRs, and 3 EU-experienced NRRs. Suspected pathologies (pleural effusion, pneumothorax, consolidations suspicious for pneumonia, lung lesions) were reported on a 5-step confidence scale (sum of 20,268 reported pathology suspicions [563 images x 9 readers x 4 pathologies]) separately by every involved reader. Board-certified radiologists' confidence scoreswere converted into 4 binary reference standards (RFSs) of different sensitivities. The RRs' and NRRs' performances were statistically compared with our AI system (trained on nonpublic data from different clinical sites) based on receiver operating characteristics (ROCs) and operating point metrics approximated to the maximum sum of sensitivity and specificity (Youden statistics). Results: The NRRs lose diagnostic accuracy to RRs with increasingly sensitive BCRs' RFSs for all considered pathologies. Based on our external validation data set, the AI system/NRRs' consensus mimicked themost sensitive BCRs' RFSswith areas under ROC of 0.940/0.837 (pneumothorax), 0.953/0.823 (pleural effusion), and 0.883/0.747 (lung lesions), which were comparable to experienced RRs and significantly overcomes EU-experienced NRRs' diagnostic performance. For consolidation detection, the AI system performed on the NRRs' consensus level (and overcomes each individual NRR) with an area under ROC of 0.847 referenced to the BCRs' most sensitive RFS. Conclusions: Our AI system matched RRs' performance, meanwhile significantly outperformed NRRs' diagnostic accuracy for most of considered CXR pathologies (pneumothorax, pleural effusion, and lung lesions) and therefore might serve as clinical decision support for NRRs.

Dokument bearbeiten Dokument bearbeiten