Assessment of CNNs, transformers, and hybrid architectures in dental image segmentation

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Schneider, Lisa; Krasowski, Aleksander ORCID: https://orcid.org/0000-0003-0192-788X; Pitchika, Vinay ORCID: https://orcid.org/0000-0001-6947-2602; Bombeck, Lisa; Schwendicke, Falk ORCID: https://orcid.org/0000-0003-1223-1669 und Büttner, Martha ORCID: https://orcid.org/0000-0001-9004-213X (Mai 2025): Assessment of CNNs, transformers, and hybrid architectures in dental image segmentation. In: Journal of Dentistry, Bd. 156, 105668 [PDF, 3MB]

[thumbnail of 1-s2.0-S0300571225001137-main.pdf]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

Veröffentlichte Version

DOI: 10.1016/j.jdent.2025.105668

Abstract

Objectives

Convolutional Neural Networks (CNNs) have long dominated image analysis in dentistry, reaching remarkable results in a range of different tasks. However, Transformer-based architectures, originally proposed for Natural Language Processing, are also promising for dental image analysis. The present study aimed to compare CNNs with Transformers for different image analysis tasks in dentistry.

Methods

Two CNNs (U-Net, DeepLabV3+), two Hybrids (SwinUNETR, UNETR) and two Transformer-based architectures (TransDeepLab, SwinUnet) were compared on three dental segmentation tasks on different image modalities. Datasets consisted of (1) 1881 panoramic radiographs used for tooth segmentation, (2) 1625 bitewings used for tooth structure segmentation, and (3) 2689 bitewings for caries lesions segmentation. All models were trained and evaluated using 5-fold cross-validation.

Results

CNNs were found to be significantly superior over Hybrids and Transformer-based architectures for all three tasks. (1) Tooth segmentation showed mean±SD F1-Score of 0.89±0.009 for CNNs, 0.86±0.015 for Hybrids and 0.83±0.22 for Transformer-based architectures. (2) In tooth structure segmentation CNNs also outperformed with 0.85±0.008 compared to Hybrids 0.84±0.005 and Transformers 0.83±0.011. (3) Even more pronounced results were found for caries lesions segmentation; 0.49±0.031 for CNNs, 0.39±0.072 for Hybrids and 0.32±0.039 for Transformer-based architectures.

Conclusion

CNNs significantly outperformed Transformer-based architectures and their Hybrids on three segmentation tasks (teeth, tooth structures, caries lesions) on varying dental data modalities (panoramic and bitewing radiographs).

Clinical significance

As deep-learning-based image analysis is part of modern dentistry, practitioners and dental researchers should be aware of strength and limitations of modern model architectures for dental-image analysis. Models that demonstrate optimal performance in other domains do not necessarily constitute the optimal selection for the purpose of dental imaging.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Medizin > Klinikum der LMU München > Poliklinik für Zahnerhaltung und Parodontologie
Themengebiete:	600 Technik, Medizin, angewandte Wissenschaften > 610 Medizin und Gesundheit
URN:	urn:nbn:de:bvb:19-epub-126728-8
ISSN:	03005712
Sprache:	Englisch
Dokumenten ID:	126728
Datum der Veröffentlichung auf Open Access LMU:	11. Jun. 2025 12:32
Letzte Änderungen:	11. Jun. 2025 12:32

Dokument bearbeiten