Generative data augmentation by conditional inpainting for multi-class object detection in infrared images

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Wang, Peng; Ma, Zhe; Dong, Bo; Liu, Xiuhua; Ding, Jishiyu; Sun, Kewu und Chen, Ying ORCID: https://orcid.org/0009-0008-8183-6729 (2024): Generative data augmentation by conditional inpainting for multi-class object detection in infrared images. In: Pattern Recognition, Bd. 153, 110501 [PDF, 2MB]

[thumbnail of 1-s2.0-S0031320324002528-main.pdf]

Vorschau

Creative Commons: Namensnennung-Nicht Kommerziell-Keine Bearbeitung 4.0 (CC-BY-NC-ND)

Veröffentlichte Version

DOI: 10.1016/j.patcog.2024.110501

Abstract

Multi-class object detection in infrared images is important in military and civilian use. Deep learning methods can obtain high accuracy but require a large-scale dataset. We propose a generative data augmentation framework DOCI-GAN, for infrared multi-class object detection with limited data. Contributions of this paper are four-folds. Firstly, DOCI-GAN is designed as a conditional image inpainting framework, yielding paired infrared multi-class object image and annotation. Secondly, a text-to-image converter is formulated to transform text-format object annotations to bounding box mask images, leading the augmentation to be mask-image-to-raw-image translation. Thirdly, a multiscale morphological erosion-based loss is created to alleviate the intensity inconsistency between inpainted local backgrounds and global background. Finally, for generating diverse images, artificial multi-class object annotations are integrated with real ones during augmentation. Experimental results demonstrated that DOCI-GAN augments dataset with high-quality infrared multi-class object images, consequently improving the accuracy of object detection baselines.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Medizin > Institut für Schlaganfall- und Demenzforschung (ISD)
Themengebiete:	600 Technik, Medizin, angewandte Wissenschaften > 610 Medizin und Gesundheit
URN:	urn:nbn:de:bvb:19-epub-120218-3
ISSN:	00313203
Sprache:	Englisch
Dokumenten ID:	120218
Datum der Veröffentlichung auf Open Access LMU:	27. Aug. 2024 13:49
Letzte Änderungen:	27. Aug. 2024 13:49

Dokument bearbeiten