Logo Logo
Hilfe
Hilfe
Switch Language to English

Scheppach, Amadeu; Gündüz, Hüseyin Anil; Dorigatti, Emilio; Münch, Philipp C.; McHardy, Alice C.; Bischl, Bernd ORCID logoORCID: https://orcid.org/0000-0001-6002-6980; Rezaei, Mina und Binder, Martin (2023): Neural Architecture Search for Genomic Sequence Data. 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (IEEE CIBCB), Eindhoven, Netherlands, 29.-31. August 2023. Nobile, Marco S. und Houghten, Sheridan (Hrsg.): In: 2023 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), Piscataway, NJ: IEEE. S. 181-190

Volltext auf 'Open Access LMU' nicht verfügbar.

Abstract

Deep learning has enabled outstanding progress on bioinformatics datasets and a variety of tasks, such as protein structure prediction, identification of regulatory regions, genome annotation, and interpretation of the noncoding genome. The layout and configuration of neural networks used for these tasks have mostly been developed manually by human experts, which is a time-consuming and error-prone process. Therefore, there is growing interest in automated neural architecture search (NAS) methods in bioinformatics. In this paper, we present a novel search space for NAS algorithms that operate on genome data, thus creating extensions for existing NAS algorithms for sequence data that we name Genome-DARTS, Genome-P-DARTS, Genome-BONAS, Genome-SH, and Genome-RS. Moreover, we introduce two novel NAS algorithms, CWP-DARTS and EDPDARTS, that build on and extend the idea of P-DARTS. We evaluate the presented methods and compare them to manually designed neural architectures on a widely used genome sequence machine learning task to show that NAS methods can be adapted well for bioinformatics sequence datasets. Our experiments show that architectures optimized by our NAS methods outperform manually developed architectures while having significantly fewer parameters.

Dokument bearbeiten Dokument bearbeiten