Efficient Maximum Likelihood Estimation for Pedigree Data with the Sum-Product Algorithm

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Engelhardt, Alexander; Rieger, Anna; Tresch, Achim und Mansmann, Ulrich (19. Dezember 2016): Efficient Maximum Likelihood Estimation for Pedigree Data with the Sum-Product Algorithm. Department of Statistics: Technical Reports, Nr. 200 [PDF, 541kB]

Vorschau

DOI: 10.5282/ubm/epub.31077

Abstract

In this paper, we analyze data sets consisting of pedigrees where the response is the age at onset of colorectal cancer (CRC). The occurrence of familial clusters of CRC suggests the existence of a latent, inheritable risk factor. We aimed to compute the probability of a family possessing this risk factor, as well as the hazard rate increase for these risk factor carriers. Due to the inheritability of this risk factor, the estimation necessitates a costly marginalization of the likelihood.

We therefore developed an EM algorithm by applying factor graphs and the sum-product algorithm in the E-step, reducing the computational complexity from exponential to linear in the number of family members.

Our algorithm is as precise as a direct likelihood maximization in a simulation study and a real family study on CRC risk. For 250 simulated families of size 19 and 21, the runtime of our algorithm is faster by a factor of 4 and 29, respectively. On the largest family (23 members) in the real data, our algorithm is 6 times faster.

We introduce a flexible and runtime-efficient tool for statistical inference in biomedical event data that opens the door for advanced analyses of pedigree data.

Dokumententyp:	Paper
Keywords:	Colorectal cancer, Personalized medicine, Cancer risk prediction, Pedigrees, EM algorithm, Factor graphs, Sum-product algorithm
Fakultät:	Mathematik, Informatik und Statistik > Statistik > Technische Reports
Themengebiete:	500 Naturwissenschaften und Mathematik > 510 Mathematik
URN:	urn:nbn:de:bvb:19-epub-31077-7
Sprache:	Englisch
Dokumenten ID:	31077
Datum der Veröffentlichung auf Open Access LMU:	19. Dez. 2016 18:03
Letzte Änderungen:	04. Nov. 2020 13:08

Dokument bearbeiten