Characterizing personalized effects of family information on disease risk using graph representation learning

Forskningsoutput: TidskriftsbidragKonferensartikelVetenskapligPeer review

Sammanfattning

Family history is considered a risk factor for many diseases because it implicitly captures shared genetic, environmental and lifestyle factors. Finland’s nationwide electronic health record (EHR) system spanning multiple generations presents new opportunities for studying a connected network of medical histories for entire families. In this work we present a graph-based deep learning approach for learning explainable, supervised representations of how each family member’s longitudinal medical history influences a patient’s disease risk. We demonstrate that this approach is beneficial for predicting 10-year disease onset for 5 complex disease phenotypes, compared to clinically-inspired and deep learning baselines for Finland’s nationwide EHR system comprising 7 million individuals with up to third-degree relatives. Through the use of graph explainability techniques, we illustrate that a graph-based approach enables more personalized modeling of family information and disease risk by identifying important relatives and features for prediction.

Originalspråkengelska
TidskriftProceedings of Machine Learning Research
Volym219
Sidor (från-till)824-845
Antal sidor22
ISSN2640-3498
StatusPublicerad - 2023
MoE-publikationstypA4 Artikel i en konferenspublikation
Evenemang8th Machine Learning for Healthcare Conference, MLHC 2023 - New York, Förenta Staterna (USA)
Varaktighet: 11 aug. 202312 aug. 2023

Bibliografisk information

Publisher Copyright:
© 2023 S. Wharrie, Z. Yang, A. Ganna & S. Kaski.

Vetenskapsgrenar

  • 113 Data- och informationsvetenskap

Citera det här