TY - JOUR
T1 - Recent methodological advances in federated learning for healthcare
AU - BloodCounts! consortium
AU - Zhang, Fan
AU - Kreuter, Daniel
AU - Chen, Yichen
AU - Dittmer, Sören
AU - Tull, Samuel
AU - Shadbahr, Tolou
AU - Schut, Martijn
AU - Asselbergs, Folkert
AU - Kar, Sujoy
AU - Sivapalaratnam, Suthesh
AU - Williams, Sophie
AU - Koh, Mickey
AU - Henskens, Yvonne
AU - de Wit, Bart
AU - D'Alessandro, Umberto
AU - Bah, Bubacarr
AU - Secka, Ousman
AU - Nachev, Parashkev
AU - Gupta, Rajeev
AU - Trompeter, Sara
AU - Boeckx, Nancy
AU - van Laer, Christine
AU - Awandare, Gordon A.
AU - Sarpong, Kwabena
AU - Amenga-Etego, Lucas
AU - Leers, Mathie
AU - Huijskens, Mirelle
AU - McDermott, Samuel
AU - Ouwehand, Willem H.
AU - Rudd, James H.F.
AU - Schӧnlieb, Carola Bibiane
AU - Gleadall, Nicholas
AU - Roberts, Michael
AU - Preller, Jacobus
AU - Rudd, James H.F.
AU - Aston, John A.D.
AU - Schönlieb, Carola Bibiane
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/6/14
Y1 - 2024/6/14
N2 - For healthcare datasets, it is often impossible to combine data samples from multiple sites due to ethical, privacy, or logistical concerns. Federated learning allows for the utilization of powerful machine learning algorithms without requiring the pooling of data. Healthcare data have many simultaneous challenges, such as highly siloed data, class imbalance, missing data, distribution shifts, and non-standardized variables, that require new methodologies to address. Federated learning adds significant methodological complexity to conventional centralized machine learning, requiring distributed optimization, communication between nodes, aggregation of models, and redistribution of models. In this systematic review, we consider all papers on Scopus published between January 2015 and February 2023 that describe new federated learning methodologies for addressing challenges with healthcare data. We reviewed 89 papers meeting these criteria. Significant systemic issues were identified throughout the literature, compromising many methodologies reviewed. We give detailed recommendations to help improve methodology development for federated learning in healthcare.
AB - For healthcare datasets, it is often impossible to combine data samples from multiple sites due to ethical, privacy, or logistical concerns. Federated learning allows for the utilization of powerful machine learning algorithms without requiring the pooling of data. Healthcare data have many simultaneous challenges, such as highly siloed data, class imbalance, missing data, distribution shifts, and non-standardized variables, that require new methodologies to address. Federated learning adds significant methodological complexity to conventional centralized machine learning, requiring distributed optimization, communication between nodes, aggregation of models, and redistribution of models. In this systematic review, we consider all papers on Scopus published between January 2015 and February 2023 that describe new federated learning methodologies for addressing challenges with healthcare data. We reviewed 89 papers meeting these criteria. Significant systemic issues were identified throughout the literature, compromising many methodologies reviewed. We give detailed recommendations to help improve methodology development for federated learning in healthcare.
KW - applications
KW - best practices
KW - deployment
KW - federated learning
KW - healthcare
KW - machine learning
KW - methodological advances
KW - privacy
KW - security
KW - systematic review
KW - 516 Educational sciences
KW - 316 Nursing
U2 - 10.1016/j.patter.2024.101006
DO - 10.1016/j.patter.2024.101006
M3 - Review Article
AN - SCOPUS:85195647024
SN - 2666-3899
VL - 5
JO - Patterns
JF - Patterns
IS - 6
M1 - 101006
ER -