FedGen: Generalizable Federated Learning for Sequential Data

Praveen Venkateswaran; Vatche Isahagian; Vinod Muthusamy; Nalini Venkatasubramanian

doi:10.1109/CLOUD60044.2023.00044

Publication

CLOUD 2023

Conference paper

FedGen: Generalizable Federated Learning for Sequential Data

CLOUD 2023

Download paper

Abstract

Existing federated learning models that follow the standard risk minimization paradigm of machine learning often fail to generalize in the presence of spurious correlations in the training data. In many real-world distributed settings, spurious correlations exist due to biases and data sampling issues on distributed devices or clients that can erroneously influence models. Current generalization approaches are designed for centralized training and attempt to identify features that have an invariant causal relationship with the target, thereby reducing the effect of spurious features. However, such invariant risk minimization approaches rely on apriori knowledge of training data distributions which is hard to obtain in many applications. In this work, we present a generalizable federated learning framework called FedGen, which allows clients to identify and distinguish between spurious and invariant features in a collaborative manner without prior knowledge of training distributions. We evaluate our approach on real-world datasets from different domains and show that FedGen results in models that achieve significantly better generalization and can outperform the accuracy of current federated learning approaches by over 24%.

Date

02 Jul 2023

Publication

CLOUD 2023

Authors

IBM-affiliated at time of publication

Topics

Resources

Publication

Abstract

Date

Publication

Authors

Topics

Resources

Share