Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Liu, Ji; Ren, Jiaxiang; Jin, Ruoming; Zhang, Zijie; Zhou, Yang; Valduriez, Patrick; Dou, Dejing

Computer Science > Machine Learning

arXiv:2410.00131 (cs)

[Submitted on 30 Sep 2024 (v1), last revised 18 Oct 2024 (this version, v2)]

Title:Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Authors:Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou

View PDF HTML (experimental)

Abstract:As a promising paradigm to collaboratively train models with decentralized data, Federated Learning (FL) can be exploited to fine-tune Large Language Models (LLMs). While LLMs correspond to huge size, the scale of the training data significantly increases, which leads to tremendous amounts of computation and communication costs. The training data is generally non-Independent and Identically Distributed (non-IID), which requires adaptive data processing within each device. Although Low Rank Adaptation (LoRA) can significantly reduce the scale of parameters to update in the fine-tuning process, it still takes unaffordable time to transfer the low-rank parameters of all the layers in LLMs. In this paper, we propose a Fisher Information-based Efficient Curriculum Federated Learning framework (FibecFed) with two novel methods, i.e., adaptive federated curriculum learning and efficient sparse parameter update. First, we propose a fisher information-based method to adaptively sample data within each device to improve the effectiveness of the FL fine-tuning process. Second, we dynamically select the proper layers for global aggregation and sparse parameters for local update with LoRA so as to improve the efficiency of the FL fine-tuning process. Extensive experimental results based on 10 datasets demonstrate that FibecFed yields excellent performance (up to 45.35% in terms of accuracy) and superb fine-tuning speed (up to 98.61% faster) compared with 17 baseline approaches).

Comments:	27 pages, 8 figures, 14 tables, to appear in EMNLP 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2410.00131 [cs.LG]
	(or arXiv:2410.00131v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.00131

Submission history

From: Ji Liu [view email]
[v1] Mon, 30 Sep 2024 18:12:18 UTC (533 KB)
[v2] Fri, 18 Oct 2024 05:22:02 UTC (534 KB)

Computer Science > Machine Learning

Title:Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators