☆ 4.6 Article

FEDBERT: When Federated Learning Meets Pre-training

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY (2022)

期刊

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY

卷 13, 期 4, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3510033

关键词

Federated learning; pre-training; BERT; NLP

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

资金

National Natural Science Foundation of China [62102157]
Fundamental Research Funds for the Central Universities [HUST:2020kfyXJJS019]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The fast growth of pre-trained models has revolutionized natural language processing, becoming the dominant technique for various NLP applications. However, pre-training requires significant training data and computing resources, making it impossible for individual clients to conduct. To enable clients with limited computing capability to participate in pre-training large models, FEDBERT proposes a federated learning approach that achieves excellent performance without sharing raw data.

The fast growth of pre-trained models (PTMs) has brought natural language processing to a new era, which has become a dominant technique for various natural language processing (NLP) applications. Every user can download the weights of PTMs, then fine-tune the weights for a task on the local side. However, the pre-training of a model relies heavily on accessing a large-scale of training data and requires a vast amount of computing resources. These strict requirements make it impossible for any single client to pre-train such a model. To grant clients with limited computing capability to participate in pre-training a large model, we propose a new learning approach, FEDBERT, that takes advantage of the federated learning and split learning approaches, resorting to pre-training BERT in a federated way. FEDBERT can prevent sharing the raw data information and obtain excellent performance. Extensive experiments on seven GLUE tasks demonstrate that FEDBERT can maintain its effectiveness without communicating to the sensitive local data of clients.

FEDBERT: When Federated Learning Meets Pre-training

期刊

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

FEDBERT: When Federated Learning Meets Pre-training

期刊

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文