Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tingyang Sun

Communication Optimization for Decentralized Learning atop Bandwidth-limited Edge Networks

Apr 16, 2025

Tingyang Sun, Tuan Nguyen, Ting He

Abstract:Decentralized federated learning (DFL) is a promising machine learning paradigm for bringing artificial intelligence (AI) capabilities to the network edge. Running DFL on top of edge networks, however, faces severe performance challenges due to the extensive parameter exchanges between agents. Most existing solutions for these challenges were based on simplistic communication models, which cannot capture the case of learning over a multi-hop bandwidth-limited network. In this work, we address this problem by jointly designing the communication scheme for the overlay network formed by the agents and the mixing matrix that controls the communication demands between the agents. By carefully analyzing the properties of our problem, we cast each design problem into a tractable optimization and develop an efficient algorithm with guaranteed performance. Our evaluations based on real topology and data show that the proposed algorithm can reduce the total training time by over $80\%$ compared to the baseline without sacrificing accuracy, while significantly improving the computational efficiency over the state of the art.

* arXiv admin note: text overlap with arXiv:2408.04705

Via

Access Paper or Ask Questions

Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Aug 08, 2024

Yudi Huang, Tingyang Sun, Ting He

Figure 1 for Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Figure 2 for Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Figure 3 for Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Figure 4 for Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Abstract:The emerging machine learning paradigm of decentralized federated learning (DFL) has the promise of greatly boosting the deployment of artificial intelligence (AI) by directly learning across distributed agents without centralized coordination. Despite significant efforts on improving the communication efficiency of DFL, most existing solutions were based on the simplistic assumption that neighboring agents are physically adjacent in the underlying communication network, which fails to correctly capture the communication cost when learning over a general bandwidth-limited network, as encountered in many edge networks. In this work, we address this gap by leveraging recent advances in network tomography to jointly design the communication demands and the communication schedule for overlay-based DFL in bandwidth-limited networks without requiring explicit cooperation from the underlying network. By carefully analyzing the structure of our problem, we decompose it into a series of optimization problems that can each be solved efficiently, to collectively minimize the total training time. Extensive data-driven simulations show that our solution can significantly accelerate DFL in comparison with state-of-the-art designs.

Via

Access Paper or Ask Questions