Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning

Chen, Z; Yi, W; Shin, H; Nallanathan, A

dc.contributor.author	Chen, Z	en_US
dc.contributor.author	Yi, W	en_US
dc.contributor.author	Shin, H	en_US
dc.contributor.author	Nallanathan, A	en_US
dc.date.accessioned	2024-07-15T08:51:41Z
dc.date.issued	2024-01-01	en_US
dc.identifier.issn	1536-1276	en_US
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/98110
dc.description.abstract	Most existing wireless federated learning (FL) studies focused on homogeneous model settings where devices train identical local models. In this setting, the devices with poor communication and computation capabilities may delay the global model update and degrade the performance of FL. Moreover, in the homogenous model settings, the scale of the global model is restricted by the device with the lowest capability. To tackle these challenges, this work proposes an adaptive model pruning-based FL (AMP-FL) framework, where the edge server dynamically generates sub-models by pruning the global model for devices' local training to adapt their heterogeneous computation capabilities and time-varying channel conditions. Since the involvement of diverse structures of devices' sub-models in the global model updating may negatively affect the training convergence, we propose compensating for the gradients of pruned model regions by devices' historical gradients. We then introduce an age of information (AoI) metric to characterize the staleness of local gradients and theoretically analyze the convergence behaviour of AMP-FL. The convergence bound suggests scheduling devices with large AoI of gradients and pruning the model regions with small AoI for devices to improve the learning performance. Inspired by this, we define a new objective function, i.e., the average AoI of local gradients, to transform the inexplicit global loss minimization problem into a tractable one for device scheduling, model pruning, and resource block (RB) allocation design. Through detailed analysis, we derive the optimal model pruning strategy and transform the RB allocation problem into equivalent linear programming that can be effectively solved. Experimental results demonstrate the effectiveness and superiority of the proposed approaches. The proposed AMP-FL is capable of achieving 1.9x and 1.6x speed up for FL on MNIST and CIFAR-10 datasets in comparison with the FL schemes with homogeneous model settings.	en_US
dc.format.extent	7582 - 7598	en_US
dc.relation.ispartof	IEEE Transactions on Wireless Communications	en_US
dc.rights	© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.title	Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning	en_US
dc.type	Article
dc.identifier.doi	10.1109/TWC.2023.3342626	en_US
pubs.issue	7	en_US
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US
pubs.volume	23	en_US
rioxxterms.funder	Default funder	en_US
rioxxterms.identifier.project	Default project	en_US
rioxxterms.funder.project	b215eee3-195d-4c4f-a85d-169a4331c138	en_US

Files in this item

Name:: Zhixiong_TWC_23_3.pdf
Size:: 2.859Mb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3490]

Show simple item record