EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Bohdal, O; Yang, Y; Hospedales, T; NeurIPS

dc.contributor.author	Bohdal, O
dc.contributor.author	Yang, Y
dc.contributor.author	Hospedales, T
dc.contributor.author	NeurIPS
dc.date.accessioned	2024-07-16T08:14:00Z
dc.date.available	2021-01-01
dc.date.available	2024-07-16T08:14:00Z
dc.date.issued	2021-01-01
dc.identifier.issn	1049-5258
dc.identifier.uri	https://qmro.qmul.ac.uk/xmlui/handle/123456789/98160
dc.description.abstract	Gradient-based meta-learning and hyperparameter optimization have seen significant progress recently, enabling practical end-to-end training of neural networks together with many hyperparameters. Nevertheless, existing approaches are relatively expensive as they need to compute second-order derivatives and store a longer computational graph. This cost prevents scaling them to larger network architectures. We present EvoGrad, a new approach to meta-learning that draws upon evolutionary techniques to more efficiently compute hypergradients. EvoGrad estimates hypergradient with respect to hyperparameters without calculating second-order gradients, or storing a longer computational graph, leading to significant improvements in efficiency. We evaluate EvoGrad on three substantial recent meta-learning applications, namely cross-domain few-shot learning with feature-wise transformations, noisy label learning with Meta-Weight-Net and low-resource cross-lingual learning with meta representation transformation. The results show that EvoGrad significantly improves efficiency and enables scaling meta-learning to bigger architectures such as from ResNet10 to ResNet34.	en_US
dc.format.extent	22234 - 22246
dc.title	EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization	en_US
dc.type	Conference Proceeding	en_US
pubs.notes	Not known	en_US
pubs.publication-status	Published	en_US
pubs.volume	27	en_US
dcterms.dateAccepted	2021-01-01

Files in this item

Name:: Yang EvoGrad: Efficient Gradie ...
Size:: 259.7Kb
Format:: application/
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

Electronic Engineering and Computer Science [3424]

Show simple item record