EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION

Phillip Wallis; Xubo Song

doi:10.1109/ICIP46576.2022.9897314

EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION

Phillip Wallis, Xubo Song

Medical Informatics and Clinical Epidemiology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

It's commonplace in modern deep learning to achieve SOTA performance by fine-tuning a large, pretrained base model. Recent successes in natural language processing, attributed in part to knowledge transfer from large, pretrained, transformer-based language models, have sparked a similar revolution in computer vision via the introduction of Vision Transformers. As modern deep neural networks increase in performance, they also tend to increase in size. Key issues associated with fine-tuning such enormous models include storage overhead, as well as memory and/or latency requirements. Parameter efficient fine-tuning is a fairly recent paradigm which has been evolving alongside massive neural networks in part to address these issues. We showcase the effectiveness of parameter efficient fine-tuning of vision transformers, and introduce a simple yet effective method for learning a non-uniform parameter allocation given a fixed budget. We demonstrate our approach across a range of benchmark tasks in image classification and semantic segmentation.

Original language	English (US)
Title of host publication	2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings
Publisher	IEEE Computer Society
Pages	3510-3514
Number of pages	5
ISBN (Electronic)	9781665496209
DOIs	https://doi.org/10.1109/ICIP46576.2022.9897314
State	Published - 2022
Event	29th IEEE International Conference on Image Processing, ICIP 2022 - Bordeaux, France Duration: Oct 16 2022 → Oct 19 2022

Publication series

Name	Proceedings - International Conference on Image Processing, ICIP
ISSN (Print)	1522-4880

Conference

Conference	29th IEEE International Conference on Image Processing, ICIP 2022
Country/Territory	France
City	Bordeaux
Period	10/16/22 → 10/19/22

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition
Signal Processing

Access to Document

10.1109/ICIP46576.2022.9897314

Cite this

EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION. / Wallis, Phillip; Song, Xubo.
2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings. IEEE Computer Society, 2022. p. 3510-3514 (Proceedings - International Conference on Image Processing, ICIP).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Wallis, P & Song, X 2022, EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION. in 2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings. Proceedings - International Conference on Image Processing, ICIP, IEEE Computer Society, pp. 3510-3514, 29th IEEE International Conference on Image Processing, ICIP 2022, Bordeaux, France, 10/16/22. https://doi.org/10.1109/ICIP46576.2022.9897314

@inproceedings{f6ce3d7e14ab43809d7d1dbbba1f5d6c,

title = "EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION",

abstract = "It's commonplace in modern deep learning to achieve SOTA performance by fine-tuning a large, pretrained base model. Recent successes in natural language processing, attributed in part to knowledge transfer from large, pretrained, transformer-based language models, have sparked a similar revolution in computer vision via the introduction of Vision Transformers. As modern deep neural networks increase in performance, they also tend to increase in size. Key issues associated with fine-tuning such enormous models include storage overhead, as well as memory and/or latency requirements. Parameter efficient fine-tuning is a fairly recent paradigm which has been evolving alongside massive neural networks in part to address these issues. We showcase the effectiveness of parameter efficient fine-tuning of vision transformers, and introduce a simple yet effective method for learning a non-uniform parameter allocation given a fixed budget. We demonstrate our approach across a range of benchmark tasks in image classification and semantic segmentation.",

author = "Phillip Wallis and Xubo Song",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 29th IEEE International Conference on Image Processing, ICIP 2022 ; Conference date: 16-10-2022 Through 19-10-2022",

year = "2022",

doi = "10.1109/ICIP46576.2022.9897314",

language = "English (US)",

series = "Proceedings - International Conference on Image Processing, ICIP",

publisher = "IEEE Computer Society",

pages = "3510--3514",

booktitle = "2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings",

}

TY - GEN

T1 - EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION

AU - Wallis, Phillip

AU - Song, Xubo

PY - 2022

Y1 - 2022

N2 - It's commonplace in modern deep learning to achieve SOTA performance by fine-tuning a large, pretrained base model. Recent successes in natural language processing, attributed in part to knowledge transfer from large, pretrained, transformer-based language models, have sparked a similar revolution in computer vision via the introduction of Vision Transformers. As modern deep neural networks increase in performance, they also tend to increase in size. Key issues associated with fine-tuning such enormous models include storage overhead, as well as memory and/or latency requirements. Parameter efficient fine-tuning is a fairly recent paradigm which has been evolving alongside massive neural networks in part to address these issues. We showcase the effectiveness of parameter efficient fine-tuning of vision transformers, and introduce a simple yet effective method for learning a non-uniform parameter allocation given a fixed budget. We demonstrate our approach across a range of benchmark tasks in image classification and semantic segmentation.

AB - It's commonplace in modern deep learning to achieve SOTA performance by fine-tuning a large, pretrained base model. Recent successes in natural language processing, attributed in part to knowledge transfer from large, pretrained, transformer-based language models, have sparked a similar revolution in computer vision via the introduction of Vision Transformers. As modern deep neural networks increase in performance, they also tend to increase in size. Key issues associated with fine-tuning such enormous models include storage overhead, as well as memory and/or latency requirements. Parameter efficient fine-tuning is a fairly recent paradigm which has been evolving alongside massive neural networks in part to address these issues. We showcase the effectiveness of parameter efficient fine-tuning of vision transformers, and introduce a simple yet effective method for learning a non-uniform parameter allocation given a fixed budget. We demonstrate our approach across a range of benchmark tasks in image classification and semantic segmentation.

UR - http://www.scopus.com/inward/record.url?scp=85146707173&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85146707173&partnerID=8YFLogxK

U2 - 10.1109/ICIP46576.2022.9897314

DO - 10.1109/ICIP46576.2022.9897314

M3 - Conference contribution

AN - SCOPUS:85146707173

T3 - Proceedings - International Conference on Image Processing, ICIP

SP - 3510

EP - 3514

BT - 2022 IEEE International Conference on Image Processing, ICIP 2022 - Proceedings

PB - IEEE Computer Society

T2 - 29th IEEE International Conference on Image Processing, ICIP 2022

Y2 - 16 October 2022 through 19 October 2022

ER -

EFFICIENT FINE-TUNING OF DEEP NEURAL NETWORKS WITH EFFECTIVE PARAMETER ALLOCATION

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this