Paper
2 May 2023 Inner layer hyperparametric meta-learning based on feature reuse
Jihong Cao, Xiangpeng Sun
Author Affiliations +
Proceedings Volume 12642, Second International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT 2023); 126421H (2023) https://doi.org/10.1117/12.2674729
Event: Second International Conference on Electronic Information Engineering, Big Data and Computer Technology (EIBDCT 2023), 2023, Xishuangbanna, China
Abstract
Few-shot learning requires fast learning and adaptation during the learning process, which has been seen as a very challenging problem because of its stringent training requirements. With the development of deep learning, meta-learning is increasingly applied to solve few-shot problems. Model-agnostic meta-learning (MAML) is applied to model-agnostic tasks by finding common initialization parameters applicable to the model through both inner and outer loops. However, the generalization performance of MAML is not strong, and its inner loop is not fully functional. Therefore, we add learnable hyperparameters to the inner loop to make the model training results more relevant to the current task and reduce the number of layers of gradient updates without unnecessary computation. Experimental results show that the inner loop hyperparameter learning based on feature reuse (HLFR), finds better initialization parameters compared to MAML.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jihong Cao and Xiangpeng Sun "Inner layer hyperparametric meta-learning based on feature reuse", Proc. SPIE 12642, Second International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT 2023), 126421H (2 May 2023); https://doi.org/10.1117/12.2674729
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Feature extraction

Mathematical modeling

Mathematical optimization

Network architectures

Deep learning

Image classification

Back to Top