Incremental Learning with Adaptive Augmentation for Image-based Active Learning

Huynh Tra My Nguyen; Viet Cuong TA

doi:10.25073/2588-1086/vnucsce.2396

Huynh Tra My Nguyen, Viet Cuong TA

PDF

Published Sep 6, 2024

DOI: https://doi.org/10.25073/2588-1086/vnucsce.2396

How to Cite

NGUYEN, Huynh Tra My; TA, Viet Cuong. Incremental Learning with Adaptive Augmentation for Image-based Active Learning. VNU Journal of Science: Computer Science and Communication Engineering, [S.l.], v. 40, n. 2, sep. 2024. ISSN 2588-1086. Available at: <//jcsce.vnu.edu.vn/index.php/jcsce/article/view/2396>. Date accessed: 31 july 2025. doi: https://doi.org/10.25073/2588-1086/vnucsce.2396.

ABNT APA BibTeX CBE EndNote - EndNote format (Macintosh & Windows) MLA ProCite - RIS format (Macintosh & Windows) RefWorks Reference Manager - RIS format (Windows only) Turabian

Issue

Vol 40 No 2 (2024)

Section

Original Articles

Abstract

Due to the increasing amount of unlabeled data, a more flexible approach is required to label data efficiently. The aim of active learning is to identify which data samples are the most valuable for learning with the dataset, thus achieving better performance with much fewer samples. Recent works show that although the data augmentation strategies are simple, they have the potential to improve active learning by expanding the input space’s exploration and assisting in the discovery of
more informative samples. By effectively controlling a set of augment operators on each active learning cycle, one could choose promising candidates from the set of unlabeled data for each iteration step of active learning. However, the scoring model is built on a hard reset at each data acquisition cycle, which is time-consuming and missing important information from previous cycles. To address the issues, we propose an incremental training procedure for active learning that avoids retraining the scoring model at each updating cycle. By relying on an augmentation strategy, the model can be used to derive a new score based on the combination between the lowest confidence score with its variance in previous cycles. Thus, the resulting scores give a better approximation of the uncertainty of the samples We evaluate our proposed algorithms on two popular benchmarks, FASHION-MNIST and CIFAR-10, and the results highlight that our method can improve the accuracy from 2% to 4% in comparison with the other baselines.

Article Sidebar

Article Details

Main Article Content

Abstract