CLEAR: Continual LEArning on Real-World Imagery
View DatasetDescription
Continual learning (CL) is considered as one of the next big challenges in AI. However, the existing CL benchmarks, e.g. Permuted-MNIST and Split-CIFAR, are artificially designed to be made continual and do not align with or generalize to real-world. In this paper, we introduce CLEAR, the first continual image recognition benchmark dataset with a natural temporal evolution of visual concepts in the real world that spans a decade (2004-2014). We build CLEAR from existing image collection (YFCC100M) by proposing a novel low-cost visio-linguistic dataset curation approach. It involves using pretrained vision-language models (e.g. CLIP) to quickly build high-quality labeled datasets on a tight budget. Finally, we post-process CLEAR via crowd-sourcing to remove errors and even inappropriate images hidden in original YFCC100M. The major strengths of CLEAR over prior CL benchmarks include (1) smooth and realistic temporal evolution of visual concepts with real-world imagery, enabling a more practical "online" (i.e., train on past, test on future) evaluation protocol (2) high-quality labeled data along with abundant unlabeled samples per time period for continual semi-supervised and unsupervised learning. Our extensive experiments reveal that mainstream "offline" evaluation protocols, which train and test on iid data, artificially inflate performance of CL systems, stressing the need for our "online" protocol since the models we train today will always be tested in future. Moreover, we find that state-of-the-arts CL algorithms that only utilize fully-supervised data fall short whereas unsupervised pretraining provides significant boost. Lastly, we introduce a biased reservoir-sampling algorithm that dynamically caches more recent training data, achieving the new state-of-the-arts while still leaving large room for improvement.
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Computer Vision and Pattern Recognition
Field
Computer Science
Domain
Physical Sciences
Confidence Score
75%
Source
Open Alex