Retrieval Augmentation
Retrieval Augmentation (RA) is an established technique that leverages publicly available data to enhance performances on downstream tasks. It retrieves task-relevant examples and uses them to adapt pretrained models. SWAT reports that the retrieved data has domain gaps compared to task-specific training data, which might be a good thing in terms of enhancing the adapted model's OOD generalization capability.
We adopt string matching-based RA approach to retrieve images from the VLM's pretraining dataset LAION-400M. The results of incorporating RA with PFT show that RA yields significant OOD accuracy gains.