Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357
Why do you think that https://github.com/csinva/gan-vae-pretrained-pytorch is a good alternative to continual-pretraining-nlp-vision