Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP
Why do you think that https://github.com/rom1504/clip-retrieval is a good alternative to Multi-Modal-Comparators
Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP
Why do you think that https://github.com/rom1504/clip-retrieval is a good alternative to Multi-Modal-Comparators