This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
Why do you think that https://github.com/FurkanGozukara/Stable-Diffusion is a good alternative to caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.
Why do you think that https://github.com/FurkanGozukara/Stable-Diffusion is a good alternative to caption-upsampling