Specialist-Diffusion
sliders
Specialist-Diffusion | sliders | |
---|---|---|
1 | 3 | |
29 | 747 | |
- | - | |
4.6 | 8.3 | |
7 months ago | about 1 month ago | |
Python | Jupyter Notebook | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Specialist-Diffusion
sliders
-
Are we at peak vector database?
> Always felt they're more like hashes/fingerprints for the RAG use cases.
Yes, I see where you’re coming from. Perceptual hashes[0] are pretty similar, the key is that similar documents should have similar embedding (unlike cryptographic hashes, where a single bit flip should produce a completely different hash).
Nice embeddings encode information spatially, a classic example of embedding arithmetic is: king - man + woman = queen[1]. “Concept Sliders” is a cool application of this to image generation [2].
Personally I’ve not had _too_ much trouble with running out of RAM due to embeddings themselves, but I did spend a fair amount of time last week profiling memory usage to make sure I didn’t run out in prod, so it is on my mind!
[0] https://en.m.wikipedia.org/wiki/Perceptual_hashing
[1] https://www.technologyreview.com/2015/09/17/166211/king-man-...
[2] https://github.com/rohitgandikota/sliders
- LoRA Adaptors for Precise Control in Diffusion Models
- List of Stable Diffusion research softwares that I don't think gotten widespread adoption.
What are some alternatives?
ziplora-pytorch - Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
stable-diffusion-reference-only - img2img version of stable diffusion. Anime Character Remix. Line Art Automatic Coloring. Style Transfer.
Rerender_A_Video - [SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
cross-image-attention - Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
SEED - Official implementation of SEED-LLaMA (ICLR 2024).
DemoFusion - Let us democratise high-resolution generation! (CVPR 2024)
RIVAL - [NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain
LAMP - Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)
ComfyUI_experiments - Some experimental custom nodes.