StableVideo
MotionDiffuse
StableVideo | MotionDiffuse | |
---|---|---|
7 | 1 | |
1,327 | 741 | |
- | - | |
6.4 | 10.0 | |
8 months ago | about 1 year ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
StableVideo
-
MagicEdit: High-Fidelity Temporally Coherent Video Editing
Looks like its building on the same concepts as stable video.
https://github.com/rese1f/StableVideo
- StableVideo: Text-driven Consistency-aware Diffusion Video Editing
-
StableVideo: Text-Driven Consistency-Aware Diffusion Video Editing
You can see the source of the github pages site on github: https://github.com/rese1f/StableVideo/tree/web
It seems they forked from somebody else and then changed the content to match their paper.
-
StableVideo
Code: https://github.com/rese1f/StableVideo
MotionDiffuse
-
[R] MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model + Gradio Demo
github: https://github.com/mingyuan-zhang/MotionDiffuse
What are some alternatives?
DiffSinger - DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
text-to-motion - Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."
PaddleNLP - π Easy-to-use and powerful NLP and LLM library with π€ Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including πText Classification, π Neural Search, β Question Answering, βΉοΈ Information Extraction, π Document Intelligence, π Sentiment Analysis etc.
text2room - Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
ReVersion - ReVersion: Diffusion-Based Relation Inversion from Images
AvatarCLIP - [SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
ReuseAndDiffuse - Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Make-It-3D - [ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
LAMP - Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)
MotionGPT - [NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Implicit-Internal-Video-Inpainting - [ICCV 2021]: IIVI: Internal Video Inpainting by Implicit Long-range Propagation
crispy - Crispy is a machine-learning algorithm to make video-games montages efficiently. It uses a neural network to detect highlights in the video-game frames