SadTalker
openscene
SadTalker | openscene | |
---|---|---|
16 | 3 | |
10,611 | 553 | |
9.0% | - | |
5.9 | 4.9 | |
4 days ago | 7 months ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SadTalker
- Can some expert analyze a github repo and tell us if it's really safe or not?
-
Does the sad talker repo contain a virus/trojan yes or not?
Trojan detected when uncompressing facevid2vid_00189-model.pth · Issue #75 · OpenTalker/SadTalker (github.com)
-
Lip Sync API Service?
I am using SadTalker to create a lipsync of a still image with an audio file. The still image is from Stable Diffusion and the audio is from ChatGPT and then AWS Polly for the voice synthesis. My problem is that even though I like the results it takes one and a half minutes to generate this video. If I use the enhancer it is more like five minutes. I am using a A10 NVIDIA GPU.
-
SD + Augmented Reality
Stable Diffusion A1111 + Sadtalker Extension - https://github.com/OpenTalker/SadTalker.git
- Are there any plugins that allow you to lip-sync/move faces?
-
Judy Collins animation generated with HeyGen
Isn't this just SadTalker?
- [D] Better alternatives to Wav2Lip?
-
😋 AGI (bark 🐶) Smart waitress 🎙️
🎥 OpenTalker/SadTalker
- I just got into SD, and discovering all the different extensions has been a lot of fun. Yesterday, I stumbled across SadTalker...audio source in comments.
- Testing a new prompt-speech to video extension for A1111 stable-diffusion-webui from one single image
openscene
-
OPENSCENE can identify objects, materials, affordances, activities, and room types in complex 3D scenes, all using a single model trained without any labeled 3D data
Project website: github.io/openscene
-
Any recent tools for LiDAR segmentation?
Can anyone recommend recent tools/models that are good at segmenting point cloud data? My interest is semantic segmentation. Particularly segmenting objects in streets, such as traffic lanes, road signs, trees, power lines, etc. I tried some bits of conventional style a few years ago (YOLO and lots of 3D labeling and training, which was a pain), but I wanted to see if there is anything new out there. For example, I noticed Esri offers a power line extraction tool. I haven't tried but looks nice. Also, deep learning and language model fusion is really kicking in these days:https://github.com/pengsongyou/openscene.
What are some alternatives?
bark - 🔊 Text-Prompted Generative Audio Model
Torch-Pruning - [CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
sd-wav2lip-uhq - Wav2Lip UHQ extension for Automatic1111
VideoMAEv2 - [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
GeneFace - GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
IGEV - [CVPR 2023] Iterative Geometry Encoding Volume for Stereo Matching and Multi-View Stereo
Thin-Plate-Spline-Motion-Model - [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Instruct2Act - Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Painter - Painter & SegGPT Series: Vision Foundation Models from BAAI
bark-speaker-directory - Site for sharing Bark voices
elevenlabs-python - The official Python API for ElevenLabs Text to Speech.