CompreFace
TTS
Our great sponsors
CompreFace | TTS | |
---|---|---|
28 | 231 | |
3,877 | 29,174 | |
5.7% | 6.6% | |
8.3 | 9.5 | |
about 1 month ago | 9 days ago | |
Java | Python | |
Apache License 2.0 | Mozilla Public License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
CompreFace
-
Double-Take not getting enough events/images from Frigate?
# https://github.com/exadel-inc/CompreFace/blob/master/docs/Face-services-and-plugins.md)
-
DeepStack (dead?) vs CompreFace (slow?)
I looked at Double Take (UI that lets you do the training of your face recondition easily) and found CompreFace as one of models they support. It looks like what I need but there is a catch... no OpenVino (intel CPUs AI accelerator) support. I really like my low power NVR setup and would like keep it that way. Running AI on CPU without acceleration is both power inefficient and much slower. I have a spare low end GPU but if dump it in the system the current AI acceleration brakes... (I know I can prob fix it but that is a rabbit hole I would prefer to avoid).
-
self Hosted face recognition software
This has good UI and also good recognition: https://github.com/exadel-inc/CompreFace
- Do we have good, gpu accelerated, text-to-speech, speech-to-text, image/video-to-text face/object recognition that is open source and self-hosted ?
-
need help installing a software in Ubuntus terminal
wget -q -O tmp.zip 'https://github.com/exadel-inc/CompreFace/releases/download/v1.1.0/CompreFace_1.1.0.zip' && unzip tmp.zip && rm tmp.zip
- Face comparison in Stable Diffusion
-
security camera advice
https://github.com/exadel-inc/CompreFace/ Is already a good locally-run API for face detection and facial recognition. Thankyou for your suggestion.
-
hey guys which is the best tool for making facial recognition using single image in deep learning
If you are looking for open source stuff and are able to self-host it, maybe have a look at CompreFace.
-
The CompreFace 1.1. Release: What’s New?
As almost always, on GitHub https://github.com/exadel-inc/CompreFace
-
Working with facial recognition
Looking into this I found Compreface (https://exadel.com/solutions/compreface/) an open source face recognition software. There are alread some controb scripts, like contrib/photils.lua, who take some images, run them through a tool, then tag them with data coming from the tool. Converting this to use Compreface looks likea promising avenue.
TTS
-
OpenAI deems its voice cloning tool too risky for general release
lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.
-
What things are happening in ML that we can't hear oer the din of LLMs?
Not sure how relevant this is but note that Coqui TTS (the realistic TTS) has already shut down
https://coqui.ai
-
Base TTS (Amazon): The largest text-to-speech model to-date
I've used coqui.ai's TTS models[0] and library[1] to great success. I was able to get cloned voice to be rendered in about 80% of the audio clip length, and I believe you can also stream the response. Do note the model license for XTTS, it is one they wrote themselves that has some restrictions.
[0] https://huggingface.co/coqui/XTTS-v2
[1] https://github.com/coqui-ai/TTS
- FLaNK Stack Weekly 12 February 2024
- Coqui Is Shutting Down
-
Coqui.ai Is Shutting Down
My only exposure to Coqui was their text to speech software. If I remember correctly the website was a commercialized service with TTS and probably some other related things. I hope the software work continues in the open.
https://github.com/coqui-ai/TTS
-
Hello guys, any selfhosted alternative to eleven labs?
Coqui.ai TTS (https://github.com/coqui-ai/TTS)
-
Demo of Anagnorisis - completely local recommendation system powered by Llama 2. Radio mode. Work in progress.
"tts_models/multilingual/multi-dataset/xtts_v2" model from https://github.com/coqui-ai/TTS. It gives pretty good results and works with references, so it's pretty easy to change the voice. By the way the source code of the project is open: https://github.com/volotat/Anagnorisis but be ready, the code is pretty raw for now.
-
XTTS voice cloning with only a seconds of audio
A recent update to their GitHub also has a no-code gradio ui to facilitate fine-tuning and inferencing locally. https://github.com/coqui-ai/TTS/releases/tag/v0.21.3
-
At a loss trying to get coqui_tts extension to load
No API token found for 🐸Coqui Studio voices - https://coqui.ai
What are some alternatives?
double-take - Unified UI and API for processing and training images for facial recognition.
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
frigate - NVR with realtime local object detection for IP cameras
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
Home Assistant - :house_with_garden: Open source home automation that puts local control and privacy first.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Face Recognition - The world's simplest facial recognition api for Python and the command line
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
insightface - State-of-the-art 2D and 3D Face Analysis Project
text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
librephotos - A self-hosted open source photo management service. This is the repository of the backend.
piper - A fast, local neural text to speech system