ffvideo
insightface
ffvideo | insightface | |
---|---|---|
22 | 34 | |
39 | 21,426 | |
- | 2.5% | |
0.0 | 6.8 | |
over 2 years ago | 4 days ago | |
C++ | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ffvideo
-
Ask HN: What apps have you created for your own use?
I wrote an optimized C++ FFMPEG player as a video surveillance system, initially to watch my pets in my yard, and then kept going adding (human) face detection, and then a DL/ML training scaffold, then Live555 re-encoding, then an embedded web browser, then I added tons of comments and turned it into a learning demo project. It's on Github, I still use it to watch my pets: https://github.com/bsenftner/ffvideo
-
Looking for a VMS and some doubts
Are your camera's ONVIF compatible? You can confirm this by running this free open source software: https://sourceforge.net/projects/onvifdm/ If your cameras appear in this software, then they are ONVIF compatible. If they are, then you can use my free and open source windows video player to view as many stream as you want: https://github.com/bsenftner/ffvideo This player is CPU efficient, intended for use when training video based machine learning models, so it leaves processor available for machine training. Used as a pure video player, I've had 32 video windows playing at 30 fps simultaneously using it on an i9 3.2 Ghz workstation.
-
[Question] I'm running facial recognition code however the video and the detection is extremely slow. Is there a way to reduce the lag of the video
In my ffmpeg playback library, be aware it is optimized for computer vision; therefore any audio is ignored and if playing from a file any timing information is ignored as well. When playing real time streams, such as from an IP camera or USB camera that playback is as close to real time as possible. I seem to remember something like under 20 ms per frame latency. However, IP video services expect timing information to be honored, and because mine ignores timing a YouTube video will fly by a few hundred frames per second. Likewise, playing from a local stored video file will playback as fast as your drive delivers frames. It was designed this way to minimize overhead and delay when training algorithms with video. Here's the essential source to the playback lib: https://github.com/bsenftner/ffvideo/tree/master/ffvideolib_src
-
Anyone have experience using modern OpenGL w/ wxWidgets?
Okay, thanks to u/bsenftner I was able to figure this out by looking at his Github repository. Essentially, if you want to use a specific version of OpenGL with wxGLCanvas you have to specify the major and minor versions in the attribute list passed to the constructor of wxGLCanvas.
-
Parallel programming for computer vision applications
If you take a look at my ffvideo github project (linked in my reply above) you can search for instances of std::thread and see they are fairly self contained, with logical data fencing protecting data shared between threads. Here's an example: a video frame exporter that runs in it's own thread, enabled when the end-user wants video frames written to disk: https://github.com/bsenftner/ffvideo/blob/aed42b5a3e856e24b030e71f6d92bcbabf5d6829/ffvideolib_src/ffvideo_frameExporter.h
- USB camera feed lagging when used with openCV
- Ways to create GUI for computer vision software
-
RTSP program
Try this: https://github.com/bsenftner/ffvideo
-
Video + bounding box coupled stream transmission
I do this in C++ here: https://github.com/bsenftner/ffvideo I think I'm using DLib rather than OpenCV, but at this level the difference between the two is negligible.
-
Doable? Cropping and alignment of photo set based on facial landmarks
I have some code doing this in an open source C++ project here: https://github.com/bsenftner/ffvideo Towards the bottom of the README on that page you'll see an image titled "demonstrating tilted head registration" describing what you're trying to do here.
insightface
-
Indexing iCloud Photos with AI Using LLaVA and Pgvector
I haven't used it for search, but I believe Insightface's embeddings can be used for this purpose. https://insightface.ai/
-
InsightFace are trying to kill off AI competitors on YouTube
This is Picsi.ai/InsightFace Discord server: https://discord.gg/Ym3X8U59ZN The COO of the company, enforcing these strikes is Discord user unmoved.mover This is the github of insightface: https://github.com/deepinsight/insightface
-
FaceFusion: Next generation face swapper and enhancer
Insightface offering improved model but closed source via discord bot.
https://github.com/deepinsight/insightface/issues/2294
https://github.com/deepinsight/insightface/issues/2315
-
Can I detect the physical orientation of a person using OpenCV?
The easiest I've found so far is to use InsightFace. When you call it on an image, it will return both a 2D and 3D mesh showing the orientation of the face.
-
Running Deepsight / Insightface on a linode server
Let's say I want to make a flask api using insightface on a linode server. How much ram do I really need?
-
How can Stable Diffusion help with blurs around edges for face swaps? Any ideas are welcome
I have tried Insightface ( https://insightface.ai/ ). But it struggles with face edges. i have used GFPGAN too, but that still leaves blurs around the face. Notice below https://www.youtube.com/watch?v=jWIAVjVeG1I
-
Open source software has gotten a lot better at having smooth swaps. Below is what i got.
Mainly as the base model. https://insightface.ai/ There was some post processing done to further improve quality. https://github.com/TencentARC/GFPGAN
-
I'm getting this big ass error after install roop extension. It appears as installed in my extensions tab but doesn't show any where under t2i or i2i. Please help.
in this page says that you need the onxruntime to run it https://github.com/deepinsight/insightface/tree/master/python-package
-
The ROOP extension made my face fine-tune/dreambooth efforts almost irrelevant
According to this, they're working on a paper for the 256 model right now.
-
Robert Jordan's Casting Choice for Mat
This is a tongue-in-cheek variation of a WIP for Mat Cauthon. RJ has a list of casting choices and a young James Garner was his selection for Mat Cauthon. The illustration itself is from my own WIP, and no, Mat doesn't look like James Garner in my version. I used Insight Faceswap, https://insightface.ai/ along with a pic of a young James Garner. Made me laugh...
What are some alternatives?
nicegui - Create web-based user interfaces with Python. The nice way.
Face Recognition - The world's simplest facial recognition api for Python and the command line
msdfgen - Multi-channel signed distance field generator
deepface - A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Dlib - A toolkit for making real world machine learning and data analysis applications in C++
facenet - Face recognition using Tensorflow
ffmpeg_shadertoy_filter
sd-webui-roop - roop extension for StableDiffusion web-ui
miniCMS - A document and content managment system for small businesses
CompreFace - Leading free and open-source face recognition system
fastAPI_TDD_Docker - A simple secure blog & basic CMS built with Python FastAPI, JWT, Postgres, TDD & Docker
FaceFusion - Next generation face swapper and enhancer