ffvideo
soundfingerprinting
ffvideo | soundfingerprinting | |
---|---|---|
22 | 6 | |
39 | 912 | |
- | - | |
0.0 | 8.1 | |
over 2 years ago | about 1 month ago | |
C++ | C# | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ffvideo
-
Ask HN: What apps have you created for your own use?
I wrote an optimized C++ FFMPEG player as a video surveillance system, initially to watch my pets in my yard, and then kept going adding (human) face detection, and then a DL/ML training scaffold, then Live555 re-encoding, then an embedded web browser, then I added tons of comments and turned it into a learning demo project. It's on Github, I still use it to watch my pets: https://github.com/bsenftner/ffvideo
-
Looking for a VMS and some doubts
Are your camera's ONVIF compatible? You can confirm this by running this free open source software: https://sourceforge.net/projects/onvifdm/ If your cameras appear in this software, then they are ONVIF compatible. If they are, then you can use my free and open source windows video player to view as many stream as you want: https://github.com/bsenftner/ffvideo This player is CPU efficient, intended for use when training video based machine learning models, so it leaves processor available for machine training. Used as a pure video player, I've had 32 video windows playing at 30 fps simultaneously using it on an i9 3.2 Ghz workstation.
-
[Question] I'm running facial recognition code however the video and the detection is extremely slow. Is there a way to reduce the lag of the video
In my ffmpeg playback library, be aware it is optimized for computer vision; therefore any audio is ignored and if playing from a file any timing information is ignored as well. When playing real time streams, such as from an IP camera or USB camera that playback is as close to real time as possible. I seem to remember something like under 20 ms per frame latency. However, IP video services expect timing information to be honored, and because mine ignores timing a YouTube video will fly by a few hundred frames per second. Likewise, playing from a local stored video file will playback as fast as your drive delivers frames. It was designed this way to minimize overhead and delay when training algorithms with video. Here's the essential source to the playback lib: https://github.com/bsenftner/ffvideo/tree/master/ffvideolib_src
-
Anyone have experience using modern OpenGL w/ wxWidgets?
Okay, thanks to u/bsenftner I was able to figure this out by looking at his Github repository. Essentially, if you want to use a specific version of OpenGL with wxGLCanvas you have to specify the major and minor versions in the attribute list passed to the constructor of wxGLCanvas.
-
Parallel programming for computer vision applications
If you take a look at my ffvideo github project (linked in my reply above) you can search for instances of std::thread and see they are fairly self contained, with logical data fencing protecting data shared between threads. Here's an example: a video frame exporter that runs in it's own thread, enabled when the end-user wants video frames written to disk: https://github.com/bsenftner/ffvideo/blob/aed42b5a3e856e24b030e71f6d92bcbabf5d6829/ffvideolib_src/ffvideo_frameExporter.h
- USB camera feed lagging when used with openCV
- Ways to create GUI for computer vision software
-
RTSP program
Try this: https://github.com/bsenftner/ffvideo
-
Video + bounding box coupled stream transmission
I do this in C++ here: https://github.com/bsenftner/ffvideo I think I'm using DLib rather than OpenCV, but at this level the difference between the two is negligible.
-
Doable? Cropping and alignment of photo set based on facial landmarks
I have some code doing this in an open source C++ project here: https://github.com/bsenftner/ffvideo Towards the bottom of the README on that page you'll see an image titled "demonstrating tilted head registration" describing what you're trying to do here.
soundfingerprinting
-
Ask HN: How many of you are self employed?
Started 10 years ago as an open-source project, building an algorithm for audio fingerprinting. Added a commercial offering, selling storage built specifically for audio fingerprints, targeting enterprise customers. Since the offering was too technical (it's hard to sell solutions to problems that are too narrow and domain-specific), pivoted to more "business-oriented problems". This last year's pivot is a chance to finally grow. Running a business in single-player mode is, at times, too stressful. Aside from the technical part, which I very much enjoy, I need to wear marketing, sales, and customer support hats.
[1] - https://emysound.com
-
Ask HN: What apps have you created for your own use?
The OP said elsewhere they are using this[1] library, which allows you to specify minimum seconds to match, so you'd presumably set it to match 20 seconds or whatever minimum length podcast commercials usually are.
Most other audio fingerprinting libraries I've seen allow you to specify min/max time, as well.
HTH.
1. https://github.com/AddictedCS/soundfingerprinting
- [P] Is it feasible to find a mapping between two non-synthesized audio signals of the same audio sequence?
-
HN: == Happy New Year HN == (What is your “plans” for the new year?)
My goal for the next year is just to work fewer hours. Covid pushed my work habit to the extreme, and I need to rebalance.
1. Read more books, less social media/news.
2. Spend more time with my friends. I haven't seen some of them IRL for more than a year.
3. Exercise more, play tennis with my daughter, spend quality time with my kids.
4. Spend more time with my parents. They've become visibly older in the last years, an observation that scares me.
5. Focus more on marketing for the businesses that I've bootstrapped. All the shiny new features that are developed are not as important as getting more people to use your product.
6. Promote open-source project to 1k GitHub stars[1]. I know it isn't very meaningful, but it's just nice to receive a bit of recognition from the community.
7. Enjoy life, don't stress about all the little things that happen along the way.
Happy new year!
[1]: https://github.com/AddictedCS/soundfingerprinting
- Demonstration of a reverse image search algorithm for detecting transformed images, partial images, and sub-images (link in comments)
What are some alternatives?
nicegui - Create web-based user interfaces with Python. The nice way.
AudioDeviceCmdlets - AudioDeviceCmdlets is a suite of PowerShell Cmdlets to control audio devices on Windows
msdfgen - Multi-channel signed distance field generator
lineiform - A meta-JIT library for Rust interpreters
Dlib - A toolkit for making real world machine learning and data analysis applications in C++
ffmediaelement - FFME: The Advanced WPF MediaElement (based on FFmpeg)
ffmpeg_shadertoy_filter
elastiknn - Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
miniCMS - A document and content managment system for small businesses
Lean - Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
fastAPI_TDD_Docker - A simple secure blog & basic CMS built with Python FastAPI, JWT, Postgres, TDD & Docker
UnityAudioVisualizer - Audio for Smart Assistant.