ffvideo
chatgpt-shell
ffvideo | chatgpt-shell | |
---|---|---|
22 | 25 | |
39 | 768 | |
- | - | |
0.0 | 9.1 | |
over 2 years ago | about 1 month ago | |
C++ | Emacs Lisp | |
- | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ffvideo
-
Ask HN: What apps have you created for your own use?
I wrote an optimized C++ FFMPEG player as a video surveillance system, initially to watch my pets in my yard, and then kept going adding (human) face detection, and then a DL/ML training scaffold, then Live555 re-encoding, then an embedded web browser, then I added tons of comments and turned it into a learning demo project. It's on Github, I still use it to watch my pets: https://github.com/bsenftner/ffvideo
-
Looking for a VMS and some doubts
Are your camera's ONVIF compatible? You can confirm this by running this free open source software: https://sourceforge.net/projects/onvifdm/ If your cameras appear in this software, then they are ONVIF compatible. If they are, then you can use my free and open source windows video player to view as many stream as you want: https://github.com/bsenftner/ffvideo This player is CPU efficient, intended for use when training video based machine learning models, so it leaves processor available for machine training. Used as a pure video player, I've had 32 video windows playing at 30 fps simultaneously using it on an i9 3.2 Ghz workstation.
-
[Question] I'm running facial recognition code however the video and the detection is extremely slow. Is there a way to reduce the lag of the video
In my ffmpeg playback library, be aware it is optimized for computer vision; therefore any audio is ignored and if playing from a file any timing information is ignored as well. When playing real time streams, such as from an IP camera or USB camera that playback is as close to real time as possible. I seem to remember something like under 20 ms per frame latency. However, IP video services expect timing information to be honored, and because mine ignores timing a YouTube video will fly by a few hundred frames per second. Likewise, playing from a local stored video file will playback as fast as your drive delivers frames. It was designed this way to minimize overhead and delay when training algorithms with video. Here's the essential source to the playback lib: https://github.com/bsenftner/ffvideo/tree/master/ffvideolib_src
-
Anyone have experience using modern OpenGL w/ wxWidgets?
Okay, thanks to u/bsenftner I was able to figure this out by looking at his Github repository. Essentially, if you want to use a specific version of OpenGL with wxGLCanvas you have to specify the major and minor versions in the attribute list passed to the constructor of wxGLCanvas.
-
Parallel programming for computer vision applications
If you take a look at my ffvideo github project (linked in my reply above) you can search for instances of std::thread and see they are fairly self contained, with logical data fencing protecting data shared between threads. Here's an example: a video frame exporter that runs in it's own thread, enabled when the end-user wants video frames written to disk: https://github.com/bsenftner/ffvideo/blob/aed42b5a3e856e24b030e71f6d92bcbabf5d6829/ffvideolib_src/ffvideo_frameExporter.h
- USB camera feed lagging when used with openCV
- Ways to create GUI for computer vision software
-
RTSP program
Try this: https://github.com/bsenftner/ffvideo
-
Video + bounding box coupled stream transmission
I do this in C++ here: https://github.com/bsenftner/ffvideo I think I'm using DLib rather than OpenCV, but at this level the difference between the two is negligible.
-
Doable? Cropping and alignment of photo set based on facial landmarks
I have some code doing this in an open source C++ project here: https://github.com/bsenftner/ffvideo Towards the bottom of the README on that page you'll see an image titled "demonstrating tilted head registration" describing what you're trying to do here.
chatgpt-shell
-
Devin, the First AI Software Engineer
I think it is a tooling issue. It is in no way obvious how use LLM's effectively, especially for really good writing results. Tweaking and tinkering can be time consuming indeed, but i use lately the chatgpt-shell [1] and it lends well to an iterative approach. One needs to cycle through some styles first, and then decide how to most effectively prompt for better results.
[1]https://github.com/xenodium/chatgpt-shell/blob/bf2d12ed2ed60...
-
Ask HN: What apps have you created for your own use?
- https://xenodium.com/an-ios-journaling-app-powered-by-org-pl... - Lately, I'm having a go at building a privacy-focused plain-text-based iOS journaling app. I starte building it for someone important in my life but now using it myself.
- https://flathabits.com - After reading Atomic Habits, I wanted a habit tracker but most had more friction than I wanted, required accounts, had distractions, lock-in etc. so I built a privacy-focused app, with little friction and no-lockin (saves to plain text).
- https://plainorg.com - There are a gazillion markdown apps on the App Store, but hardly any supporting org markup, so I built one.
- https://xenodium.com/scratch-a-minimal-scratch-area - I wanted a surface where I could just dump text with as few taps as possible.
- https://github.com/xenodium/macosrec - I wanted to take either screenshots or videos of macOS apps from the command line, so I could integrate anywhere.
- https://github.com/xenodium/chatgpt-shell - I'm far down the Emacs rabbit hole, so I prefer Emacs-integrated tools. Built a ChatGPT Emacs shell to see what the hype was all about ;) tl;dr it really does help.
- https://github.com/xenodium/dwim-shell-command - A way to manage and easily apply the gazillion one-liners (and more complex scripts) I've come across. I got close to 100 utils check-in now https://github.com/xenodium/dwim-shell-command#my-toolbox
- https://github.com/xenodium/ob-swiftui - Play around with SwiftUI layouts from the comfort of my preferd editor.
- https://github.com/xenodium/company-org-block - Org block completion.
- https://xenodium.com - I tend to scratch own itches and post my solutions here.
-
More advanced emacs tutorials
Every so often I scratch an itch to improve my workflow and write it up https://xenodium.com.
-
What I Have Changed My Mind About in Software Development
With lsp, the gap between IDEs vs text editors is narrowing. While I still prefer Emacs, I’m pragmatic enough to jump on to whatever tool does a better job for a specific task. At times, that is Xcode.
Was also sceptical about ChatGPT and changed my mind like OP. I was less pragmatic on this one and brought ChatGPT over to Emacs https://github.com/xenodium/chatgpt-shell. Pretty happy with the result so far.
-
Edit-mode for point-by-point text proofreading, like EditGPT?
There are a handful of chatgpt Emacs packages. I happen to have authored chatgpt-shell. For making a synchronous request, can use chatgpt-shell-post-prompt. For async, use chatgpt-shell-send-to-buffer with a handler.
-
Ask HN: Could you show your personal blog here?
https://xenodium.com will hit 10 years in November. It started as a single org file for personal notes (programming, cooking, Emacs, bookmarks, iOS dev, travel). One day, I decided to export it to HTML and make it accessible to me from anywhere. Sorta just became both notes and blog over time…
While the tone of the posts may have evolved a bit, the blog still serves as personal notes/reference of sorts. The tech behind it hasn’t changed a whole lot. It remains a single org file (https://raw.githubusercontent.com/xenodium/xenodium.github.i...) with my own ugly elisp hacks, but hey does the job ;-)
-
Use emacs as a ChatGPT app
u/xenodium's chatgpt-shell deserves a mention. It uses an intuitive Comint-shell based interaction and includes support for executable code blocks (in the comint-shell) and for org-babel. It's very polished -- I believe it also includes support for saving and restoring sessions, which gptel is yet to add.
-
Do you also write small guides for yourself to remind you of your own emacs workflows?
Yep. Turn some of them into posts https://xenodium.com
-
Is orgmode really that much better than an equivalent workflow using vim + other tools?
For certain concepts that I don't understand fully, I'm using chatgpt-shell. It is beyond fantastic and almost impossible to describe in a single post. This is, for example, just one of my use cases: When I'm writing a comment or a message to my colleague (and of course, yes, I edit just about any text in Emacs), I can select a paragraph and ask chatgpt-shell to improve it. It does, but it also shows me the diff of the changes, that is how I set it up.
-
Twenty Years of Blogging
Mine (https://xenodium.com) will hit 10 years in November. It started as a single org file for personal notes. One day I decided to export it to HTML as my accesible notes from anywhere. Sorta just became both notes and blog over time… While the tone of the posts may have evolved over time, they still serve as a notes/reference of sorts. The tech behind it hasn’t changed a whole lot. It remains is a single org file (https://raw.githubusercontent.com/xenodium/xenodium.github.i...).
What are some alternatives?
nicegui - Create web-based user interfaces with Python. The nice way.
E2B - Secure cloud runtime for AI apps & AI agents. Fully open-source.
msdfgen - Multi-channel signed distance field generator
gptel - A simple LLM client for Emacs
Dlib - A toolkit for making real world machine learning and data analysis applications in C++
emacs-chatgpt-jarvis - press F12 to record, use whisper to transcribe and chatgpt to answer
ffmpeg_shadertoy_filter
ideas - a hundred ideas for computing - a record of ideas - https://samsquire.github.io/ideas/
miniCMS - A document and content managment system for small businesses
go-cleanarchitecture - An example Go application demonstrating The Clean Architecture.
fastAPI_TDD_Docker - A simple secure blog & basic CMS built with Python FastAPI, JWT, Postgres, TDD & Docker
splitter - React component for building split views like in VS Code