tevr-asr-tool
MathAnimation
tevr-asr-tool | MathAnimation | |
---|---|---|
9 | 4 | |
408 | 953 | |
0.0% | - | |
5.9 | 4.1 | |
over 1 year ago | 17 days ago | |
C | C++ | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tevr-asr-tool
-
Ask HN: Why is pay so much higher in the US? (or is it?)
Do some cool open source stuff. Since August 9th, I received about 50 job offers mentioning this repo: https://github.com/DeutscheKI/tevr-asr-tool Most of them were senior engineer or AI researcher, with a few CTO / co-founder offers sprinkled in. I'm not in the market and this was a bit unexpected to me, but those emails sounded like they would pay well. And most was remote for US companies.
-
Ask HN: Who is using C++ as the main language for new project?
Me, because I plan to extend to realtime processing: https://github.com/DeutscheKI/tevr-asr-tool
Also, all the performance critical stuff on my production servers is C++ with JNI or pybind wrappers.
- Show HN: 用284行C++语言实现最先进的德语语音识别 (Show HN: State-of-the-art German speech recognition in 284 lines of C++)
-
Show HN: State-of-the-Art German Speech Recognition in 284 lines of C++
The unique work that makes this speech recognition superior to other tools is in those 284 lines of code: https://github.com/DeutscheKI/tevr-asr-tool/blob/master/tevr...
That's a custom-designed beam search decoder implemented in C++ and based on the research for my TEVR paper. It increases performance by a relative 16% reduction in word error rate.
MathAnimation
-
Ask HN: Tell us about your project that's not done yet but you want feedback on
https://github.com/ambrosiogabe/MathAnimation
If you’ve ever tried to make a mathematical animation (think 3Blue1Brown), it’s a real pain. I was using manim for awhile to make animations for my YT channel, but the whole iteration process felt very slow and repetitive. So I thought I would recreate manim over the weekend, except with a GUI and real-time feedback. It’s been a year and a half and I’m hoping this weekend will be done soon so I can move on and start making videos again.
So far, it does a lot, but it still needs a lot of polish and refinement. The readme gives some gifs and a better idea of the feature set right now.
-
The Worst API Ever Made
Win32 definitely has some stinkers. But, video encoding APIs definitely take the cake for me. I've only tried a couple of FFmpeg's APIs and AV1's API, but my God, these are the worst API's I've ever had to deal with.
Just as an example, all I wanted was an API like this[0] for FFmpeg. In order to implement that API (which in my opinion is reasonable), I had to write this monstrosity[1]. It took me a solid week to find an example of how to do this, then another few days of fiddling until I finally just barely got something working. Then I threw in the towel even though the performance was horrible. I tried again a year later and spent another month wrestling with AV1 :/
The amount of leakage going on in these APIs is absolutely insane. I shouldn't have to know the intimate details of how video encoding works to use your library. If I do, then I may as well write my own encoder at that point.
[0]: https://github.com/ambrosiogabe/MathAnimation/blob/18c004bca...
[1]: https://github.com/ambrosiogabe/MathAnimation/blob/18c004bca...
-
Show HN: Mafs – React components for interactive math
I just so happen to be working on a real-time Gui first replacement for manim :)
It still has a ways to go, but I was able to create one video with it so far and I'm working on all the pain points I ran into while using it. Feel free to check it out if you're interested!
https://github.com/ambrosiogabe/MathAnimation
-
Ask HN: Who is using C++ as the main language for new project?
I'm using C++ for a minecraft clone that I've been tinkering on for the past year[0]. I also plan on using embedded lua for scripting, and I'm using RML UI for game HUDs, ImGui for development tools, and OpenGL for graphics. I use premake for my build system but plan on switching to CMake.
I'm also using it for an animation tool[1]. I've been using 3Blue1Brown's Manim (written in Python) which is amazing, but it lacks real-time editing and proper 3D blending. It also lacks audio synchronization, 3D texture support, and some more complex features that I'd like to add :)
[0]: https://youtu.be/UAUdIQZKV88
[1]: https://github.com/ambrosiogabe/MathAnimation
What are some alternatives?
DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
k8deployer - An experimental deployer for kubernetes apps for developers who are too lazy (or busy) to learn Helm.
wisewriterv3 - From an input, creates a full book, with cover art and sells it on Amazon. Using OpenAI for content, Midjourney for covers, and puppeteer for product input.
SLID-on-Microcontrollers - Speech Classification using a Convolutional Neural Network running on a Microcontroller
israpdead_react - wip react rebuild of israpisdead. v1 is live now
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
shelby_as_a_service - Production-ready LLM Agents. Just add API keys
materialize - The data warehouse for operational workloads.
LoopModels - "Full speed or nothing." - James Hetfield
gui_starter_template - A template CMake project to get you started with C++ and tooling
slint - Slint is a declarative GUI toolkit to build native user interfaces for Rust, C++, or JavaScript apps.