eq_harry
SpeechRecognition
eq_harry | SpeechRecognition | |
---|---|---|
5 | 16 | |
0 | 8,051 | |
- | - | |
0.0 | 8.7 | |
over 2 years ago | 9 days ago | |
Python | Python | |
- | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
eq_harry
-
Cooler Master MH752 Review!
Ya boi just got his dekoni nuggets and slapped it on the mh 752; this in combination with a custom eq sets me up good for a Friday night.
-
Sennheiser HD560s thoughts from a beginner. Do I just not like 'analytical'?
If you'd like to try some of the fruits of my work (560s are my main after eq), the doors are open.
-
Short review for almost every „standard“ headphone up to 350 Euros
Some time went by and I gave it another chance, and with a custom made eq.
-
Need help 🥲 please!
Sidenote, since we are on the headphones subreddit, the headset I use is voiced and tuned to accentuate intelligibility and vocal clarity, so do not expect it to have booming bass or a flat response. With that being said, I find the lack of bass to be a huge plus, since some plantronics headsets I have used in the past have had enough bass to give me headaches during longer calls. Additionally, with the right eq, the Sennheisers can sound relatively impressive as an "open back" on ear kind of headset. HD 800 killer confirmed /s!
-
How to eq headphones?
If you are feeling adventurous and would like to eq headphones according to the HRTF instead of a standard like diffuse or harman, feel free to poke around in this repo.
SpeechRecognition
-
help with script (beginner)
Start and Stop Listening Example
-
MacWhisper: Transcribe audio files on your Mac
There is a great library that has support not only with OpenAIs whisper but many others that also work offline. https://github.com/Uberi/speech_recognition
-
Unpopular Opinion: a lot of Obsidian community make Obsidian sound like something cringey/productivity guru-y
This is the library: https://github.com/Uberi/speech_recognition
-
Nvim-VoiceRec : Add Speech-To-Text To Neovim! (useful for gpt)
It is python remote plugin that is a tin wrapper around speech_recognition package.
- Speech-to-text software
-
Voice commands in Doom Eternal possible?
I am less familiar with speech recognition myself. I have implemented something similar many years ago, back when Google had a REST API that allowed you to upload audio and they would respond with the recognized words/sentence. I think they still have the same API available, though. They limited how much you could send, but for voice commands it was pretty solid. However, SpeechRecognition looks like a library worth trying out for this, as that seems like it could do offline processing depending on the underlying library. They also have some examples to look at.
-
Build Simple CLI-Based Voice Assistant with PyAudio, Speech Recognition, pyttsx3 and SerpApi
SpeechRecognition
- Need help with speech recognition
-
Wiki for the podcast
I found this one here
-
How to use my speaker as input and my mic as output?
https://github.com/Uberi/speech_recognition/blob/master/reference/library-reference.rst this might help. I guess your best bet is to rtfm.
What are some alternatives?
pydub - Manipulate audio with a simple and easy high level interface
pyAudioAnalysis - Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
allosaurus - Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
speech-to-text-websockets-python
speechpy - :speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Watson Developer Cloud Python SDK - :snake: Client library to use the IBM Watson services in Python and available in pip as watson-developer-cloud
librosa - Python library for audio and music analysis
pysle - Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.
praatIO - A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).
Prosodylab-Aligner - Python interface for forced audio alignment using HTK and SoX
m3u8 - Python m3u8 Parser for HTTP Live Streaming (HLS) Transmissions