transcribe-anything
ai-notes
Our great sponsors
transcribe-anything | ai-notes | |
---|---|---|
11 | 15 | |
342 | 4,510 | |
- | - | |
9.3 | 9.8 | |
17 days ago | 11 days ago | |
Python | HTML | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
transcribe-anything
-
Summarize audio recordings in text
transcribe-anything
-
$620,000 stolen from YouTuber Ethan Klein and the H3 Podcast by MCN BroadbandTV and their CEO Shahrzad Rafati
OpenAI whisper. Here is a tool that has it, a video downloader, and some other things bundled in with it: https://github.com/zackees/transcribe-anything
- 32 Open Source Libraries for Python's 32nd Birthday
-
Show HN: Self-host Whisper As a Service with GUI and queueing
People interested in this might also be interested in transcribe-anything [1].
It automates video fetching and uses whisper to generate .srt, .vtt and .txt files.
[1] https://github.com/zackees/transcribe-anything
-
[P] Free Youtube Subtitles Generator
Nice looks great, link broken but it just needed a hyphen https://github.com/zackees/transcribe-anything
-
Gpu accelerated ML apps will soon get a lot easier to deploy - Pytorch-cuda moving to 100% pypi hosting.
Right now the cuda accelerated whls are hosted outside of pypi which can only be accessed by using `--extra-index-url`, when installing from a requirements file (pip install -r requirements.txt). However pip install doesn't allow --extra-index-url for security reasons, which means deploying cuda accelerated ML apps on python is a complicated affair, see this [script](https://github.com/zackees/transcribe-anything/blob/main/install_cuda.py) as an example of what needs to be done to uninstall conflicting cpu only version of pytorch and replace it with cuda acceleration.
- Bro, listen: Interact with OpenAI using voice
- Convert YouTube to Text with OpenAI Whisper
-
Draw an owl
Transcribe Anything
-
Transcribe Video/Audio on the web using `transcribe-anything`, a front end to WhisperAI
Code Repo: https://github.com/zackees/transcribe-anything (please give my repo a like)
ai-notes
-
Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch
the field just moves fast. I have curated a list of non-hypey writers and youtubers who explain these things for a typical SWE audience if you are interested. https://github.com/swyxio/ai-notes/blob/main/Resources/Good%...
- SDXL Turbo: A Real-Time Text-to-Image Generation Model
-
DeepEval – Unit Testing for LLMs
added to my notes! https://github.com/swyxio/ai-notes/
- ChatGPT Code Interpreter Capabilities
-
Google just released a 100% free learning path on Generative AI with 9 Courses
and here are mine, organized by beginner/intermediate/advanced
https://github.com/swyxio/ai-notes/blob/main/README.md#top-a...
and then you can go into the individual modality specific notes for more reading
- Show HN: Self-host Whisper As a Service with GUI and queueing
-
Show HN: YouTube Summaries Using GPT
there's https://learnprompting.org/
i've also been keeping a popular series of notes https://github.com/sw-yx/ai-notes/blob/main/TEXT_PROMPTS.md
-
Show HN: I reverse prompt engineered every Notion AI feature
Direct link to the source prompts are here: https://github.com/sw-yx/ai-notes/blob/main/Resources/Notion...
- GitHub - sw-yx/prompt-eng: notes for prompt engineering
- My hand-curated list of major distros and forks of Stable Diffusion. Please suggest anything I missed!
What are some alternatives?
frogbase - Transform audio-visual content into navigable knowledge.
text2image-gui - Somewhat modular text2image GUI, initially just for Stable Diffusion
Hentai-Diffusion - The official place for the best A.I.
diffusionbee-stable-diffusion-ui - Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
whisperX - WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
m1_huggingface_diffusers_demo - Demo of how to get HuggingFace Diffusers working on an M1 Mac
subtitle-generator - Generate subtitles for youtube videos for free with https://text-generator.io
stable-diffusion-ui - Easiest 1-click way to install and use Stable Diffusion on your computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image. [Moved to: https://github.com/easydiffusion/easydiffusion]
static_ffmpeg - Installs FFMPEG v5 On Win32/Ubuntu/MacOS
perceiver-pytorch - Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
DeepSpeech-examples - Examples of how to use or integrate DeepSpeech
stable-diffusion - A latent text-to-image diffusion model