a-to-p
LiveWhisper
a-to-p | LiveWhisper | |
---|---|---|
2 | 2 | |
2 | 306 | |
- | - | |
9.5 | 0.0 | |
2 months ago | 5 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
a-to-p
-
Oration (iOS) Turns PDFs into Audiobooks
Congrats on the launch! This is pretty similar to a project i'm working on that turns articles into podcasts. https://a-to-p.vercel.app. Feel free to give it a try.
I'd love to chat about how you generate your audiobooks if you're open to sharing.
-
Show HN: A-to-P – Convert articles into podcasts
Hi HN – I’m excited to share my open source project a-to-p (article to podcast) available at https://github.com/collinc777/a-to-p. A to P is a simple web app that converts any article to a conversational podcast experience. You can try it out here: https://a-to-p.vercel.app/
I’ve explored text-to-speech (TTS) for articles using tools like Speechify. However, I noticed a gap: TTS lacked the engagement I found in podcasts. I think there’s two reasons why:
1. Articles are visually oriented, while podcasts are audio-centric.
LiveWhisper
-
Speech Recognition module in Python
I've run into this EXACT SAME problem, and ended up creating my own SpeechRecognition alternative, using sounddevice (which unlike pyaudio IS compatible with my Linux Mint's audio drivers), and OpenAI's Whisper model.. Cause that was my only option, other than risking messing up my audio drivers.. heh
-
How to install and deploy OpenAI Whisper with Python
If anyone's interested, I took a wack at making Whisper transcribe semi-live, to the terminal: https://github.com/Nikorasu/LiveWhisper
What are some alternatives?
RasaGPT - 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram
whisper-openai-gradio-implementation - Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
whisper-standalone-win - Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
SwiftWhisper - 🎤 The easiest way to transcribe audio in Swift
FlorenceBot - A fully interactive domain-specific chatbot implemented using Prolog and PySwip.
whisper-subtitles-webui - A gradio interface for making transcribed and translated subtitles for videos
Semi-Automated-Youtube-Channel - Semi automated youtube channel that has a lot of cool features for someone to use in their content generating project
subvert - Generate subtitles, summaries, and chapters from videos in seconds
tldwol - Web API that summarizes multimedia from various sources using modern AI tools.
malayalam_english_subtitle_generator - Malayalam to English Subtitle Generator for audio files using OpenAI's Whisper.
shorthanddictation - Dictation program, which uses the reading speed unit syllables per minute