modal-examples VS whisper_mic

Compare modal-examples vs whisper_mic and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
modal-examples whisper_mic
9 2
595 642
9.2% -
9.5 7.2
4 days ago 1 day ago
Python Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

modal-examples

Posts with mentions or reviews of modal-examples. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-16.

whisper_mic

Posts with mentions or reviews of whisper_mic. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-23.
  • I've built a few tools on top of GPT-3.5 (text generation, q&a with embeddings). AMA about resources and AI dev stacks for building with OpenAI's APIs
    2 projects | /r/learnmachinelearning | 23 Feb 2023
  • Whispers AI Modular Future
    14 projects | news.ycombinator.com | 20 Feb 2023
    What utilities related to Whisper do you wish existed? What have you had to build yourself?

    On the end user application side, I wish there was something that let me pick a podcast of my choosing, get it fully transcribed, and get an embeddings search plus answer q&a on top of that podcast or set of chosen podcasts. I've seen ones for specific podcasts, but I'd like one where I can choose the podcast. (Probably won't build it)

    Also on the end user side, I wish there was an Otter alternative (still paid $30/mo, but unlimited minutes per month) that had longer transcription limits. (Started building this, not much interest from users though)

    Things I've seen on the dev tool side:

    Gladia (API call version of Whisper)

    Whisper.cpp

    Whisper webservice (https://github.com/ahmetoner/whisper-asr-webservice) - via this thread

    Live microphone demo (not real time, it still does it in chunks) https://github.com/mallorbc/whisper_mic

    Streamlit UI https://github.com/hayabhay/whisper-ui

    Whisper playground https://github.com/saharmor/whisper-playground

    Real time whisper https://github.com/shirayu/whispering

    Whisper as a service https://github.com/schibsted/WAAS

    Improved timestamps and speaker identification https://github.com/m-bain/whisperX

    MacWhisper https://goodsnooze.gumroad.com/l/macwhisper

    Crossplatform desktop Whisper that supports semi-realtime https://github.com/chidiwilliams/buzz

What are some alternatives?

When comparing modal-examples and whisper_mic you can also consider the following projects:

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

WAAS - Whisper as a Service (GUI and API with queuing for OpenAI Whisper)

FlexGen - Running large language models on a single GPU for throughput-oriented scenarios.

frogbase - Transform audio-visual content into navigable knowledge.

whisperX - WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

EasyLM - Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

mlc-llm - Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

whisper-playground - Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

brev-cli - Connect your laptop to cloud computers. Follow to stay updated about our product