transcription

Top 23 transcription Open-Source Projects

  • basic-pitch

    A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

  • Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    spotify/basic-pitch: Audio to midi converter

  • awesome-whisper

    🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

  • Project mention: Whisper as a PUSH to STT to Clipboard solution? | /r/OpenAI | 2023-08-26
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • whishper

    Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

  • Project mention: Whishper – Open-source, local-first audio transcription and subtitling suite | news.ycombinator.com | 2024-04-19
  • subvert

    Generate subtitles, summaries, and chapters from videos in seconds

  • generate-subtitles

    Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

  • Project mention: Is there any plug in that creates subtitle file out of video files? | /r/ChatGPT | 2023-10-18

    They generate .SRT files, and you can view their repository there.

  • audapolis

    an editor for spoken-word audio with automatic transcription

  • Project mention: Audapolis: An editor for spoken-word audio with automatic transcription | news.ycombinator.com | 2023-08-23
  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • textra

    A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

  • Project mention: Easy-to-Use Apple Vision wrapper for text extraction and clustering | news.ycombinator.com | 2024-01-28

    I’ve been using textra for a wrapper on the Apple vision sdk

    https://github.com/freedmand/textra

    But this project calls torch and a bunch of other ML libs. So it’s not using Apple vision?

  • react-transcript-editor

    A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

  • SwiftWhisper

    🎤 The easiest way to transcribe audio in Swift

  • subaligner

    Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

  • leopard

    On-device speech-to-text engine powered by deep learning

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • hear

    Command line speech recognition and transcription for macOS

  • noScribe

    Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

  • Project mention: Getting noScribe to work with game porting toolkit (or any other WINE distro) | /r/macgaming | 2023-06-11

    I am trying to run noScribe (for AI supported transcription of audio files) with the Game porting toolkit (I know, it's not a game, but to my understanding it should still run, since it's based on Crossover. And since it's AI based it might benefit from GPU usage). I can install the software (and my gpt genereally works for games). However, if I try to start the complete app, it crashes with this error log:

  • gecko

    Gecko - A Tool for Effective Annotation of Human Conversations

  • Synthalingua

    Synthalingua - Real Time Translation

  • Project mention: Whisper as a PUSH to STT to Clipboard solution? | /r/OpenAI | 2023-08-26

    I've been working on this for a live translation/transcription sort of thing: https://github.com/cyberofficial/Synthalingua

  • parlatype

    GNOME audio player for transcription

  • vid2cleantxt

    Python API & command-line tool to easily transcribe speech-based video files into clean text

  • concordia

    Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov

  • realtime-transcription-playground

    A real-time transcription project using React and socketio

  • Project mention: Best Practices for Streaming Speech Recognition / gRPC | /r/googlecloud | 2023-06-16

    -I was able to find this repo https://github.com/saharmor/realtime-transcription-playground/tree/main which uses web sockets instead, but this seems suboptimal/ not gRPC. Is this a viable approach?

  • go-subgen

    Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr

  • Project mention: Self hosted call transcribing and searchable text solution | /r/selfhosted | 2023-06-09

    you might be able to use go-subgen (it's my project) if you're willing/able to trigger it via http posts. There might also be a whisper frontend that can do filesystem watching that could work better, but I don't know of any off the top of my head.

  • glaemscribe

    Glaemscribe, the tolkienian languages/writings transcription engine.

  • Project mention: Text Editors in the Lord of the Rings (2011) | news.ycombinator.com | 2023-11-23

    in the world of text editors for the LotR, there's Glǽmscribe: https://glaemscrafu.jrrvf.com/english/glaemscribe.html

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

transcription related posts

  • Easy-to-Use Apple Vision wrapper for text extraction and clustering

    2 projects | news.ycombinator.com | 28 Jan 2024
  • Text Editors in the Lord of the Rings (2011)

    1 project | news.ycombinator.com | 23 Nov 2023
  • Would the forum be kind enough to check if this is alright? I would like to have it engraved as a small gift for a struggling friend.

    2 projects | /r/Quenya | 21 Jun 2023
  • Would it be possible to confirm this says nazgûl and that I'm correct in using westron?

    1 project | /r/Tengwar | 17 Jun 2023
  • Getting noScribe to work with game porting toolkit (or any other WINE distro)

    2 projects | /r/macgaming | 11 Jun 2023
  • Best Practices for Streaming Speech Recognition / gRPC

    1 project | /r/googlecloud | 16 Jun 2023
  • Easiest way or tool to use Whisper to transcribe mp4 video file?

    2 projects | /r/OpenAI | 28 Apr 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 16 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source transcription projects? This list will help you:

Project Stars
1 basic-pitch 2,961
2 awesome-whisper 1,022
3 whishper 895
4 subvert 727
5 generate-subtitles 671
6 audapolis 641
7 cheetah 556
8 textra 536
9 react-transcript-editor 536
10 SwiftWhisper 522
11 subaligner 418
12 leopard 411
13 LiveWhisper 293
14 hear 283
15 noScribe 285
16 gecko 259
17 Synthalingua 178
18 parlatype 169
19 vid2cleantxt 156
20 concordia 151
21 realtime-transcription-playground 142
22 go-subgen 53
23 glaemscribe 40

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com