I Built an AI Search Engine that can find exact timestamps for anything on Youtube using OpenAI Whisper

This page summarizes the projects mentioned and recommended in the original post on /r/OpenAI

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • atila-core-service

    The primary backend service for Atila apps.

  • Yeah, that's in the roadmap! I've actually just created an issue for this based on your feedback. https://github.com/atilatech/atila-core-service/issues/9

  • atlas-ui

    Browser extension to help users find and manage scholarships.

  • Frontend: https://github.com/atilatech/atlas-ui

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • youtube-transcript-api

    This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

  • Get the transcript of a Youtube video using the URL from the youtube transcript api.

  • pytube

    A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

  • If a transcript doesn’t exist, download the audio of the video as an mp3 file with Pytube and use our first ML model, OpenAI Whisper to transcribe

  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  • If a transcript doesn’t exist, download the audio of the video as an mp3 file with Pytube and use our first ML model, OpenAI Whisper to transcribe

  • sentence-transformers

    Multilingual Sentence & Image Embeddings with BERT

  • Break up transcript into shorter segments and convert segments to a 768 vector array. Use a process known as embedding using our second ML model, UKP Labs BERT’s sentence transformer model.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts