Show HN: Semantic Search for Any Video

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • generate-subtitles

    Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

  • 0. Excellent space, much needed. I have scripted this out for my own needs (still need to automate even more), but a web services that accepts web URLs is much appreciated and could help others.

    0b. Compare to https://freesubtitles.ai from https://news.ycombinator.com/item?id=33663486 and https://github.com/mayeaux/generate-subtitles

    Constructive critique out of love:

    1. The 10 minute length limitation makes this useless for most buried spoken content, because, you know, the longer videos have the more buried spoken content. Most detailed presentations are 15-60+ minutes where content would want to be searched IMO. A 2 hour limit would make more sense, but ideally none.

    2. The heading says "any video", but the form specifies YouTube link. This needs to support Bitchute, Odysee, and Rumble, at least, because much interesting spoken content (which is totally reasonable content, if not to say, critically important, BTW) has been repeatedly banned from YouTube. (At least Youtube gives auto-transcripts if enabled for the video, that can be downloaded with tools. This web service actually most needs to exist for video platforms OTHER THAN YouTube, which have not implemented built-in machine-transcription options yet.)

    3. The site needs SSL, or else the traffic (video links, titles, search terms) is communicated in plain text and plainly visible to all looking.

    Other platform I'm sure you can figure out, but I don't know how you will make this support larger videos affordably, but that would be a requirement for it to be really useful IMO.

    MOST IMPORTANT IDEA: I think you could support searching GRUOPS of videos. So I want to provide URLs of 500 videos, and get their transcript indexed and searchable (with good search options). And I want to share the link to upload more to that group, or to search just that group, with friends. But I do NOT want to simultaneously search the 50,000 videos that other people have listed on your service.

    Are you using whisper for transcription?

    For https://github.com/transitive-bullshit/yt-semantic-search, I'm using YouTube's built-in transcriptions which definitely aren't as high quality, but they work well enough to power the semantic search.

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
  • codesearch

    Semantic Code Search tool. Query your codebases using natural language (by rahuldan)

  • Hey really cool project!!

    I also built a semantic search engine for codebases using Openai's embedding and FAISS https://github.com/rahuldan/codesearch

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts