Seamless: Meta's New Speech Models

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • seamless_communication

    Foundational Models for State-of-the-Art Speech and Text Translation

  • The license details are listed on the project GitHub

    https://github.com/facebookresearch/seamless_communication#l...

  • gpt-tutor

    Generate personalized audio lessons for learning languages with GPT and Azure AI speech.

  • I built just this a month ago with the Azure AI speech API, which is already pretty good at multilingual speech.https://github.com/adrianmfi/gpt-tutor I look forward to testing if switching to Seamless can improve it further. Seamless supporting nearly 100 languages is a nice improvement.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • dragonfly

    Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx (by dictation-toolbox)

  • https://github.com/dictation-toolbox/dragonfly

  • I work on seamless and you can find sample code here: https://github.com/fairinternal/seamless_communication or in the HuggingFace space.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • OpenInterpreter – Natural language interface to your computer

    1 project | news.ycombinator.com | 23 Apr 2024
  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • What things are happening in ML that we can't hear oer the din of LLMs?

    3 projects | news.ycombinator.com | 28 Mar 2024
  • The Next Generation of Claude (Claude 3)

    8 projects | news.ycombinator.com | 4 Mar 2024
  • Simulatrex, an open-source Large Language Model based simulation framework

    1 project | news.ycombinator.com | 17 Feb 2024