Make your own custom wakeword and other FOSS voice assistant solutions

This page summarizes the projects mentioned and recommended in the original post on /r/selfhosted

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • secret_sauce_ai

    Secret Sauce AI: a coordinated community of tech minded AI enthusiasts

  • Secret Sauce AI is a coordinated community of AI enthusiasts. We have come together as many individuals and projects in the FOSS voice assistant space to solve big AI problems for everyone out there.

  • wakeword-data-collector

    A prototype CLI in Python where a user can collect all of the recordings needed to produce a wakeword

  • We have released a prototype Wakeword Data Collector in Python that runs a user through the collection process.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • precise-wakeword-model-maker

    Automated, end-to-end wakeword model maker using the Precise Wakeword Engine

  • We experimentally figured out the best recipe while keeping the data sparse and made the Precise Wakeword Model Maker to do it all for you automatically.

  • precise-rs

    Precise hotword listener on Tract and Rust

  • This release represents the first phase in the wakeword project, we are working on a Rust wakeword engine based on Precise and a SpeechPy MFCC port in Rust so that user's can run the wakeword easily on their phone and other devices. It is hard to believe that there aren't any current good solutions to running a modern FOSS wakeword engine on a phone in real time. We want to change that and allow everyone access to this technology, with their own wakeword of choice.

  • Leon

    🧠 Leon is your open-source personal assistant.

  • Leon AI

  • mfcc-rust

  • This release represents the first phase in the wakeword project, we are working on a Rust wakeword engine based on Precise and a SpeechPy MFCC port in Rust so that user's can run the wakeword easily on their phone and other devices. It is hard to believe that there aren't any current good solutions to running a modern FOSS wakeword engine on a phone in real time. We want to change that and allow everyone access to this technology, with their own wakeword of choice.

  • NLU-engine-prototype-benchmarks

    Demo and benchmarks for building an NLU engine similar to those in voice assistants. Several intent classifiers are implemented and benchmarked. Conditional Random Fields (CRFs) are used for entity extraction.

  • This will be the next project we focus on. We will benchmark current solutions, improving general data sets, and publish information to help everyone improve upon their current NLU-NLG use cases. All of this is still a heavy work in progress. * NLU engine prototype benchmark and examples * Snips data set converter

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • NLU-snips-converter

    Quick and dirty solution to convert CSV training data into Snips JSON format and train an engine

  • This will be the next project we focus on. We will benchmark current solutions, improving general data sets, and publish information to help everyone improve upon their current NLU-NLG use cases. All of this is still a heavy work in progress. * NLU engine prototype benchmark and examples * Snips data set converter

  • voice-assistant-protocol

    A bus for voice assistants, including benchmarks.

  • We are working on a universal Voice Assistant Protocol (VAP)

  • lily

    Open source voice assistance for humans

  • Lily

  • glados-voice-assistant

    DIY Voice Assistant based on the GLaDOS character from Portal video game series. Works with home assistant!

  • GLaDOS Voice Assistant

  • VISOR---Android-Version-Assistant

    My in-development personal Android assistant

  • V.I.S.O.R.

  • DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts