Coral TPU Dev Board for speech-to-text and nvidia agx as host running LLaMA??

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • project-keyword-spotter

    Audio Keyphrase Detector

  • I'm looking to replace Alexa. I own the hardware and have started putting things together but haven't sorted it all out yet. I have an Audio Classification Model (keyphrase detector) running on the coral tpu board next to our echo dot (alexa). That is a bit of a modified, cobbled together, absolute hack --but it's not too bad... I'd say it almost hears better than Alexa at times.

  • whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  • I'm currently using whisper with the whisper.cpp engine it is hands down the best speech to text model. It is also the only one I tried that handles background noise, like music, TV, traffic,etc.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • mycroft-core

    Mycroft Core, the Mycroft Artificial Intelligence platform.

  • But I would recommend writing some proper glue logic in Python and use the socket function for communication. But if you really want to get rid of Alexa, it's probably worth it to set up mycroft.ai or another open source assistant.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts